Science.gov

Sample records for accurate phylogenetic classification

  1. Accurate phylogenetic classification of DNA fragments based onsequence composition

    SciTech Connect

    McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis; Hugenholtz, Philip; Rigoutsos, Isidore

    2006-05-01

    Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequence characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.

  2. A Higher-Level Phylogenetic Classification of the Fungi

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A comprehensive phylogenetic classification of the Fungi is proposed, with reference to recent molecular phylogenetic analyses, and with input from members of the fungal taxonomic community. The classification includes 196 taxa, down to the level of order, of which 23 are described or are validated ...

  3. Accurate Phylogenetic Tree Reconstruction from Quartets: A Heuristic Approach

    PubMed Central

    Reaz, Rezwana; Bayzid, Md. Shamsuzzoha; Rahman, M. Sohel

    2014-01-01

    Supertree methods construct trees on a set of taxa (species) combining many smaller trees on the overlapping subsets of the entire set of taxa. A ‘quartet’ is an unrooted tree over taxa, hence the quartet-based supertree methods combine many -taxon unrooted trees into a single and coherent tree over the complete set of taxa. Quartet-based phylogeny reconstruction methods have been receiving considerable attentions in the recent years. An accurate and efficient quartet-based method might be competitive with the current best phylogenetic tree reconstruction methods (such as maximum likelihood or Bayesian MCMC analyses), without being as computationally intensive. In this paper, we present a novel and highly accurate quartet-based phylogenetic tree reconstruction method. We performed an extensive experimental study to evaluate the accuracy and scalability of our approach on both simulated and biological datasets. PMID:25117474

  4. Discriminative features and classification methods for accurate classification

    NASA Astrophysics Data System (ADS)

    Dessauer, Michael P.; Dua, Sumeet

    2010-04-01

    Automated classification and tracking approaches suffer from the high-dimensionality of the data and information space, which frequently rely upon both discriminative feature selection and efficient, accurate supervised classification strategies. Feature selection strategies have the benefit of representing the data in a modified reduced space to improve the efficacy of data mining, machine learning, and computer vision approaches. We have developed feature-selection methods involving feature ranking and assimilation to discover reduced feature sets that produce accurate results in classification for automated classifiers with significant specificity and sensitivity. We have tested a wide range of spatial, texture, and wavelet-based feature sets for electro-optical (EO) aerial imagery and infrared (IR) land-based image sequences on several machine-learning algorithms for classification for performance evaluation and comparison. A detailed experimental evaluation is provided for the classification efficacy of the features and classifiers on the particular data sets, and is accompanied by a discussion of the particular success or failure. In the second section, we detail our novel feature set that combines moment and edge descriptors and produces high, robust accuracy when evaluated for classification. Our method leverages information previously calculated in the detection stage, which includes wavelet decomposition and texture statistics. We demonstrate the results of our feature set implementation and discuss methods for creating classifier decision rules to choose a particular classification algorithm dependent on certain operating conditions or data types adaptively.

  5. Phylogenetic classification of short environmental DNA fragments

    PubMed Central

    Krause, Lutz; Diaz, Naryttza N.; Goesmann, Alexander; Kelley, Scott; Nattkemper, Tim W.; Rohwer, Forest; Edwards, Robert A.; Stoye, Jens

    2008-01-01

    Metagenomics is providing striking insights into the ecology of microbial communities. The recently developed massively parallel 454 pyrosequencing technique gives the opportunity to rapidly obtain metagenomic sequences at a low cost and without cloning bias. However, the phylogenetic analysis of the short reads produced represents a significant computational challenge. The phylogenetic algorithm CARMA for predicting the source organisms of environmental 454 reads is described. The algorithm searches for conserved Pfam domain and protein families in the unassembled reads of a sample. These gene fragments (environmental gene tags, EGTs), are classified into a higher-order taxonomy based on the reconstruction of a phylogenetic tree of each matching Pfam family. The method exhibits high accuracy for a wide range of taxonomic groups, and EGTs as short as 27 amino acids can be phylogenetically classified up to the rank of genus. The algorithm was applied in a comparative study of three aquatic microbial samples obtained by 454 pyrosequencing. Profound differences in the taxonomic composition of these samples could be clearly revealed. PMID:18285365

  6. Towards an integrated phylogenetic classification of the Tremellomycetes.

    PubMed

    Liu, X-Z; Wang, Q-M; Göker, M; Groenewald, M; Kachalkin, A V; Lumbsch, H T; Millanes, A M; Wedin, M; Yurkov, A M; Boekhout, T; Bai, F-Y

    2015-06-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained. PMID:26955199

  7. Towards an integrated phylogenetic classification of the Tremellomycetes

    PubMed Central

    Liu, X.-Z.; Wang, Q.-M.; Göker, M.; Groenewald, M.; Kachalkin, A.V.; Lumbsch, H.T.; Millanes, A.M.; Wedin, M.; Yurkov, A.M.; Boekhout, T.; Bai, F.-Y.

    2016-01-01

    Families and genera assigned to Tremellomycetes have been mainly circumscribed by morphology and for the yeasts also by biochemical and physiological characteristics. This phenotype-based classification is largely in conflict with molecular phylogenetic analyses. Here a phylogenetic classification framework for the Tremellomycetes is proposed based on the results of phylogenetic analyses from a seven-genes dataset covering the majority of tremellomycetous yeasts and closely related filamentous taxa. Circumscriptions of the taxonomic units at the order, family and genus levels recognised were quantitatively assessed using the phylogenetic rank boundary optimisation (PRBO) and modified general mixed Yule coalescent (GMYC) tests. In addition, a comprehensive phylogenetic analysis on an expanded LSU rRNA (D1/D2 domains) gene sequence dataset covering as many as available teleomorphic and filamentous taxa within Tremellomycetes was performed to investigate the relationships between yeasts and filamentous taxa and to examine the stability of undersampled clades. Based on the results inferred from molecular data and morphological and physiochemical features, we propose an updated classification for the Tremellomycetes. We accept five orders, 17 families and 54 genera, including seven new families and 18 new genera. In addition, seven families and 17 genera are emended and one new species name and 185 new combinations are proposed. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the species names that are temporarily maintained. PMID:26955199

  8. A Functional-Phylogenetic Classification System for Transmembrane Solute Transporters

    PubMed Central

    Saier, Milton H.

    2000-01-01

    A comprehensive classification system for transmembrane molecular transporters has been developed and recently approved by the transport panel of the nomenclature committee of the International Union of Biochemistry and Molecular Biology. This system is based on (i) transporter class and subclass (mode of transport and energy coupling mechanism), (ii) protein phylogenetic family and subfamily, and (iii) substrate specificity. Almost all of the more than 250 identified families of transporters include members that function exclusively in transport. Channels (115 families), secondary active transporters (uniporters, symporters, and antiporters) (78 families), primary active transporters (23 families), group translocators (6 families), and transport proteins of ill-defined function or of unknown mechanism (51 families) constitute distinct categories. Transport mode and energy coupling prove to be relatively immutable characteristics and therefore provide primary bases for classification. Phylogenetic grouping reflects structure, function, mechanism, and often substrate specificity and therefore provides a reliable secondary basis for classification. Substrate specificity and polarity of transport prove to be more readily altered during evolutionary history and therefore provide a tertiary basis for classification. With very few exceptions, a phylogenetic family of transporters includes members that function by a single transport mode and energy coupling mechanism, although a variety of substrates may be transported, sometimes with either inwardly or outwardly directed polarity. In this review, I provide cross-referencing of well-characterized constituent transporters according to (i) transport mode, (ii) energy coupling mechanism, (iii) phylogenetic grouping, and (iv) substrates transported. The structural features and distribution of recognized family members throughout the living world are also evaluated. The tabulations should facilitate familial and functional assignments of newly sequenced transport proteins that will result from future genome sequencing projects. PMID:10839820

  9. The ABC of ABCS: a phylogenetic and functional classification of ABC systems in living organisms.

    PubMed

    Dassa, E; Bouige, P

    2001-01-01

    ATP binding cassette (ABC) systems constitute one of the most abundant superfamilies of proteins. They are involved not only in the transport of a wide variety of substances, but also in many cellular processes and in their regulation. In this paper, we made a comparative analysis of the properties of ABC systems and we provide a phylogenetic and functional classification. This analysis will be helpful to accurately annotate ABC systems discovered during the sequencing of the genome of living organisms and to identify the partners of the ABC ATPases. PMID:11421270

  10. Phylogenetic and functional classification of ATP-binding cassette (ABC) systems.

    PubMed

    Bouige, Philippe; Laurent, David; Piloyan, Linda; Dassa, Elie

    2002-10-01

    ATP binding cassette (ABC) systems constitute one of the most abundant superfamilies of proteins. They are involved in the transport of a wide variety of substances, but also in many cellular processes and in their regulation. In this paper, we made a comparative analysis of the properties of ABC systems and we provide a phylogenetic and functional classification. This analysis will be helpful to accurately annotate ABC systems discovered during the sequencing of the genome of living organisms and to identify the partners of the ABC ATPases. PMID:12370001

  11. Phylogenetic classification of Cordyceps and the clavicipitaceous fungi

    PubMed Central

    Sung, Gi-Ho; Hywel-Jones, Nigel L.; Sung, Jae-Mo; Luangsa-ard, J. Jennifer; Shrestha, Bhushan; Spatafora, Joseph W.

    2007-01-01

    Cordyceps, comprising over 400 species, was historically classified in the Clavicipitaceae, based on cylindrical asci, thickened ascus apices and filiform ascospores, which often disarticulate into part-spores. Cordyceps was characterized by the production of well-developed often stipitate stromata and an ecology as a pathogen of arthropods and Elaphomyces with infrageneric classifications emphasizing arrangement of perithecia, ascospore morphology and host affiliation. To refine the classification of Cordyceps and the Clavicipitaceae, the phylogenetic relationships of 162 taxa were estimated based on analyses consisting of five to seven loci, including the nuclear ribosomal small and large subunits (nrSSU and nrLSU), the elongation factor 1α (tef1), the largest and the second largest subunits of RNA polymerase II (rpb1 and rpb2), β-tubulin (tub), and mitochondrial ATP6 (atp6). Our results strongly support the existence of three clavicipitaceous clades and reject the monophyly of both Cordyceps and Clavicipitaceae. Most diagnostic characters used in current classifications of Cordyceps (e.g., arrangement of perithecia, ascospore fragmentation, etc.) were not supported as being phylogenetically informative; the characters that were most consistent with the phylogeny were texture, pigmentation and morphology of stromata. Therefore, we revise the taxonomy of Cordyceps and the Clavicipitaceae to be consistent with the multi-gene phylogeny. The family Cordycipitaceae is validated based on the type of Cordyceps, C. militaris, and includes most Cordyceps species that possess brightly coloured, fleshy stromata. The new family Ophiocordycipitaceae is proposed based on Ophiocordyceps Petch, which we emend. The majority of species in this family produce darkly pigmented, tough to pliant stromata that often possess aperithecial apices. The new genus Elaphocordyceps is proposed for a subclade of the Ophiocordycipitaceae, which includes all species of Cordyceps that parasitize the fungal genus Elaphomyces and some closely related species that parasitize arthropods. The family Clavicipitaceae s. s. is emended and includes the core clade of grass symbionts (e.g., Balansia, Claviceps, Epichloë, etc.), and the entomopathogenic genus Hypocrella and relatives. In addition, the new genus Metacordyceps is proposed for Cordyceps species that are closely related to the grass symbionts in the Clavicipitaceae s. s. Metacordyceps includes teleomorphs linked to Metarhizium and other closely related anamorphs. Two new species are described, and lists of accepted names for species in Cordyceps, Elaphocordyceps, Metacordyceps and Ophiocordyceps are provided. PMID:18490993

  12. Towards a phylogenetic classification of Leptothecata (Cnidaria, Hydrozoa)

    PubMed Central

    Maronna, Maximiliano M.; Miranda, Thaís P.; Peña Cantero, Álvaro L.; Barbeitos, Marcos S.; Marques, Antonio C.

    2016-01-01

    Leptothecata are hydrozoans whose hydranths are covered by perisarc and gonophores and whose medusae bear gonads on their radial canals. They develop complex polypoid colonies and exhibit considerable morphological variation among species with respect to growth, defensive structures and mode of development. For instance, several lineages within this order have lost the medusa stage. Depending on the author, traditional taxonomy in hydrozoans may be either polyp- or medusa-oriented. Therefore, the absence of the latter stage in some lineages may lead to very different classification schemes. Molecular data have proved useful in elucidating this taxonomic challenge. We analyzed a super matrix of new and published rRNA gene sequences (16S, 18S and 28S), employing newly proposed methods to measure branch support and improve phylogenetic signal. Our analysis recovered new clades not recognized by traditional taxonomy and corroborated some recently proposed taxa. We offer a thorough taxonomic revision of the Leptothecata, erecting new orders, suborders, infraorders and families. We also discuss the origination and diversification dynamics of the group from a macroevolutionary perspective. PMID:26821567

  13. Towards a phylogenetic classification of Leptothecata (Cnidaria, Hydrozoa).

    PubMed

    Maronna, Maximiliano M; Miranda, Thaís P; Peña Cantero, Álvaro L; Barbeitos, Marcos S; Marques, Antonio C

    2016-01-01

    Leptothecata are hydrozoans whose hydranths are covered by perisarc and gonophores and whose medusae bear gonads on their radial canals. They develop complex polypoid colonies and exhibit considerable morphological variation among species with respect to growth, defensive structures and mode of development. For instance, several lineages within this order have lost the medusa stage. Depending on the author, traditional taxonomy in hydrozoans may be either polyp- or medusa-oriented. Therefore, the absence of the latter stage in some lineages may lead to very different classification schemes. Molecular data have proved useful in elucidating this taxonomic challenge. We analyzed a super matrix of new and published rRNA gene sequences (16S, 18S and 28S), employing newly proposed methods to measure branch support and improve phylogenetic signal. Our analysis recovered new clades not recognized by traditional taxonomy and corroborated some recently proposed taxa. We offer a thorough taxonomic revision of the Leptothecata, erecting new orders, suborders, infraorders and families. We also discuss the origination and diversification dynamics of the group from a macroevolutionary perspective. PMID:26821567

  14. SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.

    PubMed

    Liu, Kevin; Warnow, Tandy J; Holder, Mark T; Nelesen, Serita M; Yu, Jiaye; Stamatakis, Alexandros P; Linder, C Randal

    2012-01-01

    Highly accurate estimation of phylogenetic trees for large data sets is difficult, in part because multiple sequence alignments must be accurate for phylogeny estimation methods to be accurate. Coestimation of alignments and trees has been attempted but currently only SATé estimates reasonably accurate trees and alignments for large data sets in practical time frames (Liu K., Raghavan S., Nelesen S., Linder C.R., Warnow T. 2009b. Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees. Science. 324:1561-1564). Here, we present a modification to the original SATé algorithm that improves upon SATé (which we now call SATé-I) in terms of speed and of phylogenetic and alignment accuracy. SATé-II uses a different divide-and-conquer strategy than SATé-I and so produces smaller more closely related subsets than SATé-I; as a result, SATé-II produces more accurate alignments and trees, can analyze larger data sets, and runs more efficiently than SATé-I. Generally, SATé is a metamethod that takes an existing multiple sequence alignment method as an input parameter and boosts the quality of that alignment method. SATé-II-boosted alignment methods are significantly more accurate than their unboosted versions, and trees based upon these improved alignments are more accurate than trees based upon the original alignments. Because SATé-I used maximum likelihood (ML) methods that treat gaps as missing data to estimate trees and because we found a correlation between the quality of tree/alignment pairs and ML scores, we explored the degree to which SATé's performance depends on using ML with gaps treated as missing data to determine the best tree/alignment pair. We present two lines of evidence that using ML with gaps treated as missing data to optimize the alignment and tree produces very poor results. First, we show that the optimization problem where a set of unaligned DNA sequences is given and the output is the tree and alignment of those sequences that maximize likelihood under the Jukes-Cantor model is uninformative in the worst possible sense. For all inputs, all trees optimize the likelihood score. Second, we show that a greedy heuristic that uses GTR+Gamma ML to optimize the alignment and the tree can produce very poor alignments and trees. Therefore, the excellent performance of SATé-II and SATé-I is not because ML is used as an optimization criterion for choosing the best tree/alignment pair but rather due to the particular divide-and-conquer realignment techniques employed. PMID:22139466

  15. Accurate reconstruction of a known HIV-1 transmission history by phylogenetic tree analysis.

    PubMed

    Leitner, T; Escanilla, D; Franzén, C; Uhlén, M; Albert, J

    1996-10-01

    Phylogenetic analyses are increasingly used in attempts to clarify transmission patterns of human immunodeficiency virus type 1 (HIV-1), but there is a continuing discussion about their validity because convergent evolution and transmission of minor HIV variants may obscure epidemiological patterns. Here we have studied a unique HIV-1 transmission cluster consisting of nine infected individuals, for whom the time and direction of each virus transmission was exactly known. Most of the transmissions occurred between 1981 and 1983, and a total of 13 blood samples were obtained approximately 2-12 years later. The p17 gag and env V3 regions of the HIV-1 genome were directly sequenced from uncultured lymphocytes. A true phylogenetic tree was constructed based on the knowledge about when the transmissions had occurred and when the samples were obtained. This complex, known HIV-1 transmission history was compared with reconstructed molecular trees, which were calculated from the DNA sequences by several commonly used phylogenetic inference methods [Fitch-Margoliash, neighbor-joining, minimum-evolution, maximum-likelihood, maximum-parsimony, unweighted pair group method using arithmetic averages (UPGMA), and a Fitch-Margoliash method assuming a molecular clock (KITSCH)]. A majority of the reconstructed trees were good estimates of the true phylogeny; 12 of 13 taxa were correctly positioned in the most accurate trees. The choice of gene fragment was found to be more important than the choice of phylogenetic method and substitution model. However, methods that are sensitive to unequal rates of change performed more poorly (such as UPGMA and KITSCH, which assume a constant molecular clock). The rapidly evolving V3 fragment gave better reconstructions than p17, but a combined data set of both p17 and V3 performed best. The accuracy of the phylogenetic methods justifies their use in HIV-1 research and argues against convergent evolution and selective transmission of certain virus variants. PMID:8855273

  16. International Standards for Neurological Classification of Spinal Cord Injury: Training Effect on Accurate Classification

    PubMed Central

    Chafetz, Ross S; Vogel, Lawrence C; Betz, Randal R; Gaughan, John P; Mulcahey, Mary Jane

    2008-01-01

    Objective: To evaluate the accuracy and agreement of International Standards for Neurological Classification of Spinal Cord Injury (ISCSCI) classification and to determine the effectiveness of formal training for pediatric clinicians. Study Population: Participants (N = 28) in a formal 90-minute classification training session. Outcome Measure: Pre/post-training examination of 10 case examples of a variety of neurological classifications. Results: Regardless of years of experience with the ISCSCI, a statistically significant improvement (P < 0.05) in classification was achieved after formal training. Before training, 27% (539 of 1,960) of the questions were answered incorrectly. After training, the percentage of incorrect classifications decreased to 11% (198 of 1,960) incorrect (P < 0.05). After training, the percentage of incorrect motor level classifications decreased by 23% (42% to 19% incorrect; P < 0.05). Post-training improvements were also demonstrated (P < 0.05) in classifying sensory levels (9% to 3% incorrect), neurological levels (31% to 6% incorrect), and severity of injury (9% to 0% incorrect). After training, reductions in classification errors (P < 0.05) were demonstrated in American Spinal Injury Association (ASIA) Impairment Scale (AIS) A (from 20% to 7%), B (50% to 11%), C (71% to 46%), and D (63% to 16%). Conclusions: This study demonstrated the benefits of formal, standardized training for accurate classification of the ISCSCI. Effective training programs must emphasize the guidelines and decision algorithms used to determine motor level and ASIA AIS designations because these remained problematic after training and are often a concern of patients/parents and are primary endpoints in clinical trials for neurological recovery. PMID:19086711

  17. A phylogenetic analysis of the mycoplasmas: basis for their classification.

    PubMed Central

    Weisburg, W G; Tully, J G; Rose, D L; Petzel, J P; Oyaizu, H; Yang, D; Mandelco, L; Sechrest, J; Lawrence, T G; Van Etten, J

    1989-01-01

    Small-subunit rRNA sequences were determined for almost 50 species of mycoplasmas and their walled relatives, providing the basis for a phylogenetic systematic analysis of these organisms. Five groups of mycoplasmas per se were recognized (provisional names are given): the hominis group (which included species such as Mycoplasma hominis, Mycoplasma lipophilum, Mycoplasma pulmonis, and Mycoplasma neurolyticum), the pneumoniae group (which included species such as Mycoplasma pneumoniae and Mycoplasma muris), the spiroplasma group (which included species such as Mycoplasma mycoides, Spiroplasma citri, and Spiroplasma apis), the anaeroplasma group (which encompassed the anaeroplasmas and acholeplasmas), and a group known to contain only the isolated species Asteroleplasma anaerobium. In addition to these five mycoplasma groups, a sixth group of variously named gram-positive, walled organisms (which included lactobacilli, clostridia, and other organisms) was also included in the overall phylogenetic unit. In each of these six primary groups, subgroups were readily recognized and defined. Although the phylogenetic units identified by rRNA comparisons are difficult to recognize on the basis of mutually exclusive phenotypic characters alone, phenotypic justification can be given a posteriori for a number of them. PMID:2592342

  18. Phylogenetic classification of the mitochondrial carrier family of Saccharomyces cerevisiae.

    PubMed

    el Moualij, B; Duyckaerts, C; Lamotte-Brasseur, J; Sluse, F E

    1997-05-01

    The screening of the open reading frames identified in the whole yeast genome has allowed us to discover 34 proteins belonging to the mitochondrial carrier family. By phylogenetic study, they can be divided into 27 subfamilies including ADP/ATP, phosphate and citrate carriers, putative oxoglutarate and GDC carriers and 22 new subfamilies. Topology predictions using the 'positive inside rule' approach have shown that the yeast carriers are similarly oriented with both extremities exposed to the cytosol. In each subfamily, a strict conservation of the charged residues in the six transmembrane alpha-helices is observed, suggesting a functional role for these residues and the existence of 27 functionally distinct carriers. PMID:9178508

  19. Automatic classification and accurate size measurement of blank mask defects

    NASA Astrophysics Data System (ADS)

    Bhamidipati, Samir; Paninjath, Sankaranarayanan; Pereira, Mark; Buck, Peter

    2015-07-01

    A blank mask and its preparation stages, such as cleaning or resist coating, play an important role in the eventual yield obtained by using it. Blank mask defects' impact analysis directly depends on the amount of available information such as the number of defects observed, their accurate locations and sizes. Mask usability qualification at the start of the preparation process, is crudely based on number of defects. Similarly, defect information such as size is sought to estimate eventual defect printability on the wafer. Tracking of defect characteristics, specifically size and shape, across multiple stages, can further be indicative of process related information such as cleaning or coating process efficiencies. At the first level, inspection machines address the requirement of defect characterization by detecting and reporting relevant defect information. The analysis of this information though is still largely a manual process. With advancing technology nodes and reducing half-pitch sizes, a large number of defects are observed; and the detailed knowledge associated, make manual defect review process an arduous task, in addition to adding sensitivity to human errors. Cases where defect information reported by inspection machine is not sufficient, mask shops rely on other tools. Use of CDSEM tools is one such option. However, these additional steps translate into increased costs. Calibre NxDAT based MDPAutoClassify tool provides an automated software alternative to the manual defect review process. Working on defect images generated by inspection machines, the tool extracts and reports additional information such as defect location, useful for defect avoidance[4][5]; defect size, useful in estimating defect printability; and, defect nature e.g. particle, scratch, resist void, etc., useful for process monitoring. The tool makes use of smart and elaborate post-processing algorithms to achieve this. Their elaborateness is a consequence of the variety and complexity of defects encountered. The variety arises due to factors such as defect nature, size, shape and composition; and the optical phenomena occurring around the defect. This paper focuses on preliminary characterization results, in terms of classification and size estimation, obtained by Calibre MDPAutoClassify tool on a variety of mask blank defects. It primarily highlights the challenges faced in achieving the results with reference to the variety of defects observed on blank mask substrates and the underlying complexities which make accurate defect size measurement an important and challenging task.

  20. Accurate mobile malware detection and classification in the cloud.

    PubMed

    Wang, Xiaolei; Yang, Yuexiang; Zeng, Yingzhi

    2015-01-01

    As the dominator of the Smartphone operating system market, consequently android has attracted the attention of s malware authors and researcher alike. The number of types of android malware is increasing rapidly regardless of the considerable number of proposed malware analysis systems. In this paper, by taking advantages of low false-positive rate of misuse detection and the ability of anomaly detection to detect zero-day malware, we propose a novel hybrid detection system based on a new open-source framework CuckooDroid, which enables the use of Cuckoo Sandbox's features to analyze Android malware through dynamic and static analysis. Our proposed system mainly consists of two parts: anomaly detection engine performing abnormal apps detection through dynamic analysis; signature detection engine performing known malware detection and classification with the combination of static and dynamic analysis. We evaluate our system using 5560 malware samples and 6000 benign samples. Experiments show that our anomaly detection engine with dynamic analysis is capable of detecting zero-day malware with a low false negative rate (1.16 %) and acceptable false positive rate (1.30 %); it is worth noting that our signature detection engine with hybrid analysis can accurately classify malware samples with an average positive rate 98.94 %. Considering the intensive computing resources required by the static and dynamic analysis, our proposed detection system should be deployed off-device, such as in the Cloud. The app store markets and the ordinary users can access our detection system for malware detection through cloud service. PMID:26543718

  1. Phylogenetic classification of yeasts and related taxa within Pucciniomycotina

    PubMed Central

    Wang, Q.-M.; Yurkov, A.M.; Göker, M.; Lumbsch, H.T.; Leavitt, S.D.; Groenewald, M.; Theelen, B.; Liu, X.-Z.; Boekhout, T.; Bai, F.-Y.

    2016-01-01

    Most small genera containing yeast species in the Pucciniomycotina (Basidiomycota, Fungi) are monophyletic, whereas larger genera including Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces are polyphyletic. With the implementation of the “One Fungus = One Name” nomenclatural principle these polyphyletic genera were revised. Nine genera, namely Bannoa, Cystobasidiopsis, Colacogloea, Kondoa, Erythrobasidium, Rhodotorula, Sporobolomyces, Sakaguchia and Sterigmatomyces, were emended to include anamorphic and teleomorphic species based on the results obtained by a multi-gene phylogenetic analysis, phylogenetic network analyses, branch length-based methods, as well as morphological, physiological and biochemical comparisons. A new class Spiculogloeomycetes is proposed to accommodate the order Spiculogloeales. The new families Buckleyzymaceae with Buckleyzyma gen. nov., Chrysozymaceae with Chrysozyma gen. nov., Microsporomycetaceae with Microsporomyces gen. nov., Ruineniaceae with Ruinenia gen. nov., Symmetrosporaceae with Symmetrospora gen. nov., Colacogloeaceae and Sakaguchiaceae are proposed. The new genera Bannozyma, Buckleyzyma, Fellozyma, Hamamotoa, Hasegawazyma, Jianyunia, Rhodosporidiobolus, Oberwinklerozyma, Phenoliferia, Pseudobensingtonia, Pseudohyphozyma, Sampaiozyma, Slooffia, Spencerozyma, Trigonosporomyces, Udeniozyma, Vonarxula, Yamadamyces and Yunzhangia are proposed to accommodate species segregated from the genera Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces. Ballistosporomyces is emended and reintroduced to include three Sporobolomyces species of the sasicola clade. A total of 111 new combinations are proposed in this study. PMID:26951631

  2. Phylogenetic classification of yeasts and related taxa within Pucciniomycotina.

    PubMed

    Wang, Q-M; Yurkov, A M; Göker, M; Lumbsch, H T; Leavitt, S D; Groenewald, M; Theelen, B; Liu, X-Z; Boekhout, T; Bai, F-Y

    2015-06-01

    Most small genera containing yeast species in the Pucciniomycotina (Basidiomycota, Fungi) are monophyletic, whereas larger genera including Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces are polyphyletic. With the implementation of the "One Fungus = One Name" nomenclatural principle these polyphyletic genera were revised. Nine genera, namely Bannoa, Cystobasidiopsis, Colacogloea, Kondoa, Erythrobasidium, Rhodotorula, Sporobolomyces, Sakaguchia and Sterigmatomyces, were emended to include anamorphic and teleomorphic species based on the results obtained by a multi-gene phylogenetic analysis, phylogenetic network analyses, branch length-based methods, as well as morphological, physiological and biochemical comparisons. A new class Spiculogloeomycetes is proposed to accommodate the order Spiculogloeales. The new families Buckleyzymaceae with Buckleyzyma gen. nov., Chrysozymaceae with Chrysozyma gen. nov., Microsporomycetaceae with Microsporomyces gen. nov., Ruineniaceae with Ruinenia gen. nov., Symmetrosporaceae with Symmetrospora gen. nov., Colacogloeaceae and Sakaguchiaceae are proposed. The new genera Bannozyma, Buckleyzyma, Fellozyma, Hamamotoa, Hasegawazyma, Jianyunia, Rhodosporidiobolus, Oberwinklerozyma, Phenoliferia, Pseudobensingtonia, Pseudohyphozyma, Sampaiozyma, Slooffia, Spencerozyma, Trigonosporomyces, Udeniozyma, Vonarxula, Yamadamyces and Yunzhangia are proposed to accommodate species segregated from the genera Bensingtonia, Rhodosporidium, Rhodotorula, Sporidiobolus and Sporobolomyces. Ballistosporomyces is emended and reintroduced to include three Sporobolomyces species of the sasicola clade. A total of 111 new combinations are proposed in this study. PMID:26951631

  3. Phylogenetic classification of Aureobasidium pullulans strains for production of pullulan and xylanase

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This study tests the hypothesis that phylogenetic classification can predict whether A. pullulans strains will produce useful levels of the commercial polysaccharide, pullulan, or the valuable enzyme, xylanase. To test this hypothesis, 19 strains of A. pullulans with previously described phenotypes...

  4. Molecular classification and phylogenetic relationships of selected edible Basidiomycetes species.

    PubMed

    Avin, Farhat Ahmadi; Bhassu, Subha; Shin, Tan Yee; Sabaratnam, Vikineswary

    2012-07-01

    Morphological identification of edible mushrooms can sometimes prove troublesome, because phenotypic variation in fungi can be affected by substrate and environmental factors. One of the most important problems for mushroom breeders is the lack of a systematic consensus tool to distinguish different species, which are sometimes morphologically identical. Basidiomycetes as one of the largest groups of edible mushrooms have become more important in recent times for their medicinal and nutritional properties. Partial rDNA sequences, including the Internal Transcribed Spacer I-5.8SrDNA-Internal Transcribed Spacer II, were used in this study for molecular identification and assessment of phylogenetic relationships between selected edible species of the Basidiomycetes. Phylogenetic trees showed five distinct clades; each clade belonging to a separate family group. The first clade included all the species belonging to the Pleurotaceae (Pleurotus spp.) family; similarly, the second, third, fourth, and fifth clades consist of species from the Agaricaceae (Agaricus sp.), Lyophllaceae (Hypsigygus sp.), Marasmiaceae (Lentinula edodes sp.) and Physalacriaceae (Flammulina velutipes sp.) families, respectively. Moreover, different species of each family were clearly placed in a distinct sub-cluster and a total of 13 species were taken for analysis. Species differentiation was re-confirmed by AMOVA analysis (among the populations: 99.67%; within: 0.33%), nucleotide divergence, haplotyping and P value. Polymorphism occurred throughout the ITS regions due to insertion-deletion and point mutations, and can be clearly differentiated within the families as well as genera. Moreover, this study proves that the sequence of the ITS region is a superior molecular DNA barcode for taxonomic identification of Basidiomycetes. PMID:22327649

  5. Accurate, Rapid Taxonomic Classification of Fungal Large-Subunit rRNA Genes

    PubMed Central

    Liu, Kuan-Liang; Porras-Alfaro, Andrea; Eichorst, Stephanie A.

    2012-01-01

    Taxonomic and phylogenetic fingerprinting based on sequence analysis of gene fragments from the large-subunit rRNA (LSU) gene or the internal transcribed spacer (ITS) region is becoming an integral part of fungal classification. The lack of an accurate and robust classification tool trained by a validated sequence database for taxonomic placement of fungal LSU genes is a severe limitation in taxonomic analysis of fungal isolates or large data sets obtained from environmental surveys. Using a hand-curated set of 8,506 fungal LSU gene fragments, we determined the performance characteristics of a naïve Bayesian classifier across multiple taxonomic levels and compared the classifier performance to that of a sequence similarity-based (BLASTN) approach. The naïve Bayesian classifier was computationally more rapid (>460-fold with our system) than the BLASTN approach, and it provided equal or superior classification accuracy. Classifier accuracies were compared using sequence fragments of 100 bp and 400 bp and two different PCR primer anchor points to mimic sequence read lengths commonly obtained using current high-throughput sequencing technologies. Accuracy was higher with 400-bp sequence reads than with 100-bp reads. It was also significantly affected by sequence location across the 1,400-bp test region. The highest accuracy was obtained across either the D1 or D2 variable region. The naïve Bayesian classifier provides an effective and rapid means to classify fungal LSU sequences from large environmental surveys. The training set and tool are publicly available through the Ribosomal Database Project (http://rdp.cme.msu.edu/classifier/classifier.jsp). PMID:22194300

  6. Accurate, rapid taxonomic classification of fungal large-subunit rRNA genes.

    PubMed

    Liu, Kuan-Liang; Porras-Alfaro, Andrea; Kuske, Cheryl R; Eichorst, Stephanie A; Xie, Gary

    2012-03-01

    Taxonomic and phylogenetic fingerprinting based on sequence analysis of gene fragments from the large-subunit rRNA (LSU) gene or the internal transcribed spacer (ITS) region is becoming an integral part of fungal classification. The lack of an accurate and robust classification tool trained by a validated sequence database for taxonomic placement of fungal LSU genes is a severe limitation in taxonomic analysis of fungal isolates or large data sets obtained from environmental surveys. Using a hand-curated set of 8,506 fungal LSU gene fragments, we determined the performance characteristics of a naïve Bayesian classifier across multiple taxonomic levels and compared the classifier performance to that of a sequence similarity-based (BLASTN) approach. The naïve Bayesian classifier was computationally more rapid (>460-fold with our system) than the BLASTN approach, and it provided equal or superior classification accuracy. Classifier accuracies were compared using sequence fragments of 100 bp and 400 bp and two different PCR primer anchor points to mimic sequence read lengths commonly obtained using current high-throughput sequencing technologies. Accuracy was higher with 400-bp sequence reads than with 100-bp reads. It was also significantly affected by sequence location across the 1,400-bp test region. The highest accuracy was obtained across either the D1 or D2 variable region. The naïve Bayesian classifier provides an effective and rapid means to classify fungal LSU sequences from large environmental surveys. The training set and tool are publicly available through the Ribosomal Database Project. PMID:22194300

  7. Short interspersed elements (SINEs) in plants: origin, classification, and use as phylogenetic markers.

    PubMed

    Deragon, Jean-Marc; Zhang, Xiaoyu

    2006-12-01

    Short interspersed elements (SINEs) are a class of dispersed mobile sequences that use RNA as an intermediate in an amplification process called retroposition. The presence-absence of a SINE at a given locus has been used as a meaningful classification criterion to evaluate phylogenetic relations among species. We review here recent developments in the characterisation of plant SINEs and their use as molecular makers to retrace phylogenetic relations among wild and cultivated Oryza and Brassica species. In Brassicaceae, further use of SINE markers is limited by our partial knowledge of endogenous SINE families (their origin and evolution histories) and by the absence of a clear classification. To solve this problem, phylogenetic relations among all known Brassicaceae SINEs were analyzed and a new classification, grouping SINEs in 15 different families, is proposed. The relative age and size of each Brassicaceae SINE family was evaluated and new phylogenetically supported subfamilies were described. We also present evidence suggesting that new potentially active SINEs recently emerged in Brassica oleracea from the shuffling of preexisting SINE portions. Finally, the comparative evolution history of SINE families present in Arabidopsis thaliana and Brassica oleracea revealed that SINEs were in general more active in the Brassica lineage. The importance of these new data for the use of Brassicaceae SINEs as molecular markers in future applications is discussed. PMID:17345676

  8. Phylogenetics.

    PubMed

    Sleator, Roy D

    2011-04-01

    The recent rapid expansion in the DNA and protein databases, arising from large-scale genomic and metagenomic sequence projects, has forced significant development in the field of phylogenetics: the study of the evolutionary relatedness of the planet's inhabitants. Advances in phylogenetic analysis have greatly transformed our view of the landscape of evolutionary biology, transcending the view of the tree of life that has shaped evolutionary theory since Darwinian times. Indeed, modern phylogenetic analysis no longer focuses on the restricted Darwinian-Mendelian model of vertical gene transfer, but must also consider the significant degree of lateral gene transfer, which connects and shapes almost all living things. Herein, I review the major tree-building methods, their strengths, weaknesses and future prospects. PMID:21249334

  9. The phagotrophic origin of eukaryotes and phylogenetic classification of Protozoa.

    PubMed

    Cavalier-Smith, T

    2002-03-01

    Eukaryotes and archaebacteria form the clade neomura and are sisters, as shown decisively by genes fragmented only in archaebacteria and by many sequence trees. This sisterhood refutes all theories that eukaryotes originated by merging an archaebacterium and an alpha-proteobacterium, which also fail to account for numerous features shared specifically by eukaryotes and actinobacteria. I revise the phagotrophy theory of eukaryote origins by arguing that the essentially autogenous origins of most eukaryotic cell properties (phagotrophy, endomembrane system including peroxisomes, cytoskeleton, nucleus, mitosis and sex) partially overlapped and were synergistic with the symbiogenetic origin of mitochondria from an alpha-proteobacterium. These radical innovations occurred in a derivative of the neomuran common ancestor, which itself had evolved immediately prior to the divergence of eukaryotes and archaebacteria by drastic alterations to its eubacterial ancestor, an actinobacterial posibacterium able to make sterols, by replacing murein peptidoglycan by N-linked glycoproteins and a multitude of other shared neomuran novelties. The conversion of the rigid neomuran wall into a flexible surface coat and the associated origin of phagotrophy were instrumental in the evolution of the endomembrane system, cytoskeleton, nuclear organization and division and sexual life-cycles. Cilia evolved not by symbiogenesis but by autogenous specialization of the cytoskeleton. I argue that the ancestral eukaryote was uniciliate with a single centriole (unikont) and a simple centrosomal cone of microtubules, as in the aerobic amoebozoan zooflagellate Phalansterium. I infer the root of the eukaryote tree at the divergence between opisthokonts (animals, Choanozoa, fungi) with a single posterior cilium and all other eukaryotes, designated 'anterokonts' because of the ancestral presence of an anterior cilium. Anterokonts comprise the Amoebozoa, which may be ancestrally unikont, and a vast ancestrally biciliate clade, named 'bikonts'. The apparently conflicting rRNA and protein trees can be reconciled with each other and this ultrastructural interpretation if long-branch distortions, some mechanistically explicable, are allowed for. Bikonts comprise two groups: corticoflagellates, with a younger anterior cilium, no centrosomal cone and ancestrally a semi-rigid cell cortex with a microtubular band on either side of the posterior mature centriole; and Rhizaria [a new infrakingdom comprising Cercozoa (now including Ascetosporea classis nov.), Retaria phylum nov., Heliozoa and Apusozoa phylum nov.], having a centrosomal cone or radiating microtubules and two microtubular roots and a soft surface, frequently with reticulopodia. Corticoflagellates comprise photokaryotes (Plantae and chromalveolates, both ancestrally with cortical alveoli) and Excavata (a new protozoan infrakingdom comprising Loukozoa, Discicristata and Archezoa, ancestrally with three microtubular roots). All basal eukaryotic radiations were of mitochondrial aerobes; hydrogenosomes evolved polyphyletically from mitochondria long afterwards, the persistence of their double envelope long after their genomes disappeared being a striking instance of membrane heredity. I discuss the relationship between the 13 protozoan phyla recognized here and revise higher protozoan classification by updating as subkingdoms Lankester's 1878 division of Protozoa into Corticata (Excavata, Alveolata; with prominent cortical microtubules and ancestrally localized cytostome--the Parabasalia probably secondarily internalized the cytoskeleton) and Gymnomyxa [infrakingdoms Sarcomastigota (Choanozoa, Amoebozoa) and Rhizaria; both ancestrally with a non-cortical cytoskeleton of radiating singlet microtubules and a relatively soft cell surface with diffused feeding]. As the eukaryote root almost certainly lies within Gymnomyxa, probably among the Sarcomastigota, Corticata are derived. Following the single symbiogenetic origin of chloroplasts in a corticoflagellate host with cortical alveoli, this ancestral plant radiated rapidly into glaucophytes, green plants and red algae. Secondary symbiogeneses subsequently transferred plastids laterally into different hosts, making yet more complex cell chimaeras--probably only thrice: from a red alga to the corticoflagellate ancestor of chromalveolates (Chromista plus Alveolata), from green algae to a secondarily uniciliate cercozoan to form chlorarachneans and independently to a biciliate excavate to yield photosynthetic euglenoids. Tertiary symbiogenesis involving eukaryotic algal symbionts replaced peridinin-containing plastids in two or three dinoflagellate lineages, but yielded no major novel groups. The origin and well-resolved primary bifurcation of eukaryotes probably occurred in the Cryogenian Period, about 850 million years ago, much more recently than suggested by unwarranted backward extrapolations of molecular 'clocks' or dubious interpretations as 'eukaryotic' of earlier large microbial fossils or still more ancient steranes. The origin of chloroplasts and the symbiogenetic incorporation of a red alga into a corticoflagellate to create chromalveolates may both have occurred in a big bang after the Varangerian snowball Earth melted about 580 million years ago, thereby stimulating the ensuing Cambrian explosion of animals and protists in the form of simultaneous, poorly resolved opisthokont and anterokont radiations. PMID:11931142

  10. Phylogeny and phylogenetic classification of the antbirds, ovenbirds, woodcreepers, and allies (Aves: Passeriformes: Infraorder Furnariides)

    USGS Publications Warehouse

    Moyle, R.G.; Chesser, R.T.; Brumfield, R.T.; Tello, J.G.; Marchese, D.J.; Cracraft, J.

    2009-01-01

    The infraorder Furnariides is a diverse group of suboscine passerine birds comprising a substantial component of the Neotropical avifauna. The included species encompass a broad array of morphologies and behaviours, making them appealing for evolutionary studies, but the size of the group (ca. 600 species) has limited well-sampled higher-level phylogenetic studies. Using DNA sequence data from the nuclear RAG-1 and RAG-2 exons, we undertook a phylogenetic analysis of the Furnariides sampling 124 (more than 88%) of the genera. Basal relationships among family-level taxa differed depending on phylogenetic method, but all topologies had little nodal support, mirroring the results from earlier studies in which discerning relationships at the base of the radiation was also difficult. In contrast, branch support for family-rank taxa and for many relationships within those clades was generally high. Our results support the Melanopareidae and Grallariidae as distinct from the Rhinocryptidae and Formicariidae, respectively. Within the Furnariides our data contradict some recent phylogenetic hypotheses and suggest that further study is needed to resolve these discrepancies. Of the few genera represented by multiple species, several were not monophyletic, indicating that additional systematic work remains within furnariine families and must include dense taxon sampling. We use this study as a basis for proposing a new phylogenetic classification for the group and in the process erect new family-group names for clades having high branch support across methods. ?? 2009 The Willi Hennig Society.

  11. Molecular phylogenetic perspectives for character classification and convergence: Framing some issues with nematode vulval appendages and telotylenchid tail termini

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Characters flagged as convergent based on newer molecular phylogenetic trees inform both practical identification and more esoteric classification. Nematode morphological characters such as lateral lines, bullae and laciniae are quite independent structures from those similarly named in other organi...

  12. Accurate arrhythmia classification using auto-associative neural network.

    PubMed

    Chakroborty, Sandipan

    2013-01-01

    Currently about one in eighteen of the American population suffer from cardiac Arrhythmias that lead to Coronary Heart Diseases and this rate is steadily increasing. An early monitoring and diagnosis of Arrhythmia based on Electrocardiogram signals can help in reducing mortality. This paper primarily focuses on the application of Auto Associative Neural Network as a new classification approach, which does not require feature extraction task. The weights of a trained Neural Network are stored as class representative models that results in high compression gain with respect to the size of training data. The evaluation of the proposed technique is tested on segmented ECG beats of four different classes of Arrhythmia excluding normal pattern. These beats have been extracted from the MIT/BIH Arrhythmia database and compared against the state-of-the art template matching technique such as Dynamic Time Warping. The proposed technique yields an average accuracy of more than 97% and a relative compression gain of above 90%. PMID:24110670

  13. Classification of Phylogenetic Profiles for Protein Function Prediction: An SVM Approach

    NASA Astrophysics Data System (ADS)

    Kotaru, Appala Raju; Joshi, Ramesh C.

    Predicting the function of an uncharacterized protein is a major challenge in post-genomic era due to problems complexity and scale. Having knowledge of protein function is a crucial link in the development of new drugs, better crops, and even the development of biochemicals such as biofuels. Recently numerous high-throughput experimental procedures have been invented to investigate the mechanisms leading to the accomplishment of a protein’s function and Phylogenetic profile is one of them. Phylogenetic profile is a way of representing a protein which encodes evolutionary history of proteins. In this paper we proposed a method for classification of phylogenetic profiles using supervised machine learning method, support vector machine classification along with radial basis function as kernel for identifying functionally linked proteins. We experimentally evaluated the performance of the classifier with the linear kernel, polynomial kernel and compared the results with the existing tree kernel. In our study we have used proteins of the budding yeast saccharomyces cerevisiae genome. We generated the phylogenetic profiles of 2465 yeast genes and for our study we used the functional annotations that are available in the MIPS database. Our experiments show that the performance of the radial basis kernel is similar to polynomial kernel is some functional classes together are better than linear, tree kernel and over all radial basis kernel outperformed the polynomial kernel, linear kernel and tree kernel. In analyzing these results we show that it will be feasible to make use of SVM classifier with radial basis function as kernel to predict the gene functionality using phylogenetic profiles.

  14. Phylogenetic Analysis and Classification of the Fungal bHLH Domain

    PubMed Central

    Sailsbery, Joshua K.; Atchley, William R.; Dean, Ralph A.

    2012-01-01

    The basic Helix-Loop-Helix (bHLH) domain is an essential highly conserved DNA-binding domain found in many transcription factors in all eukaryotic organisms. The bHLH domain has been well studied in the Animal and Plant Kingdoms but has yet to be characterized within Fungi. Herein, we obtained and evaluated the phylogenetic relationship of 490 fungal-specific bHLH containing proteins from 55 whole genome projects composed of 49 Ascomycota and 6 Basidiomycota organisms. We identified 12 major groupings within Fungi (F1–F12); identifying conserved motifs and functions specific to each group. Several classification models were built to distinguish the 12 groups and elucidate the most discerning sites in the domain. Performance testing on these models, for correct group classification, resulted in a maximum sensitivity and specificity of 98.5% and 99.8%, respectively. We identified 12 highly discerning sites and incorporated those into a set of rules (simplified model) to classify sequences into the correct group. Conservation of amino acid sites and phylogenetic analyses established that like plant bHLH proteins, fungal bHLH–containing proteins are most closely related to animal Group B. The models used in these analyses were incorporated into a software package, the source code for which is available at www.fungalgenomics.ncsu.edu. PMID:22114358

  15. Phylogenetic analysis and classification of the fungal bHLH domain.

    PubMed

    Sailsbery, Joshua K; Atchley, William R; Dean, Ralph A

    2012-05-01

    The basic Helix-Loop-Helix (bHLH) domain is an essential highly conserved DNA-binding domain found in many transcription factors in all eukaryotic organisms. The bHLH domain has been well studied in the Animal and Plant Kingdoms but has yet to be characterized within Fungi. Herein, we obtained and evaluated the phylogenetic relationship of 490 fungal-specific bHLH containing proteins from 55 whole genome projects composed of 49 Ascomycota and 6 Basidiomycota organisms. We identified 12 major groupings within Fungi (F1-F12); identifying conserved motifs and functions specific to each group. Several classification models were built to distinguish the 12 groups and elucidate the most discerning sites in the domain. Performance testing on these models, for correct group classification, resulted in a maximum sensitivity and specificity of 98.5% and 99.8%, respectively. We identified 12 highly discerning sites and incorporated those into a set of rules (simplified model) to classify sequences into the correct group. Conservation of amino acid sites and phylogenetic analyses established that like plant bHLH proteins, fungal bHLH-containing proteins are most closely related to animal Group B. The models used in these analyses were incorporated into a software package, the source code for which is available at www.fungalgenomics.ncsu.edu. PMID:22114358

  16. Phylogenetic systematics and a revised generic classification of anthidiine bees (Hymenoptera: Megachilidae).

    PubMed

    Litman, Jessica R; Griswold, Terry; Danforth, Bryan N

    2016-07-01

    The bee tribe Anthidiini (Hymenoptera: Megachilidae) is a large, cosmopolitan group of solitary bees that exhibit intriguing nesting behavior. We present the first molecular-based phylogenetic analysis of relationships within Anthidiini using model-based methods and a large, multi-locus dataset (five nuclear genes, 5081 base pairs), as well as a combined analysis using our molecular dataset in conjunction with a previously published morphological matrix. We discuss the evolution of nesting behavior in Anthidiini and the relationship between nesting material and female mandibular morphology. Following an examination of the morphological characters historically used to recognize anthidiine genera, we recommend the use of a molecular-based phylogenetic backbone to define taxonomic groups prior to the assignment of diagnostic morphological characters for these groups. Finally, our results reveal the paraphyly of numerous genera and have significant consequences for anthidiine classification. In order to promote a classification system based on stable, monophyletic clades, we hereby make the following changes to Michener's (2007) classification: The subgenera Afranthidium (Zosteranthidium) Michener and Griswold, 1994, Afranthidium (Branthidium) Pasteels, 1969 and Afranthidium (Immanthidium) Pasteels, 1969 are moved into the genus Pseudoanthidium, thus forming the new combinations Pseudoanthidium (Zosteranthidium), Pseudoanthidium (Branthidium), and Pseudoanthidium (Immanthidium). The genus Neanthidium Pasteels, 1969 is also moved into the genus Pseudoanthidium, thus forming the new combination Pseudoanthidium (Neanthidium). Based on morphological characters shared with our new definition of the genus Pseudoanthidium, the subgenus Afranthidium (Mesanthidiellum) Pasteels, 1969 and the genus Gnathanthidium Pasteels, 1969 are also moved into the genus Pseudoanthidium, thus forming the new combinations Pseudoanthidium (Mesanthidiellum) and Pseudoanthidium (Gnathanthidium). PMID:26988413

  17. Rapid phylogenetic and functional classification of short genomic fragments with signature peptides

    PubMed Central

    2012-01-01

    Background Classification is difficult for shotgun metagenomics data from environments such as soils, where the diversity of sequences is high and where reference sequences from close relatives may not exist. Approaches based on sequence-similarity scores must deal with the confounding effects that inheritance and functional pressures exert on the relation between scores and phylogenetic distance, while approaches based on sequence alignment and tree-building are typically limited to a small fraction of gene families. We describe an approach based on finding one or more exact matches between a read and a precomputed set of peptide 10-mers. Results At even the largest phylogenetic distances, thousands of 10-mer peptide exact matches can be found between pairs of bacterial genomes. Genes that share one or more peptide 10-mers typically have high reciprocal BLAST scores. Among a set of 403 representative bacterial genomes, some 20 million 10-mer peptides were found to be shared. We assign each of these peptides as a signature of a particular node in a phylogenetic reference tree based on the RNA polymerase genes. We classify the phylogeny of a genomic fragment (e.g., read) at the most specific node on the reference tree that is consistent with the phylogeny of observed signature peptides it contains. Using both synthetic data from four newly-sequenced soil-bacterium genomes and ten real soil metagenomics data sets, we demonstrate a sensitivity and specificity comparable to that of the MEGAN metagenomics analysis package using BLASTX against the NR database. Phylogenetic and functional similarity metrics applied to real metagenomics data indicates a signal-to-noise ratio of approximately 400 for distinguishing among environments. Our method assigns ~6.6 Gbp/hr on a single CPU, compared with 25 kbp/hr for methods based on BLASTX against the NR database. Conclusions Classification by exact matching against a precomputed list of signature peptides provides comparable results to existing techniques for reads longer than about 300 bp and does not degrade severely with shorter reads. Orders of magnitude faster than existing methods, the approach is suitable now for inclusion in analysis pipelines and appears to be extensible in several different directions. PMID:22925230

  18. Cloning, in Vitro expression, and novel phylogenetic classification of a channel catfish estrogen receptor

    USGS Publications Warehouse

    Xia, Z.; Patino, R.; Gale, W.L.; Maule, A.G.; Densmore, L.D.

    1999-01-01

    We obtained two channel catfish estrogen receptor (ccER) cDNA from liver of female fish using RT–PCR. The two fragments were identical in sequence except that the smaller one had an out-of-frame deletion in the E domain, suggesting the existence of ccER splice variants. The larger fragment was used to screen a cDNA library from liver of a prepubescent female. A cDNA was obtained that encoded a 581-amino-acid ER with a deduced molecular weight of 63.8 kDa. Extracts of COS-7 cells transfected with ccER cDNA bound estrogen with high affinity (Kd = 4.7 nM) and specificity. Maximum parsimony and Neighbor Joining analyses were used to generate a phylogenetic classification of ccER on the basis of 18 full-length ER sequences. The tree suggested the existence of two major ER branches. One branch contained two clearly divergent clades which included all piscine ER (except Japanese eel ER) and all tetrapod ERα, respectively. The second major branch contained the eel ER and the mammalian ERβ. The high degree of divergence between the eel ER and mammalian ERβ suggested that they also represent distinct piscine and tetrapod ER. These data suggest that ERα and ERβ are present throughout vertebrates and that these two major ER types evolved by duplication of an ancestral ER gene. Sequence alignments with other members of the nuclear hormone receptor superfamily indicated the presence of 8 amino acids in the E domain that align exclusively among ER. Four of these amino acids have not received prior research attention and their function is unknown. The novel finding of putative ER splice variants in a nonmammalian vertebrate and the novel phylogenetic classification of ER offer new perspectives in understanding the diversification and function of ER.

  19. Accurate cortical tissue classification on MRI by modeling cortical folding patterns.

    PubMed

    Kim, Hosung; Caldairou, Benoit; Hwang, Ji-Wook; Mansi, Tommaso; Hong, Seok-Jun; Bernasconi, Neda; Bernasconi, Andrea

    2015-09-01

    Accurate tissue classification is a crucial prerequisite to MRI morphometry. Automated methods based on intensity histograms constructed from the entire volume are challenged by regional intensity variations due to local radiofrequency artifacts as well as disparities in tissue composition, laminar architecture and folding patterns. Current work proposes a novel anatomy-driven method in which parcels conforming cortical folding were regionally extracted from the brain. Each parcel is subsequently classified using nonparametric mean shift clustering. Evaluation was carried out on manually labeled images from two datasets acquired at 3.0 Tesla (n = 15) and 1.5 Tesla (n = 20). In both datasets, we observed high tissue classification accuracy of the proposed method (Dice index >97.6% at 3.0 Tesla, and >89.2% at 1.5 Tesla). Moreover, our method consistently outperformed state-of-the-art classification routines available in SPM8 and FSL-FAST, as well as a recently proposed local classifier that partitions the brain into cubes. Contour-based analyses localized more accurate white matter-gray matter (GM) interface classification of the proposed framework compared to the other algorithms, particularly in central and occipital cortices that generally display bright GM due to their highly degree of myelination. Excellent accuracy was maintained, even in the absence of correction for intensity inhomogeneity. The presented anatomy-driven local classification algorithm may significantly improve cortical boundary definition, with possible benefits for morphometric inference and biomarker discovery. PMID:26037453

  20. Accurate crop classification using hierarchical genetic fuzzy rule-based systems

    NASA Astrophysics Data System (ADS)

    Topaloglou, Charalampos A.; Mylonas, Stelios K.; Stavrakoudis, Dimitris G.; Mastorocostas, Paris A.; Theocharis, John B.

    2014-10-01

    This paper investigates the effectiveness of an advanced classification system for accurate crop classification using very high resolution (VHR) satellite imagery. Specifically, a recently proposed genetic fuzzy rule-based classification system (GFRBCS) is employed, namely, the Hierarchical Rule-based Linguistic Classifier (HiRLiC). HiRLiC's model comprises a small set of simple IF-THEN fuzzy rules, easily interpretable by humans. One of its most important attributes is that its learning algorithm requires minimum user interaction, since the most important learning parameters affecting the classification accuracy are determined by the learning algorithm automatically. HiRLiC is applied in a challenging crop classification task, using a SPOT5 satellite image over an intensively cultivated area in a lake-wetland ecosystem in northern Greece. A rich set of higher-order spectral and textural features is derived from the initial bands of the (pan-sharpened) image, resulting in an input space comprising 119 features. The experimental analysis proves that HiRLiC compares favorably to other interpretable classifiers of the literature, both in terms of structural complexity and classification accuracy. Its testing accuracy was very close to that obtained by complex state-of-the-art classification systems, such as the support vector machines (SVM) and random forest (RF) classifiers. Nevertheless, visual inspection of the derived classification maps shows that HiRLiC is characterized by higher generalization properties, providing more homogeneous classifications that the competitors. Moreover, the runtime requirements for producing the thematic map was orders of magnitude lower than the respective for the competitors.

  1. Towards a Formal Genealogical Classification of the Lezgian Languages (North Caucasus): Testing Various Phylogenetic Methods on Lexical Data

    PubMed Central

    Kassian, Alexei

    2015-01-01

    A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456

  2. Towards a formal genealogical classification of the Lezgian languages (North Caucasus): testing various phylogenetic methods on lexical data.

    PubMed

    Kassian, Alexei

    2015-01-01

    A lexicostatistical classification is proposed for 20 languages and dialects of the Lezgian group of the North Caucasian family, based on meticulously compiled 110-item wordlists, published as part of the Global Lexicostatistical Database project. The lexical data have been subsequently analyzed with the aid of the principal phylogenetic methods, both distance-based and character-based: Starling neighbor joining (StarlingNJ), Neighbor joining (NJ), Unweighted pair group method with arithmetic mean (UPGMA), Bayesian Markov chain Monte Carlo (MCMC), Unweighted maximum parsimony (UMP). Cognation indexes within the input matrix were marked by two different algorithms: traditional etymological approach and phonetic similarity, i.e., the automatic method of consonant classes (Levenshtein distances). Due to certain reasons (first of all, high lexicographic quality of the wordlists and a consensus about the Lezgian phylogeny among Caucasologists), the Lezgian database is a perfect testing area for appraisal of phylogenetic methods. For the etymology-based input matrix, all the phylogenetic methods, with the possible exception of UMP, have yielded trees that are sufficiently compatible with each other to generate a consensus phylogenetic tree of the Lezgian lects. The obtained consensus tree agrees with the traditional expert classification as well as some of the previously proposed formal classifications of this linguistic group. Contrary to theoretical expectations, the UMP method has suggested the least plausible tree of all. In the case of the phonetic similarity-based input matrix, the distance-based methods (StarlingNJ, NJ, UPGMA) have produced the trees that are rather close to the consensus etymology-based tree and the traditional expert classification, whereas the character-based methods (Bayesian MCMC, UMP) have yielded less likely topologies. PMID:25719456

  3. DNA barcode analysis: a comparison of phylogenetic and statistical classification methods

    PubMed Central

    Austerlitz, Frederic; David, Olivier; Schaeffer, Brigitte; Bleakley, Kevin; Olteanu, Madalina; Leblois, Raphael; Veuille, Michel; Laredo, Catherine

    2009-01-01

    Background DNA barcoding aims to assign individuals to given species according to their sequence at a small locus, generally part of the CO1 mitochondrial gene. Amongst other issues, this raises the question of how to deal with within-species genetic variability and potential transpecific polymorphism. In this context, we examine several assignation methods belonging to two main categories: (i) phylogenetic methods (neighbour-joining and PhyML) that attempt to account for the genealogical framework of DNA evolution and (ii) supervised classification methods (k-nearest neighbour, CART, random forest and kernel methods). These methods range from basic to elaborate. We investigated the ability of each method to correctly classify query sequences drawn from samples of related species using both simulated and real data. Simulated data sets were generated using coalescent simulations in which we varied the genealogical history, mutation parameter, sample size and number of species. Results No method was found to be the best in all cases. The simplest method of all, "one nearest neighbour", was found to be the most reliable with respect to changes in the parameters of the data sets. The parameter most influencing the performance of the various methods was molecular diversity of the data. Addition of genetically independent loci - nuclear genes - improved the predictive performance of most methods. Conclusion The study implies that taxonomists can influence the quality of their analyses either by choosing a method best-adapted to the configuration of their sample, or, given a certain method, increasing the sample size or altering the amount of molecular diversity. This can be achieved either by sequencing more mtDNA or by sequencing additional nuclear genes. In the latter case, they may also have to modify their data analysis method. PMID:19900297

  4. Molecular and morphological data supporting phylogenetic reconstruction of the genus Goniothalamus (Annonaceae), including a reassessment of previous infrageneric classifications

    PubMed Central

    Tang, Chin Cheung; Thomas, Daniel C.; Saunders, Richard M.K.

    2015-01-01

    Data is presented in support of a phylogenetic reconstruction of the species-rich early-divergent angiosperm genus Goniothalamus (Annonaceae) (Tang et al., Mol. Phylogenetic Evol., 2015) [1], inferred using chloroplast DNA (cpDNA) sequences. The data includes a list of primers for amplification and sequencing for nine cpDNA regions: atpB-rbcL, matK, ndhF, psbA-trnH, psbM-trnD, rbcL, trnL-F, trnS-G, and ycf1, the voucher information and molecular data (GenBank accession numbers) of 67 ingroup Goniothalamus accessions and 14 outgroup accessions selected from across the tribe Annoneae, and aligned data matrices for each gene region. We also present our Bayesian phylogenetic reconstructions for Goniothalamus, with information on previous infrageneric classifications superimposed to enable an evaluation of monophyly, together with a taxon-character data matrix (with 15 morphological characters scored for 66 Goniothalamus species and seven other species from the tribe Annoneae that are shown to be phylogenetically correlated). PMID:26286044

  5. A classification of the Chloridoideae (Poaceae) based on multi-gene phylogenetic trees.

    PubMed

    Peterson, Paul M; Romaschenko, Konstantin; Johnson, Gabriel

    2010-05-01

    We conducted a molecular phylogenetic study of the subfamily Chloridoideae using six plastid DNA sequences (ndhA intron, ndhF, rps16-trnK, rps16 intron, rps3, and rpl32-trnL) and a single nuclear ITS DNA sequence. Our large original data set includes 246 species (17.3%) representing 95 genera (66%) of the grasses currently placed in the Chloridoideae. The maximum likelihood and Bayesian analysis of DNA sequences provides strong support for the monophyly of the Chloridoideae; followed by, in order of divergence: a Triraphideae clade with Neyraudia sister to Triraphis; an Eragrostideae clade with the Cotteinae (includes Cottea and Enneapogon) sister to the Uniolinae (includes Entoplocamia, Tetrachne, and Uniola), and a terminal Eragrostidinae clade of Ectrosia, Harpachne, and Psammagrostis embedded in a polyphyletic Eragrostis; a Zoysieae clade with Urochondra sister to a Zoysiinae (Zoysia) clade, and a terminal Sporobolinae clade that includes Spartina, Calamovilfa, Pogoneura, and Crypsis embedded in a polyphyletic Sporobolus; and a very large terminal Cynodonteae clade that includes 13 monophyletic subtribes. The Cynodonteae includes, in alphabetical order: Aeluropodinae (Aeluropus); Boutelouinae (Bouteloua); Eleusininae (includes Apochiton, Astrebla with Schoenefeldia embedded, Austrochloris, Brachyachne, Chloris, Cynodon with Brachyachne embedded in part, Eleusine, Enteropogon with Eustachys embedded in part, Eustachys, Chrysochloa, Coelachyrum, Leptochloa with Dinebra embedded, Lepturus, Lintonia, Microchloa, Saugetia, Schoenefeldia, Sclerodactylon, Tetrapogon, and Trichloris); Hilariinae (Hilaria); Monanthochloinae (includes Distichlis, Monanthochloe, and Reederochloa); Muhlenbergiinae (Muhlenbergia with Aegopogon, Bealia, Blepharoneuron, Chaboissaea, Lycurus, Pereilema, Redfieldia, Schaffnerella, and Schedonnardus all embedded); Orcuttiinae (includes Orcuttia and Tuctoria); Pappophorinae (includes Neesiochloa and Pappophorum); Scleropogoninae (includes Blepharidachne, Dasyochloa, Erioneuron, Munroa, Scleropogon, and Swallenia); Traginae (Tragus with Monelytrum, Polevansia, and Willkommia all embedded); Tridentinae (includes Gouinia, Tridens, Triplasis, and Vaseyochloa); Triodiinae (Triodia); and the Tripogoninae (Melanocenchris and Tripogon with Eragrostiella embedded). In our study the Cynodonteae still include 19 genera and the Zoysieae include a single genus that are not yet placed in a subtribe. The tribe Triraphideae and the subtribe Aeluropodinae are newly treated at that rank. We propose a new tribal and subtribal classification for all known genera in the Chloridoideae. The subfamily might have originated in Africa and/or Asia since the basal lineage, the Triraphideae, includes species with African and Asian distribution. PMID:20096795

  6. GPD: a graph pattern diffusion kernel for accurate graph classification with applications in cheminformatics.

    PubMed

    Smalter, Aaron; Huan, Jun Luke; Jia, Yi; Lushington, Gerald

    2010-01-01

    Graph data mining is an active research area. Graphs are general modeling tools to organize information from heterogeneous sources and have been applied in many scientific, engineering, and business fields. With the fast accumulation of graph data, building highly accurate predictive models for graph data emerges as a new challenge that has not been fully explored in the data mining community. In this paper, we demonstrate a novel technique called graph pattern diffusion (GPD) kernel. Our idea is to leverage existing frequent pattern discovery methods and to explore the application of kernel classifier (e.g., support vector machine) in building highly accurate graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the graph database and use a process we call "pattern diffusion" to label nodes in the graphs. Finally, we designed a graph alignment algorithm to compute the inner product of two graphs. We have tested our algorithm using a number of chemical structure data. The experimental results demonstrate that our method is significantly better than competing methods such as those kernel functions based on paths, cycles, and subgraphs. PMID:20431140

  7. GPD: A Graph Pattern Diffusion Kernel for Accurate Graph Classification with Applications in Cheminformatics

    PubMed Central

    Smalter, Aaron; Huan, Jun (Luke); Jia, Yi; Lushington, Gerald

    2011-01-01

    Graph data mining is an active research area. Graphs are general modeling tools to organize information from heterogeneous sources and have been applied in many scientific, engineering, and business fields. With the fast accumulation of graph data, building highly accurate predictive models for graph data emerges as a new challenge that has not been fully explored in the data mining community. In this paper, we demonstrate a novel technique called graph pattern diffusion (GPD) kernel. Our idea is to leverage existing frequent pattern discovery methods and to explore the application of kernel classifier (e.g., support vector machine) in building highly accurate graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the graph database and use a process we call “pattern diffusion” to label nodes in the graphs. Finally, we designed a graph alignment algorithm to compute the inner product of two graphs. We have tested our algorithm using a number of chemical structure data. The experimental results demonstrate that our method is significantly better than competing methods such as those kernel functions based on paths, cycles, and subgraphs. PMID:20431140

  8. Phylogenetic systematics and a revised generic classification of anthidiine bees (Hymenoptera: Megachile)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The bee tribe Anthidiini (Hymenoptera: Megachilidae) is a large, cosmopolitan group of solitary bees that exhibit intriguing nesting behavior. We present the first molecular-based phylogenetic analysis of relationships within Anthidiini using model based methods and a large, multi-locus dataset (fiv...

  9. Photometric brown-dwarf classification. I. A method to identify and accurately classify large samples of brown dwarfs without spectroscopy

    NASA Astrophysics Data System (ADS)

    Skrzypek, N.; Warren, S. J.; Faherty, J. K.; Mortlock, D. J.; Burgasser, A. J.; Hewett, P. C.

    2015-02-01

    Aims: We present a method, named photo-type, to identify and accurately classify L and T dwarfs onto the standard spectral classification system using photometry alone. This enables the creation of large and deep homogeneous samples of these objects efficiently, without the need for spectroscopy. Methods: We created a catalogue of point sources with photometry in 8 bands, ranging from 0.75 to 4.6 μm, selected from an area of 3344 deg2, by combining SDSS, UKIDSS LAS, and WISE data. Sources with 13.0 0.8, were then classified by comparison against template colours of quasars, stars, and brown dwarfs. The L and T templates, spectral types L0 to T8, were created by identifying previously known sources with spectroscopic classifications, and fitting polynomial relations between colour and spectral type. Results: Of the 192 known L and T dwarfs with reliable photometry in the surveyed area and magnitude range, 189 are recovered by our selection and classification method. We have quantified the accuracy of the classification method both externally, with spectroscopy, and internally, by creating synthetic catalogues and accounting for the uncertainties. We find that, brighter than J = 17.5, photo-type classifications are accurate to one spectral sub-type, and are therefore competitive with spectroscopic classifications. The resultant catalogue of 1157 L and T dwarfs will be presented in a companion paper.

  10. Concomitant Medical Conditions and Therapies Preclude Accurate Classification of Children With Severe or Severe Complicated Clostridium difficile Infection.

    PubMed

    Kociolek, Larry K; Patel, Sameer J; Shulman, Stanford T; Gerding, Dale N

    2015-12-01

    Severe and severe complicated Clostridium difficile infections (SCDI/SCCDI) were retrospectively assessed in a pediatric cohort. Underlying medical conditions and concomitant medical therapy preclude accurate classification of children with SCDI/SCCDI, using current CDI severity definitions. Revised CDI definitions in children should focus on more objective, age-appropriate, and CDI-specific markers of severity. PMID:26582882

  11. Archaeal-eubacterial mergers in the origin of Eukarya: phylogenetic classification of life

    NASA Technical Reports Server (NTRS)

    Margulis, L.

    1996-01-01

    A symbiosis-based phylogeny leads to a consistent, useful classification system for all life. "Kingdoms" and "Domains" are replaced by biological names for the most inclusive taxa: Prokarya (bacteria) and Eukarya (symbiosis-derived nucleated organisms). The earliest Eukarya, anaerobic mastigotes, hypothetically originated from permanent whole-cell fusion between members of Archaea (e.g., Thermoplasma-like organisms) and of Eubacteria (e.g., Spirochaeta-like organisms). Molecular biology, life-history, and fossil record evidence support the reunification of bacteria as Prokarya while subdividing Eukarya into uniquely defined subtaxa: Protoctista, Animalia, Fungi, and Plantae.

  12. Archaeal-eubacterial mergers in the origin of Eukarya: phylogenetic classification of life.

    PubMed

    Margulis, L

    1996-02-01

    A symbiosis-based phylogeny leads to a consistent, useful classification system for all life. "Kingdoms" and "Domains" are replaced by biological names for the most inclusive taxa: Prokarya (bacteria) and Eukarya (symbiosis-derived nucleated organisms). The earliest Eukarya, anaerobic mastigotes, hypothetically originated from permanent whole-cell fusion between members of Archaea (e.g., Thermoplasma-like organisms) and of Eubacteria (e.g., Spirochaeta-like organisms). Molecular biology, life-history, and fossil record evidence support the reunification of bacteria as Prokarya while subdividing Eukarya into uniquely defined subtaxa: Protoctista, Animalia, Fungi, and Plantae. PMID:8577716

  13. Archaeal-eubacterial mergers in the origin of Eukarya: phylogenetic classification of life.

    PubMed Central

    Margulis, L

    1996-01-01

    A symbiosis-based phylogeny leads to a consistent, useful classification system for all life. "Kingdoms" and "Domains" are replaced by biological names for the most inclusive taxa: Prokarya (bacteria) and Eukarya (symbiosis-derived nucleated organisms). The earliest Eukarya, anaerobic mastigotes, hypothetically originated from permanent whole-cell fusion between members of Archaea (e.g., Thermoplasma-like organisms) and of Eubacteria (e.g., Spirochaeta-like organisms). Molecular biology, life-history, and fossil record evidence support the reunification of bacteria as Prokarya while subdividing Eukarya into uniquely defined subtaxa: Protoctista, Animalia, Fungi, and Plantae. Images Fig. 1 PMID:8577716

  14. Assignment of Calibration Information to Deeper Phylogenetic Nodes is More Effective in Obtaining Precise and Accurate Divergence Time Estimates.

    PubMed

    Mello, Beatriz; Schrago, Carlos G

    2014-01-01

    Divergence time estimation has become an essential tool for understanding macroevolutionary events. Molecular dating aims to obtain reliable inferences, which, within a statistical framework, means jointly increasing the accuracy and precision of estimates. Bayesian dating methods exhibit the propriety of a linear relationship between uncertainty and estimated divergence dates. This relationship occurs even if the number of sites approaches infinity and places a limit on the maximum precision of node ages. However, how the placement of calibration information may affect the precision of divergence time estimates remains an open question. In this study, relying on simulated and empirical data, we investigated how the location of calibration within a phylogeny affects the accuracy and precision of time estimates. We found that calibration priors set at median and deep phylogenetic nodes were associated with higher precision values compared to analyses involving calibration at the shallowest node. The results were independent of the tree symmetry. An empirical mammalian dataset produced results that were consistent with those generated by the simulated sequences. Assigning time information to the deeper nodes of a tree is crucial to guarantee the accuracy and precision of divergence times. This finding highlights the importance of the appropriate choice of outgroups in molecular dating. PMID:24855333

  15. TWARIT: an extremely rapid and efficient approach for phylogenetic classification of metagenomic sequences.

    PubMed

    Reddy, Rachamalla Maheedhar; Mohammed, Monzoorul Haque; Mande, Sharmila S

    2012-09-01

    Phylogenetic assignment of individual sequence reads to their respective taxa, referred to as 'taxonomic binning', constitutes a key step of metagenomic analysis. Existing binning methods have limitations either with respect to time or accuracy/specificity of binning. Given these limitations, development of a method that can bin vast amounts of metagenomic sequence data in a rapid, efficient and computationally inexpensive manner can profoundly influence metagenomic analysis in computational resource poor settings. We introduce TWARIT, a hybrid binning algorithm, that employs a combination of short-read alignment and composition-based signature sorting approaches to achieve rapid binning rates without compromising on binning accuracy and specificity. TWARIT is validated with simulated and real-world metagenomes and the results demonstrate significantly lower overall binning times compared to that of existing methods. Furthermore, the binning accuracy and specificity of TWARIT are observed to be comparable/superior to them. A web server implementing TWARIT algorithm is available at http://metagenomics.atc.tcs.com/Twarit/ PMID:22710135

  16. Deceptive Desmas: Molecular Phylogenetics Suggests a New Classification and Uncovers Convergent Evolution of Lithistid Demosponges

    PubMed Central

    Schuster, Astrid; Erpenbeck, Dirk; Pisera, Andrzej; Hooper, John; Bryce, Monika; Fromont, Jane; Wörheide, Gert

    2015-01-01

    Reconciling the fossil record with molecular phylogenies to enhance the understanding of animal evolution is a challenging task, especially for taxa with a mostly poor fossil record, such as sponges (Porifera). ‘Lithistida’, a polyphyletic group of recent and fossil sponges, are an exception as they provide the richest fossil record among demosponges. Lithistids, currently encompassing 13 families, 41 genera and >300 recent species, are defined by the common possession of peculiar siliceous spicules (desmas) that characteristically form rigid articulated skeletons. Their phylogenetic relationships are to a large extent unresolved and there has been no (taxonomically) comprehensive analysis to formally reallocate lithistid taxa to their closest relatives. This study, based on the most comprehensive molecular and morphological investigation of ‘lithistid’ demosponges to date, corroborates some previous weakly-supported hypotheses, and provides novel insights into the evolutionary relationships of the previous ‘order Lithistida’. Based on molecular data (partial mtDNA CO1 and 28S rDNA sequences), we show that 8 out of 13 ‘Lithistida’ families belong to the order Astrophorida, whereas Scleritodermidae and Siphonidiidae form a separate monophyletic clade within Tetractinellida. Most lithistid astrophorids are dispersed between different clades of the Astrophorida and we propose to formally reallocate them, respectively. Corallistidae, Theonellidae and Phymatellidae are monophyletic, whereas the families Pleromidae and Scleritodermidae are polyphyletic. Family Desmanthidae is polyphyletic and groups within Halichondriidae – we formally propose a reallocation. The sister group relationship of the family Vetulinidae to Spongillida is confirmed and we propose here for the first time to include Vetulina into a new Order Sphaerocladina. Megascleres and microscleres possibly evolved and/or were lost several times independently in different ‘lithistid’ taxa, and microscleres might at least be four times more likely lost than megascleres. Desma spicules occasionally may have undergone secondary losses too. Our study provides a framework for further detailed investigations of this important demosponge group. PMID:25565279

  17. Accurate multi-source forest species mapping using the multiple spectral-spatial classification approach

    NASA Astrophysics Data System (ADS)

    Stavrakoudis, Dimitris; Gitas, Ioannis; Karydas, Christos; Kolokoussis, Polychronis; Karathanassi, Vassilia

    2015-10-01

    This paper proposes an efficient methodology for combining multiple remotely sensed imagery, in order to increase the classification accuracy in complex forest species mapping tasks. The proposed scheme follows a decision fusion approach, whereby each image is first classified separately by means of a pixel-wise Fuzzy-Output Support Vector Machine (FO-SVM) classifier. Subsequently, the multiple results are fused according to the so-called multiple spectral- spatial classifier using the minimum spanning forest (MSSC-MSF) approach, which constitutes an effective post-regularization procedure for enhancing the result of a single pixel-based classification. For this purpose, the original MSSC-MSF has been extended in order to handle multiple classifications. In particular, the fuzzy outputs of the pixel-based classifiers are stacked and used to grow the MSF, whereas the markers are also determined considering both classifications. The proposed methodology has been tested on a challenging forest species mapping task in northern Greece, considering a multispectral (GeoEye) and a hyper-spectral (CASI) image. The pixel-wise classifications resulted in overall accuracies (OA) of 68.71% for the GeoEye and 77.95% for the CASI images, respectively. Both of them are characterized by high levels of speckle noise. Applying the proposed multi-source MSSC-MSF fusion, the OA climbs to 90.86%, which is attributed both to the ability of MSSC-MSF to tackle the salt-and-pepper effect, as well as the fact that the fusion approach exploits the relative advantages of both information sources.

  18. Phylogenetic analysis, genomic diversity and classification of M class gene segments of turkey reoviruses.

    PubMed

    Mor, Sunil K; Marthaler, Douglas; Verma, Harsha; Sharafeldin, Tamer A; Jindal, Naresh; Porter, Robert E; Goyal, Sagar M

    2015-03-23

    From 2011 to 2014, 13 turkey arthritis reoviruses (TARVs) were isolated from cases of swollen hock joints in 2-18-week-old turkeys. In addition, two isolates from similar cases of turkey arthritis were received from another laboratory. Eight turkey enteric reoviruses (TERVs) isolated from fecal samples of turkeys were also used for comparison. The aims of this study were to characterize turkey reovirus (TRV) based on complete M class genome segments and to determine genetic diversity within TARVs in comparison to TERVs and chicken reoviruses (CRVs). Nucleotide (nt) cut off values of 84%, 83% and 85% for the M1, M2 and M3 gene segments were proposed and used for genotype classification, generating 5, 7, and 3 genotypes, respectively. Using these nt cut off values, we propose M class genotype constellations (GCs) for avian reoviruses. Of the seven GCs, GC1 and GC3 were shared between the TARVs and TERVs, indicating possible reassortment between turkey and chicken reoviruses. The TARVs and TERVs were divided into three GCs, and GC2 was unique to TARVs and TERVs. The proposed new GC approach should be useful in identifying reassortant viruses, which may ultimately be used in the design of a universal vaccine against both chicken and turkey reoviruses. PMID:25655814

  19. Highly Accurate Classification of Watson-Crick Basepairs on Termini of Single DNA Molecules

    PubMed Central

    Winters-Hilt, Stephen; Vercoutere, Wenonah; DeGuzman, Veronica S.; Deamer, David; Akeson, Mark; Haussler, David

    2003-01-01

    We introduce a computational method for classification of individual DNA molecules measured by an α-hemolysin channel detector. We show classification with better than 99% accuracy for DNA hairpin molecules that differ only in their terminal Watson-Crick basepairs. Signal classification was done in silico to establish performance metrics (i.e., where train and test data were of known type, via single-species data files). It was then performed in solution to assay real mixtures of DNA hairpins. Hidden Markov Models (HMMs) were used with Expectation/Maximization for denoising and for associating a feature vector with the ionic current blockade of the DNA molecule. Support Vector Machines (SVMs) were used as discriminators, and were the focus of off-line training. A multiclass SVM architecture was designed to place less discriminatory load on weaker discriminators, and novel SVM kernels were used to boost discrimination strength. The tuning on HMMs and SVMs enabled biophysical analysis of the captured molecule states and state transitions; structure revealed in the biophysical analysis was used for better feature selection. PMID:12547778

  20. Classification algorithms with multi-modal data fusion could accurately distinguish neuromyelitis optica from multiple sclerosis

    PubMed Central

    Eshaghi, Arman; Riyahi-Alam, Sadjad; Saeedi, Roghayyeh; Roostaei, Tina; Nazeri, Arash; Aghsaei, Aida; Doosti, Rozita; Ganjgahi, Habib; Bodini, Benedetta; Shakourirad, Ali; Pakravan, Manijeh; Ghana'ati, Hossein; Firouznia, Kavous; Zarei, Mojtaba; Azimi, Amir Reza; Sahraian, Mohammad Ali

    2015-01-01

    Neuromyelitis optica (NMO) exhibits substantial similarities to multiple sclerosis (MS) in clinical manifestations and imaging results and has long been considered a variant of MS. With the advent of a specific biomarker in NMO, known as anti-aquaporin 4, this assumption has changed; however, the differential diagnosis remains challenging and it is still not clear whether a combination of neuroimaging and clinical data could be used to aid clinical decision-making. Computer-aided diagnosis is a rapidly evolving process that holds great promise to facilitate objective differential diagnoses of disorders that show similar presentations. In this study, we aimed to use a powerful method for multi-modal data fusion, known as a multi-kernel learning and performed automatic diagnosis of subjects. We included 30 patients with NMO, 25 patients with MS and 35 healthy volunteers and performed multi-modal imaging with T1-weighted high resolution scans, diffusion tensor imaging (DTI) and resting-state functional MRI (fMRI). In addition, subjects underwent clinical examinations and cognitive assessments. We included 18 a priori predictors from neuroimaging, clinical and cognitive measures in the initial model. We used 10-fold cross-validation to learn the importance of each modality, train and finally test the model performance. The mean accuracy in differentiating between MS and NMO was 88%, where visible white matter lesion load, normal appearing white matter (DTI) and functional connectivity had the most important contributions to the final classification. In a multi-class classification problem we distinguished between all of 3 groups (MS, NMO and healthy controls) with an average accuracy of 84%. In this classification, visible white matter lesion load, functional connectivity, and cognitive scores were the 3 most important modalities. Our work provides preliminary evidence that computational tools can be used to help make an objective differential diagnosis of NMO and MS. PMID:25610795

  1. Classification algorithms with multi-modal data fusion could accurately distinguish neuromyelitis optica from multiple sclerosis.

    PubMed

    Eshaghi, Arman; Riyahi-Alam, Sadjad; Saeedi, Roghayyeh; Roostaei, Tina; Nazeri, Arash; Aghsaei, Aida; Doosti, Rozita; Ganjgahi, Habib; Bodini, Benedetta; Shakourirad, Ali; Pakravan, Manijeh; Ghana'ati, Hossein; Firouznia, Kavous; Zarei, Mojtaba; Azimi, Amir Reza; Sahraian, Mohammad Ali

    2015-01-01

    Neuromyelitis optica (NMO) exhibits substantial similarities to multiple sclerosis (MS) in clinical manifestations and imaging results and has long been considered a variant of MS. With the advent of a specific biomarker in NMO, known as anti-aquaporin 4, this assumption has changed; however, the differential diagnosis remains challenging and it is still not clear whether a combination of neuroimaging and clinical data could be used to aid clinical decision-making. Computer-aided diagnosis is a rapidly evolving process that holds great promise to facilitate objective differential diagnoses of disorders that show similar presentations. In this study, we aimed to use a powerful method for multi-modal data fusion, known as a multi-kernel learning and performed automatic diagnosis of subjects. We included 30 patients with NMO, 25 patients with MS and 35 healthy volunteers and performed multi-modal imaging with T1-weighted high resolution scans, diffusion tensor imaging (DTI) and resting-state functional MRI (fMRI). In addition, subjects underwent clinical examinations and cognitive assessments. We included 18 a priori predictors from neuroimaging, clinical and cognitive measures in the initial model. We used 10-fold cross-validation to learn the importance of each modality, train and finally test the model performance. The mean accuracy in differentiating between MS and NMO was 88%, where visible white matter lesion load, normal appearing white matter (DTI) and functional connectivity had the most important contributions to the final classification. In a multi-class classification problem we distinguished between all of 3 groups (MS, NMO and healthy controls) with an average accuracy of 84%. In this classification, visible white matter lesion load, functional connectivity, and cognitive scores were the 3 most important modalities. Our work provides preliminary evidence that computational tools can be used to help make an objective differential diagnosis of NMO and MS. PMID:25610795

  2. Towards a phylogenetic generic classification of Thelypteridaceae: Additional sampling suggests alterations of neotropical taxa and further study of paleotropical genera.

    PubMed

    Almeida, Thaís Elias; Hennequin, Sabine; Schneider, Harald; Smith, Alan R; Batista, João Aguiar Nogueira; Ramalho, Aline Joseph; Proite, Karina; Salino, Alexandre

    2016-01-01

    Thelypteridaceae is one of the largest fern families, having about 950 species and a cosmopolitan distribution but with most species occurring in tropical and subtropical regions. Its generic classification remains controversial, with different authors recognizing from one up to 32 genera. Phylogenetic relationships within the family have not been exhaustively studied, but previous studies have confirmed the monophyly of the lineage. Thus far, sampling has been inadequate for establishing a robust hypothesis of infrafamilial relationships within the family. In order to understand phylogenetic relationships within Thelypteridaceae and thus to improve generic reclassification, we expand the molecular sampling, including new samples of Old World taxa and, especially, many additional neotropical representatives. We also explore the monophyly of exclusively or mostly neotropical genera Amauropelta, Goniopteris, Meniscium, and Steiropteris. Our sampling includes 68 taxa and 134 newly generated sequences from two plastid genomic regions (rps4-trnS and trnL-trnF), plus 73 rps4 and 72 trnL-trnF sequences from GenBank. These data resulted in a concatenated matrix of 1980 molecular characters for 149 taxa. The combined data set was analyzed using maximum parsimony and bayesian inference of phylogeny. Our results are consistent with the general topological structure found in previous studies, including two main lineages within the family: phegopteroid and thelypteroid. The thelypteroid lineage comprises two clades; one of these included the segregates Metathelypteris, Coryphopteris, and Amauropelta (including part of Parathelypteris), whereas the other comprises all segregates of Cyclosorus s.l., such as Goniopteris, Meniscium, and Steiropteris (including Thelypteris polypodioides, previously incertae sedis). The three mainly neotropical segregates were found to be monophyletic but nested in a broadly defined Cyclosorus. The fourth mainly neotropical segregate, Amauropelta, was found to include species considered to be part of Parathelypteris. In Old World thelypteroids, which correspond to nearly half the diversity in the family, an increase in sampling is still needed to resolve relationships and circumscription of genera, particularly in the christelloid clade (i.e., Amphineuron, Chingia, Christella, Pneumatopteris, Pronephrium, and Sphaerostephanos). Based on currently available knowledge, we propose the recognition of 16 genera in the family. PMID:26400101

  3. Accurate and reliable image classification by using conformal predictors in the TJ-II Thomson scatteringa)

    NASA Astrophysics Data System (ADS)

    Vega, J.; Murari, A.; Pereira, A.; González, S.; Pastor, I.

    2010-10-01

    The charge-coupled device camera of the TJ-II Thomson scattering (TS) can capture five different classes of images. Typically, different data processing is performed depending on the kind of image that is acquired. The procedure can be automated to recognize the type of image. To this end, machine learning methods (MLM) are applied. However, usually, MLM classify without confidence estimates. An image classifier based on conformal predictors has been developed for the TJ-II TS. It provides a couple of indicators (confidence and credibility) for each classification that measures the accuracy and reliability of the prediction. Results achieve success rates of about 97%. The implemented classifier is valid for any kind of images.

  4. Use B-factor related features for accurate classification between protein binding interfaces and crystal packing contacts

    PubMed Central

    2014-01-01

    Background Distinction between true protein interactions and crystal packing contacts is important for structural bioinformatics studies to respond to the need of accurate classification of the rapidly increasing protein structures. There are many unannotated crystal contacts and there also exist false annotations in this rapidly expanding volume of data. Previous tools have been proposed to address this problem. However, challenging issues still remain, such as low performance when the training and test data contain mixed interfaces having diverse sizes of contact areas. Methods and results B factor is a measure to quantify the vibrational motion of an atom, a more relevant feature than interface size to characterize protein binding. We propose to use three features related to B factor for the classification between biological interfaces and crystal packing contacts. The first feature is the sum of the normalized B factors of the interfacial atoms in the contact area, the second is the average of the interfacial B factor per residue in the chain, and the third is the average number of interfacial atoms with a negative normalized B factor per residue in the chain. We investigate the distribution properties of these basic features and a compound feature on four datasets of biological binding and crystal packing, and on a protein binding-only dataset with known binding affinity. We also compare the cross-dataset classification performance of these features with existing methods and with a widely-used and the most effective feature interface area. The results demonstrate that our features outperform the interface area approach and the existing prediction methods remarkably for many tests on all of these datasets. Conclusions The proposed B factor related features are more effective than interface area to distinguish crystal packing from biological binding interfaces. Our computational methods have a potential for large-scale and accurate identification of biological interactions from the experimentally determined structural data stored at PDB which may have diverse interface sizes. PMID:25522196

  5. Two fast and accurate heuristic RBF learning rules for data classification.

    PubMed

    Rouhani, Modjtaba; Javan, Dawood S

    2016-03-01

    This paper presents new Radial Basis Function (RBF) learning methods for classification problems. The proposed methods use some heuristics to determine the spreads, the centers and the number of hidden neurons of network in such a way that the higher efficiency is achieved by fewer numbers of neurons, while the learning algorithm remains fast and simple. To retain network size limited, neurons are added to network recursively until termination condition is met. Each neuron covers some of train data. The termination condition is to cover all training data or to reach the maximum number of neurons. In each step, the center and spread of the new neuron are selected based on maximization of its coverage. Maximization of coverage of the neurons leads to a network with fewer neurons and indeed lower VC dimension and better generalization property. Using power exponential distribution function as the activation function of hidden neurons, and in the light of new learning approaches, it is proved that all data became linearly separable in the space of hidden layer outputs which implies that there exist linear output layer weights with zero training error. The proposed methods are applied to some well-known datasets and the simulation results, compared with SVM and some other leading RBF learning methods, show their satisfactory and comparable performance. PMID:26797472

  6. Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

    PubMed

    Sevilla-Reyes, Edgar E; Chavaro-Prez, David A; Piten-Isidro, Elvira; Gutirrez-Gonzlez, Luis H; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis. PMID:23667580

  7. Protein Clustering and RNA Phylogenetic Reconstruction of the Influenza a Virus NS1 Protein Allow an Update in Classification and Identification of Motif Conservation

    PubMed Central

    Sevilla-Reyes, Edgar E.; Chavaro-Prez, David A.; Piten-Isidro, Elvira; Gutirrez-Gonzlez, Luis H.; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis. PMID:23667580

  8. TIPP: taxonomic identification and phylogenetic profiling

    PubMed Central

    Nguyen, Nam-phuong; Mirarab, Siavash; Liu, Bo; Pop, Mihai; Warnow, Tandy

    2014-01-01

    Motivation: Abundance profiling (also called ‘phylogenetic profiling’) is a crucial step in understanding the diversity of a metagenomic sample, and one of the basic techniques used for this is taxonomic identification of the metagenomic reads. Results: We present taxon identification and phylogenetic profiling (TIPP), a new marker-based taxon identification and abundance profiling method. TIPP combines SAT\\'e-enabled phylogenetic placement a phylogenetic placement method, with statistical techniques to control the classification precision and recall, and results in improved abundance profiles. TIPP is highly accurate even in the presence of high indel errors and novel genomes, and matches or improves on previous approaches, including NBC, mOTU, PhymmBL, MetaPhyler and MetaPhlAn. Availability and implementation: Software and supplementary materials are available at http://www.cs.utexas.edu/users/phylo/software/sepp/tipp-submission/. Contact: warnow@illinois.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25359891

  9. Phylogenetic classification of Escherichia coli O26 strains from human, animals, and environmental origins using nucleotide polymorphisms

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Shiga toxin-producing Escherichia coli (STEC) O26 strains are food-borne pathogens that were recently classified as adulterants in certain beef products. Little is known about their genetic diversity, including whether or not phylogenetic subtypes within the serogroup vary in their assoc...

  10. Comprehensive phylogenetic reconstructions of African swine fever virus: proposal for a new classification and molecular dating of the virus.

    PubMed

    Michaud, Vincent; Randriamparany, Tantely; Albina, Emmanuel

    2013-01-01

    African swine fever (ASF) is a highly lethal disease of domestic pigs caused by the only known DNA arbovirus. It was first described in Kenya in 1921 and since then many isolates have been collected worldwide. However, although several phylogenetic studies have been carried out to understand the relationships between the isolates, no molecular dating analyses have been achieved so far. In this paper, comprehensive phylogenetic reconstructions were made using newly generated, publicly available sequences of hundreds of ASFV isolates from the past 70 years. Analyses focused on B646L, CP204L, and E183L genes from 356, 251, and 123 isolates, respectively. Phylogenetic analyses were achieved using maximum likelihood and Bayesian coalescence methods. A new lineage-based nomenclature is proposed to designate 35 different clusters. In addition, dating of ASFV origin was carried out from the molecular data sets. To avoid bias, diversity due to positive selection or recombination events was neutralized. The molecular clock analyses revealed that ASFV strains currently circulating have evolved over 300 years, with a time to the most recent common ancestor (TMRCA) in the early 18(th) century. PMID:23936068

  11. Increasing the data size to accurately reconstruct the phylogenetic relationships between nine subgroups of the Drosophila melanogaster species group (Drosophilidae, Diptera).

    PubMed

    Yang, Yong; Hou, Zhuo-Cheng; Qian, Yuan-Huai; Kang, Han; Zeng, Qing-Tao

    2012-01-01

    Previous phylogenetic analyses of the melanogaster species group have led to conflicting hypotheses concerning their relationship; therefore the addition of new sequence data is necessary to discover the phylogeny of this species group. Here we present new data derived from 17 genes and representing 48 species to reconstruct the phylogeny of the melanogaster group. A variety of statistical tests, as well as maximum likelihood mapping analysis, were performed to estimate data quality, suggesting that all genes had a high degree of contribution to resolve the phylogeny. Individual locus was analyzed using maximum likelihood (ML), and the concatenated dataset (12,988 bp) were analyzed using partitioned maximum likelihood (ML) and Bayesian analyses. Separated analysis produced various phylogenetic relationships, however, phylogenetic topologies from ML and Bayesian analysis based on concatenated dataset, at the subgroup level, were completely identical to each other with high levels of support. Our results recovered three major clades: the ananassae subgroup, followed by the montium subgroup, the melanogaster subgroup and the oriental subgroups form the third monophyletic clade, in which melanogaster (takahashii, suzukii) forms one subclade and ficusphila [eugracilis (elegans, rhopaloa)] forms another. However, more data are necessary to determine the phylogenetic position of Drosophila lucipennis which proved difficult to place. PMID:21985965

  12. Analysis of genetic diversity in banana cultivars (Musa cvs.) from the South of Oman using AFLP markers and classification by phylogenetic, hierarchical clustering and principal component analyses*

    PubMed Central

    Opara, Umezuruike Linus; Jacobson, Dan; Al-Saady, Nadiya Abubakar

    2010-01-01

    Banana is an important crop grown in Oman and there is a dearth of information on its genetic diversity to assist in crop breeding and improvement programs. This study employed amplified fragment length polymorphism (AFLP) to investigate the genetic variation in local banana cultivars from the southern region of Oman. Using 12 primer combinations, a total of 1094 bands were scored, of which 1012 were polymorphic. Eighty-two unique markers were identified, which revealed the distinct separation of the seven cultivars. The results obtained show that AFLP can be used to differentiate the banana cultivars. Further classification by phylogenetic, hierarchical clustering and principal component analyses showed significant differences between the clusters found with molecular markers and those clusters created by previous studies using morphological analysis. Based on the analytical results, a consensus dendrogram of the banana cultivars is presented. PMID:20443211

  13. DEFLATE Compression Algorithm Corrects for Overestimation of Phylogenetic Diversity by Grantham Approach to Single-Nucleotide Polymorphism Classification

    PubMed Central

    Schlosberg, Arran; Lam, Brian Y. H.; Yeo, Giles S. H.; Clifton-Bligh, Roderick J.

    2014-01-01

    Improvements in speed and cost of genome sequencing are resulting in increasing numbers of novel non-synonymous single nucleotide polymorphisms (nsSNPs) in genes known to be associated with disease. The large number of nsSNPs makes laboratory-based classification infeasible and familial co-segregation with disease is not always possible. In-silico methods for classification or triage are thus utilised. A popular tool based on multiple-species sequence alignments (MSAs) and work by Grantham, Align-GVGD, has been shown to underestimate deleterious effects, particularly as sequence numbers increase. We utilised the DEFLATE compression algorithm to account for expected variation across a number of species. With the adjusted Grantham measure we derived a means of quantitatively clustering known neutral and deleterious nsSNPs from the same gene; this was then used to assign novel variants to the most appropriate cluster as a means of binary classification. Scaling of clusters allows for inter-gene comparison of variants through a single pathogenicity score. The approach improves upon the classification accuracy of Align-GVGD while correcting for sensitivity to large MSAs. Open-source code and a web server are made available at https://github.com/aschlosberg/CompressGV. PMID:24828207

  14. When proglottids and scoleces conflict: phylogenetic relationships and a family-level classification of the Lecanicephalidea (Platyhelminthes: Cestoda).

    PubMed

    Jensen, Kirsten; Caira, Janine N; Cielocha, Joanna J; Littlewood, D Timothy J; Waeschenbach, Andrea

    2016-05-01

    This study presents the first comprehensive phylogenetic analysis of the interrelationships of the morphologically diverse elasmobranch-hosted tapeworm order Lecanicephalidea, based on molecular sequence data. With almost half of current generic diversity having been erected or resurrected within the last decade, an apparent conflict between scolex morphology and proglottid anatomy has hampered the assignment of many of these genera to families. Maximum likelihood and Bayesian analyses of two nuclear markers (D1-D3 of lsrDNA and complete ssrDNA) and two mitochondrial markers (partial rrnL and partial cox1) for 61 lecanicephalidean species representing 22 of the 25 valid genera were conducted; new sequence data were generated for 43 species and 11 genera, including three undescribed genera. The monophyly of the order was confirmed in all but the analyses based on cox1 data alone. Sesquipedalapex placed among species of Anteropora and was thus synonymized with the latter genus. Based on analyses of the concatenated dataset, eight major groups emerged which are herein formally recognised at the familial level. Existing family names (i.e., Lecanicephalidae, Polypocephalidae, Tetragonocephalidae, and Cephalobothriidae) are maintained for four of the eight clades, and new families are proposed for the remaining four groups (Aberrapecidae n. fam., Eniochobothriidae n. fam., Paraberrapecidae n. fam., and Zanobatocestidae n. fam.). The four new families and the Tetragonocephalidae are monogeneric, while the Cephalobothriidae, Lecanicephalidae and Polypocephalidae comprise seven, eight and four genera, respectively. As a result of their unusual morphologies, the three genera not included here (i.e., Corrugatocephalum, Healyum and Quadcuspibothrium) are considered incertae sedis within the order until their familial affinities can be examined in more detail. All eight families are newly circumscribed based on morphological features and a key to the families is provided. Aspects of morphological evolution and host associations are discussed in a phylogenetic context for each family and for the order as a whole. Lecanicephalidean genera lacking apical structures were confirmed as the earliest diverging lineages. Proglottid anatomy was determined to be much more conserved and indicative of phylogenetic affinities than scolex morphology. Collectively, the Lecanicephalidea parasitize three of the four orders of Batoidea-their almost exclusive absence from skates (Order Rajiformes) appears to be real; only a few records from sharks exist. At the family level, the breadth of host associations is correlated with taxonomic diversity of the family. The degree to which factors such as intermediate host use or host specificity at any stage in the life-cycle shape these patterns is currently unknown. PMID:26994689

  15. More accurate MK classification using the continuous spectra and average energy distributions for O9-A0 stars

    NASA Astrophysics Data System (ADS)

    Ruban, E. V.

    1985-01-01

    For stars of classes O9-A0, stars that best represent each of the spectral subclasses were chosen so as to obtain the average energy distributions in the spectra on the basis of homogeneous data obtained by the Southern Expedition of the USSR Academy of Sciences from 1971 to 1973. Tabulated information given with the current work includes normal stars with unchanged classifications, normal stars with changed classifications, and stars excluded from the list of normal stars.

  16. Phylogenetic relationships and classification of thiolases and thiolase-like proteins of Mycobacterium tuberculosis and Mycobacterium smegmatis.

    PubMed

    Anbazhagan, Padmanabhan; Harijan, Rajesh K; Kiema, Tiila R; Janardan, Neelanjana; Murthy, M R N; Michels, Paul A M; Juffer, Andr H; Wierenga, Rik K

    2014-07-01

    Thiolases are enzymes involved in lipid metabolism. Thiolases remove the acetyl-CoA moiety from 3-ketoacyl-CoAs in the degradative reaction. They can also catalyze the reverse Claisen condensation reaction, which is the first step of biosynthetic processes such as the biosynthesis of sterols and ketone bodies. In human, six distinct thiolases have been identified. Each of these thiolases is different from the other with respect to sequence, oligomeric state, substrate specificity and subcellular localization. Four sequence fingerprints, identifying catalytic loops of thiolases, have been described. In this study genome searches of two mycobacterial species (Mycobacterium tuberculosis and Mycobacterium smegmatis), were carried out, using the six human thiolase sequences as queries. Eight and thirteen different thiolase sequences were identified in M.tuberculosis and M.smegmatis, respectively. In addition, thiolase-like proteins (one encoded in the Mtb and two in the Msm genome) were found. The purpose of this study is to classify these mostly uncharacterized thiolases and thiolase-like proteins. Several other sequences obtained by searches of genome databases of bacteria, mammals and the parasitic protist family of the Trypanosomatidae were included in the analysis. Thiolase-like proteins were also found in the trypanosomatid genomes, but not in those of mammals. In order to study the phylogenetic relationships at a high confidence level, additional thiolase sequences were included such that a total of 130 thiolases and thiolase-like protein sequences were used for the multiple sequence alignment. The resulting phylogenetic tree identifies 12 classes of sequences, each possessing a characteristic set of sequence fingerprints for the catalytic loops. From this analysis it is now possible to assign the mycobacterial thiolases to corresponding homologues in other kingdoms of life. The results of this bioinformatics analysis also show interesting differences between the distributions of M.tuberculosis and M.smegmatis thiolases over the 12 different classes. PMID:24825023

  17. Classification

    ERIC Educational Resources Information Center

    Clary, Renee; Wandersee, James

    2013-01-01

    In this article, Renee Clary and James Wandersee describe the beginnings of "Classification," which lies at the very heart of science and depends upon pattern recognition. Clary and Wandersee approach patterns by first telling the story of the "Linnaean classification system," introduced by Carl Linnacus (1707-1778), who is…

  18. Bacterial classification of fish-pathogenic Mycobacterium species by multigene phylogenetic analyses and MALDI Biotyper identification system.

    PubMed

    Kurokawa, Satoru; Kabayama, Jun; Fukuyasu, Tsuguaki; Hwang, Seong Don; Park, Chan-Il; Park, Seong-Bin; del Castillo, Carmelo S; Hikima, Jun-ichi; Jung, Tae-Sung; Kondo, Hidehiro; Hirono, Ikuo; Takeyama, Haruko; Aoki, Takashi

    2013-06-01

    Mycobacterium marinum is difficult to distinguish from other species of Mycobacterium isolated from fish using biochemical methods. Here, we used genetic and proteomic analyses to distinguish three Mycobacterium strains: M. marinum strains MB2 and Europe were isolated from tropical and marine fish in Thailand and Europe, and Mycobacterium sp. 012931 strain was isolated from yellowtail in Japan. In phylogenetic trees based on gyrB, rpoB, and Ag85B genes, Mycobacterium sp. 012931 clustered with M. marinum strains MB2 and Europe, but in trees based on 16S rRNA, hsp65, and Ag85A genes Mycobacterium sp. 012931 did not cluster with the other strains. In proteomic analyses using a Bruker matrix-assisted laser desorption ionization Biotyper, the mass profile of Mycobacterium sp. 012931 differed from the mass profiles of the other two fish M. marinum strains. Therefore, Mycobacterium sp. 012931 is similar to M. marinum but is not the same, suggesting that it could be a subspecies of M. marinum. PMID:23229498

  19. Accurate classification and hemagglutinin amino acid signatures for influenza A virus host-origin association and subtyping.

    PubMed

    ElHefnawi, Mahmoud; Sherif, Fayroz F

    2014-01-20

    Host-origin classification and signatures of influenza A viruses were investigated based on the HA protein for tracking of the HA host of origin. Hidden Markov models (HMMs), decision trees and associative classification for each influenza A virus subtype and its major hosts (human, avian, swine) were generated. Features of the HA protein signatures that were host-and subtype-specific were sought. Host-associated signatures that occurred in different subtypes of the virus were identified. Evaluation of the classification models based on ROC curves and support and confidence ratings for the amino acid class-association rules was performed. Host classification based on the HA subtype achieved accuracies between 91.2% and 100% using decision trees after feature selection. Host-specific class association rules for avian-host origins gave better support and confidence ratings, followed by human and finally swine origin. This finding indicated the lower specificity of the swine host, perhaps pointing to its ability to mix different strains. PMID:24418567

  20. Phylogenetic relationships of coprophilous Pleosporales (Dothideomycetes, Ascomycota), and the classification of some bitunicate taxa of unknown position.

    PubMed

    Kruys, Asa; Eriksson, Ove E; Wedin, Mats

    2006-05-01

    The purpose of this study was to investigate the natural relationships within the large bitunicate order Pleosporales, with special focus on the coprophilous families Delitschiaceae, Phaeotrichaceae and Sporormiaceae. Parsimony and Bayesian analyses were performed using nSSU, nLSU and mtSSU rDNA sequence data. We also investigated the placement of a number of taxa with uncertain position. Our results showed that Pleosporales, including Delitschiaceae, Sporormiaceae, Zopfiaceae and Testudinaceae, form a monophyletic group with strong support. Although Delitschiaceae has been considered a synonym of Sporormiaceae, the two families do not form one monophyletic group. Similarly, Zopfiaceae and Testudinaceae should be retained as separate families as they did not group together or with Phaeotrichaceae or Sporormiaceae. Zopfiaceae and Delitchiaceae did group together, but without significant support. Eremodothis angulata (currently in Testudinaceae) is closely related to Westerdykella in Sporormiaceae. Phaeotrichaceae and Venturiaceae formed a group with strong BS support on a branch outside Pleosporales, but an alternative topology including Phaeotrichaceae and Venturiaceae within Pleosporales could not be rejected. All taxa in the present study that were placed with uncertain position in Dothideomycetes/Chaetothyriomycetes in the current classification by Eriksson, grouped within the monophyletic Dothideomycetes. PMID:16769507

  1. A species independent universal bio-detection microarray for pathogen forensics and phylogenetic classification of unknown microorganisms

    PubMed Central

    2011-01-01

    Background The ability to differentiate a bioterrorist attack or an accidental release of a research pathogen from a naturally occurring pandemic or disease event is crucial to the safety and security of this nation by enabling an appropriate and rapid response. It is critical in samples from an infected patient, the environment, or a laboratory to quickly and accurately identify the precise pathogen including natural or engineered variants and to classify new pathogens in relation to those that are known. Current approaches for pathogen detection rely on prior genomic sequence information. Given the enormous spectrum of genetic possibilities, a field deployable, robust technology, such as a universal (any species) microarray has near-term potential to address these needs. Results A new and comprehensive sequence-independent array (Universal Bio-Signature Detection Array) was designed with approximately 373,000 probes. The main feature of this array is that the probes are computationally derived and sequence independent. There is one probe for each possible 9-mer sequence, thus 49 (262,144) probes. Each genome hybridized on this array has a unique pattern of signal intensities corresponding to each of these probes. These signal intensities were used to generate an un-biased cluster analysis of signal intensity hybridization patterns that can easily distinguish species into accepted and known phylogenomic relationships. Within limits, the array is highly sensitive and is able to detect synthetically mixed pathogens. Examples of unique hybridization signal intensity patterns are presented for different Brucella species as well as relevant host species and other pathogens. These results demonstrate the utility of the UBDA array as a diagnostic tool in pathogen forensics. Conclusions This pathogen detection system is fast, accurate and can be applied to any species. Hybridization patterns are unique to a specific genome and these can be used to decipher the identity of a mixed pathogen sample and can separate hosts and pathogens into their respective phylogenomic relationships. This technology can also differentiate between different species and classify genomes into their known clades. The development of this technology will result in the creation of an integrated biomarker-specific bio-signature, multiple select agent specific detection system. PMID:21672191

  2. A non-contact method based on multiple signal classification algorithm to reduce the measurement time for accurately heart rate detection

    NASA Astrophysics Data System (ADS)

    Bechet, P.; Mitran, R.; Munteanu, M.

    2013-08-01

    Non-contact methods for the assessment of vital signs are of great interest for specialists due to the benefits obtained in both medical and special applications, such as those for surveillance, monitoring, and search and rescue. This paper investigates the possibility of implementing a digital processing algorithm based on the MUSIC (Multiple Signal Classification) parametric spectral estimation in order to reduce the observation time needed to accurately measure the heart rate. It demonstrates that, by proper dimensioning the signal subspace, the MUSIC algorithm can be optimized in order to accurately assess the heart rate during an 8-28 s time interval. The validation of the processing algorithm performance was achieved by minimizing the mean error of the heart rate after performing simultaneous comparative measurements on several subjects. In order to calculate the error the reference value of heart rate was measured using a classic measurement system through direct contact.

  3. A non-contact method based on multiple signal classification algorithm to reduce the measurement time for accurately heart rate detection.

    PubMed

    Bechet, P; Mitran, R; Munteanu, M

    2013-08-01

    Non-contact methods for the assessment of vital signs are of great interest for specialists due to the benefits obtained in both medical and special applications, such as those for surveillance, monitoring, and search and rescue. This paper investigates the possibility of implementing a digital processing algorithm based on the MUSIC (Multiple Signal Classification) parametric spectral estimation in order to reduce the observation time needed to accurately measure the heart rate. It demonstrates that, by proper dimensioning the signal subspace, the MUSIC algorithm can be optimized in order to accurately assess the heart rate during an 8-28 s time interval. The validation of the processing algorithm performance was achieved by minimizing the mean error of the heart rate after performing simultaneous comparative measurements on several subjects. In order to calculate the error the reference value of heart rate was measured using a classic measurement system through direct contact. PMID:24007088

  4. Is the SIOP-2001 Classification of Renal Tumors of Childhood accurate with regard to prognosis? A problem revisited

    PubMed Central

    Taran, Katarzyna; Młynarski, Wojciech; Sitkiewicz, Anna

    2012-01-01

    Introduction The goal of this study was to analyze morbidity and mortality of Wilms’ tumor based on the revised SIOP-2001 classification. Material and methods Sixty-four patients with unilateral Wilms’ tumor, 33 girls (51.5%) and 31 boys (48.5%), aged 1 to 144 months (mean: 42.8 months) were treated between 1993 and 2009. All patients underwent multimodal therapy according to the SIOP protocols. The follow-up period ranged from 2 to 18 years (mean: 11.6 years). Results Thirty-three patients (51.6%) had intermediate-risk, 6 (9.4%) low-risk and 25 (39%) high-risk tumors. Stage I disease was diagnosed in 28 (43.7%), stage II in 19 (29.7%), stage III in 8 (12.5%) and stage IV in 9 patients (14.1%). Event-free survival (EFS) in the entire group was 78.1% and OS was 92.2%. The EFS in stage IV (44.4%) was significantly lower than in stage I (82.1%, p = 0.04), stage II (89.5%, p = 0.02) and in the entire group (78.1%, p = 0.04). Sixteen complications were observed in 14 children (21.9%); metastases in 7 cases (10.9%), 8 relapses (12.5%) and 5 deaths (7.8%). Blastemal (20/24 – 83.3%) and anaplastic (3/24 – 12.5%) subtypes were responsible for mortality in high-risk tumors (OS – 87.5%), while poorly differentiated epithelial (7/34 – 20.6%) and regressive (8/34 – 23.5%) subtypes decreased OS (94.1%) in the intermediate-risk tumors. Conclusions The results of our study show that epithelial and regressive subtypes were responsible for mortality in the intermediate-risk Wilms’ tumors. PMID:23056081

  5. Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the 'Extreme Learning Machine' Algorithm.

    PubMed

    McDonnell, Mark D; Tissera, Migel D; Vladusich, Tony; van Schaik, André; Tapson, Jonathan

    2015-01-01

    Recent advances in training deep (multi-layer) architectures have inspired a renaissance in neural network use. For example, deep convolutional networks are becoming the default option for difficult tasks on large datasets, such as image and speech recognition. However, here we show that error rates below 1% on the MNIST handwritten digit benchmark can be replicated with shallow non-convolutional neural networks. This is achieved by training such networks using the 'Extreme Learning Machine' (ELM) approach, which also enables a very rapid training time (∼ 10 minutes). Adding distortions, as is common practise for MNIST, reduces error rates even further. Our methods are also shown to be capable of achieving less than 5.5% error rates on the NORB image database. To achieve these results, we introduce several enhancements to the standard ELM algorithm, which individually and in combination can significantly improve performance. The main innovation is to ensure each hidden-unit operates only on a randomly sized and positioned patch of each image. This form of random 'receptive field' sampling of the input ensures the input weight matrix is sparse, with about 90% of weights equal to zero. Furthermore, combining our methods with a small number of iterations of a single-batch backpropagation method can significantly reduce the number of hidden-units required to achieve a particular performance. Our close to state-of-the-art results for MNIST and NORB suggest that the ease of use and accuracy of the ELM algorithm for designing a single-hidden-layer neural network classifier should cause it to be given greater consideration either as a standalone method for simpler problems, or as the final classification stage in deep neural networks applied to more difficult problems. PMID:26262687

  6. Classification

    NASA Technical Reports Server (NTRS)

    Oza, Nikunj C.

    2011-01-01

    A supervised learning task involves constructing a mapping from input data (normally described by several features) to the appropriate outputs. Within supervised learning, one type of task is a classification learning task, in which each output is one or more classes to which the input belongs. In supervised learning, a set of training examples---examples with known output values---is used by a learning algorithm to generate a model. This model is intended to approximate the mapping between the inputs and outputs. This model can be used to generate predicted outputs for inputs that have not been seen before. For example, we may have data consisting of observations of sunspots. In a classification learning task, our goal may be to learn to classify sunspots into one of several types. Each example may correspond to one candidate sunspot with various measurements or just an image. A learning algorithm would use the supplied examples to generate a model that approximates the mapping between each supplied set of measurements and the type of sunspot. This model can then be used to classify previously unseen sunspots based on the candidate's measurements. This chapter discusses methods to perform machine learning, with examples involving astronomy.

  7. Molecular-genetic analysis is essential for accurate classification of renal carcinoma resembling Xp11.2 translocation carcinoma.

    PubMed

    Hayes, Malcolm; Peckova, Kvetoslava; Martinek, Petr; Hora, Milan; Kalusova, Kristyna; Straka, Lubomir; Daum, Ondrej; Kokoskova, Bohuslava; Rotterova, Pavla; Pivovarčikova, Kristyna; Branzovsky, Jindrich; Dubova, Magdalena; Vesela, Pavla; Michal, Michal; Hes, Ondrej

    2015-03-01

    Xp11.2-translocation renal carcinoma (TRCC) is suspected when a renal carcinoma occurs in young patients, patients with a prior history of exposure to chemotherapy and when the neoplasm has morphological features suggestive of that entity. We retrieved 20 renal tumours (from 17,500 archival cases) of which morphology arose suspicion for TRCC. In nine cases, TFE3 translocation was confirmed by fluorescence in situ hybridisation analysis. In 9 of the remaining 11 TRCC-like cases (7 male, 4 female, aged 22-84 years), material was available for further study. The morphological spectrum was diverse. Six tumours showed a mixture of cells with eosinophilic or clear cytoplasm in tubular, acinar and papillary architecture. One case was high grade with epithelioid, spindle cell and sarcomatoid areas. Another showed tubular, solid, and papillary areas and foci containing spindle cells reminiscent of mucinous tubular and spindle cell carcinoma. The third showed dyscohesive nests of large epithelioid and histiocytoid cells in a background of dense lymphoplasmacytic infiltrate. By immunohistochemistry, keratin AE1/AE3 was diffusely positive in three tumours, while CK7 strongly stained one tumour and another focally and weakly. CD10 and Pax8 were expressed by eight, AMACR and vimentin by seven, CA-IX by four and TFE3 and cathepsin K by two tumours. Of the two TFE3-positive tumours, one showed polysomy of chromosome 7 and the other of 17; they were VHL normal and diagnosed as unclassifiable RCC. Of the seven TFE3-negative tumours, three showed polysomy of 7/17 and VHL abnormality and were diagnosed as combined clear cell RCC/papillary RCC. One TFE3-negative tumour with normal 7/17 but LOH 3p (VHL abnormality) was diagnosed as clear cell RCC. One TFE3-negative tumour with polysomy 7/17 but normal VHL was diagnosed as papillary RCC, and two with normal chromosomes 7/17 and VHL gene were considered unclassifiable. As morphological features and IHC are heterogeneous, TRCC-like renal tumours can only be sub-classified accurately by multi-parameter molecular-genetic analysis. PMID:25544614

  8. Phylogenetic Status of an Unrecorded Species of Curvularia, C. spicifera, Based on Current Classification System of Curvularia and Bipolaris Group Using Multi Loci

    PubMed Central

    Jeon, Sun Jeong; Nguyen, Thi Thuong Thuong

    2015-01-01

    A seed-borne fungus, Curvularia sp. EML-KWD01, was isolated from an indigenous wheat seed by standard blotter method. This fungus was characterized based on the morphological characteristics and molecular phylogenetic analysis. Phylogenetic status of the fungus was determined using sequences of three loci: rDNA internal transcribed spacer, large ribosomal subunit, and glyceraldehyde 3-phosphate dehydrogenase gene. Multi loci sequencing analysis revealed that this fungus was Curvularia spicifera within Curvularia group 2 of family Pleosporaceae. PMID:26539036

  9. Bacterial Phylogenetic Clusters Revealed by Genome Structure

    PubMed Central

    Liu, Shu-Lin; Schryvers, Anthony B.; Sanderson, Kenneth E.; Johnston, Randal N.

    1999-01-01

    Current bacterial taxonomy is mostly based on phenotypic criteria, which may yield misleading interpretations in classification and identification. As a result, bacteria not closely related may be grouped together as a genus or species. For pathogenic bacteria, incorrect classification or misidentification could be disastrous. There is therefore an urgent need for appropriate methodologies to classify bacteria according to phylogeny and corresponding new approaches that permit their rapid and accurate identification. For this purpose, we have devised a strategy enabling us to resolve phylogenetic clusters of bacteria by comparing their genome structures. These structures were revealed by cleaving genomic DNA with the endonuclease I-CeuI, which cuts within the 23S ribosomal DNA (rDNA) sequences, and by mapping the resulting large DNA fragments with pulsed-field gel electrophoresis. We tested this experimental system on two representative bacterial genera: Salmonella and Pasteurella. Among Salmonella spp., I-CeuI mapping revealed virtually indistinguishable genome structures, demonstrating a high degree of structural conservation. Consistent with this, 16S rDNA sequences are also highly conserved among the Salmonella spp. In marked contrast, the Pasteurella strains have very different genome structures among and even within individual species. The divergence of Pasteurella was also reflected in 16S rDNA sequences and far exceeded that seen between Escherichia and Salmonella. Based on this diversity, the Pasteurella haemolytica strains we analyzed could be divided into 14 phylogenetic groups and the Pasteurella multocida strains could be divided into 9 groups. If criteria for defining bacterial species or genera similar to those used for Salmonella and Escherichia coli were applied, the striking phylogenetic diversity would allow bacteria in the currently recognized species of P. multocida and P. haemolytica to be divided into different species, genera, or even higher ranks. On the other hand, strains of Pasteurella ureae and Pasteurella pneumotropica are very similar to those of P. multocida in both genome structure and 16S rDNA sequence and should be regarded as strains within this species. We conclude that large-scale genome structure can be a sensitive indicator of phylogenetic relationships and that, therefore, I-CeuI-based genomic mapping is an efficient tool for probing the phylogenetic status of bacteria. PMID:10542177

  10. An Exploration of Hyperion Hyperspectral Imagery Combined with Different Supervised Classification Approaches Towards Obtaining More Accurate Land Use/Cover Cartography

    NASA Astrophysics Data System (ADS)

    Igityan, Nune

    2014-05-01

    Land use and land cover (LULC) constitutes a key variable of the Earth's system that has in general shown a close correlation with human activities and the physical environment. Describing the pattern and the spatial distribution of LULC is traditionally based on remote sensing data analysis and, evidently, one of the most commonly techniques applied has been image classification. The main objective of the present study has been to evaluate the combined use of Hyperion hyperspectral imagery with a range of supervised classification algorithms widely available today for discriminating LULC classes in a typical Mediterranean setting. Accuracy assessment of the derived thematic maps was based on the analysis of the classification confusion matrix statistics computed for each classification map, using for consistency the same set of validation points. Those were selected on the basis of photo-interpretation of high resolution aerial imagery and of panchromatic imagery available for the studied region at the time of the Hyperion overpass. Results indicated close classification accuracy between the different classifiers with the SVMs outperforming the other classification approaches. The higher classification accuracy by SVMs was attributed principally to the ability of this classifier to identify an optimal separating hyperplane for classes' separation which allows a low generalisation error, thus producing the best possible classes' separation. Although all classifiers produced close results, SVMs generally appeared most useful in describing the spatial distribution and the cover density of each land cover category. All in all, this study demonstrated that, provided that a Hyperion hyperspectral imagery can be made available at regular time intervals over a given region, when combined with SVMs classifiers, can potentially enable a wider approach in land use/cover mapping. This can be of particular importance, especially for regions like in the Mediterranean basin, since it can be related to mapping and monitoring of land degradation and desertification phenomena which are evident in such areas. KEYWORDS: land cover/use mapping, Hyperion, classification, Mediterranean

  11. A preliminary phylogenetic analysis of the New World Helopini (Coleoptera, Tenebrionidae, Tenebrioninae) indicates the need for profound rearrangements of the classification

    PubMed Central

    Cifuentes-Ruiz, Paulina; Zaragoza-Caballero, Santiago; Ochoterena-Booth, Helga; Morón, Miguel Ángel

    2014-01-01

    Abstract Helopini is a diverse tribe in the subfamily Tenebrioninae with a worldwide distribution. The New World helopine species have not been reviewed recently and several doubts emerge regarding their generic assignment as well as the naturalness of the tribe and subordinate taxa. To assess these questions, a preliminary cladistic analysis was conducted with emphasis on sampling the genera distributed in the New World, but including representatives from other regions. The parsimony analysis includes 30 ingroup species from America, Europe and Asia of the subtribes Helopina and Cylindrinotina, plus three outgroups, and 67 morphological characters. Construction of the matrix resulted in the discovery of morphological character states not previously reported for the tribe, particularly from the genitalia of New World species. A consensus of the 12 most parsimonious trees supports the monophyly of the tribe based on a unique combination of characters, including one synapomorphy. None of the subtribes or the genera of the New World represented by more than one species (Helops Fabricius, Nautes Pascoe and Tarpela Bates) were recovered as monophyletic. Helopina was recovered as paraphyletic in relation to Cylindrinotina. One Nearctic species of Helops and one Palearctic species of Tarpela (subtribe Helopina) were more closely related to species of Cylindrinotina. A relatively derived clade, mainly composed by Neotropical species, was found; it includes seven species of Tarpela, seven species of Nautes, and three species of Helops, two Nearctic and one Neotropical. Our results reveal the need to deeply re-evaluate the current classification of the tribe and subordinated taxa, but a broader taxon sampling and further character exploration is needed in order to fully recognize monophyletic groups at different taxonomic levels (from subtribes to genera). PMID:25009428

  12. Extensive phylogenetic analysis of a soil bacterial community illustrates extreme taxon evenness and the effects of amplicon length, degree of coverage, and DNA fractionation on classification and ecological parameters.

    PubMed

    Morales, Sergio E; Cosart, Theodore F; Johnson, Jesse V; Holben, William E

    2009-02-01

    To thoroughly investigate the bacterial community diversity present in a single composite sample from an agricultural soil and to examine potential biases resulting from data acquisition and analytical approaches, we examined the effects of percent G+C DNA fractionation, sequence length, and degree of coverage of bacterial diversity on several commonly used ecological parameters (species estimation, diversity indices, and evenness). We also examined variation in phylogenetic placement based on multiple commonly used approaches (ARB alignments and multiple RDP tools). The results demonstrate that this soil bacterial community is highly diverse, with 1,714 operational taxonomic units demonstrated and 3,555 estimated (based on the Chao1 richness estimation) at 97% sequence similarity using the 16S rRNA gene. The results also demonstrate a fundamental lack of dominance (i.e., a high degree of evenness), with 82% of phylotypes being encountered three times or less. The data also indicate that generally accepted cutoff values for phylum-level taxonomic classification might not be as applicable or as general as previously assumed and that such values likely vary between prokaryotic phyla or groups. PMID:19011079

  13. ICGA-PSO-ELM approach for accurate multiclass cancer classification resulting in reduced gene sets in which genes encoding secreted proteins are highly represented.

    PubMed

    Saraswathi, Saras; Sundaram, Suresh; Sundararajan, Narasimhan; Zimmermann, Michael; Nilsen-Hamilton, Marit

    2011-01-01

    A combination of Integer-Coded Genetic Algorithm (ICGA) and Particle Swarm Optimization (PSO), coupled with the neural-network-based Extreme Learning Machine (ELM), is used for gene selection and cancer classification. ICGA is used with PSO-ELM to select an optimal set of genes, which is then used to build a classifier to develop an algorithm (ICGA_PSO_ELM) that can handle sparse data and sample imbalance. We evaluate the performance of ICGA-PSO-ELM and compare our results with existing methods in the literature. An investigation into the functions of the selected genes, using a systems biology approach, revealed that many of the identified genes are involved in cell signaling and proliferation. An analysis of these gene sets shows a larger representation of genes that encode secreted proteins than found in randomly selected gene sets. Secreted proteins constitute a major means by which cells interact with their surroundings. Mounting biological evidence has identified the tumor microenvironment as a critical factor that determines tumor survival and growth. Thus, the genes identified by this study that encode secreted proteins might provide important insights to the nature of the critical biological features in the microenvironment of each tumor type that allow these cells to thrive and proliferate. PMID:21233525

  14. Accurate age classification of 6 and 12 month-old infants based on resting-state functional connectivity magnetic resonance imaging data

    PubMed Central

    Pruett, John R.; Kandala, Sridhar; Hoertel, Sarah; Snyder, Abraham Z.; Elison, Jed T.; Nishino, Tomoyuki; Feczko, Eric; Dosenbach, Nico U.F.; Nardos, Binyam; Power, Jonathan D.; Adeyemo, Babatunde; Botteron, Kelly N.; McKinstry, Robert C.; Evans, Alan C.; Hazlett, Heather C.; Dager, Stephen R.; Paterson, Sarah; Schultz, Robert T.; Collins, D. Louis; Fonov, Vladimir S.; Styner, Martin; Gerig, Guido; Das, Samir; Kostopoulos, Penelope; Constantino, John N.; Estes, Annette M.; Petersen, Steven E.; Schlaggar, Bradley L.; Piven, Joseph

    2015-01-01

    Human large-scale functional brain networks are hypothesized to undergo significant changes over development. Little is known about these functional architectural changes, particularly during the second half of the first year of life. We used multivariate pattern classification of resting-state functional connectivity magnetic resonance imaging (fcMRI) data obtained in an on-going, multi-site, longitudinal study of brain and behavioral development to explore whether fcMRI data contained information sufficient to classify infant age. Analyses carefully account for the effects of fcMRI motion artifact. Support vector machines (SVMs) classified 6 versus 12 month-old infants (128 datasets) above chance based on fcMRI data alone. Results demonstrate significant changes in measures of brain functional organization that coincide with a special period of dramatic change in infant motor, cognitive, and social development. Explorations of the most different correlations used for SVM lead to two different interpretations about functional connections that support 6 versus 12-month age categorization. PMID:25704288

  15. Accurate age classification of 6 and 12 month-old infants based on resting-state functional connectivity magnetic resonance imaging data.

    PubMed

    Pruett, John R; Kandala, Sridhar; Hoertel, Sarah; Snyder, Abraham Z; Elison, Jed T; Nishino, Tomoyuki; Feczko, Eric; Dosenbach, Nico U F; Nardos, Binyam; Power, Jonathan D; Adeyemo, Babatunde; Botteron, Kelly N; McKinstry, Robert C; Evans, Alan C; Hazlett, Heather C; Dager, Stephen R; Paterson, Sarah; Schultz, Robert T; Collins, D Louis; Fonov, Vladimir S; Styner, Martin; Gerig, Guido; Das, Samir; Kostopoulos, Penelope; Constantino, John N; Estes, Annette M; Petersen, Steven E; Schlaggar, Bradley L; Piven, Joseph

    2015-04-01

    Human large-scale functional brain networks are hypothesized to undergo significant changes over development. Little is known about these functional architectural changes, particularly during the second half of the first year of life. We used multivariate pattern classification of resting-state functional connectivity magnetic resonance imaging (fcMRI) data obtained in an on-going, multi-site, longitudinal study of brain and behavioral development to explore whether fcMRI data contained information sufficient to classify infant age. Analyses carefully account for the effects of fcMRI motion artifact. Support vector machines (SVMs) classified 6 versus 12 month-old infants (128 datasets) above chance based on fcMRI data alone. Results demonstrate significant changes in measures of brain functional organization that coincide with a special period of dramatic change in infant motor, cognitive, and social development. Explorations of the most different correlations used for SVM lead to two different interpretations about functional connections that support 6 versus 12-month age categorization. PMID:25704288

  16. Rapid and accurate taxonomic classification of insect (class Insecta) cytochrome c oxidase subunit 1 (COI) DNA barcode sequences using a naïve Bayesian classifier

    PubMed Central

    Porter, Teresita M; Gibson, Joel F; Shokralla, Shadi; Baird, Donald J; Golding, G Brian; Hajibabaei, Mehrdad

    2014-01-01

    Current methods to identify unknown insect (class Insecta) cytochrome c oxidase (COI barcode) sequences often rely on thresholds of distances that can be difficult to define, sequence similarity cut-offs, or monophyly. Some of the most commonly used metagenomic classification methods do not provide a measure of confidence for the taxonomic assignments they provide. The aim of this study was to use a naïve Bayesian classifier (Wang et al. Applied and Environmental Microbiology, 2007; 73: 5261) to automate taxonomic assignments for large batches of insect COI sequences such as data obtained from high-throughput environmental sequencing. This method provides rank-flexible taxonomic assignments with an associated bootstrap support value, and it is faster than the blast-based methods commonly used in environmental sequence surveys. We have developed and rigorously tested the performance of three different training sets using leave-one-out cross-validation, two field data sets, and targeted testing of Lepidoptera, Diptera and Mantodea sequences obtained from the Barcode of Life Data system. We found that type I error rates, incorrect taxonomic assignments with a high bootstrap support, were already relatively low but could be lowered further by ensuring that all query taxa are actually present in the reference database. Choosing bootstrap support cut-offs according to query length and summarizing taxonomic assignments to more inclusive ranks can also help to reduce error while retaining the maximum number of assignments. Additionally, we highlight gaps in the taxonomic and geographic representation of insects in public sequence databases that will require further work by taxonomists to improve the quality of assignments generated using any method.

  17. Fast, Simple and Accurate Handwritten Digit Classification by Training Shallow Neural Network Classifiers with the ‘Extreme Learning Machine’ Algorithm

    PubMed Central

    McDonnell, Mark D.; Tissera, Migel D.; Vladusich, Tony; van Schaik, André; Tapson, Jonathan

    2015-01-01

    Recent advances in training deep (multi-layer) architectures have inspired a renaissance in neural network use. For example, deep convolutional networks are becoming the default option for difficult tasks on large datasets, such as image and speech recognition. However, here we show that error rates below 1% on the MNIST handwritten digit benchmark can be replicated with shallow non-convolutional neural networks. This is achieved by training such networks using the ‘Extreme Learning Machine’ (ELM) approach, which also enables a very rapid training time (∼ 10 minutes). Adding distortions, as is common practise for MNIST, reduces error rates even further. Our methods are also shown to be capable of achieving less than 5.5% error rates on the NORB image database. To achieve these results, we introduce several enhancements to the standard ELM algorithm, which individually and in combination can significantly improve performance. The main innovation is to ensure each hidden-unit operates only on a randomly sized and positioned patch of each image. This form of random ‘receptive field’ sampling of the input ensures the input weight matrix is sparse, with about 90% of weights equal to zero. Furthermore, combining our methods with a small number of iterations of a single-batch backpropagation method can significantly reduce the number of hidden-units required to achieve a particular performance. Our close to state-of-the-art results for MNIST and NORB suggest that the ease of use and accuracy of the ELM algorithm for designing a single-hidden-layer neural network classifier should cause it to be given greater consideration either as a standalone method for simpler problems, or as the final classification stage in deep neural networks applied to more difficult problems. PMID:26262687

  18. The Need for Improved Identification and Accurate Classification of Stages 3–5 Chronic Kidney Disease in Primary Care: Retrospective Cohort Study

    PubMed Central

    Jain, Poorva; Calvert, Melanie; Cockwell, Paul; McManus, Richard J.

    2014-01-01

    Background Around ten percent of the population have been reported as having Chronic Kidney Disease (CKD), which is associated with increased cardiovascular mortality. Few previous studies have ascertained the chronicity of CKD. In the UK, a payment for performance (P4P) initiative incentivizes CKD (stages 3–5) recognition and management in primary care, but the impact of this has not been assessed. Methods and Findings Using data from 426 primary care practices (population 2,707,130), the age standardised prevalence of stages 3–5 CKD was identified using two consecutive estimated Glomerular Filtration Rates (eGFRs) seven days apart. Additionally the accuracy of practice CKD registers and the relationship between accurate identification of CKD and the achievement of P4P indicators was determined. Between 2005 and 2009, the prevalence of stages 3–5 CKD increased from 0.3% to 3.9%. In 2009, 30,440 patients (1.1% unadjusted) fulfilled biochemical criteria for CKD but were not on a practice CKD register (uncoded CKD) and 60,705 patients (2.2% unadjusted) were included on a practice CKD register but did not fulfil biochemical criteria (miscoded CKD). For patients with confirmed CKD, inclusion in a practice register was associated with increasing age, male sex, diabetes, hypertension, cardiovascular disease and increasing CKD stage (p<0.0001). Uncoded CKD patients compared to miscoded patients were less likely to achieve performance indicators for blood pressure (OR 0.84, 95% CI 0.82–0.86 p<0.001) or recorded albumin-creatinine ratio (OR 0.73, 0.70–0.76, p<0.001). Conclusions The prevalence of stages 3–5 CKD, using two laboratory reported eGFRs, was lower than estimates from previous studies. Clinically significant discrepancies were identified between biochemically defined CKD and appearance on practice registers, with misclassification associated with sub-optimal care for some people with CKD. PMID:25115813

  19. Phylogenetic molecular function annotation

    NASA Astrophysics Data System (ADS)

    Engelhardt, Barbara E.; Jordan, Michael I.; Repo, Susanna T.; Brenner, Steven E.

    2009-07-01

    It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called "phylogenomics") is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods.

  20. Phylogenetic molecular function annotation

    PubMed Central

    Engelhardt, Barbara E; Jordan, Michael I; Repo, Susanna T; Brenner, Steven E

    2010-01-01

    It is now easier to discover thousands of protein sequences in a new microbial genome than it is to biochemically characterize the specific activity of a single protein of unknown function. The molecular functions of protein sequences have typically been predicted using homology-based computational methods, which rely on the principle that homologous proteins share a similar function. However, some protein families include groups of proteins with different molecular functions. A phylogenetic approach for predicting molecular function (sometimes called phylogenomics) is an effective means to predict protein molecular function. These methods incorporate functional evidence from all members of a family that have functional characterizations using the evolutionary history of the protein family to make robust predictions for the uncharacterized proteins. However, they are often difficult to apply on a genome-wide scale because of the time-consuming step of reconstructing the phylogenies of each protein to be annotated. Our automated approach for function annotation using phylogeny, the SIFTER (Statistical Inference of Function Through Evolutionary Relationships) methodology, uses a statistical graphical model to compute the probabilities of molecular functions for unannotated proteins. Our benchmark tests showed that SIFTER provides accurate functional predictions on various protein families, outperforming other available methods. PMID:20664722

  1. Complete chloroplast genome of the genus Cymbidium: lights into the species identification, phylogenetic implications and population genetic analyses

    PubMed Central

    2013-01-01

    Background Cymbidium orchids, including some 50 species, are the famous flowers, and they possess high commercial value in the floricultural industry. Furthermore, the values of different orchids are great differences. However, species identification is very difficult. To a certain degree, chloroplast DNA sequence data are a versatile tool for species identification and phylogenetic implications in plants. Different chloroplast loci have been utilized for evaluating phylogenetic relationships at each classification level among plant species, including at the interspecies and intraspecies levels. However, there is no evidence that a short sequence can distinguish all plant species from each other in order to infer phylogenetic relationships. Molecular markers derived from the complete chloroplast genome can provide effective tools for species identification and phylogenetic resolution. Results The complete nucleotide sequences of eight individuals from a total of five Cymbidium species’ chloroplast (cp) genomes were determined using Illumina sequencing technology of the total DNA via a combination of de novo and reference-guided assembly. The length of the Cymbidium cp genome is about 155 kb. The cp genomes contain 123 unique genes, and the IR regions contain 24 duplicates. Although the genomes, including genome structure, gene order and orientation, are similar to those of other orchids, they are not evolutionarily conservative. The cp genome of Cymbidium evolved moderately with more than 3% sequence divergence, which could provide enough information for phylogeny. Rapidly evolving chloroplast genome regions were identified and 11 new divergence hotspot regions were disclosed for further phylogenetic study and species identification in Orchidaceae. Conclusions Phylogenomic analyses were conducted using 10 complete chloroplast genomes from seven orchid species. These data accurately identified the individuals and established the phylogenetic relationships between the species. The results reveal that phylogenomics based on organelle genome sequencing lights the species identification—organelle-scale “barcodes”, and is also an effective approach for studying whole populations and phylogenetic characteristics of Cymbidium. PMID:23597078

  2. Phylogenetic Inference From Conserved sites Alignments

    SciTech Connect

    grundy, W.N.; Naylor, G.J.P.

    1999-08-15

    Molecular sequences provide a rich source of data for inferring the phylogenetic relationships among species. However, recent work indicates that even an accurate multiple alignment of a large sequence set may yield an incorrect phylogeny and that the quality of the phylogenetic tree improves when the input consists only of the highly conserved, motif regions of the alignment. This work introduces two methods of producing multiple alignments that include only the conserved regions of the initial alignment. The first method retains conserved motifs, whereas the second retains individual conserved sites in the initial alignment. Using parsimony analysis on a mitochondrial data set containing 19 species among which the phylogenetic relationships are widely accepted, both conserved alignment methods produce better phylogenetic trees than the complete alignment. Unlike any of the 19 inference methods used before to analyze this data, both methods produce trees that are completely consistent with the known phylogeny. The motif-based method employs far fewer alignment sites for comparable error rates. For a larger data set containing mitochondrial sequences from 39 species, the site-based method produces a phylogenetic tree that is largely consistent with known phylogenetic relationships and suggests several novel placements.

  3. Phylogenetic lineages in Entomophthoromycota

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Entomophthoromycota Humber is one of five major phylogenetic lineages among the former phylum Zygomycota. These early terrestrial fungi share evolutionarily ancestral characters such as coenocytic mycelium and gametangiogamy as a sexual process resulting in zygospore formation. Previous molecular st...

  4. Phylogenetic relationships among arecoid palms (Arecaceae: Arecoideae)

    PubMed Central

    Baker, William J.; Norup, Maria V.; Clarkson, James J.; Couvreur, Thomas L. P.; Dowe, John L.; Lewis, Carl E.; Pintaud, Jean-Christophe; Savolainen, Vincent; Wilmot, Tomas; Chase, Mark W.

    2011-01-01

    Background and Aims The Arecoideae is the largest and most diverse of the five subfamilies of palms (Arecaceae/Palmae), containing >50 % of the species in the family. Despite its importance, phylogenetic relationships among Arecoideae are poorly understood. Here the most densely sampled phylogenetic analysis of Arecoideae available to date is presented. The results are used to test the current classification of the subfamily and to identify priority areas for future research. Methods DNA sequence data for the low-copy nuclear genes PRK and RPB2 were collected from 190 palm species, covering 103 (96 %) genera of Arecoideae. The data were analysed using the parsimony ratchet, maximum likelihood, and both likelihood and parsimony bootstrapping. Key Results and Conclusions Despite the recovery of paralogues and pseudogenes in a small number of taxa, PRK and RPB2 were both highly informative, producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. Simultaneous analyses of the combined data sets provided additional resolution and support. Two areas of incongruence between PRK and RPB2 were strongly supported by the bootstrap relating to the placement of tribes Chamaedoreeae, Iriarteeae and Reinhardtieae; the causes of this incongruence remain uncertain. The current classification within Arecoideae was strongly supported by the present data. Of the 14 tribes and 14 sub-tribes in the classification, only five sub-tribes from tribe Areceae (Basseliniinae, Linospadicinae, Oncospermatinae, Rhopalostylidinae and Verschaffeltiinae) failed to receive support. Three major higher level clades were strongly supported: (1) the RRC clade (Roystoneeae, Reinhardtieae and Cocoseae), (2) the POS clade (Podococceae, Oranieae and Sclerospermeae) and (3) the core arecoid clade (Areceae, Euterpeae, Geonomateae, Leopoldinieae, Manicarieae and Pelagodoxeae). However, new data sources are required to elucidate ambiguities that remain in phylogenetic relationships among and within the major groups of Arecoideae, as well as within the Areceae, the largest tribe in the palm family. PMID:21325340

  5. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

    PubMed

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  6. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing

    PubMed Central

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J.; O'Donnell, Kerry; Geiser, David M.; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  7. On Exploring Genome Rearrangement Phylogenetic Patterns

    NASA Astrophysics Data System (ADS)

    Xu, Andrew Wei

    The study of genome rearrangement is much harder than the corresponding problems on DNA and protein sequences, because of the occurrences of numerous combinatorial structures. By explicitly exploring these combinatorial structures, the recently developed adequate subgraph theory shows that a family of these structures, adequate subgraphs, are informative in finding the optimal solutions to the rearrangement median problem. Its extension gives rise to the tree scoring method GASTS, which provides quick and accurate estimation of the number of rearrangement events, for any given topology. With a similar motivation, this paper discusses and provides solid but somewhat initial results, on combinatorial structures that are informative in phylogenetic inference. These structures, called rearrangement phylogenetic patterns, provide more insights than algorithmic approaches, and may provide statistical significance for inferred phylogenies and lead to efficient and robust phylogenetic inference methods on large sets of taxa.

  8. The phylogenetic likelihood library.

    PubMed

    Flouri, T; Izquierdo-Carrasco, F; Darriba, D; Aberer, A J; Nguyen, L-T; Minh, B Q; Von Haeseler, A; Stamatakis, A

    2015-03-01

    We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2-10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL). PMID:25358969

  9. The Phylogenetic Likelihood Library

    PubMed Central

    Flouri, T.; Izquierdo-Carrasco, F.; Darriba, D.; Aberer, A.J.; Nguyen, L.-T.; Minh, B.Q.; Von Haeseler, A.; Stamatakis, A.

    2015-01-01

    We introduce the Phylogenetic Likelihood Library (PLL), a highly optimized application programming interface for developing likelihood-based phylogenetic inference and postanalysis software. The PLL implements appropriate data structures and functions that allow users to quickly implement common, error-prone, and labor-intensive tasks, such as likelihood calculations, model parameter as well as branch length optimization, and tree space exploration. The highly optimized and parallelized implementation of the phylogenetic likelihood function and a thorough documentation provide a framework for rapid development of scalable parallel phylogenetic software. By example of two likelihood-based phylogenetic codes we show that the PLL improves the sequential performance of current software by a factor of 2–10 while requiring only 1 month of programming time for integration. We show that, when numerical scaling for preventing floating point underflow is enabled, the double precision likelihood calculations in the PLL are up to 1.9 times faster than those in BEAGLE. On an empirical DNA dataset with 2000 taxa the AVX version of PLL is 4 times faster than BEAGLE (scaling enabled and required). The PLL is available at http://www.libpll.org under the GNU General Public License (GPL). PMID:25358969

  10. Phylogenetically resolving epidemiologic linkage

    DOE PAGESBeta

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-02-22

    The use of phylogenetic trees in epidemiological investigations has become commonplace, but their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the truemore » transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. Moreover, we confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results.« less

  11. Phylogenetically resolving epidemiologic linkage

    PubMed Central

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-01-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  12. Phylogenetically resolving epidemiologic linkage.

    PubMed

    Romero-Severson, Ethan O; Bulla, Ingo; Leitner, Thomas

    2016-03-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals' HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  13. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes.

    PubMed

    Segata, Nicola; Börnigen, Daniela; Morgan, Xochitl C; Huttenhower, Curtis

    2013-01-01

    New microbial genomes are constantly being sequenced, and it is crucial to accurately determine their taxonomic identities and evolutionary relationships. Here we report PhyloPhlAn, a new method to assign microbial phylogeny and putative taxonomy using >400 proteins optimized from among 3,737 genomes. This method measures the sequence diversity of all clades, classifies genomes from deep-branching candidate divisions through closely related subspecies and improves consistency between phylogenetic and taxonomic groupings. PhyloPhlAn improved taxonomic accuracy for existing and newly sequenced genomes, detecting 157 erroneous labels, correcting 46 and placing or refining 130 new genomes. We provide examples of accurate classifications from subspecies (Sulfolobus spp.) to phyla, and of preliminary rooting of deep-branching candidate divisions, including consistent statistical support for Caldiserica (formerly candidate division OP5). PhyloPhlAn will thus be useful for both phylogenetic assessment and taxonomic quality control of newly sequenced genomes. The final phylogenies, conserved protein sequences and open-source implementation are available online. PMID:23942190

  14. A Universal Phylogenetic Tree.

    ERIC Educational Resources Information Center

    Offner, Susan

    2001-01-01

    Presents a universal phylogenetic tree suitable for use in high school and college-level biology classrooms. Illustrates the antiquity of life and that all life is related, even if it dates back 3.5 billion years. Reflects important evolutionary relationships and provides an exciting way to learn about the history of life. (SAH)

  15. Host specificity and phylogenetic relationships of chicken and turkey parvoviruses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Previous reports indicate that the newly discovered chicken parvoviruses (ChPV) and turkey parvoviruses (TuPV) are very similar to each other, yet they represent different species within a new genus of Parvoviridae. Currently, strain classification is based on the phylogenetic analysis of a 561 bas...

  16. Soft-tissue characters in higher primate phylogenetics

    PubMed Central

    Gibbs, Sally; Collard, Mark; Wood, Bernard

    2000-01-01

    Recent research has cast doubt on the reliability of bones and teeth for reconstructing phylogenetic relationships among higher primate species and genera. Herein, we investigate whether this problem is confined to hard tissues by examining the utility of higher primate soft-tissue characters for reconstructing phylogenetic relationships at low taxonomic levels. We use cladistic methods to analyze 197 soft-tissue characters for the extant hominoids and then compare the resulting phylogenetic hypotheses with the group's consensus molecular phylogeny, which is widely considered to be accurate. We show that the soft-tissue characters yield robust phylogenetic hypotheses that are compatible with the molecular phylogeny. Given the strength of the evidence for molecular phylogeny, these results indicate that, unlike craniodental hard-tissue characters, soft tissues are reliable for reconstructing phylogenetic relationships among higher primate species and genera. Thus, in higher primates at least, some types of morphological data are more useful than others for phylogeny reconstruction. PMID:10995466

  17. Charles Darwin, beetles and phylogenetics

    NASA Astrophysics Data System (ADS)

    Beutel, Rolf G.; Friedrich, Frank; Leschen, Richard A. B.

    2009-11-01

    Here, we review Charles Darwin’s relation to beetles and developments in coleopteran systematics in the last two centuries. Darwin was an enthusiastic beetle collector. He used beetles to illustrate different evolutionary phenomena in his major works, and astonishingly, an entire sub-chapter is dedicated to beetles in “The Descent of Man”. During his voyage on the Beagle, Darwin was impressed by the high diversity of beetles in the tropics, and he remarked that, to his surprise, the majority of species were small and inconspicuous. However, despite his obvious interest in the group, he did not get involved in beetle taxonomy, and his theoretical work had little immediate impact on beetle classification. The development of taxonomy and classification in the late nineteenth and earlier twentieth century was mainly characterised by the exploration of new character systems (e.g. larval features and wing venation). In the mid-twentieth century, Hennig’s new methodology to group lineages by derived characters revolutionised systematics of Coleoptera and other organisms. As envisioned by Darwin and Ernst Haeckel, the new Hennigian approach enabled systematists to establish classifications truly reflecting evolution. Roy A. Crowson and Howard E. Hinton, who both made tremendous contributions to coleopterology, had an ambivalent attitude towards the Hennigian ideas. The Mickoleit school combined detailed anatomical work with a classical Hennigian character evaluation, with stepwise tree building, comparatively few characters and a priori polarity assessment without explicit use of the outgroup comparison method. The rise of cladistic methods in the 1970s had a strong impact on beetle systematics. Cladistic computer programs facilitated parsimony analyses of large data matrices, mostly morphological characters not requiring detailed anatomical investigations. Molecular studies on beetle phylogeny started in the 1990s with modest taxon sampling and limited DNA data. This has changed dramatically. With very large data sets and high throughput sampling, phylogenetic questions can be addressed without prior knowledge of morphological characters. Nevertheless, molecular studies have not lead to the great breakthrough in beetle systematics—yet. Especially the phylogeny of the extremely species rich suborder Polyphaga remains incompletely resolved. Coordinated efforts of molecular workers and of morphologists using innovative techniques may lead to more profound insights in the near future. The final aim is to develop a well-founded phylogeny, which truly reflects the evolution of this immensely species rich group of organisms.

  18. Charles Darwin, beetles and phylogenetics.

    PubMed

    Beutel, Rolf G; Friedrich, Frank; Leschen, Richard A B

    2009-11-01

    Here, we review Charles Darwin's relation to beetles and developments in coleopteran systematics in the last two centuries. Darwin was an enthusiastic beetle collector. He used beetles to illustrate different evolutionary phenomena in his major works, and astonishingly, an entire sub-chapter is dedicated to beetles in "The Descent of Man". During his voyage on the Beagle, Darwin was impressed by the high diversity of beetles in the tropics, and he remarked that, to his surprise, the majority of species were small and inconspicuous. However, despite his obvious interest in the group, he did not get involved in beetle taxonomy, and his theoretical work had little immediate impact on beetle classification. The development of taxonomy and classification in the late nineteenth and earlier twentieth century was mainly characterised by the exploration of new character systems (e.g. larval features and wing venation). In the mid-twentieth century, Hennig's new methodology to group lineages by derived characters revolutionised systematics of Coleoptera and other organisms. As envisioned by Darwin and Ernst Haeckel, the new Hennigian approach enabled systematists to establish classifications truly reflecting evolution. Roy A. Crowson and Howard E. Hinton, who both made tremendous contributions to coleopterology, had an ambivalent attitude towards the Hennigian ideas. The Mickoleit school combined detailed anatomical work with a classical Hennigian character evaluation, with stepwise tree building, comparatively few characters and a priori polarity assessment without explicit use of the outgroup comparison method. The rise of cladistic methods in the 1970s had a strong impact on beetle systematics. Cladistic computer programs facilitated parsimony analyses of large data matrices, mostly morphological characters not requiring detailed anatomical investigations. Molecular studies on beetle phylogeny started in the 1990s with modest taxon sampling and limited DNA data. This has changed dramatically. With very large data sets and high throughput sampling, phylogenetic questions can be addressed without prior knowledge of morphological characters. Nevertheless, molecular studies have not lead to the great breakthrough in beetle systematics--yet. Especially the phylogeny of the extremely species rich suborder Polyphaga remains incompletely resolved. Coordinated efforts of molecular workers and of morphologists using innovative techniques may lead to more profound insights in the near future. The final aim is to develop a well-founded phylogeny, which truly reflects the evolution of this immensely species rich group of organisms. PMID:19760277

  19. Refuting phylogenetic relationships

    PubMed Central

    Bucknam, James; Boucher, Yan; Bapteste, Eric

    2006-01-01

    Background Phylogenetic methods are philosophically grounded, and so can be philosophically biased in ways that limit explanatory power. This constitutes an important methodologic dimension not often taken into account. Here we address this dimension in the context of concatenation approaches to phylogeny. Results We discuss some of the limits of a methodology restricted to verificationism, the philosophy on which gene concatenation practices generally rely. As an alternative, we describe a software which identifies and focuses on impossible or refuted relationships, through a simple analysis of bootstrap bipartitions, followed by multivariate statistical analyses. We show how refuting phylogenetic relationships could in principle facilitate systematics. We also apply our method to the study of two complex phylogenies: the phylogeny of the archaea and the phylogeny of the core of genes shared by all life forms. While many groups are rejected, our results left open a possible proximity of N. equitans and the Methanopyrales, of the Archaea and the Cyanobacteria, and as well the possible grouping of the Methanobacteriales/Methanoccocales and Thermosplasmatales, of the Spirochaetes and the Actinobacteria and of the Proteobacteria and firmicutes. Conclusion It is sometimes easier (and preferable) to decide which species do not group together than which ones do. When possible topologies are limited, identifying local relationships that are rejected may be a useful alternative to classical concatenation approaches aiming to find a globally resolved tree on the basis of weak phylogenetic markers. Reviewers This article was reviewed by Mark Ragan, Eugene V Koonin and J Peter Gogarten. PMID:16956399

  20. The revised classification of eukaryotes

    PubMed Central

    Adl, Sina M.; Simpson, Alastair. G.; Lane, Christopher E.; Lukeš, Julius; Bass, David; Bowser, Samuel S.; Brown, Matt; Burki, Fabien; Dunthorn, Micah; Hampl, Vladimir; Heiss, Aaron; Hoppenrath, Mona; Lara, Enrique; leGall, Line; Lynn, Denis H.; McManus, Hilary; Mitchell, Edward A. D.; Mozley-Stanridge, Sharon E.; Parfrey, Laura Wegener; Pawlowski, Jan; Rueckert, Sonja; Shadwick, Lora; Schoch, Conrad; Smirnov, Alexey; Spiegel, Frederick W.

    2012-01-01

    This revision of the classification of eukaryotes, which updates that of Adl et al. (2005), retains an emphasis on the protists and incorporates changes since 2005 that have resolved nodes and branches in phylogenetic trees. Whereas the previous revision was successful in re-introducing name stability to the classification, this revision provides a classification for lineages that were then still unresolved. The supergroups have withstood phylogenetic hypothesis testing with some modifications, but despite some progress, problematic nodes at the base of the eukaryotic tree still remain to be statistically resolved. Looking forward, subsequent transformations to our understanding of the diversity of life will be from the discovery of novel lineages in previously under-sampled areas and from environmental genomic information. PMID:23020233

  1. Phylogenetic systematics of microorganisms inhabiting thermal environments.

    PubMed

    Lebedinsky, A V; Chernyh, N A; Bonch-Osmolovskaya, E A

    2007-12-01

    Thermal habitats harbor specialized communities of thermophilic microorganisms, primarily prokaryotes. This review considers modern systematics of prokaryotes and the place of thermophilic archaea and bacteria in it. Among the existing hierarchical classifications of prokaryotes, the bulk of attention is given to the one accepted in the current second edition of "Bergey's Manual of Systematic Bacteriology", which is primarily based on 16S rRNA phylogeny and phenotypic properties of the organisms. Analysis of the genomics data shows that they on the whole agree with the 16S rRNA-based system, although revealing the significance of the evolutionary role of lateral transfer, duplication, and loss of genes. According to the classification elaborated in the current edition of "Bergey's Manual", the prokaryotes currently culturable under laboratory conditions are distributed among 26 phyla, two of which belong to the domain Archaea and 24 to the domain Bacteria. Six phyla contain exclusively thermophiles, and eleven phyla contain thermophiles along with mesophiles, thermophiles being usually separated phylogenetically and representing high-level taxa (classes, orders). In light of the data on the topology of the 16S rRNA-based phylogenetic tree and some other data, this review discusses the probable hyperthermophilic nature of the universal common ancestor. PMID:18205614

  2. Phylogenetic Comparative Assembly

    NASA Astrophysics Data System (ADS)

    Husemann, Peter; Stoye, Jens

    Recent high throughput sequencing technologies are capable of generating a huge amount of data for bacterial genome sequencing projects. Although current sequence assemblers successfully merge the overlapping reads, often several contigs remain which cannot be assembled any further. It is still costly and time consuming to close all the gaps in order to acquire the whole genomic sequence. Here we propose an algorithm that takes several related genomes and their phylogenetic relationships into account to create a contig adjacency graph. From this a layout graph can be computed which indicates putative adjacencies of the contigs in order to aid biologists in finishing the complete genomic sequence.

  3. ClassyFlu: classification of influenza A viruses with Discriminatively trained profile-HMMs.

    PubMed

    Van der Auwera, Sandra; Bulla, Ingo; Ziller, Mario; Pohlmann, Anne; Harder, Timm; Stanke, Mario

    2014-01-01

    Accurate and rapid characterization of influenza A virus (IAV) hemagglutinin (HA) and neuraminidase (NA) sequences with respect to subtype and clade is at the basis of extended diagnostic services and implicit to molecular epidemiologic studies. ClassyFlu is a new tool and web service for the classification of IAV sequences of the HA and NA gene into subtypes and phylogenetic clades using discriminatively trained profile hidden Markov models (HMMs), one for each subtype or clade. ClassyFlu merely requires as input unaligned, full-length or partial HA or NA DNA sequences. It enables rapid and highly accurate assignment of HA sequences to subtypes H1-H17 but particularly focusses on the finer grained assignment of sequences of highly pathogenic avian influenza viruses of subtype H5N1 according to the cladistics proposed by the H5N1 Evolution Working Group. NA sequences are classified into subtypes N1-N10. ClassyFlu was compared to semiautomatic classification approaches using BLAST and phylogenetics and additionally for H5 sequences to the new "Highly Pathogenic H5N1 Clade Classification Tool" (IRD-CT) proposed by the Influenza Research Database. Our results show that both web tools (ClassyFlu and IRD-CT), although based on different methods, are nearly equivalent in performance and both are more accurate and faster than semiautomatic classification. A retraining of ClassyFlu to altered cladistics as well as an extension of ClassyFlu to other IAV genome segments or fragments thereof is undemanding. This is exemplified by unambiguous assignment to a distinct cluster within subtype H7 of sequences of H7N9 viruses which emerged in China early in 2013 and caused more than 130 human infections. http://bioinf.uni-greifswald.de/ClassyFlu is a free web service. For local execution, the ClassyFlu source code in PERL is freely available. PMID:24404173

  4. Phenotypic and phylogenetic characterization of an abamectin-degrading bacterial strain isolated from a citrus orchard.

    PubMed

    Ali, Shinawar Waseem; Yu, Fang-Bo; Haider, Muhammad Saleem; Yan, Xin; Li, Shun-Peng

    2013-01-01

    Bacterial strain GB-01 was isolated from abamectin-contaminated soils by continuous enrichment culture. The preliminary identification of strain GB-01 as a Burkholderia species was based mainly on simple biochemical and substrate utilization tests; however, these tests alone cannot accurately differentiate all the species within the genus Burkholderia. The strain GB-01 was subjected to taxonomic analysis through a polyphasic approach, in which phenotypic, genotypic, and phylogenetic information was gathered to conclude the classification of this microbe. Phenotypic information comes from basic bacteriological tests and substrate utilization patterns using the Biolog GN2 MicroPlating system and automated miniature biochemical test kits, i.e. API 20 NE, ID 32 GN and API 50 CH, as well as analyzing the whole cell fatty acid profile. Genotypic information was gathered from whole genome DNA base composition (G+C mol%), and DNA-DNA hybridization with its closest species, while phylogenetic information was collected from the comparative analysis of 16S rRNA and recA gene sequences. The results of polyphasic analysis concluded that strain GB-01 is an atypical strain of the Burkholderia diffusa species. PMID:23863292

  5. Phylogenetic trees in bioinformatics

    SciTech Connect

    Burr, Tom L

    2008-01-01

    Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding the best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.

  6. Entanglement, Invariants, and Phylogenetics

    NASA Astrophysics Data System (ADS)

    Sumner, J. G.

    2007-10-01

    This thesis develops and expands upon known techniques of mathematical physics relevant to the analysis of the popular Markov model of phylogenetic trees required in biology to reconstruct the evolutionary relationships of taxonomic units from biomolecular sequence data. The techniques of mathematical physics are plethora and have been developed for some time. The Markov model of phylogenetics and its analysis is a relatively new technique where most progress to date has been achieved by using discrete mathematics. This thesis takes a group theoretical approach to the problem by beginning with a remarkable mathematical parallel to the process of scattering in particle physics. This is shown to equate to branching events in the evolutionary history of molecular units. The major technical result of this thesis is the derivation of existence proofs and computational techniques for calculating polynomial group invariant functions on a multi-linear space where the group action is that relevant to a Markovian time evolution. The practical results of this thesis are an extended analysis of the use of invariant functions in distance based methods and the presentation of a new reconstruction technique for quartet trees which is consistent with the most general Markov model of sequence evolution.

  7. CREST – Classification Resources for Environmental Sequence Tags

    PubMed Central

    Lanzén, Anders; Jørgensen, Steffen L.; Huson, Daniel H.; Gorfer, Markus; Grindhaug, Svenn Helge; Jonassen, Inge; Øvreås, Lise; Urich, Tim

    2012-01-01

    Sequencing of taxonomic or phylogenetic markers is becoming a fast and efficient method for studying environmental microbial communities. This has resulted in a steadily growing collection of marker sequences, most notably of the small-subunit (SSU) ribosomal RNA gene, and an increased understanding of microbial phylogeny, diversity and community composition patterns. However, to utilize these large datasets together with new sequencing technologies, a reliable and flexible system for taxonomic classification is critical. We developed CREST (Classification Resources for Environmental Sequence Tags), a set of resources and tools for generating and utilizing custom taxonomies and reference datasets for classification of environmental sequences. CREST uses an alignment-based classification method with the lowest common ancestor algorithm. It also uses explicit rank similarity criteria to reduce false positives and identify novel taxa. We implemented this method in a web server, a command line tool and the graphical user interfaced program MEGAN. Further, we provide the SSU rRNA reference database and taxonomy SilvaMod, derived from the publicly available SILVA SSURef, for classification of sequences from bacteria, archaea and eukaryotes. Using cross-validation and environmental datasets, we compared the performance of CREST and SilvaMod to the RDP Classifier. We also utilized Greengenes as a reference database, both with CREST and the RDP Classifier. These analyses indicate that CREST performs better than alignment-free methods with higher recall rate (sensitivity) as well as precision, and with the ability to accurately identify most sequences from novel taxa. Classification using SilvaMod performed better than with Greengenes, particularly when applied to environmental sequences. CREST is freely available under a GNU General Public License (v3) from http://apps.cbu.uib.no/crest and http://lcaclassifier.googlecode.com. PMID:23145153

  8. The Phylogenetic Diversity of Metagenomes

    PubMed Central

    Kembel, Steven W.; Eisen, Jonathan A.; Pollard, Katherine S.; Green, Jessica L.

    2011-01-01

    Phylogenetic diversity—patterns of phylogenetic relatedness among organisms in ecological communities—provides important insights into the mechanisms underlying community assembly. Studies that measure phylogenetic diversity in microbial communities have primarily been limited to a single marker gene approach, using the small subunit of the rRNA gene (SSU-rRNA) to quantify phylogenetic relationships among microbial taxa. In this study, we present an approach for inferring phylogenetic relationships among microorganisms based on the random metagenomic sequencing of DNA fragments. To overcome challenges caused by the fragmentary nature of metagenomic data, we leveraged fully sequenced bacterial genomes as a scaffold to enable inference of phylogenetic relationships among metagenomic sequences from multiple phylogenetic marker gene families. The resulting metagenomic phylogeny can be used to quantify the phylogenetic diversity of microbial communities based on metagenomic data sets. We applied this method to understand patterns of microbial phylogenetic diversity and community assembly along an oceanic depth gradient, and compared our findings to previous studies of this gradient using SSU-rRNA gene and metagenomic analyses. Bacterial phylogenetic diversity was highest at intermediate depths beneath the ocean surface, whereas taxonomic diversity (diversity measured by binning sequences into taxonomically similar groups) showed no relationship with depth. Phylogenetic diversity estimates based on the SSU-rRNA gene and the multi-gene metagenomic phylogeny were broadly concordant, suggesting that our approach will be applicable to other metagenomic data sets for which corresponding SSU-rRNA gene sequences are unavailable. Our approach opens up the possibility of using metagenomic data to study microbial diversity in a phylogenetic context. PMID:21912589

  9. Modeling body size evolution in Felidae under alternative phylogenetic hypotheses

    PubMed Central

    2009-01-01

    The use of phylogenetic comparative methods in ecological research has advanced during the last twenty years, mainly due to accurate phylogenetic reconstructions based on molecular data and computational and statistical advances. We used phylogenetic correlograms and phylogenetic eigenvector regression (PVR) to model body size evolution in 35 worldwide Felidae (Mammalia, Carnivora) species using two alternative phylogenies and published body size data. The purpose was not to contrast the phylogenetic hypotheses but to evaluate how analyses of body size evolution patterns can be affected by the phylogeny used for comparative analyses (CA). Both phylogenies produced a strong phylogenetic pattern, with closely related species having similar body sizes and the similarity decreasing with increasing distances in time. The PVR explained 65% to 67% of body size variation and all Moran's I values for the PVR residuals were non-significant, indicating that both these models explained phylogenetic structures in trait variation. Even though our results did not suggest that any phylogeny can be used for CA with the same power, or that “good” phylogenies are unnecessary for the correct interpretation of the evolutionary dynamics of ecological, biogeographical, physiological or behavioral patterns, it does suggest that developments in CA can, and indeed should, proceed without waiting for perfect and fully resolved phylogenies. PMID:21637664

  10. Insights into the evolution of sorbitol metabolism: phylogenetic analysis of SDR196C family

    PubMed Central

    2012-01-01

    Background Short chain dehydrogenases/reductases (SDR) are NAD(P)(H)-dependent oxidoreductases with a highly conserved 3D structure and of an early origin, which has allowed them to diverge into several families and enzymatic activities. The SDR196C family (http://www.sdr-enzymes.org) groups bacterial sorbitol dehydrogenases (SDH), which are of great industrial interest. In this study, we examine the phylogenetic relationship between the members of this family, and based on the findings and some sequence conserved blocks, a new and a more accurate classification is proposed. Results The distribution of the 66 bacterial SDH species analyzed was limited to Gram-negative bacteria. Six different bacterial families were found, encompassing α-, β- and γ-proteobacteria. This broad distribution in terms of bacteria and niches agrees with that of SDR, which are found in all forms of life. A cluster analysis of sorbitol dehydrogenase revealed different types of gene organization, although with a common pattern in which the SDH gene is surrounded by sugar ABC transporter proteins, another SDR, a kinase, and several gene regulators. According to the obtained trees, six different lineages and three sublineages can be discerned. The phylogenetic analysis also suggested two different origins for SDH in β-proteobacteria and four origins for γ-proteobacteria. Finally, this subdivision was further confirmed by the differences observed in the sequence of the conserved blocks described for SDR and some specific blocks of SDH, and by a functional divergence analysis, which made it possible to establish new consensus sequences and specific fingerprints for the lineages and sub lineages. Conclusion SDH distribution agrees with that observed for SDR, indicating the importance of the polyol metabolism, as an alternative source of carbon and energy. The phylogenetic analysis pointed to six clearly defined lineages and three sub lineages, and great variability in the origin of this gene, despite its well conserved 3D structure. This suggests that SDH are very old and emerged early during the evolution. This study also opens up a new and more accurate classification of SDR196C family, introducing two numbers at the end of the family name, which indicate the lineage and the sublineage of each member, i.e, SDR196C6.3. PMID:22899811

  11. Dengue virus type 3 in Brazil: a phylogenetic perspective.

    PubMed

    Araújo, Josélio Maria Galvão de; Bello, Gonzalo; Schatzmayr, Hermann Gonçalves; Santos, Flávia Barreto dos; Nogueira, Rita Maria Ribeiro

    2009-05-01

    Circulation of a new dengue virus (DENV)-3 genotype was recently described in Brazil and Colombia, but the precise classification of this genotype has been controversial. Here we perform phylogenetic and nucleotide-distance analyses of the envelope gene, which support the subdivision of DENV-3 strains into five distinct genotypes (GI to GV) and confirm the classification of the new South American genotype as GV. The extremely low genetic distances between Brazilian GV strains and the prototype Philippines/L11423 GV strain isolated in 1956 raise important questions regarding the origin of GV in South America. PMID:19547883

  12. A format for phylogenetic placements.

    PubMed

    Matsen, Frederick A; Hoffman, Noah G; Gallagher, Aaron; Stamatakis, Alexandros

    2012-01-01

    We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988

  13. A Format for Phylogenetic Placements

    PubMed Central

    Matsen, Frederick A.; Hoffman, Noah G.; Gallagher, Aaron; Stamatakis, Alexandros

    2012-01-01

    We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988

  14. A revision of infrageneric classification in Astelia Banks & Sol. ex R.Br. (Asteliaceae)

    PubMed Central

    Birch, Joanne L.

    2015-01-01

    Abstract Systematic investigations and phylogenetic analyses have indicated that Astelia, as currently circumscribed, is paraphyletic, with Collospermum nested within it. Further, Astelia subgenus Astelia is polyphyletic, and Astelia subgenera Asteliopsis and Tricella are paraphyletic, as currently circumscribed. Revision of the subgeneric classification of Astelia is warranted to ensure classification accurately reflects the evolutionary history of these taxa. Collospermum is relegated to synonymy within Astelia. Astelia is dioecious or polygamodioecious, with a superior ovary, anthers dorsi- or basifixed, pistillodes or pistils that have a single short or poorly defined style, a 3 lobed stigma, and fleshy uni- or trilocular fruit with funicular hairs that are poorly to well developed. Astelia subgenus Collospermum (Skottsb.) Birch is described. A key to Astelia sections is provided. Astelia hastata Colenso, Astelia montana Seem., and Astelia microsperma Colenso pro parte are resurrected and the new combination Astelia samoense (Skottsb.) Birch, comb. nov. is made. PMID:26312037

  15. Photometric brown-dwarf classification. II. A homogeneous sample of 1361 L and T dwarfs brighter than J = 17.5 with accurate spectral types

    NASA Astrophysics Data System (ADS)

    Skrzypek, N.; Warren, S. J.; Faherty, J. K.

    2016-04-01

    We present a homogeneous sample of 1361 L and T dwarfs brighter than J = 17.5 (of which 998 are new), from an effective area of 3070 deg2, classified by the photo-type method to an accuracy of one spectral sub-type using izYJHKW1W2 photometry from SDSS+UKIDSS+WISE. Other than a small bias in the early L types, the sample is shown to be effectively complete to the magnitude limit, for all spectral types L0 to T8. The nature of the bias is an incompleteness estimated at 3% because peculiar blue L dwarfs of type L4 and earlier are classified late M. There is a corresponding overcompleteness because peculiar red (likely young) late M dwarfs are classified early L. Contamination of the sample is confirmed to be small: so far spectroscopy has been obtained for 19 sources in the catalogue and all are confirmed to be ultracool dwarfs. We provide coordinates and izYJHKW1W2 photometry of all sources. We identify an apparent discontinuity, Δm ~ 0.4 mag, in the Y - K colour between spectral types L7 and L8. We present near-infrared spectra of nine sources identified by photo-type as peculiar, including a new low-gravity source ULAS J005505.68+013436.0, with spectroscopic classification L2γ. We provide revised izYJHKW1W2 template colours for late M dwarfs, types M7 to M9. The catalogue is only available at the CDS via anonymous ftp to http://cdsarc.u-strasbg.fr (ftp://130.79.128.5) or via http://cdsarc.u-strasbg.fr/viz-bin/qcat?J/A+A/589/A49

  16. Map Classification

    ERIC Educational Resources Information Center

    Larsgaard, Mary

    1973-01-01

    Although map classification may seem a somewhat esoteric subject, at least to those who are not map librarians, the enterprising searcher for map classification schemes can find a goodly number from which to choose. (30 references) (Author)

  17. Classification Options

    ERIC Educational Resources Information Center

    Exceptional Children, 1978

    1978-01-01

    The interview presents opinions of Nicholas Hobbs on the classification of exceptional children, including topics such as ecologically oriented classification systems, the role of parents, and need for revision of teacher preparation programs. (IM)

  18. Phylogenetic lineages in Entomophthoromycota.

    PubMed

    Gryganskyi, A P; Humber, R A; Smith, M E; Hodge, K; Huang, B; Voigt, K; Vilgalys, R

    2013-06-01

    Entomophthoromycota is one of six major phylogenetic lineages among the former phylum Zygomycota. These early terrestrial fungi share evolutionarily ancestral characters such as coenocytic mycelium and gametangiogamy as a sexual process resulting in zygospore formation. Previous molecular studies have shown the monophyly of Entomophthoromycota, thus justifying raising the taxonomic status of these fungi to a phylum. Multi-gene phylogenies have identified five major lineages of Entomophthoromycota. In this review we provide a detailed discussion about the biology and taxonomy of these lineages: I) Basidiobolus (Basidiobolomycetes: Basidiobolaceae; primarily saprobic); II) Conidiobolus (Entomophthoromycetes, Ancylistaceae; several clades of saprobes and invertebrate pathogens), as well as three rapidly evolving entomopathogenic lineages in the family Entomophthoraceae centering around; III) Batkoa; IV) Entomophthora and allied genera; and V) the subfamily Erynioideae which includes Zoophthora and allied genera. Molecular phylogenic analysis has recently determined the relationships of several taxa that were previously unresolved based on morphology alone: Eryniopsis, Macrobiotophthora, Massospora, Strongwellsea and two as yet undescribed genera of Basidiobolaceae. PMID:24027349

  19. Phylogenetic and Biogeographic Analysis of Sphaerexochine Trilobites

    PubMed Central

    Congreve, Curtis R.; Lieberman, Bruce S.

    2011-01-01

    Background Sphaerexochinae is a speciose and widely distributed group of cheirurid trilobites. Their temporal range extends from the earliest Ordovician through the Silurian, and they survived the end Ordovician mass extinction event (the second largest mass extinction in Earth history). Prior to this study, the individual evolutionary relationships within the group had yet to be determined utilizing rigorous phylogenetic methods. Understanding these evolutionary relationships is important for producing a stable classification of the group, and will be useful in elucidating the effects the end Ordovician mass extinction had on the evolutionary and biogeographic history of the group. Methodology/Principal Findings Cladistic parsimony analysis of cheirurid trilobites assigned to the subfamily Sphaerexochinae was conducted to evaluate phylogenetic patterns and produce a hypothesis of relationship for the group. This study utilized the program TNT, and the analysis included thirty-one taxa and thirty-nine characters. The results of this analysis were then used in a Lieberman-modified Brooks Parsimony Analysis to analyze biogeographic patterns during the Ordovician-Silurian. Conclusions/Significance The genus Sphaerexochus was found to be monophyletic, consisting of two smaller clades (one composed entirely of Ordovician species and another composed of Silurian and Ordovician species). By contrast, the genus Kawina was found to be paraphyletic. It is a basal grade that also contains taxa formerly assigned to Cydonocephalus. Phylogenetic patterns suggest Sphaerexochinae is a relatively distinctive trilobite clade because it appears to have been largely unaffected by the end Ordovician mass extinction. Finally, the biogeographic analysis yields two major conclusions about Sphaerexochus biogeography: Bohemia and Avalonia were close enough during the Silurian to exchange taxa; and during the Ordovician there was dispersal between Eastern Laurentia and the Yangtze block (South China) and between Eastern Laurentia and Avalonia. PMID:21738632

  20. High-resolution phylogenetic microbial community profiling

    SciTech Connect

    Singer, Esther; Coleman-Derr, Devin; Bowman, Brett; Schwientek, Patrick; Clum, Alicia; Copeland, Alex; Ciobanu, Doina; Cheng, Jan-Fang; Gies, Esther; Hallam, Steve; Tringe, Susannah; Woyke, Tanja

    2014-03-17

    The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance our knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.

  1. Classification of papillomaviruses.

    PubMed

    de Villiers, Ethel-Michele; Fauquet, Claude; Broker, Thomas R; Bernard, Hans-Ulrich; zur Hausen, Harald

    2004-06-20

    One hundred eighteen papillomavirus (PV) types have been completely described, and a yet higher number of presumed new types have been detected by preliminary data such as subgenomic amplicons. The classification of this diverse group of viruses, which include important human pathogens, has been debated for three decades. This article describes the higher-order PV taxonomy following the general criteria established by the International Committee on the Taxonomy of Viruses (ICTV), reviews the literature of the lower order taxa, lists all known "PV types", and interprets their phylogenetic relationship. PVs are a taxonomic family of their own, Papillomaviridae, unrelated to the polyomaviruses. Higher-order phylogenetic assemblages of PV types, such as the "genital human PVs", are considered a genus, the latter group, for example, the genus "Alpha-Papillomavirus". Lower-order assemblages of PV types within each genus are treated as species because they are phylogenetically closely related, but while they have distinct genomic sequences, they have identical or very similar biological or pathological properties. The taxonomic status of PV types, subtypes, and variants remains unchanged and is based on the traditional criteria that the sequence of their L1 genes should be at least 10%, 2-10%, and maximally 2% dissimilar from one another. PMID:15183049

  2. Phylogenetics and the Human Microbiome

    PubMed Central

    Matsen, Frederick A.

    2015-01-01

    The human microbiome is the ensemble of genes in the microbes that live inside and on the surface of humans. Because microbial sequencing information is now much easier to come by than phenotypic information, there has been an explosion of sequencing and genetic analysis of microbiome samples. Much of the analytical work for these sequences involves phylogenetics, at least indirectly, but methodology has developed in a somewhat different direction than for other applications of phylogenetics. In this article, I review the field and its methods from the perspective of a phylogeneticist, as well as describing current challenges for phylogenetics coming from this type of work. PMID:25102857

  3. [Foundations of the new phylogenetics].

    PubMed

    Pavlinov, I Ia

    2004-01-01

    Evolutionary idea is the core of the modern biology. Due to this, phylogenetics dealing with historical reconstructions in biology takes a priority position among biological disciplines. The second half of the 20th century witnessed growth of a great interest to phylogenetic reconstructions at macrotaxonomic level which replaced microevolutionary studies dominating during the 30s-60s. This meant shift from population thinking to phylogenetic one but it was not revival of the classical phylogenetics; rather, a new approach emerged that was baptized The New Phylogenetics. It arose as a result of merging of three disciplines which were developing independently during 60s-70s, namely cladistics, numerical phyletics, and molecular phylogenetics (now basically genophyletics). Thus, the new phylogenetics could be defined as a branch of evolutionary biology aimed at elaboration of "parsimonious" cladistic hypotheses by means of numerical methods on the basis of mostly molecular data. Classical phylogenetics, as a historical predecessor of the new one, emerged on the basis of the naturphilosophical worldview which included a superorganismal idea of biota. Accordingly to that view, historical development (the phylogeny) was thought an analogy of individual one (the ontogeny) so its most basical features were progressive parallel developments of "parts" (taxa), supplemented with Darwinian concept of monophyly. Two predominating traditions were diverged within classical phylogenetics according to a particular interpretation of relation between these concepts. One of them (Cope, Severtzow) belittled monophyly and paid most attention to progressive parallel developments of morphological traits. Such an attitude turned this kind of phylogenetics to be rather the semogenetics dealing primarily with evolution of structures and not of taxa. Another tradition (Haeckel) considered both monophyletic and parallel origins of taxa jointly: in the middle of 20th century it was split into phylistics (Rasnitsyn's term; close to Simpsonian evolutionary taxonomy) belonging rather to the classical realm, and Hennigian cladistics that pays attention to origin of monophyletic taxa exclusively. In early of the 20th century, microevolutionary doctrine became predominating in evolutionary studies. Its core is the population thinking accompanied by the phenetic one based on equation of kinship to overall similarity. They were connected to positivist philosophy and hence were characterized by reductionism at both ontological and epistemological levels. It led to fall of classical phylogenetics but created the prerequisites for the new phylogenetics which also appeared to be full of reductionism. The new rise of phylogenetic (rather than tree) thinking during the last third of the 20th century was caused by lost of explanatory power of population one and by development of the new worldview and new epistemological premises. That new worldview is based on the synergetic (Prigoginian) model of development of non-equilibrium systems: evolution of the biota, a part of which is phylogeny, is considered as such a development. At epistemological level, the principal premise appeared to be fall of positivism which was replaced by post-positivism argumentation schemes. Input of cladistics into new phylogenetics is twofold. On the one hand, it reduced phylogeny to cladistic history lacking any adaptivist interpretation and presuming minimal evolution model. From this it followed reduction of kinship relation to sister-group relation lacking any reference to real time scale and to ancestor-descendant relation. On the other hand, cladistics elaborated methodology of phylogenetic reconstructions based on the synapomorphy principle, the outgroup concept became its part. The both inputs served as premises of incorporation of both numerical techniques and molecular data into phylogenetic reconstruction. Numerical phyletics provided the new phylogenetics with easily manipulated algorithms of cladogram construing and thus made phylogenetic reconstructions operational and repetitive. The above phenetic formula "kinship = similarity" appeared to be a keystone for development of the genophyletics. Within numerical phyletics, a lot of computer programs were elaborated which allow to manipulate with evolutionary scenario during phylogenetic reconstructions. They make it possible to reconstruct both clado- and semogeneses based on the same formalized methods. Multiplicity of numerical approaches indicates that, just as in the case of numerical phenetics, choice of adequate method(s) should be based on biologically sound theory. The main input of genophyletics (= molecular phylogenetics) into the new phylogenetics was due to completely new factology which makes it possible to compare directly such far distant taxa as prokaryotes and higher eukaryotes. Genophyletics is based on the theory of neutral evolution borrowed from microevolutionary theory and on the molecular clock hypothesis which is now considered largely inadequate. The future developments of genophyletics will be aimed at clarification of such fundamental (and "classical" by origin) problems as application of character and homology concepts to molecular structures. The new phylogenetics itself is differentiated into several schools caused basically by diversity of various approaches existing within each of its "roots". Cladistics makes new phylogenetics splitted into evolutionary and parsimonious ontological viewpoints. Numerical phyletics divides it into statistical and (again) parsimonious methodologies. Molecular phylogenetics is opposite by its factological basis to morphological one. The new phylogenetics has significance impact onto the "newest" systematics. From one side, it gives ontological status back to macrotaxa they have lost due to "new" systematics based on population thinking. From another side, it rejects some basical principles of classical phylogenetic (originally Linnean) taxonomy such as recognitions of fixed taxonomic ranks designated by respective terms and definition of taxic names not by the diagnostic characters but by reference to the ancestor. The latter makes the PhyloCode overburdened ideologically and the "newest" systematics self-controversial, as concept of ancestor has been acknowledged non-operational from the very beginning of cladistics. Relation between classical and new phylogenetics is twofold. At the one hand, general phylogenetic hypothesis (in its classical sense) can be treated as a combination of cladogenetic and semogenetic reconstructions. Such a consideration is bound to pay close attention to the uncertainty relation principle which, in case of the phylogenetics, means that the general phylogenetic hypothesis cannot be more certain than any of initial cladogenetic or semogenetic hypotheses. From this standpoint, the new phylogenetics makes it possible to reconstruct phylogeny following epistemological principle "from simple to complex". It elaborates a kind of null hypotheses about evolutionary history which are more easy to test as compared to classical hypotheses. Afterward, such hypotheses are possible to be completed toward the classical, more content-wise ones by adding anagenetic information to the cladogenetic one. At another hand, reconstructions elaborated within the new phylogenetics could be considered as specific null hypotheses about both clado- and semogeneses. They are to be tested subsequently by mean of various models, including those borrowed from "classical" morphology. The future development of the new phylogenetics is supposed to be connected with getting out of plethora of reductionism inherited by it from population thinking and specification of object domain of the phylogenetics. As the latter is a part of an evolutionary theory, its future developments will be adjusted with the latter. Lately predominating neodarwinism is now being replaced by the epigenetic evolutionary theory to which phylistics (one of the modern versions of classical phylogenetics) seems to be more correspondent. PMID:15490579

  4. A phylogenetic analysis of the myxobacteria: basis for their classification

    NASA Technical Reports Server (NTRS)

    Shimkets, L.; Woese, C. R.

    1992-01-01

    The primary sequence and secondary structural features of the 16S rRNA were compared for 12 different myxobacteria representing all the known cultivated genera. Analysis of these data show the myxobacteria to form a monophyletic grouping consisting of three distinct families, which lies within the delta subdivision of the purple bacterial phylum. The composition of the families is consistent with differences in cell and spore morphology, cell behavior, and pigment and secondary metabolite production but is not correlated with the morphological complexity of the fruiting bodies. The Nannocystis exedens lineage has evolved at an unusually rapid pace and its rRNA shows numerous primary and secondary structural idiosyncrasies.

  5. Phylogenetic Studies and Modern Classification of the Pyraloidea (Lepidoptera)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pyraloidea, the third largest superfamily of the Lepidoptera, is comprised of two families - Pyralidae and Crambidae. The history of families previously placed in the Pyraloidea is discussed. The group now includes about 16,000 species worldwide. Morphologically, the superfamily is defined by a b...

  6. Community Phylogenetics: Assessing Tree Reconstruction Methods and the Utility of DNA Barcodes

    PubMed Central

    Boyle, Elizabeth E.; Adamowicz, Sarah J.

    2015-01-01

    Studies examining phylogenetic community structure have become increasingly prevalent, yet little attention has been given to the influence of the input phylogeny on metrics that describe phylogenetic patterns of co-occurrence. Here, we examine the influence of branch length, tree reconstruction method, and amount of sequence data on measures of phylogenetic community structure, as well as the phylogenetic signal (Pagel’s λ) in morphological traits, using Trichoptera larval communities from Churchill, Manitoba, Canada. We find that model-based tree reconstruction methods and the use of a backbone family-level phylogeny improve estimations of phylogenetic community structure. In addition, trees built using the barcode region of cytochrome c oxidase subunit I (COI) alone accurately predict metrics of phylogenetic community structure obtained from a multi-gene phylogeny. Input tree did not alter overall conclusions drawn for phylogenetic signal, as significant phylogenetic structure was detected in two body size traits across input trees. As the discipline of community phylogenetics continues to expand, it is important to investigate the best approaches to accurately estimate patterns. Our results suggest that emerging large datasets of DNA barcode sequences provide a vast resource for studying the structure of biological communities. PMID:26110886

  7. A mixed branch length model of heterotachy improves phylogenetic accuracy.

    PubMed

    Kolaczkowski, Bryan; Thornton, Joseph W

    2008-06-01

    Evolutionary relationships are typically inferred from molecular sequence data using a statistical model of the evolutionary process. When the model accurately reflects the underlying process, probabilistic phylogenetic methods recover the correct relationships with high accuracy. There is ample evidence, however, that models commonly used today do not adequately reflect real-world evolutionary dynamics. Virtually all contemporary models assume that relatively fast-evolving sites are fast across the entire tree, whereas slower sites always evolve at relatively slower rates. Many molecular sequences, however, exhibit site-specific changes in evolutionary rates, called "heterotachy." Here we examine the accuracy of 2 phylogenetic methods for incorporating heterotachy, the mixed branch length model--which incorporates site-specific rate changes by summing likelihoods over multiple sets of branch lengths on the same tree--and the covarion model, which uses a hidden Markov process to allow sites to switch between variable and invariable as they evolve. Under a variety of simple heterogeneous simulation conditions, the mixed model was dramatically more accurate than homotachous models, which were subject to topological biases as well as biases in branch length estimates. When data were simulated with strong versions of the types of heterotachy observed in real molecular sequences, the mixed branch length model was more accurate than homotachous techniques. Analyses of empirical data sets confirmed that the mixed branch length model can improve phylogenetic accuracy under conditions that cause homotachous models to fail. In contrast, the covarion model did not improve phylogenetic accuracy compared with homotachous models and was sometimes substantially less accurate. We conclude that a mixed branch length approach, although not the solution to all phylogenetic errors, is a valuable strategy for improving the accuracy of inferred trees. PMID:18319244

  8. Phylogenetic lineages in Pseudocercospora

    PubMed Central

    Crous, P.W.; Braun, U.; Hunter, G.C.; Wingfield, M.J.; Verkley, G.J.M.; Shin, H.-D.; Nakashima, C.; Groenewald, J.Z.

    2013-01-01

    Pseudocercospora is a large cosmopolitan genus of plant pathogenic fungi that are commonly associated with leaf and fruit spots as well as blights on a wide range of plant hosts. They occur in arid as well as wet environments and in a wide range of climates including cool temperate, sub-tropical and tropical regions. Pseudocercospora is now treated as a genus in its own right, although formerly recognised as either an anamorphic state of Mycosphaerella or having mycosphaerella-like teleomorphs. The aim of this study was to sequence the partial 28S nuclear ribosomal RNA gene of a selected set of isolates to resolve phylogenetic generic limits within the Pseudocercospora complex. From these data, 14 clades are recognised, six of which cluster in Mycosphaerellaceae. Pseudocercospora s. str. represents a distinct clade, sister to Passalora eucalypti, and a clade representing the genera Scolecostigmina, Trochophora and Pallidocercospora gen. nov., taxa formerly accommodated in the Mycosphaerella heimii complex and characterised by smooth, pale brown conidia, as well as the formation of red crystals in agar media. Other clades in Mycosphaerellaceae include Sonderhenia, Microcyclosporella, and Paracercospora. Pseudocercosporella resides in a large clade along with Phloeospora, Miuraea, Cercospora and Septoria. Additional clades represent Dissoconiaceae, Teratosphaeriaceae, Cladosporiaceae, and the genera Xenostigmina, Strelitziana, Cyphellophora and Thedgonia. The genus Phaeomycocentrospora is introduced to accommodate Mycocentrospora cantuariensis, primarily distinguished from Pseudocercospora based on its hyaline hyphae, broad conidiogenous loci and hila. Host specificity was considered for 146 species of Pseudocercospora occurring on 115 host genera from 33 countries. Partial nucleotide sequence data for three gene loci, ITS, EF-1α, and ACT suggest that the majority of these species are host specific. Species identified on the basis of host, symptomatology and general morphology, within the same geographic region, frequently differed phylogenetically, indicating that the application of European and American names to Asian taxa, and vice versa, was often not warranted. Taxonomic novelties: New genera - Pallidocercospora Crous, Phaeomycocentrospora Crous, H.D. Shin & U. Braun; New species - Cercospora eucommiae Crous, U. Braun & H.D. Shin, Microcyclospora quercina Crous & Verkley, Pseudocercospora ampelopsis Crous, U. Braun & H.D. Shin, Pseudocercospora cercidicola Crous, U. Braun & C. Nakash., Pseudocercospora crispans G.C. Hunter & Crous, Pseudocercospora crocea Crous, U. Braun, G.C. Hunter & H.D. Shin, Pseudocercospora haiweiensis Crous & X. Zhou, Pseudocercospora humulicola Crous, U. Braun & H.D. Shin, Pseudocercospora marginalis G.C. Hunter, Crous, U. Braun & H.D. Shin, Pseudocercospora ocimi-basilici Crous, M.E. Palm & U. Braun, Pseudocercospora plectranthi G.C. Hunter, Crous, U. Braun & H.D. Shin, Pseudocercospora proteae Crous, Pseudocercospora pseudostigmina-platani Crous, U. Braun & H.D. Shin, Pseudocercospora pyracanthigena Crous, U. Braun & H.D. Shin, Pseudocercospora ravenalicola G.C. Hunter & Crous, Pseudocercospora rhamnellae G.C. Hunter, H.D. Shin, U. Braun & Crous, Pseudocercospora rhododendri-indici Crous, U. Braun & H.D. Shin, Pseudocercospora tibouchinigena Crous & U. Braun, Pseudocercospora xanthocercidis Crous, U. Braun & A. Wood, Pseudocercosporella koreana Crous, U. Braun & H.D. Shin; New combinations - Pallidocercospora acaciigena (Crous & M.J. Wingf.) Crous & M.J. Wingf., Pallidocercospora crystallina (Crous & M.J. Wingf.) Crous & M.J. Wingf., Pallidocercospora heimii (Crous) Crous, Pallidocercospora heimioides (Crous & M.J. Wingf.) Crous & M.J. Wingf., Pallidocercospora holualoana (Crous, Joanne E. Taylor & M.E. Palm) Crous, Pallidocercospora konae (Crous, Joanne E. Taylor & M.E. Palm) Crous, Pallidoocercospora irregulariramosa (Crous & M.J. Wingf.) Crous & M.J. Wingf., Phaeomycocentrospora cantuariensis (E.S. Salmon & Wormald) Crous, H.D. Shin & U. Braun, Pseudocercospora hakeae (U. Braun & Crous) U. Braun & Crous, Pseudocercospora leucadendri (Cooke) U. Braun & Crous, Pseudocercospora snelliana (Reichert) U. Braun, H.D. Shin, C. Nakash. & Crous, Pseudocercosporella chaenomelis (Y. Suto) C. Nakash., Crous, U. Braun & H.D. Shin; Typifications: Epitypifications - Pseudocercospora angolensis (T. Carvalho & O. Mendes) Crous & U. Braun, Pseudocercospora araliae (Henn.) Deighton, Pseudocercospora cercidis-chinensis H.D. Shin & U. Braun, Pseudocercospora corylopsidis (Togashi & Katsuki) C. Nakash. & Tak. Kobay., Pseudocercospora dovyalidis (Chupp & Doidge) Deighton, Pseudocercospora fukuokaensis (Chupp) X.J. Liu & Y.L. Guo, Pseudocercospora humuli (Hori) Y.L. Guo & X.J. Liu, Pseudocercospora kiggelariae (Syd.) Crous & U. Braun, Pseudocercospora lyoniae (Katsuki & Tak. Kobay.) Deighton, Pseudocercospora lythri H.D. Shin & U. Braun, Pseudocercospora sambucigena U. Braun, Crous & K. Schub., Pseudocercospora stephanandrae (Tak. Kobay. & H. Horie) C. Nakash. & Tak. Kobay., Pseudocercospora viburnigena U. Braun & Crous, Pseudocercosporella chaenomelis (Y. Suto) C. Nakash., Crous, U. Braun & H.D. Shin, Xenostigmina zilleri (A. Funk) Crous; Lectotypification - Pseudocercospora ocimicola (Petr. & Cif.) Deighton; Neotypifications - Pseudocercospora kiggelariae (Syd.) Crous & U. Braun, Pseudocercospora lonicericola (W. Yamam.) Deighton, Pseudocercospora zelkovae (Hori) X.J. Liu & Y.L. Guo. PMID:24014898

  9. Uncovering epidemiological dynamics in heterogeneous host populations using phylogenetic methods.

    PubMed

    Stadler, Tanja; Bonhoeffer, Sebastian

    2013-03-19

    Host population structure has a major influence on epidemiological dynamics. However, in particular for sexually transmitted diseases, quantitative data on population contact structure are hard to obtain. Here, we introduce a new method that quantifies host population structure based on phylogenetic trees, which are obtained from pathogen genetic sequence data. Our method is based on a maximum-likelihood framework and uses a multi-type branching process, under which each host is assigned to a type (subpopulation). In a simulation study, we show that our method produces accurate parameter estimates for phylogenetic trees in which each tip is assigned to a type, as well for phylogenetic trees in which the type of the tip is unknown. We apply the method to a Latvian HIV-1 dataset, quantifying the impact of the intravenous drug user epidemic on the heterosexual epidemic (known tip states), and identifying superspreader dynamics within the men-having-sex-with-men epidemic (unknown tip states). PMID:23382421

  10. Phylogenetic Incongruence in E. coli O104: Understanding the Evolutionary Relationships of Emerging Pathogens in the Face of Homologous Recombination

    PubMed Central

    Hao, Weilong; Allen, Vanessa G.; Jamieson, Frances B.; Low, Donald E.; Alexander, David C.

    2012-01-01

    Escherichia coli O104:H4 was identified as an emerging pathogen during the spring and summer of 2011 and was responsible for a widespread outbreak that resulted in the deaths of 50 people and sickened over 4075. Traditional phenotypic and genotypic assays, such as serotyping, pulsed field gel electrophoresis (PFGE), and multilocus sequence typing (MLST), permit identification and classification of bacterial pathogens, but cannot accurately resolve relationships among genotypically similar but pathotypically different isolates. To understand the evolutionary origins of E. coli O104:H4, we sequenced two strains isolated in Ontario, Canada. One was epidemiologically linked to the 2011 outbreak, and the second, unrelated isolate, was obtained in 2010. MLST analysis indicated that both isolates are of the same sequence type (ST678), but whole-genome sequencing revealed differences in chromosomal and plasmid content. Through comprehensive phylogenetic analysis of five O104:H4 ST678 genomes, we identified 167 genes in three gene clusters that have undergone homologous recombination with distantly related E. coli strains. These recombination events have resulted in unexpectedly high sequence diversity within the same sequence type. Failure to recognize or adjust for homologous recombination can result in phylogenetic incongruence. Understanding the extent of homologous recombination among different strains of the same sequence type may explain the pathotypic differences between the ON2010 and ON2011 strains and help shed new light on the emergence of this new pathogen. PMID:22493677

  11. Contextual classification of multispectral image data: Approximate algorithm

    NASA Technical Reports Server (NTRS)

    Tilton, J. C. (Principal Investigator)

    1980-01-01

    An approximation to a classification algorithm incorporating spatial context information in a general, statistical manner is presented which is computationally less intensive. Classifications that are nearly as accurate are produced.

  12. Accurate Transposable Element Annotation Is Vital When Analyzing New Genome Assemblies.

    PubMed

    Platt, Roy N; Blanco-Berdugo, Laura; Ray, David A

    2016-01-01

    Transposable elements (TEs) are mobile genetic elements with the ability to replicate themselves throughout the host genome. In some taxa TEs reach copy numbers in hundreds of thousands and can occupy more than half of the genome. The increasing number of reference genomes from nonmodel species has begun to outpace efforts to identify and annotate TE content and methods that are used vary significantly between projects. Here, we demonstrate variation that arises in TE annotations when less than optimal methods are used. We found that across a variety of taxa, the ability to accurately identify TEs based solely on homology decreased as the phylogenetic distance between the queried genome and a reference increased. Next we annotated repeats using homology alone, as is often the case in new genome analyses, and a combination of homology and de novo methods as well as an additional manual curation step. Reannotation using these methods identified a substantial number of new TE subfamilies in previously characterized genomes, recognized a higher proportion of the genome as repetitive, and decreased the average genetic distance within TE families, implying recent TE accumulation. Finally, these finding-increased recognition of younger TEs-were confirmed via an analysis of the postman butterfly (Heliconius melpomene). These observations imply that complete TE annotation relies on a combination of homology and de novo-based repeat identification, manual curation, and classification and that relying on simple, homology-based methods is insufficient to accurately describe the TE landscape of a newly sequenced genome. PMID:26802115

  13. Accurate Transposable Element Annotation Is Vital When Analyzing New Genome Assemblies

    PubMed Central

    Platt, Roy N.; Blanco-Berdugo, Laura; Ray, David A.

    2016-01-01

    Transposable elements (TEs) are mobile genetic elements with the ability to replicate themselves throughout the host genome. In some taxa TEs reach copy numbers in hundreds of thousands and can occupy more than half of the genome. The increasing number of reference genomes from nonmodel species has begun to outpace efforts to identify and annotate TE content and methods that are used vary significantly between projects. Here, we demonstrate variation that arises in TE annotations when less than optimal methods are used. We found that across a variety of taxa, the ability to accurately identify TEs based solely on homology decreased as the phylogenetic distance between the queried genome and a reference increased. Next we annotated repeats using homology alone, as is often the case in new genome analyses, and a combination of homology and de novo methods as well as an additional manual curation step. Reannotation using these methods identified a substantial number of new TE subfamilies in previously characterized genomes, recognized a higher proportion of the genome as repetitive, and decreased the average genetic distance within TE families, implying recent TE accumulation. Finally, these finding—increased recognition of younger TEs—were confirmed via an analysis of the postman butterfly (Heliconius melpomene). These observations imply that complete TE annotation relies on a combination of homology and de novo–based repeat identification, manual curation, and classification and that relying on simple, homology-based methods is insufficient to accurately describe the TE landscape of a newly sequenced genome. PMID:26802115

  14. Hubble Classification

    NASA Astrophysics Data System (ADS)

    Murdin, P.

    2000-11-01

    A classification scheme for galaxies, devised in its original form in 1925 by Edwin P Hubble (1889-1953), and still widely used today. The Hubble classification recognizes four principal types of galaxy—elliptical, spiral, barred spiral and irregular—and arranges these in a sequence that is called the tuning-fork diagram....

  15. Endodontic classification.

    PubMed

    Morse, D R; Seltzer, S; Sinai, I; Biron, G

    1977-04-01

    Clinical and histopathologic findings are mixed in current endodontic classifications. A new system, based on symptomatology, may be more useful in clincial practice. The classifications are vital asymptomatic, hypersensitive dentin, inflamed-reversible, inflamed/dengenerating without area-irreversible, inflamed/degenerating with area-irreversible, necrotic without area, and necrotic with area. PMID:265327

  16. Phylogenetic reconstruction methods: an overview.

    PubMed

    De Bruyn, Alexandre; Martin, Darren P; Lefeuvre, Pierre

    2014-01-01

    Initially designed to infer evolutionary relationships based on morphological and physiological characters, phylogenetic reconstruction methods have greatly benefited from recent developments in molecular biology and sequencing technologies with a number of powerful methods having been developed specifically to infer phylogenies from macromolecular data. This chapter, while presenting an overview of basic concepts and methods used in phylogenetic reconstruction, is primarily intended as a simplified step-by-step guide to the construction of phylogenetic trees from nucleotide sequences using fairly up-to-date maximum likelihood methods implemented in freely available computer programs. While the analysis of chloroplast sequences from various Vanilla species is used as an illustrative example, the techniques covered here are relevant to the comparative analysis of homologous sequences datasets sampled from any group of organisms. PMID:24415479

  17. Two issues in archaeological phylogenetics: taxon construction and outgroup selection.

    PubMed

    O'Brien, Michael J; Lyman, R Lee; Saab, Youssef; Saab, Elias; Darwent, John; Glover, Daniel S

    2002-03-21

    Cladistics is widely used in biology and paleobiology to construct phylogenetic hypotheses, but rarely has it been applied outside those disciplines. There is, however, no reason to suppose that cladistics is not applicable to anything that evolves by cladogenesis and produces a nested hierarchy of taxa. This includes cultural phenomena such as languages and tools recovered from archaeological contexts. Two methodological issues assume primacy in attempts to extend cladistics to archaeological materials: the construction of analytical taxa and the selection of appropriate outgroups. In biology the species is the primary taxonomic unit used, irrespective of the debates that have arisen in phylogenetic theory over the nature of species. Also in biology the phylogenetic history of a group of taxa usually is well enough known that an appropriate taxon can be selected as an outgroup. No analytical unit parallel to the species exists in archaeology, and thus taxa have to be constructed specifically for phylogenetic analysis. One method of constructing taxa is paradigmatic classification, which defines classes (taxa) on the basis of co-occurring, unweighted character states. Once classes have been created, a form of occurrence seriation-an archaeological method based on the theory of cultural transmission and heritability-offers an objective basis for selecting an outgroup. PMID:12051970

  18. Interpreting the universal phylogenetic tree

    PubMed Central

    Woese, Carl R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist. PMID:10900003

  19. Interpreting the universal phylogenetic tree

    NASA Technical Reports Server (NTRS)

    Woese, C. R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist.

  20. Grading More Accurately

    ERIC Educational Resources Information Center

    Rom, Mark Carl

    2011-01-01

    Grades matter. College grading systems, however, are often ad hoc and prone to mistakes. This essay focuses on one factor that contributes to high-quality grading systems: grading accuracy (or "efficiency"). I proceed in several steps. First, I discuss the elements of "efficient" (i.e., accurate) grading. Next, I present analytical results…

  1. Molecular identification of hepatitis B virus genotypes/subgenotypes: Revised classification hurdles and updated resolutions

    PubMed Central

    Pourkarim, Mahmoud Reza; Amini-Bavil-Olyaee, Samad; Kurbanov, Fuat; Van Ranst, Marc; Tacke, Frank

    2014-01-01

    The clinical course of infections with the hepatitis B virus (HBV) substantially varies between individuals, as a consequence of a complex interplay between viral, host, environmental and other factors. Due to the high genetic variability of HBV, the virus can be categorized into different HBV genotypes and subgenotypes, which considerably differ with respect to geographical distribution, transmission routes, disease progression, responses to antiviral therapy or vaccination, and clinical outcome measures such as cirrhosis or hepatocellular carcinoma. However, HBV (sub)genotyping has caused some controversies in the past due to misclassifications and incorrect interpretations of different genotyping methods. Thus, an accurate, holistic and dynamic classification system is essential. In this review article, we aimed at highlighting potential pitfalls in genetic and phylogenetic analyses of HBV and suggest novel terms for HBV classification. Analyzing full-length genome sequences when classifying genotypes and subgenotypes is the foremost prerequisite of this classification system. Careful attention must be paid to all aspects of phylogenetic analysis, such as bootstrapping values and meeting the necessary thresholds for (sub)genotyping. Quasi-subgenotype refers to subgenotypes that were incorrectly suggested to be novel. As many of these strains were misclassified due to genetic differences resulting from recombination, we propose the term “recombino-subgenotype”. Moreover, immigration is an important confounding facet of global HBV distribution and substantially changes the geographic pattern of HBV (sub)genotypes. We therefore suggest the term “immigro-subgenotype” to distinguish exotic (sub)genotypes from native ones. We are strongly convinced that applying these two proposed terms in HBV classification will help harmonize this rapidly progressing field and allow for improved prophylaxis, diagnosis and treatment. PMID:24966586

  2. Phylogenetic Analysis of a Spontaneous Cocoa Bean Fermentation Metagenome Reveals New Insights into Its Bacterial and Fungal Community Diversity

    PubMed Central

    Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan

    2012-01-01

    This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques. PMID:22666442

  3. Phylogenetic analysis of a spontaneous cocoa bean fermentation metagenome reveals new insights into its bacterial and fungal community diversity.

    PubMed

    Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan

    2012-01-01

    This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques. PMID:22666442

  4. Absolute Pitch in Boreal Chickadees and Humans: Exceptions that Test a Phylogenetic Rule

    ERIC Educational Resources Information Center

    Weisman, Ronald G.; Balkwill, Laura-Lee; Hoeschele, Marisa; Moscicki, Michele K.; Bloomfield, Laurie L.; Sturdy, Christopher B.

    2010-01-01

    This research examined generality of the phylogenetic rule that birds discriminate frequency ranges more accurately than mammals. Human absolute pitch chroma possessors accurately tracked transitions between frequency ranges. Independent tests showed that they used note naming (pitch chroma) to remap the tones into ranges; neither possessors nor…

  5. Phylogenetic placement of the ectomycorrhizal genus Cenococcum in Gloniaceae (Dothideomycetes).

    PubMed

    Spatafora, Joseph W; Owensby, C Alisha; Douhan, Greg W; Boehm, Eric W A; Schoch, Conrad L

    2012-01-01

    Cenococcum is a genus of ectomycorrhizal Ascomycota that has a broad host range and geographic distribution. It is not known to produce either meiotic or mitotic spores and is known to exist only in the form of hyphae, sclerotia and host-colonized ectomycorrhizal root tips. Due to its lack of sexual and asexual spores and reproductive structures, it has proven difficult to incorporate into traditional classification within Ascomycota. Molecular phylogenetic studies of ribosomal RNA placed Cenococcum in Dothideomycetes, but the definitive identification of closely related taxa remained elusive. Here we report a phylogenetic analysis of five nuclear loci (SSU, LSU, TEF1, RPB1, RPB2) of Dothideomycetes that placed Cenococcum as a close relative of the genus Glonium of Gloniaceae (Pleosporomycetidae incertae sedis) with strong statistical support. Glonium is a genus of saprobic Dothideomycetes that produces darkly pigmented, carbonaceous, hysteriate apothecia and is not known to be biotrophic. Evolution of ectomycorhizae, Cenococcum and Dothideomycetes is discussed. PMID:22453119

  6. On the nature of global classification

    NASA Technical Reports Server (NTRS)

    Wheelis, M. L.; Kandler, O.; Woese, C. R.

    1992-01-01

    Molecular sequencing technology has brought biology into the era of global (universal) classification. Methodologically and philosophically, global classification differs significantly from traditional, local classification. The need for uniformity requires that higher level taxa be defined on the molecular level in terms of universally homologous functions. A global classification should reflect both principal dimensions of the evolutionary process: genealogical relationship and quality and extent of divergence within a group. The ultimate purpose of a global classification is not simply information storage and retrieval; such a system should also function as an heuristic representation of the evolutionary paradigm that exerts a directing influence on the course of biology. The global system envisioned allows paraphyletic taxa. To retain maximal phylogenetic information in these cases, minor notational amendments in existing taxonomic conventions should be adopted.

  7. Multiple Sparse Representations Classification.

    PubMed

    Plenge, Esben; Klein, Stefan; Klein, Stefan S; Niessen, Wiro J; Meijering, Erik

    2015-01-01

    Sparse representations classification (SRC) is a powerful technique for pixelwise classification of images and it is increasingly being used for a wide variety of image analysis tasks. The method uses sparse representation and learned redundant dictionaries to classify image pixels. In this empirical study we propose to further leverage the redundancy of the learned dictionaries to achieve a more accurate classifier. In conventional SRC, each image pixel is associated with a small patch surrounding it. Using these patches, a dictionary is trained for each class in a supervised fashion. Commonly, redundant/overcomplete dictionaries are trained and image patches are sparsely represented by a linear combination of only a few of the dictionary elements. Given a set of trained dictionaries, a new patch is sparse coded using each of them, and subsequently assigned to the class whose dictionary yields the minimum residual energy. We propose a generalization of this scheme. The method, which we call multiple sparse representations classification (mSRC), is based on the observation that an overcomplete, class specific dictionary is capable of generating multiple accurate and independent estimates of a patch belonging to the class. So instead of finding a single sparse representation of a patch for each dictionary, we find multiple, and the corresponding residual energies provides an enhanced statistic which is used to improve classification. We demonstrate the efficacy of mSRC for three example applications: pixelwise classification of texture images, lumen segmentation in carotid artery magnetic resonance imaging (MRI), and bifurcation point detection in carotid artery MRI. We compare our method with conventional SRC, K-nearest neighbor, and support vector machine classifiers. The results show that mSRC outperforms SRC and the other reference methods. In addition, we present an extensive evaluation of the effect of the main mSRC parameters: patch size, dictionary size, and sparsity level. PMID:26177106

  8. Multiple Sparse Representations Classification

    PubMed Central

    Plenge, Esben; Klein, Stefan S.; Niessen, Wiro J.; Meijering, Erik

    2015-01-01

    Sparse representations classification (SRC) is a powerful technique for pixelwise classification of images and it is increasingly being used for a wide variety of image analysis tasks. The method uses sparse representation and learned redundant dictionaries to classify image pixels. In this empirical study we propose to further leverage the redundancy of the learned dictionaries to achieve a more accurate classifier. In conventional SRC, each image pixel is associated with a small patch surrounding it. Using these patches, a dictionary is trained for each class in a supervised fashion. Commonly, redundant/overcomplete dictionaries are trained and image patches are sparsely represented by a linear combination of only a few of the dictionary elements. Given a set of trained dictionaries, a new patch is sparse coded using each of them, and subsequently assigned to the class whose dictionary yields the minimum residual energy. We propose a generalization of this scheme. The method, which we call multiple sparse representations classification (mSRC), is based on the observation that an overcomplete, class specific dictionary is capable of generating multiple accurate and independent estimates of a patch belonging to the class. So instead of finding a single sparse representation of a patch for each dictionary, we find multiple, and the corresponding residual energies provides an enhanced statistic which is used to improve classification. We demonstrate the efficacy of mSRC for three example applications: pixelwise classification of texture images, lumen segmentation in carotid artery magnetic resonance imaging (MRI), and bifurcation point detection in carotid artery MRI. We compare our method with conventional SRC, K-nearest neighbor, and support vector machine classifiers. The results show that mSRC outperforms SRC and the other reference methods. In addition, we present an extensive evaluation of the effect of the main mSRC parameters: patch size, dictionary size, and sparsity level. PMID:26177106

  9. Fast and accurate methods for phylogenomic analyses

    PubMed Central

    2011-01-01

    Background Species phylogenies are not estimated directly, but rather through phylogenetic analyses of different gene datasets. However, true gene trees can differ from the true species tree (and hence from one another) due to biological processes such as horizontal gene transfer, incomplete lineage sorting, and gene duplication and loss, so that no single gene tree is a reliable estimate of the species tree. Several methods have been developed to estimate species trees from estimated gene trees, differing according to the specific algorithmic technique used and the biological model used to explain differences between species and gene trees. Relatively little is known about the relative performance of these methods. Results We report on a study evaluating several different methods for estimating species trees from sequence datasets, simulating sequence evolution under a complex model including indels (insertions and deletions), substitutions, and incomplete lineage sorting. The most important finding of our study is that some fast and simple methods are nearly as accurate as the most accurate methods, which employ sophisticated statistical methods and are computationally quite intensive. We also observe that methods that explicitly consider errors in the estimated gene trees produce more accurate trees than methods that assume the estimated gene trees are correct. Conclusions Our study shows that highly accurate estimations of species trees are achievable, even when gene trees differ from each other and from the species tree, and that these estimations can be obtained using fairly simple and computationally tractable methods. PMID:22152123

  10. Phylogenetic relationships of sugarcane fungi

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The phylogenetic positions of Puccinia spp. infecting sugarcane (a complex hybrid of Saccharum spp.) was determined using 42 newly generated rust sequences and 25 sequences from Genbank. Rust specimens on sugarcane were collected from 161 locations in 25 countries and identified based on light micro...

  11. Phylogenetic Relationships Among Lepidium Papilliferum

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Previous phylogenetic analyses of Lepidium included only a few acessions of L. montanum, L. flavum, and L. fremontii to represent western North Amrican species. Two additional species endemic to southwest Idaho have posed both taxonomic and conservation questions regarding their species status. Le...

  12. Molecular phylogenetics of Gobioidei and phylogenetic placement of European gobies.

    PubMed

    Agorreta, Ainhoa; San Mauro, Diego; Schliewen, Ulrich; Van Tassell, James L; Kovačić, Marcelo; Zardoya, Rafael; Rüber, Lukas

    2013-12-01

    Gobioidei is one of the largest suborders of teleost fishes, with nearly 2000 extant species currently recognized. They have a worldwide distribution and show a spectacular variety in morphology, ecology, and behavior. Despite their importance, phylogenetic relationships among many groups of gobioids (including some of the major lineages) still remain poorly understood. In this study, we analyze sequence data of five molecular markers (two mitochondrial and three nuclear) averaging 6000 bp for 222 species of gobioids. Our study is the first to include both multiple nuclear and mitochondrial genes to reconstruct a comprehensive multilocus phylogeny of gobioids encompassing most major lineages representing the overall diversity of one of the most speciose vertebrate lineages. Two separate datasets are produced and used to specifically address the phylogenetic placement of Rhyacichthyidae and Odontobutidae, and the phylogenetic relationships among gobioid lineages. Our results strongly support that the initial split in the gobioid tree separated a clade containing Rhyacichthyidae+Odontobutidae as the sister group of all other lineages. The family Eleotrididae branches off the gobioid tree after the Rhyacichthyidae+Odontobutidae clade, followed by the Butidae as sister group to the Gobiidae. Additionally, several major monophyletic groups are confidently identified within the two major Gobiidae subclades, the gobiine-like gobiids and the gobionelline-like gobiids. Robustness of the phylogenetic trees inferred here is significantly higher than that of previous studies, hence our results provide the most compelling molecular phylogenetic hypothesis of Gobioidei thus far. For the first time, we provide a comprehensive sampling of European gobies that traditionally have been divided into "transverse" gobies and "sand gobies". We show that the European gobies cluster in three distinct lineages, the Pomatoschistus-, Aphia-, and Gobius-lineages. The former resolved within the gobionelline-like gobiids and the latter two within the gobiine-like gobiids. These findings have significant implications for our understanding of the phylogeographic origin of European gobies in the light of the closure of the Paratethys. A rogue taxon analysis identified Kraemeria as an unstable taxon decreasing support at the base of the gobiine-like gobiids. Removal of this rogue taxon significantly increased phylogenetic resolution in that part of the tree and revealed additional insights into early bursts of cladogenesis of the gobiine-like gobiids. PMID:23911892

  13. Accurate monotone cubic interpolation

    NASA Technical Reports Server (NTRS)

    Huynh, Hung T.

    1991-01-01

    Monotone piecewise cubic interpolants are simple and effective. They are generally third-order accurate, except near strict local extrema where accuracy degenerates to second-order due to the monotonicity constraint. Algorithms for piecewise cubic interpolants, which preserve monotonicity as well as uniform third and fourth-order accuracy are presented. The gain of accuracy is obtained by relaxing the monotonicity constraint in a geometric framework in which the median function plays a crucial role.

  14. Accurate Finite Difference Algorithms

    NASA Technical Reports Server (NTRS)

    Goodrich, John W.

    1996-01-01

    Two families of finite difference algorithms for computational aeroacoustics are presented and compared. All of the algorithms are single step explicit methods, they have the same order of accuracy in both space and time, with examples up to eleventh order, and they have multidimensional extensions. One of the algorithm families has spectral like high resolution. Propagation with high order and high resolution algorithms can produce accurate results after O(10(exp 6)) periods of propagation with eight grid points per wavelength.

  15. Phylogenetic analysis of polyomavirus BK sequences.

    PubMed

    Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet

    2006-09-01

    Polyomavirus BK (BKV) has emerged as an important pathogen in kidney transplant patients. Existing taxonomic classifications of BKV come from conventional DNA sequence alignments based on limited data derived from the VP1 gene. We have used a phylogenetic whole-genome approach to examine the pattern of diversity and evolutionary relationships between 45 BKV strains isolated from multiple clinical settings. This analysis supports the classification of BKV into six genotypes, of which types V and VI have not been previously recognized. BKV strains hitherto classified as type I are, in fact, quite heterogeneous, and several cluster with our newly defined genotypes V and VI. The sequence information needed for assigning genotypes can be captured by VP1, VP2, VP3, or large T-gene sequencing. The most polymorphic coding region in the viral genome is VP1, but significant variation is also present in the large T-antigen gene, wherein polymorphisms are found in 11.39% of all nucleotide sites, 46.22% of which are cluster specific. Type-specific amino acid changes within the VP1 region are predicted to map to the BC and DE loops. The number of taxonomically informative amino acid changes in the large T antigen exceeds even that of the VP1 region. Viral strains isolated from healthy subjects and from patients with human immunodeficiency virus infection, Wiskott-Aldrich syndrome, and vasculopathy with capillary leak syndrome formed distinct subclusters. However, within the kidney transplant population, BKV strains derived from patients with asymptomatic viruria did not show complete separation from strains associated with allograft nephropathy. PMID:16940499

  16. Functional Basis of Microorganism Classification

    PubMed Central

    Zhu, Chengsheng; Delmont, Tom O.; Vogel, Timothy M.; Bromberg, Yana

    2015-01-01

    Correctly identifying nearest “neighbors” of a given microorganism is important in industrial and clinical applications where close relationships imply similar treatment. Microbial classification based on similarity of physiological and genetic organism traits (polyphasic similarity) is experimentally difficult and, arguably, subjective. Evolutionary relatedness, inferred from phylogenetic markers, facilitates classification but does not guarantee functional identity between members of the same taxon or lack of similarity between different taxa. Using over thirteen hundred sequenced bacterial genomes, we built a novel function-based microorganism classification scheme, functional-repertoire similarity-based organism network (FuSiON; flattened to fusion). Our scheme is phenetic, based on a network of quantitatively defined organism relationships across the known prokaryotic space. It correlates significantly with the current taxonomy, but the observed discrepancies reveal both (1) the inconsistency of functional diversity levels among different taxa and (2) an (unsurprising) bias towards prioritizing, for classification purposes, relatively minor traits of particular interest to humans. Our dynamic network-based organism classification is independent of the arbitrary pairwise organism similarity cut-offs traditionally applied to establish taxonomic identity. Instead, it reveals natural, functionally defined organism groupings and is thus robust in handling organism diversity. Additionally, fusion can use organism meta-data to highlight the specific environmental factors that drive microbial diversification. Our approach provides a complementary view to cladistic assignments and holds important clues for further exploration of microbial lifestyles. Fusion is a more practical fit for biomedical, industrial, and ecological applications, as many of these rely on understanding the functional capabilities of the microbes in their environment and are less concerned with phylogenetic descent. PMID:26317871

  17. Functional Basis of Microorganism Classification.

    PubMed

    Zhu, Chengsheng; Delmont, Tom O; Vogel, Timothy M; Bromberg, Yana

    2015-08-01

    Correctly identifying nearest "neighbors" of a given microorganism is important in industrial and clinical applications where close relationships imply similar treatment. Microbial classification based on similarity of physiological and genetic organism traits (polyphasic similarity) is experimentally difficult and, arguably, subjective. Evolutionary relatedness, inferred from phylogenetic markers, facilitates classification but does not guarantee functional identity between members of the same taxon or lack of similarity between different taxa. Using over thirteen hundred sequenced bacterial genomes, we built a novel function-based microorganism classification scheme, functional-repertoire similarity-based organism network (FuSiON; flattened to fusion). Our scheme is phenetic, based on a network of quantitatively defined organism relationships across the known prokaryotic space. It correlates significantly with the current taxonomy, but the observed discrepancies reveal both (1) the inconsistency of functional diversity levels among different taxa and (2) an (unsurprising) bias towards prioritizing, for classification purposes, relatively minor traits of particular interest to humans. Our dynamic network-based organism classification is independent of the arbitrary pairwise organism similarity cut-offs traditionally applied to establish taxonomic identity. Instead, it reveals natural, functionally defined organism groupings and is thus robust in handling organism diversity. Additionally, fusion can use organism meta-data to highlight the specific environmental factors that drive microbial diversification. Our approach provides a complementary view to cladistic assignments and holds important clues for further exploration of microbial lifestyles. Fusion is a more practical fit for biomedical, industrial, and ecological applications, as many of these rely on understanding the functional capabilities of the microbes in their environment and are less concerned with phylogenetic descent. PMID:26317871

  18. Phylogenetic analysis of Bacillus subtilis and related taxa based on partial gyrA gene sequences.

    PubMed

    Chun, J; Bae, K S

    2000-08-01

    Partial gyrA sequences were determined for twelve strains belonging to Bacillus amyloliquefaciens, B. atrophaeus, B. licheniformis, B. mojavensis, B. subtilis subsp. subtilis, B. subtilis subsp. spizizenii and B. vallismortis. The average nucleotide and translated amino acid similarities for the seven type strains were 83.7 and 95.1%, respectively, whereas the corresponding value for the 16S rRNA sequences was 99.1%. All of the type strains were sharply separated; the closest relationship was found between B. atrophaeus and B. mojavensis which shared a nucleotide similarity of 95.8%. Phylogenetic trees were inferred from gyrA nucleotide sequences using the neighbor-joining, Fitch-Margoliash and maximum parsimony algorithms. The test strains were divided into four groups, which generally reflected results previously reported in restriction digest and DNA-DNA hybridization studies. It is concluded from the comparative sequence analysis that the gyrA sequences provide a firm framework for the rapid and accurate classification and identification of Bacillus subtilis and related taxa. PMID:11204764

  19. Cyber-infrastructure for Fusarium (CiF): Three integrated platforms supporting strain identification, phylogenetics, comparative genomics, and knowledge sharing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on ...

  20. Molecular identification and phylogenetic study of Demodex caprae.

    PubMed

    Zhao, Ya-E; Cheng, Juan; Hu, Li; Ma, Jun-Xian

    2014-10-01

    The DNA barcode has been widely used in species identification and phylogenetic analysis since 2003, but there have been no reports in Demodex. In this study, to obtain an appropriate DNA barcode for Demodex, molecular identification of Demodex caprae based on mitochondrial cox1 was conducted. Firstly, individual adults and eggs of D. caprae were obtained for genomic DNA (gDNA) extraction; Secondly, mitochondrial cox1 fragment was amplified, cloned, and sequenced; Thirdly, cox1 fragments of D. caprae were aligned with those of other Demodex retrieved from GenBank; Finally, the intra- and inter-specific divergences were computed and the phylogenetic trees were reconstructed to analyze phylogenetic relationship in Demodex. Results obtained from seven 429-bp fragments of D. caprae showed that sequence identities were above 99.1% among three adults and four eggs. The intraspecific divergences in D. caprae, Demodex folliculorum, Demodex brevis, and Demodex canis were 0.0-0.9, 0.5-0.9, 0.0-0.2, and 0.0-0.5%, respectively, while the interspecific divergences between D. caprae and D. folliculorum, D. canis, and D. brevis were 20.3-20.9, 21.8-23.0, and 25.0-25.3, respectively. The interspecific divergences were 10 times higher than intraspecific ones, indicating considerable barcoding gap. Furthermore, the phylogenetic trees showed that four Demodex species gathered separately, representing independent species; and Demodex folliculorum gathered with canine Demodex, D. caprae, and D. brevis in sequence. In conclusion, the selected 429-bp mitochondrial cox1 gene is an appropriate DNA barcode for molecular classification, identification, and phylogenetic analysis of Demodex. D. caprae is an independent species and D. folliculorum is closer to D. canis than to D. caprae or D. brevis. PMID:25132566

  1. Classification of perinatal deaths.

    PubMed

    Wigglesworth, J S

    1994-01-01

    Accurate classification of perinatal deaths is instrumental in efforts to improve the perinatal mortality rate (PNMR). Accurate records should be maintained on birth weight, gestational age, a simple classification of cause of death (possibly the "Wigglesworth" or the author's system), a classification system correlated with birth weight/gestation of dead and live births, and, where possible, detailed coding such as the International Classification of Diseases (ICD) or the Systematized Nomenclature of Medicine (SNOMED). The fetal loss rate in Great Britain is reported as a minimum of 0.1% per week for pregnancies from 22 weeks until term. Obstetric or neonatal management can affect the level of mortality. Terminations due to malformations performed before the 24th week are found to account for a 50% decline in the PNMR. PNMR is affected also by birth weight and the occurrence of multiple births. Neonatal care that prolongs the survival of sick or malformed neonates may decrease PNMR or increase infant death rates. Analysis of trends in PNMR must consider demographic changes such as an increase in preterm multiple births due to fertility treatment. Perinatal deaths may be classified by cause with a variety of systems. The ICD system is a single axis classification which uses three digit codes for primary causes and a fourth and fifth digit for detailed subcategories, which can be accessed in a alphabetically-organized index. SNOMED provides six different fields of information, including topography, morphology, aetiology, function, disease, and procedure. Five subordinate fields are identified and linked to topographic body sites. Choice of classification systems is considered to be based on availability of information and the needs for subsequent analysis. Two other systems of classification are available in the UK. The Aberdeen system is based on maternal factors, but the disadvantage is the high number of unexplained deaths. The Hey et al. system codes by fetal and neonatal factors and does not require modern investigative techniques, but the detail is highly variables from case-to-case and hospital-to-hospital. The author's system is based on routine autopsies of external conditions. Four groupings are identified, including macerated normally-formed stillborn infants, congenital anomalies (stillbirth or neonatal death), conditions associated with prematurity, and fresh stillbirths presumed asphyxiated. A fifth category groups infants dying of specific conditions such as toxoplasmosis. PMID:8147106

  2. Phylogenetic diversity of nonmarine picocyanobacteria.

    PubMed

    Callieri, Cristiana; Coci, Manuela; Corno, Gianluca; Macek, Miroslav; Modenutti, Beatriz; Balseiro, Esteban; Bertoni, Roberto

    2013-08-01

    We studied the phylogenetic diversity of nonmarine picocyanobacteria broadening the sequence data set with 43 new sequences of the 16S rRNA gene. The sequences were derived from monoclonal strains isolated from four volcanic high-altitude athalassohaline lakes in Mexico, five glacial ultraoligotrophic North Patagonian lakes and six Italian lakes of glacial, volcanic and morenic origin. The new sequences fall into a number of both novel and previously described clades within the phylogenetic tree of 16S rRNA gene. The new cluster of Lake Nahuel Huapi (North Patagonia) forms a sister clade to the subalpine cluster II and the marine Synechococcus subcluster 5.2. Our finding of the novel clade of 'halotolerants' close to the marine subcluster 5.3 (Synechococcus RCC307) constitutes an important demonstration that euryhaline and marine strains affiliate closely. The intriguing results obtained shed new light on the importance of the nonmarine halotolerants in the phylogenesis of picocyanobacteria. PMID:23528076

  3. Sirevirus LTR retrotransposons: phylogenetic misconceptions in the plant world.

    PubMed

    Bousios, Alexandros; Darzentas, Nikos

    2013-01-01

    Sireviruses are an ancient and plant-specific LTR retrotransposon genus. They possess a unique genome structure that is characterized by a plethora of highly conserved sequence motifs in key domains of the non-coding genome, and often, by the presence of an envelope-like gene. Recently, their crucial role in the organization of the maize genome, where Sireviruses occupy approximately 21% of its nuclear content, was revealed, followed by an analysis of their distribution across the plant kingdom. It is now suggested that Sireviruses have been a major mediator of the evolution of many plant genomes. However, the name 'Sirevirus' has caused confusion in the scientific community in regards to their classification within the LTR retrotransposon order and their relationship with viruses - a situation that is not unique to Sireviruses, but also affects other LTR retrotransposon genera. Here, we clarify the phylogenetic position of Sireviruses as typical LTR retrotransposons of the Copia superfamily and explain that the confusion stems from the discrepancy in the categorization of LTR retrotransposons by the two main classification systems: the International Committee on the Taxonomy of Viruses (ICTV) system and the unified classification system for eukaryotic transposable elements. While the name 'Sirevirus' has been given by ICTV, we show that the transposable element system, which is more suitable for eukaryotic genome studies, lacks an appropriate taxonomic level for describing them. We urge for this inconsistency to be addressed. Finally, we provide data suggesting that of the three ICTV-proposed genera of the Pseudoviridae (that is, Copia) family, only Sireviruses form a monophyletic group, while the phylogenetic distinction between Pseudoviruses and Hemiviruses is unclear. We conclude that because of their ongoing important contribution to the classification of transposable elements, these schemes need to be frequently revisited and revised - as shown by the example of the Sirevirus LTR retrotransposon genus. PMID:23452336

  4. Point estimates in phylogenetic reconstructions

    PubMed Central

    Benner, Philipp; Bačák, Miroslav; Bourguignon, Pierre-Yves

    2014-01-01

    Motivation: The construction of statistics for summarizing posterior samples returned by a Bayesian phylogenetic study has so far been hindered by the poor geometric insights available into the space of phylogenetic trees, and ad hoc methods such as the derivation of a consensus tree makeup for the ill-definition of the usual concepts of posterior mean, while bootstrap methods mitigate the absence of a sound concept of variance. Yielding satisfactory results with sufficiently concentrated posterior distributions, such methods fall short of providing a faithful summary of posterior distributions if the data do not offer compelling evidence for a single topology. Results: Building upon previous work of Billera et al., summary statistics such as sample mean, median and variance are defined as the geometric median, Fréchet mean and variance, respectively. Their computation is enabled by recently published works, and embeds an algorithm for computing shortest paths in the space of trees. Studying the phylogeny of a set of plants, where several tree topologies occur in the posterior sample, the posterior mean balances correctly the contributions from the different topologies, where a consensus tree would be biased. Comparisons of the posterior mean, median and consensus trees with the ground truth using simulated data also reveals the benefits of a sound averaging method when reconstructing phylogenetic trees. Availability and implementation: We provide two independent implementations of the algorithm for computing Fréchet means, geometric medians and variances in the space of phylogenetic trees. TFBayes: https://github.com/pbenner/tfbayes, TrAP: https://github.com/bacak/TrAP. Contact: philipp.benner@mis.mpg.de PMID:25161244

  5. Biometric speaker classification

    NASA Astrophysics Data System (ADS)

    Nelson, Douglas J.; Smith, David C.; Richman, D. J.; Townsend, Goffrey

    2000-10-01

    We address the problem of classification of speakers based on measurements of features obtained from their speech. The process is an adaption of biometric methods used to identify people. The process for speech differs since speech is not stationary. We therefore propose the classification of speakers b y the statistical distributions of parameters which may be accurately estimated by modern signal processing techniques. The intent is to develop a speaker clustering algorithm which is dependent of transmission channel and insensitive to language variations, and which may be re-trained, with minimal data, to include a new speaker. We demonstrate effectiveness on the problem of identification of the speakers gender, and present evidence that the methods may be extended to the general problem of speaker identification.

  6. Phylogenetic analysis of otospiralin protein

    PubMed Central

    Torktaz, Ibrahim; Behjati, Mohaddeseh; Rostami, Amin

    2016-01-01

    Background: Fibrocyte-specific protein, otospiralin, is a small protein, widely expressed in the central nervous system as neuronal cell bodies and glia. The increased expression of otospiralin in reactive astrocytes implicates its role in signaling pathways and reparative mechanisms subsequent to injury. Indeed, otospiralin is considered to be essential for the survival of fibrocytes of the mesenchymal nonsensory regions of the cochlea. It seems that other functions of this protein are not yet completely understood. Materials and Methods: Amino acid sequences of otospiralin from 12 vertebrates were derived from National Center for Biotechnology Information database. Phylogenetic analysis and phylogeny estimation were performed using MEGA 5.0.5 program, and neighbor-joining tree was constructed by this software. Results: In this computational study, the phylogenetic tree of otospiralin has been investigated. Therefore, dendrograms of otospiralin were depicted. Alignment performed in MUSCLE method by UPGMB algorithm. Also, entropy plot determined for a better illustration of amino acid variations in this protein. Conclusion: In the present study, we used otospiralin sequence of 12 different species and by constructing phylogenetic tree, we suggested out group for some related species. PMID:27099854

  7. Constructing and counting phylogenetic invariants.

    PubMed

    Evans, S N; Zhou, X

    1998-01-01

    The method of invariants is an approach to the problem of reconstructing the phylogenetic tree of a collection of m taxa using nucleotide sequence data. Models for the respective probabilities of the 4m possible vectors of bases at a given site will have unknown parameters that describe the random mechanism by which substitution occurs along the branches of a putative phylogenetic tree. An invariant is a polynomial in these probabilities that, for a given phylogeny, is zero for all choices of the substitution mechanism parameters. If the invariant is typically non-zero for another phylogenetic tree, then estimates of the invariant can be used as evidence to support one phylogeny over another. Previous work of Evans and Speed showed that, for certain commonly used substitution models, the problem of finding a minimal generating set for the ideal of invariants can be reduced to the linear algebra problem of finding a basis for a certain lattice (that is, a free Z-module). They also conjectured that the cardinality of such a generating set can be computed using a simple "degrees of freedom" formula. We verify this conjecture. Along the way, we explain in detail how the observations of Evans and Speed lead to a simple, computationally feasible algorithm for constructing a minimal generating set. PMID:10072086

  8. Phylogenetic signal and noise: predicting the power of a data set to resolve phylogeny.

    PubMed

    Townsend, Jeffrey P; Su, Zhuo; Tekle, Yonas I

    2012-10-01

    A principal objective for phylogenetic experimental design is to predict the power of a data set to resolve nodes in a phylogenetic tree. However, proactively assessing the potential for phylogenetic noise compared with signal in a candidate data set has been a formidable challenge. Understanding the impact of collection of additional sequence data to resolve recalcitrant internodes at diverse historical times will facilitate increasingly accurate and cost-effective phylogenetic research. Here, we derive theory based on the fundamental unit of the phylogenetic tree, the quartet, that applies estimates of the state space and the rates of evolution of characters in a data set to predict phylogenetic signal and phylogenetic noise and therefore to predict the power to resolve internodes. We develop and implement a Monte Carlo approach to estimating power to resolve as well as deriving a nearly equivalent faster deterministic calculation. These approaches are applied to describe the distribution of potential signal, polytomy, or noise for two example data sets, one recent (cytochrome c oxidase I and 28S ribosomal rRNA sequences from Diplazontinae parasitoid wasps) and one deep (eight nuclear genes and a phylogenomic sequence for diverse microbial eukaryotes including Stramenopiles, Alveolata, and Rhizaria). The predicted power of resolution for the loci analyzed is consistent with the historic use of the genes in phylogenetics. PMID:22389443

  9. Highly accurate nephelometric titrimetry.

    PubMed

    Zhan, Xiancheng; Li, Chengrong; Li, Zhiyi; Yang, Xiucen; Zhong, Shuguang; Yi, Tao

    2004-02-01

    A method that accurately indicates the end-point of precipitation reactions by the measurement of the relative intensity of the scattered light in the titrate is presented. A new nephelometric titrator with an internal nephelometric sensor has been devised. The work of the titrator including the sensor and change in the turbidity of the titrate and intensity of the scattered light are described. The accuracy of the nephelometric titrimetry is discussed theoretically. The titration of NaCl with AgNO(3) serves as a model. A relative error as well as deviation is within 0.2% under the experimental conditions. The applicability of the titrimetry in pharmaceutical analyses, for example, phenytoin sodium and procaine hydrochloride, is generally illustrated. PMID:14705200

  10. Accurate Inclusion Mass Screening

    PubMed Central

    Jaffe, Jacob D.; Keshishian, Hasmik; Chang, Betty; Addona, Theresa A.; Gillette, Michael A.; Carr, Steven A.

    2008-01-01

    Verification of candidate biomarker proteins in blood is typically done using multiple reaction monitoring (MRM) of peptides by LC-MS/MS on triple quadrupole MS systems. MRM assay development for each protein requires significant time and cost, much of which is likely to be of little value if the candidate biomarker is below the detection limit in blood or a false positive in the original discovery data. Here we present a new technology, accurate inclusion mass screening (AIMS), designed to provide a bridge from unbiased discovery to MS-based targeted assay development. Masses on the software inclusion list are monitored in each scan on the Orbitrap MS system, and MS/MS spectra for sequence confirmation are acquired only when a peptide from the list is detected with both the correct accurate mass and charge state. The AIMS experiment confirms that a given peptide (and thus the protein from which it is derived) is present in the plasma. Throughput of the method is sufficient to qualify up to a hundred proteins/week. The sensitivity of AIMS is similar to MRM on a triple quadrupole MS system using optimized sample preparation methods (low tens of ng/ml in plasma), and MS/MS data from the AIMS experiments on the Orbitrap can be directly used to configure MRM assays. The method was shown to be at least 4-fold more efficient at detecting peptides of interest than undirected LC-MS/MS experiments using the same instrumentation, and relative quantitation information can be obtained by AIMS in case versus control experiments. Detection by AIMS ensures that a quantitative MRM-based assay can be configured for that protein. The method has the potential to qualify large number of biomarker candidates based on their detection in plasma prior to committing to the time- and resource-intensive steps of establishing a quantitative assay. PMID:18534968

  11. Accurate quantum chemical calculations

    NASA Technical Reports Server (NTRS)

    Bauschlicher, Charles W., Jr.; Langhoff, Stephen R.; Taylor, Peter R.

    1989-01-01

    An important goal of quantum chemical calculations is to provide an understanding of chemical bonding and molecular electronic structure. A second goal, the prediction of energy differences to chemical accuracy, has been much harder to attain. First, the computational resources required to achieve such accuracy are very large, and second, it is not straightforward to demonstrate that an apparently accurate result, in terms of agreement with experiment, does not result from a cancellation of errors. Recent advances in electronic structure methodology, coupled with the power of vector supercomputers, have made it possible to solve a number of electronic structure problems exactly using the full configuration interaction (FCI) method within a subspace of the complete Hilbert space. These exact results can be used to benchmark approximate techniques that are applicable to a wider range of chemical and physical problems. The methodology of many-electron quantum chemistry is reviewed. Methods are considered in detail for performing FCI calculations. The application of FCI methods to several three-electron problems in molecular physics are discussed. A number of benchmark applications of FCI wave functions are described. Atomic basis sets and the development of improved methods for handling very large basis sets are discussed: these are then applied to a number of chemical and spectroscopic problems; to transition metals; and to problems involving potential energy surfaces. Although the experiences described give considerable grounds for optimism about the general ability to perform accurate calculations, there are several problems that have proved less tractable, at least with current computer resources, and these and possible solutions are discussed.

  12. A Phylogenetic Re-Analysis of Groupers with Applications for Ciguatera Fish Poisoning

    PubMed Central

    Schoelinck, Charlotte; Hinsinger, Damien D.; Dettaï, Agnès; Cruaud, Corinne; Justine, Jean-Lou

    2014-01-01

    Background Ciguatera fish poisoning (CFP) is a significant public health problem due to dinoflagellates. It is responsible for one of the highest reported incidence of seafood-borne illness and Groupers are commonly reported as a source of CFP due to their position in the food chain. With the role of recent climate change on harmful algal blooms, CFP cases might become more frequent and more geographically widespread. Since there is no appropriate treatment for CFP, the most efficient solution is to regulate fish consumption. Such a strategy can only work if the fish sold are correctly identified, and it has been repeatedly shown that misidentifications and species substitutions occur in fish markets. Methods We provide here both a DNA-barcoding reference for groupers, and a new phylogenetic reconstruction based on five genes and a comprehensive taxonomical sampling. We analyse the correlation between geographic range of species and their susceptibility to ciguatera accumulation, and the co-occurrence of ciguatoxins in closely related species, using both character mapping and statistical methods. Results Misidentifications were encountered in public databases, precluding accurate species identifications. Epinephelinae now includes only twelve genera (vs. 15 previously). Comparisons with the ciguatera incidences show that in some genera most species are ciguateric, but statistical tests display only a moderate correlation with the phylogeny. Atlantic species were rarely contaminated, with ciguatera occurrences being restricted to the South Pacific. Conclusions The recent changes in classification based on the reanalyses of the relationships within Epinephelidae have an impact on the interpretation of the ciguatera distribution in the genera. In this context and to improve the monitoring of fish trade and safety, we need to obtain extensive data on contamination at the species level. Accurate species identifications through DNA barcoding are thus an essential tool in controlling CFP since meal remnants in CFP cases can be easily identified with molecular tools. PMID:25093850

  13. Learning classification trees

    NASA Technical Reports Server (NTRS)

    Buntine, Wray

    1991-01-01

    Algorithms for learning classification trees have had successes in artificial intelligence and statistics over many years. How a tree learning algorithm can be derived from Bayesian decision theory is outlined. This introduces Bayesian techniques for splitting, smoothing, and tree averaging. The splitting rule turns out to be similar to Quinlan's information gain splitting rule, while smoothing and averaging replace pruning. Comparative experiments with reimplementations of a minimum encoding approach, Quinlan's C4 and Breiman et al. Cart show the full Bayesian algorithm is consistently as good, or more accurate than these other approaches though at a computational price.

  14. HIV classification using coalescent theory

    SciTech Connect

    Zhang, Ming; Letiner, Thomas K; Korber, Bette T

    2008-01-01

    Algorithms for subtype classification and breakpoint detection of HIV-I sequences are based on a classification system of HIV-l. Hence, their quality highly depend on this system. Due to the history of creation of the current HIV-I nomenclature, the current one contains inconsistencies like: The phylogenetic distance between the subtype B and D is remarkably small compared with other pairs of subtypes. In fact, it is more like the distance of a pair of subsubtypes Robertson et al. (2000); Subtypes E and I do not exist any more since they were discovered to be composed of recombinants Robertson et al. (2000); It is currently discussed whether -- instead of CRF02 being a recombinant of subtype A and G -- subtype G should be designated as a circulating recombination form (CRF) nd CRF02 as a subtype Abecasis et al. (2007); There are 8 complete and over 400 partial HIV genomes in the LANL-database which belong neither to a subtype nor to a CRF (denoted by U). Moreover, the current classification system is somehow arbitrary like all complex classification systems that were created manually. To this end, it is desirable to deduce the classification system of HIV systematically by an algorithm. Of course, this problem is not restricted to HIV, but applies to all fast mutating and recombining viruses. Our work addresses the simpler subproblem to score classifications of given input sequences of some virus species (classification denotes a partition of the input sequences in several subtypes and CRFs). To this end, we reconstruct ancestral recombination graphs (ARG) of the input sequences under restrictions determined by the given classification. These restritions are imposed in order to ensure that the reconstructed ARGs do not contradict the classification under consideration. Then, we find the ARG with maximal probability by means of Markov Chain Monte Carlo methods. The probability of the most probable ARG is interpreted as a score for the classification. To our knowledge, this particular problem was not addressed up to now. The software package Lamarc Kuhner et al. (2000) allows for sampling ARGs, but it assumes that recombination events only involve one breakpoint. However, in HIV recombinants usually have more than one breakpoint. Moreover, Lamarc does not perform an explicit breakpoint detection, but tries to find them by chance. Although this approach is suitable for most situations, it will not lead to satisfying results in case of highly recombining viruses with multiple breakpoints.

  15. Subject Classification.

    ERIC Educational Resources Information Center

    Thompson, Gayle; And Others

    Three newspaper librarians described how they manage the files of newspaper clippings which are a necessary part of their collections. The development of a new subject classification system for the clippings files was outlined. The new subject headings were based on standard subject heading lists and on local need. It was decided to use a computer…

  16. Toward extracting all phylogenetic information from matrices of evolutionary distances.

    PubMed

    Roch, Sebastien

    2010-03-12

    The matrix of evolutionary distances is a model-based statistic, derived from molecular sequences, summarizing the pairwise phylogenetic relations between a collection of species. Phylogenetic tree reconstruction methods relying on this matrix are relatively fast and thus widely used in molecular systematics. However, because of their intrinsic reliance on summary statistics, distance-matrix methods are assumed to be less accurate than likelihood-based approaches. In this paper, pairwise sequence comparisons are shown to be more powerful than previously hypothesized. A statistical analysis of certain distance-based techniques indicates that their data requirement for large evolutionary trees essentially matches the conjectured performance of maximum likelihood methods--challenging the idea that summary statistics lead to suboptimal analyses. On the basis of a connection between ancestral state reconstruction and distance averaging, the critical role played by the covariances of the distance matrix is identified. PMID:20223986

  17. Phylogenetic structure and host abundance drive disease pressure in communities.

    PubMed

    Parker, Ingrid M; Saunders, Megan; Bontrager, Megan; Weitz, Andrew P; Hendricks, Rebecca; Magarey, Roger; Suiter, Karl; Gilbert, Gregory S

    2015-04-23

    Pathogens play an important part in shaping the structure and dynamics of natural communities, because species are not affected by them equally. A shared goal of ecology and epidemiology is to predict when a species is most vulnerable to disease. A leading hypothesis asserts that the impact of disease should increase with host abundance, producing a 'rare-species advantage'. However, the impact of a pathogen may be decoupled from host abundance, because most pathogens infect more than one species, leading to pathogen spillover onto closely related species. Here we show that the phylogenetic and ecological structure of the surrounding community can be important predictors of disease pressure. We found that the amount of tissue lost to disease increased with the relative abundance of a species across a grassland plant community, and that this rare-species advantage had an additional phylogenetic component: disease pressure was stronger on species with many close relatives. We used a global model of pathogen sharing as a function of relatedness between hosts, which provided a robust predictor of relative disease pressure at the local scale. In our grassland, the total amount of disease was most accurately explained not by the abundance of the focal host alone, but by the abundance of all species in the community weighted by their phylogenetic distance to the host. Furthermore, the model strongly predicted observed disease pressure for 44 novel host species we introduced experimentally to our study site, providing evidence for a mechanism to explain why phylogenetically rare species are more likely to become invasive when introduced. Our results demonstrate how the phylogenetic and ecological structure of communities can have a key role in disease dynamics, with implications for the maintenance of biodiversity, biotic resistance against introduced weeds, and the success of managed plants in agriculture and forestry. PMID:25903634

  18. Phylogenetic placement of the Spirosomaceae

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Maloy, S.; Mandelco, L.; Raj, H. D.

    1990-01-01

    Comparative analysis of 16S rRNA sequences shows that the family Spirosomaceae belongs within the eubacterial phylum defined by the flavobacteria and bacteriodes. Its constituent genera, Spirosoma, Flectobacillus, and Runella form a monophyletic grouping therein. The phylogenetic assignment is based not only upon evolutionary distance analysis, but also upon sequence signatures and higher order structural synapomorphies in 16S rRNA. Another genus peripherally associated with the Spirosomaceae, Ancylobacter ("Microcyclus"), does not cluster with the flavobacteria and their relatives, but rather belongs to the alpha subdivision of the purple bacteria.

  19. Making Mosquito Taxonomy Useful: A Stable Classification of Tribe Aedini that Balances Utility with Current Knowledge of Evolutionary Relationships

    PubMed Central

    Wilkerson, Richard C.; Linton, Yvonne-Marie; Fonseca, Dina M.; Schultz, Ted R.; Price, Dana C.; Strickman, Daniel A.

    2015-01-01

    The tribe Aedini (Family Culicidae) contains approximately one-quarter of the known species of mosquitoes, including vectors of deadly or debilitating disease agents. This tribe contains the genus Aedes, which is one of the three most familiar genera of mosquitoes. During the past decade, Aedini has been the focus of a series of extensive morphology-based phylogenetic studies published by Reinert, Harbach, and Kitching (RH&K). Those authors created 74 new, elevated or resurrected genera from what had been the single genus Aedes, almost tripling the number of genera in the entire family Culicidae. The proposed classification is based on subjective assessments of the “number and nature of the characters that support the branches” subtending particular monophyletic groups in the results of cladistic analyses of a large set of morphological characters of representative species. To gauge the stability of RH&K’s generic groupings we reanalyzed their data with unweighted parsimony jackknife and maximum-parsimony analyses, with and without ordering 14 of the characters as in RH&K. We found that their phylogeny was largely weakly supported and their taxonomic rankings failed priority and other useful taxon-naming criteria. Consequently, we propose simplified aedine generic designations that 1) restore a classification system that is useful for the operational community; 2) enhance the ability of taxonomists to accurately place new species into genera; 3) maintain the progress toward a natural classification based on monophyletic groups of species; and 4) correct the current classification system that is subject to instability as new species are described and existing species more thoroughly defined. We do not challenge the phylogenetic hypotheses generated by the above-mentioned series of morphological studies. However, we reduce the ranks of the genera and subgenera of RH&K to subgenera or informal species groups, respectively, to preserve stability as new data become available. PMID:26226613

  20. Phylogenetic inference of Indian malaria vectors from multilocus DNA sequences.

    PubMed

    Dixit, Jyotsana; Srivastava, Hemlata; Sharma, Meenu; Das, Manoj K; Singh, O P; Raghavendra, K; Nanda, Nutan; Dash, Aditya P; Saksena, D N; Das, Aparup

    2010-08-01

    Inferences on the taxonomic positions, phylogenetic interrelationships and divergence time among closely related species of medical importance is essential to understand evolutionary patterns among species, and based on which, disease control measures could be devised. To this respect, malaria is one of the important mosquito borne diseases of tropical and sub-tropical parts of the globe. Taxonomic status of malaria vectors has been so far documented based on morphological, cytological and few molecular genetic features. However, utilization of multilocus DNA sequences in phylogenetic inferences are still in dearth. India contains one of the richest resources of mosquito species diversity but little molecular taxonomic information is available in Indian malaria vectors. We herewith utilized the whole genome sequence information of An. gambiae to amplify and sequence three orthologous nuclear genetic regions in six Indian malaria vector species (An. culicifacies, An. minimus, An. sundaicus, An. fluviatilis, An. annularis and An. stephensi). Further, we utilized the previously published DNA sequence information on the COII and ITS2 genes in all the six species, making the total number of loci to five. Multilocus molecular phylogenetic study of Indian anophelines and An. gambiae was conducted at each individual genetic region using Neighbour Joining (NJ), Maximum Likelihood (ML), Maximum Parsimony (MP) and Bayesian approaches. Although tree topologies with COII, and ITS2 genes were similar, for no other three genetic regions similar tree topologies were observed. In general, the reconstructed phylogenetic status of Indian malaria vectors follows the pattern based on morphological and cytological classifications that was reconfirmed with COII and ITS2 genetic regions. Further, divergence times based on COII gene sequences were estimated among the seven Anopheles species which corroborate the earlier hypothesis on the radiation of different species of the Anopheles genus during the late Cretaceous period. PMID:20435167

  1. Phylogenetic Conservatism in Plant Phenology

    NASA Technical Reports Server (NTRS)

    Davies, T. Jonathan; Wolkovich, Elizabeth M.; Kraft, Nathan J. B.; Salamin, Nicolas; Allen, Jenica M.; Ault, Toby R.; Betancourt, Julio L.; Bolmgren, Kjell; Cleland, Elsa E.; Cook, Benjamin I.; Crimmins, Theresa M.; Mazer, Susan J.; McCabe, Gregory J.; Pau, Stephanie; Regetz, Jim; Schwartz, Mark D.; Travers, Steven E.

    2013-01-01

    Phenological events defined points in the life cycle of a plant or animal have been regarded as highly plastic traits, reflecting flexible responses to various environmental cues. The ability of a species to track, via shifts in phenological events, the abiotic environment through time might dictate its vulnerability to future climate change. Understanding the predictors and drivers of phenological change is therefore critical. Here, we evaluated evidence for phylogenetic conservatism the tendency for closely related species to share similar ecological and biological attributes in phenological traits across flowering plants. We aggregated published and unpublished data on timing of first flower and first leaf, encompassing 4000 species at 23 sites across the Northern Hemisphere. We reconstructed the phylogeny for the set of included species, first, using the software program Phylomatic, and second, from DNA data. We then quantified phylogenetic conservatism in plant phenology within and across sites. We show that more closely related species tend to flower and leaf at similar times. By contrasting mean flowering times within and across sites, however, we illustrate that it is not the time of year that is conserved, but rather the phenological responses to a common set of abiotic cues. Our findings suggest that species cannot be treated as statistically independent when modelling phenological responses.Closely related species tend to resemble each other in the timing of their life-history events, a likely product of evolutionarily conserved responses to environmental cues. The search for the underlying drivers of phenology must therefore account for species' shared evolutionary histories.

  2. Multipolar consensus for phylogenetic trees.

    PubMed

    Bonnard, Cécile; Berry, Vincent; Lartillot, Nicolas

    2006-10-01

    Collections of phylogenetic trees are usually summarized using consensus methods. These methods build a single tree, supposed to be representative of the collection. However, in the case of heterogeneous collections of trees, the resulting consensus may be poorly resolved (strict consensus, majority-rule consensus, ...), or may perform arbitrary choices among mutually incompatible clades, or splits (greedy consensus). Here, we propose an alternative method, which we call the multipolar consensus (MPC). Its aim is to display all the splits having a support above a predefined threshold, in a minimum number of consensus trees, or poles. We show that the problem is equivalent to a graph-coloring problem, and propose an implementation of the method. Finally, we apply the MPC to real data sets. Our results indicate that, typically, all the splits down to a weight of 10% can be displayed in no more than 4 trees. In addition, in some cases, biologically relevant secondary signals, which would not have been present in any of the classical consensus trees, are indeed captured by our method, indicating that the MPC provides a convenient exploratory method for phylogenetic analysis. The method was implemented in a package freely available at http://www.lirmm.fr/~cbonnard/MPC.html PMID:17060203

  3. Automated analysis of phylogenetic clusters

    PubMed Central

    2013-01-01

    Background As sequence data sets used for the investigation of pathogen transmission patterns increase in size, automated tools and standardized methods for cluster analysis have become necessary. We have developed an automated Cluster Picker which identifies monophyletic clades meeting user-input criteria for bootstrap support and maximum genetic distance within large phylogenetic trees. A second tool, the Cluster Matcher, automates the process of linking genetic data to epidemiological or clinical data, and matches clusters between runs of the Cluster Picker. Results We explore the effect of different bootstrap and genetic distance thresholds on clusters identified in a data set of publicly available HIV sequences, and compare these results to those of a previously published tool for cluster identification. To demonstrate their utility, we then use the Cluster Picker and Cluster Matcher together to investigate how clusters in the data set changed over time. We find that clusters containing sequences from more than one UK location at the first time point (multiple origin) were significantly more likely to grow than those representing only a single location. Conclusions The Cluster Picker and Cluster Matcher can rapidly process phylogenetic trees containing tens of thousands of sequences. Together these tools will facilitate comparisons of pathogen transmission dynamics between studies and countries. PMID:24191891

  4. Neuromuscular disease classification system

    NASA Astrophysics Data System (ADS)

    Sáez, Aurora; Acha, Begoña; Montero-Sánchez, Adoración; Rivas, Eloy; Escudero, Luis M.; Serrano, Carmen

    2013-06-01

    Diagnosis of neuromuscular diseases is based on subjective visual assessment of biopsies from patients by the pathologist specialist. A system for objective analysis and classification of muscular dystrophies and neurogenic atrophies through muscle biopsy images of fluorescence microscopy is presented. The procedure starts with an accurate segmentation of the muscle fibers using mathematical morphology and a watershed transform. A feature extraction step is carried out in two parts: 24 features that pathologists take into account to diagnose the diseases and 58 structural features that the human eye cannot see, based on the assumption that the biopsy is considered as a graph, where the nodes are represented by each fiber, and two nodes are connected if two fibers are adjacent. A feature selection using sequential forward selection and sequential backward selection methods, a classification using a Fuzzy ARTMAP neural network, and a study of grading the severity are performed on these two sets of features. A database consisting of 91 images was used: 71 images for the training step and 20 as the test. A classification error of 0% was obtained. It is concluded that the addition of features undetectable by the human visual inspection improves the categorization of atrophic patterns.

  5. Neuromuscular disease classification system.

    PubMed

    Sáez, Aurora; Acha, Begoña; Montero-Sánchez, Adoración; Rivas, Eloy; Escudero, Luis M; Serrano, Carmen

    2013-06-01

    Diagnosis of neuromuscular diseases is based on subjective visual assessment of biopsies from patients by the pathologist specialist. A system for objective analysis and classification of muscular dystrophies and neurogenic atrophies through muscle biopsy images of fluorescence microscopy is presented. The procedure starts with an accurate segmentation of the muscle fibers using mathematical morphology and a watershed transform. A feature extraction step is carried out in two parts: 24 features that pathologists take into account to diagnose the diseases and 58 structural features that the human eye cannot see, based on the assumption that the biopsy is considered as a graph, where the nodes are represented by each fiber, and two nodes are connected if two fibers are adjacent. A feature selection using sequential forward selection and sequential backward selection methods, a classification using a Fuzzy ARTMAP neural network, and a study of grading the severity are performed on these two sets of features. A database consisting of 91 images was used: 71 images for the training step and 20 as the test. A classification error of 0% was obtained. It is concluded that the addition of features undetectable by the human visual inspection improves the categorization of atrophic patterns. PMID:23804164

  6. Virus classification in 60-dimensional protein space.

    PubMed

    Li, Yongkun; Tian, Kun; Yin, Changchuan; He, Rong Lucy; Yau, Stephen S-T

    2016-06-01

    Due to vast sequence divergence among different viral groups, sequence alignment is not directly applicable to genome-wide comparative analysis of viruses. More and more attention has been paid to alignment-free methods for whole genome comparison and phylogenetic tree reconstruction. Among alignment-free methods, the recently proposed "Natural Vector (NV) representation" has successfully been used to study the phylogeny of multi-segmented viruses based on a 12-dimensional genome space derived from the nucleotide sequence structure. But the preference of proteomes over genomes for the determination of viral phylogeny was not deeply investigated. As the translated products of genes, proteins directly form the shape of viral structure and are vital for all metabolic pathways. In this study, using the NV representation of a protein sequence along with the Hausdorff distance suitable to compare point sets, we construct a 60-dimensional protein space to analyze the evolutionary relationships of 4021 viruses by whole-proteomes in the current NCBI Reference Sequence Database (RefSeq). We also take advantage of the previously developed natural graphical representation to recover viral phylogeny. Our results demonstrate that the proposed method is efficient and accurate for classifying viruses. The accuracy rates of our predictions such as for Baltimore II viruses are as high as 95.9% for family labels, 95.7% for subfamily labels and 96.5% for genus labels. Finally, we discover that proteomes lead to better viral classification when reliable protein sequences are abundant. In other cases, the accuracy rates using proteomes are still comparable to that of genomes. PMID:26988414

  7. New Phylogenetic Groups of Torque Teno Virus Identified in Eastern Taiwan Indigenes

    PubMed Central

    Hsiao, Kuang-Liang; Wang, Li-Yu; Lin, Chiung-Ling; Liu, Hsin-Fu

    2016-01-01

    Torque teno virus (TTV) is a single-stranded DNA virus highly prevalent in the world. It has been detected in eastern Taiwan indigenes with a low prevalence of 11% by using N22 region of which known to underestimate TTV prevalence excessively. In order to clarify their realistic epidemiology, we re-analyzed TTV prevalence with UTR region. One hundred and forty serum samples from eastern Taiwanese indigenous population were collected and TTV DNA was detected in 133 (95%) samples. Direct sequencing revealed an extensive mix-infection of different TTV strains within the infected individual. Entire TTV open reading frame 1 was amplified and cloned from a TTV positive individual to distinguish mix-infected strains. Phylogenetic analysis showed eleven isolates were clustered into a monophyletic group that is distinct from all known groups. In addition, another our isolate was clustered with recently described Hebei-1 strain and formed an independent clade. Based on the distribution pattern of pairwise distances, both new clusters were placed at phylogenetic group level, designed as the 6th and 7th phylogenetic group. In present study, we showed a very high prevalence of TTV infection in eastern Taiwan indigenes and indentified new phylogenetic groups from the infected individual. Both intra- and inter-phylogenetic group mix-infections can be found from one healthy person. Our study has further broadened the field of human TTVs and proposed a robust criterion for classification of the major TTV phylogenetic groups. PMID:26901643

  8. Reanalysis and Simulation Suggest a Phylogenetic Microarray Does Not Accurately Profile Microbial Communities

    PubMed Central

    Midgley, David J.; Greenfield, Paul; Shaw, Janet M.; Oytam, Yalchin; Li, Dongmei; Kerr, Caroline A.; Hendry, Philip

    2012-01-01

    The second generation (G2) PhyloChip is designed to detect over 8700 bacteria and archaeal and has been used over 50 publications and conference presentations. Many of those publications reveal that the PhyloChip measures of species richness greatly exceed statistical estimates of richness based on other methods. An examination of probes downloaded from Greengenes suggested that the system may have the potential to distort the observed community structure. This may be due to the sharing of probes by taxa; more than 21% of the taxa in that downloaded data have no unique probes. In-silico simulations using these data showed that a population of 64 taxa representing a typical anaerobic subterranean community returned 96 different taxa, including 15 families incorrectly called present and 19 families incorrectly called absent. A study of nasal and oropharyngeal microbial communities by Lemon et al (2010) found some 1325 taxa using the G2 PhyloChip, however, about 950 of these taxa have, in the downloaded data, no unique probes and cannot be definitively called present. Finally, data from Brodie et al (2007), when re-examined, indicate that the abundance of the majority of detected taxa, are highly correlated with one another, suggesting that many probe sets do not act independently. Based on our analyses of downloaded data, we conclude that outputs from the G2 PhyloChip should be treated with some caution, and that the presence of taxa represented solely by non-unique probes be independently verified. PMID:22457798

  9. Multisensor classification of sedimentary rocks

    NASA Technical Reports Server (NTRS)

    Evans, Diane

    1988-01-01

    A comparison is made between linear discriminant analysis and supervised classification results based on signatures from the Landsat TM, the Thermal Infrared Multispectral Scanner (TIMS), and airborne SAR, alone and combined into extended spectral signatures for seven sedimentary rock units exposed on the margin of the Wind River Basin, Wyoming. Results from a linear discriminant analysis showed that training-area classification accuracies based on the multisensor data were improved an average of 15 percent over TM alone, 24 percent over TIMS alone, and 46 percent over SAR alone, with similar improvement resulting when supervised multisensor classification maps were compared to supervised, individual sensor classification maps. When training area signatures were used to map spectrally similar materials in an adjacent area, the average classification accuracy improved 19 percent using the multisensor data over TM alone, 2 percent over TIMS alone, and 11 percent over SAR alone. It is concluded that certain sedimentary lithologies may be accurately mapped using a single sensor, but classification of a variety of rock types can be improved using multisensor data sets that are sensitive to different characteristics such as mineralogy and surface roughness.

  10. Zooflagellate phylogeny and classification.

    PubMed

    Cavalier-Smith, T

    1995-01-01

    Zooflagellates are non-photosynthetic flagellates without plastids or cell walls which feed by phagocytosis or endocytosis. They are the most diverse of all eukaryotes and gave rise directly or indirectly to most, if not all, other groups of eukaryotes. They are here classified into thirteen or fourteen phyla, spread across four of the seven eukaryote kingdoms that I now recognize: (1) the probably primitively amitochondrial and entirely non-photosynthetic Archezoa; (2) the usually aerobic but predominantly non-photosynthetic Protozoa; (3) the always aerobic and usually photosynthetic Cryptista; (4) the always aerobic and predominantly photosynthetic Chromista. Whether the few non-photosynthetic haptophytes also lack plastids and thus are zooflagellates in the present sense is unclear. Six phyla (Archamoebae and Metamonada within the Archezoa; Percolozoa, Parabasala, Opalozoa, and Choanozoa within the Protozoa) consist largely or entirely of zooflagellates. One protozoan phylum (Euglenozoa) consists predominantly of zooflagellate families and genera, with a minority only of phytoflagellate genera: the photosynthetic euglenoids are probably all descended from a non-photosynthetic euglenoid which acquired a photosynthetic endosymblont related to the ancestor of green algae. In the phylum Dinozoa (i.e. dinoflagellates and protalveolates) most classes consist purely of zooflagellates, but the majority of species are photosynthetic. The photosynthetic chlorarachneans are related to the sarcomonad zooflagellates and to the filose amoebae, so that the classes Chlorarachnea and Sarcomonadea are now placed in the phylum Rhizopoda, which is also modified by segregating the lobose amoebae as the phylum Amoebozoa. Although most zooflagellates are primitively without photosynthesis, there is good molecular evidence for the secondary origin of the zooflagellate condition by the loss of plastids in the case of the colourless pedinellids. A classification of 62 orders including zooflagellates grouped into 36 classes consisting primarily of zooflagellates, and four classes containing a few zooflagellates is presented; the ultrastructural and molecular evidence for the phylogenetic ideas underlying the classification is summarized. PMID:8868448

  11. Classification of Bacteria and Archaea: past, present and future.

    PubMed

    Schleifer, Karl Heinz

    2009-12-01

    The late 19th century was the beginning of bacterial taxonomy and bacteria were classified on the basis of phenotypic markers. The distinction of prokaryotes and eukaryotes was introduced in the 1960s. Numerical taxonomy improved phenotypic identification but provided little information on the phylogenetic relationships of prokaryotes. Later on, chemotaxonomic and genotypic methods were widely used for a more satisfactory classification. Archaea were first classified as a separate group of prokaryotes in 1977. The current classification of Bacteria and Archaea is based on an operational-based model, the so-called polyphasic approach, comprised of phenotypic, chemotaxonomic and genotypic data, as well as phylogenetic information. The provisional status Candidatus has been established for describing uncultured prokaryotic cells for which their phylogenetic relationship has been determined and their authenticity revealed by in situ probing. The ultimate goal is to achieve a theory-based classification system based on a phylogenetic/evolutionary concept. However, there are currently two contradictory opinions about the future classification of Bacteria and Archaea. A group of mostly molecular biologists posits that the yet-unclear effect of gene flow, in particular lateral gene transfer, makes the line of descent difficult, if not impossible, to describe. However, even in the face of genomic fluidity it seems that the typical geno- and phenotypic characteristics of a taxon are still maintained, and are sufficient for reliable classification and identification of Bacteria and Archaea. There are many well-defined genotypic clusters that are congruent with known species delineated by polyphasic approaches. Comparative sequence analysis of certain core genes, including rRNA genes, may be useful for the characterization of higher taxa, whereas various character genes may be suitable as phylogenetic markers for the delineation of lower taxa. Nevertheless, there may still be a few organisms which escape a reliable classification. PMID:19819658

  12. Phylogenetic Signal Dissection Identifies the Root of Starfishes

    PubMed Central

    Feuda, Roberto; Smith, Andrew B.

    2015-01-01

    Relationships within the class Asteroidea have remained controversial for almost 100 years and, despite many attempts to resolve this problem using molecular data, no consensus has yet emerged. Using two nuclear genes and a taxon sampling covering the major asteroid clades we show that non-phylogenetic signal created by three factors - Long Branch Attraction, compositional heterogeneity and the use of poorly fitting models of evolution – have confounded accurate estimation of phylogenetic relationships. To overcome the effect of this non-phylogenetic signal we analyse the data using non-homogeneous models, site stripping and the creation of subpartitions aimed to reduce or amplify the systematic error, and calculate Bayes Factor support for a selection of previously suggested topological arrangements of asteroid orders. We show that most of the previous alternative hypotheses are not supported in the most reliable data partitions, including the previously suggested placement of either Forcipulatida or Paxillosida as sister group to the other major branches. The best-supported solution places Velatida as the sister group to other asteroids, and the implications of this finding for the morphological evolution of asteroids are presented. PMID:25955729

  13. Phylogenetic signal dissection identifies the root of starfishes.

    PubMed

    Feuda, Roberto; Smith, Andrew B

    2015-01-01

    Relationships within the class Asteroidea have remained controversial for almost 100 years and, despite many attempts to resolve this problem using molecular data, no consensus has yet emerged. Using two nuclear genes and a taxon sampling covering the major asteroid clades we show that non-phylogenetic signal created by three factors--Long Branch Attraction, compositional heterogeneity and the use of poorly fitting models of evolution--have confounded accurate estimation of phylogenetic relationships. To overcome the effect of this non-phylogenetic signal we analyse the data using non-homogeneous models, site stripping and the creation of subpartitions aimed to reduce or amplify the systematic error, and calculate Bayes Factor support for a selection of previously suggested topological arrangements of asteroid orders. We show that most of the previous alternative hypotheses are not supported in the most reliable data partitions, including the previously suggested placement of either Forcipulatida or Paxillosida as sister group to the other major branches. The best-supported solution places Velatida as the sister group to other asteroids, and the implications of this finding for the morphological evolution of asteroids are presented. PMID:25955729

  14. LABEL: Fast and Accurate Lineage Assignment with Assessment of H5N1 and H9N2 Influenza A Hemagglutinins

    PubMed Central

    Shepard, Samuel S.; Davis, C. Todd; Bahl, Justin; Rivailler, Pierre; York, Ian A.; Donis, Ruben O.

    2014-01-01

    The evolutionary classification of influenza genes into lineages is a first step in understanding their molecular epidemiology and can inform the subsequent implementation of control measures. We introduce a novel approach called Lineage Assignment By Extended Learning (LABEL) to rapidly determine cladistic information for any number of genes without the need for time-consuming sequence alignment, phylogenetic tree construction, or manual annotation. Instead, LABEL relies on hidden Markov model profiles and support vector machine training to hierarchically classify gene sequences by their similarity to pre-defined lineages. We assessed LABEL by analyzing the annotated hemagglutinin genes of highly pathogenic (H5N1) and low pathogenicity (H9N2) avian influenza A viruses. Using the WHO/FAO/OIE H5N1 evolution working group nomenclature, the LABEL pipeline quickly and accurately identified the H5 lineages of uncharacterized sequences. Moreover, we developed an updated clade nomenclature for the H9 hemagglutinin gene and show a similarly fast and reliable phylogenetic assessment with LABEL. While this study was focused on hemagglutinin sequences, LABEL could be applied to the analysis of any gene and shows great potential to guide molecular epidemiology activities, accelerate database annotation, and provide a data sorting tool for other large-scale bioinformatic studies. PMID:24466291

  15. Re-thinking the classification of corticioid fungi.

    PubMed

    Larsson, Karl-Henrik

    2007-09-01

    Corticioid fungi are basidiomycetes with effused basidiomata, a smooth, merulioid or hydnoid hymenophore, and holobasidia. These fungi used to be classified as a single family, Corticiaceae, but molecular phylogenetic analyses have shown that corticioid fungi are distributed among all major clades within Agaricomycetes. There is a relative consensus concerning the higher order classification of basidiomycetes down to order. This paper presents a phylogenetic classification for corticioid fungi at the family level. Fifty putative families were identified from published phylogenies and preliminary analyses of unpublished sequence data. A dataset with 178 terminal taxa was compiled and subjected to phylogenetic analyses using MP and Bayesian inference. From the analyses, 41 strongly supported and three unsupported clades were identified. These clades are treated as families in a Linnean hierarchical classification and each family is briefly described. Three additional families not covered by the phylogenetic analyses are also included in the classification. All accepted corticioid genera are either referred to one of the families or listed as incertae sedis. PMID:17981020

  16. Demonstrating Biological Classification Using a Simulation of Natural Taxa.

    ERIC Educational Resources Information Center

    Vogt, Kenneth D.

    1995-01-01

    A review of introductory college level and high school biology texts reveals that concepts and theories behind classification are usually poorly discussed. Suggests ways in which card games can be used to teach differences between the phenetic and phylogenetic approaches. (LZ)

  17. Propionibacterium acnes Types I and II Represent Phylogenetically Distinct Groups

    PubMed Central

    McDowell, Andrew; Valanne, Susanna; Ramage, Gordon; Tunney, Michael M.; Glenn, Josephine V.; McLorinan, Gregory C.; Bhatia, Ajay; Maisonneuve, Jean-Francois; Lodes, Michael; Persing, David H.; Patrick, Sheila

    2005-01-01

    Although two phenotypes of the opportunistic pathogen Propionibacterium acnes (types I and II) have been described, epidemiological investigations of their roles in different infections have not been widely reported. Using immunofluorescence microscopy with monoclonal antibodies (MAbs) QUBPa1 and QUBPa2, specific for types I and II, respectively, we investigated the prevalences of the two types among 132 P. acnes isolates. Analysis of isolates from failed prosthetic hip implants (n = 40) revealed approximately equal numbers of type I and II organisms. Isolates from failed prosthetic hip-associated bone (n = 6) and tissue (n = 38) samples, as well as isolates from acne (n = 22), dental infections (n = 8), and skin removed during surgical incision (n = 18) were predominately of type I. A total of 11 (8%) isolates showed atypical MAb labeling and could not be conclusively identified. Phylogenetic analysis of P. acnes by nucleotide sequencing revealed the 16S rRNA gene to be highly conserved between types I and II. In contrast, sequence analysis of recA and a putative hemolysin gene (tly) revealed significantly greater type-specific polymorphisms that corresponded to phylogenetically distinct cluster groups. All 11 isolates with atypical MAb labeling were identified as type I by sequencing. Within the recA and tly phylogenetic trees, nine of these isolates formed a cluster distinct from other type I organisms, suggesting a further phylogenetic subdivision within type I. Our study therefore demonstrates that the phenotypic differences between P. acnes types I and II reflect deeper differences in their phylogeny. Furthermore, nucleotide sequencing provides an accurate method for identifying the type status of P. acnes isolates. PMID:15634990

  18. SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations

    PubMed Central

    2013-01-01

    Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408

  19. Classification in Australia.

    ERIC Educational Resources Information Center

    McKinlay, John

    Despite some inroads by the Library of Congress Classification and short-lived experimentation with Universal Decimal Classification and Bliss Classification, Dewey Decimal Classification, with its ability in recent editions to be hospitable to local needs, remains the most widely used classification system in Australia. Although supplemented at…

  20. Phylogenetic fields of species: cross-species patterns of phylogenetic structure and geographical coexistence.

    PubMed

    Villalobos, Fabricio; Rangel, Thiago F; Diniz-Filho, José Alexandre F

    2013-04-01

    Differential coexistence among species underlies geographical patterns of biodiversity. Understanding such patterns has relied either on ecological or historical approaches applied separately. Recently, macroecology and community phylogenetics have tried to integrate both ecological and historical approaches. However, macroecology is mostly non-phylogenetic, whereas community phylogenetics is largely focused on local scales. Here, we propose a conceptual framework to link macroecology and community phylogenetics by exploring the evolutionary context of large-scale species coexistence, introducing the phylogenetic field concept. This is defined as the phylogenetic structure of species co-occurrence within a focal species' geographical range. We developed concepts and methods for analysing phylogenetic fields and applied them to study coexistence patterns of the bat family Phyllostomidae. Our analyses showed that phyllostomid bats coexist mostly with closely related species, revealing a north-south gradient from overdispersed to clustered phylogenetic fields. Patterns at different phylogenetic levels (i.e. all species versus close relatives only) presented the same gradient. Results support the tropical niche conservatism hypothesis, potentially mediated by higher speciation rates in the region of origin coupled with shared environmental preferences among species. The phylogenetic field approach enables species-based community phylogenetics, instead of those that are site-based, allowing the description of historical processes at more appropriate macroecological and biogeographic scales. PMID:23390100

  1. Phylogenetic fields of species: cross-species patterns of phylogenetic structure and geographical coexistence

    PubMed Central

    Villalobos, Fabricio; Rangel, Thiago F.; Diniz-Filho, José Alexandre F.

    2013-01-01

    Differential coexistence among species underlies geographical patterns of biodiversity. Understanding such patterns has relied either on ecological or historical approaches applied separately. Recently, macroecology and community phylogenetics have tried to integrate both ecological and historical approaches. However, macroecology is mostly non-phylogenetic, whereas community phylogenetics is largely focused on local scales. Here, we propose a conceptual framework to link macroecology and community phylogenetics by exploring the evolutionary context of large-scale species coexistence, introducing the phylogenetic field concept. This is defined as the phylogenetic structure of species co-occurrence within a focal species' geographical range. We developed concepts and methods for analysing phylogenetic fields and applied them to study coexistence patterns of the bat family Phyllostomidae. Our analyses showed that phyllostomid bats coexist mostly with closely related species, revealing a north–south gradient from overdispersed to clustered phylogenetic fields. Patterns at different phylogenetic levels (i.e. all species versus close relatives only) presented the same gradient. Results support the tropical niche conservatism hypothesis, potentially mediated by higher speciation rates in the region of origin coupled with shared environmental preferences among species. The phylogenetic field approach enables species-based community phylogenetics, instead of those that are site-based, allowing the description of historical processes at more appropriate macroecological and biogeographic scales. PMID:23390100

  2. Use of whole genome sequences to develop a molecular phylogenetic framework for Rhodococcus fascians and the Rhodococcus genus.

    PubMed

    Creason, Allison L; Davis, Edward W; Putnam, Melodie L; Vandeputte, Olivier M; Chang, Jeff H

    2014-01-01

    The accurate diagnosis of diseases caused by pathogenic bacteria requires a stable species classification. Rhodococcus fascians is the only documented member of its ill-defined genus that is capable of causing disease on a wide range of agriculturally important plants. Comparisons of genome sequences generated from isolates of Rhodococcus associated with diseased plants revealed a level of genetic diversity consistent with them representing multiple species. To test this, we generated a tree based on more than 1700 homologous sequences from plant-associated isolates of Rhodococcus, and obtained support from additional approaches that measure and cluster based on genome similarities. Results were consistent in supporting the definition of new Rhodococcus species within clades containing phytopathogenic members. We also used the genome sequences, along with other rhodococcal genome sequences to construct a molecular phylogenetic tree as a framework for resolving the Rhodococcus genus. Results indicated that Rhodococcus has the potential for having 20 species and also confirmed a need to revisit the taxonomic groupings within Rhodococcus. PMID:25237311

  3. Use of whole genome sequences to develop a molecular phylogenetic framework for Rhodococcus fascians and the Rhodococcus genus

    PubMed Central

    Creason, Allison L.; Davis, Edward W.; Putnam, Melodie L.; Vandeputte, Olivier M.; Chang, Jeff H.

    2014-01-01

    The accurate diagnosis of diseases caused by pathogenic bacteria requires a stable species classification. Rhodococcus fascians is the only documented member of its ill-defined genus that is capable of causing disease on a wide range of agriculturally important plants. Comparisons of genome sequences generated from isolates of Rhodococcus associated with diseased plants revealed a level of genetic diversity consistent with them representing multiple species. To test this, we generated a tree based on more than 1700 homologous sequences from plant-associated isolates of Rhodococcus, and obtained support from additional approaches that measure and cluster based on genome similarities. Results were consistent in supporting the definition of new Rhodococcus species within clades containing phytopathogenic members. We also used the genome sequences, along with other rhodococcal genome sequences to construct a molecular phylogenetic tree as a framework for resolving the Rhodococcus genus. Results indicated that Rhodococcus has the potential for having 20 species and also confirmed a need to revisit the taxonomic groupings within Rhodococcus. PMID:25237311

  4. Molecular phylogenetics and character evolution of morphologically diverse groups, Dendrobium section Dendrobium and allies

    PubMed Central

    Takamiya, Tomoko; Wongsawad, Pheravut; Sathapattayanon, Apirada; Tajima, Natsuko; Suzuki, Shunichiro; Kitamura, Saki; Shioda, Nao; Handa, Takashi; Kitanaka, Susumu; Iijima, Hiroshi; Yukawa, Tomohisa

    2014-01-01

    It is always difficult to construct coherent classification systems for plant lineages having diverse morphological characters. The genus Dendrobium, one of the largest genera in the Orchidaceae, includes ∼1100 species, and enormous morphological diversification has hindered the establishment of consistent classification systems covering all major groups of this genus. Given the particular importance of species in Dendrobium section Dendrobium and allied groups as floriculture and crude drug genetic resources, there is an urgent need to establish a stable classification system. To clarify phylogenetic relationships in Dendrobium section Dendrobium and allied groups, we analysed the macromolecular characters of the group. Phylogenetic analyses of 210 taxa of Dendrobium were conducted on DNA sequences of internal transcribed spacer (ITS) regions of 18S–26S nuclear ribosomal DNA and the maturase-coding gene (matK) located in an intron of the plastid gene trnK using maximum parsimony and Bayesian methods. The parsimony and Bayesian analyses revealed 13 distinct clades in the group comprising section Dendrobium and its allied groups. Results also showed paraphyly or polyphyly of sections Amblyanthus, Aporum, Breviflores, Calcarifera, Crumenata, Dendrobium, Densiflora, Distichophyllae, Dolichocentrum, Holochrysa, Oxyglossum and Pedilonum. On the other hand, the monophyly of section Stachyobium was well supported. It was found that many of the morphological characters that have been believed to reflect phylogenetic relationships are, in fact, the result of convergence. As such, many of the sections that have been recognized up to this point were found to not be monophyletic, so recircumscription of sections is required. PMID:25107672

  5. Molecular and morphological analyses reveal phylogenetic relationships of stingrays focusing on the family Dasyatidae (Myliobatiformes).

    PubMed

    Lim, Kean Chong; Lim, Phaik-Eem; Chong, Ving Ching; Loh, Kar-Hoe

    2015-01-01

    Elucidating the phylogenetic relationships of the current but problematic Dasyatidae (Order Myliobatiformes) was the first priority of the current study. Here, we studied three molecular gene markers of 43 species (COI gene), 33 species (ND2 gene) and 34 species (RAG1 gene) of stingrays to draft out the phylogenetic tree of the order. Nine character states were identified and used to confirm the molecularly constructed phylogenetic trees. Eight or more clades (at different hierarchical level) were identified for COI, ND2 and RAG1 genes in the Myliobatiformes including four clades containing members of the present Dasyatidae, thus rendering the latter non-monophyletic. The uncorrected p-distance between these four 'Dasytidae' clades when compared to the distance between formally known families confirmed that these four clades should be elevated to four separate families. We suggest a revision of the present classification, retaining the Dasyatidae (Dasyatis and Taeniurops species) but adding three new families namely, Neotrygonidae (Neotrygon and Taeniura species), Himanturidae (Himantura species) and Pastinachidae (Pastinachus species). Our result indicated the need to further review the classification of Dasyatis microps. By resolving the non-monophyletic problem, the suite of nine character states enables the natural classification of the Myliobatiformes into at least thirteen families based on morphology. PMID:25867639

  6. Molecular and Morphological Analyses Reveal Phylogenetic Relationships of Stingrays Focusing on the Family Dasyatidae (Myliobatiformes)

    PubMed Central

    Lim, Kean Chong; Lim, Phaik-Eem; Chong, Ving Ching; Loh, Kar-Hoe

    2015-01-01

    Elucidating the phylogenetic relationships of the current but problematic Dasyatidae (Order Myliobatiformes) was the first priority of the current study. Here, we studied three molecular gene markers of 43 species (COI gene), 33 species (ND2 gene) and 34 species (RAG1 gene) of stingrays to draft out the phylogenetic tree of the order. Nine character states were identified and used to confirm the molecularly constructed phylogenetic trees. Eight or more clades (at different hierarchical level) were identified for COI, ND2 and RAG1 genes in the Myliobatiformes including four clades containing members of the present Dasyatidae, thus rendering the latter non-monophyletic. The uncorrected p-distance between these four ‘Dasytidae’ clades when compared to the distance between formally known families confirmed that these four clades should be elevated to four separate families. We suggest a revision of the present classification, retaining the Dasyatidae (Dasyatis and Taeniurops species) but adding three new families namely, Neotrygonidae (Neotrygon and Taeniura species), Himanturidae (Himantura species) and Pastinachidae (Pastinachus species). Our result indicated the need to further review the classification of Dasyatis microps. By resolving the non-monophyletic problem, the suite of nine character states enables the natural classification of the Myliobatiformes into at least thirteen families based on morphology. PMID:25867639

  7. Synopsis of Trichosanthes (Cucurbitaceae) based on recent molecular phylogenetic data

    PubMed Central

    de Boer, Hugo J.; Thulin, Mats

    2012-01-01

    Abstract The snake gourd genus, Trichosanthes, is the largest genus in the Cucurbitaceae family, with over 90 species. Recent molecular phylogenetic data have indicated that the genus Gymnopetalum is to be merged with Trichosanthes to maintain monophyly. A revised infrageneric classification of Trichosanthes including Gymnopetalum is proposed with two subgenera, (I) subg. Scotanthus comb. nov. and (II) subg. Trichosanthes, eleven sections, (i) sect. Asterospermae, (ii) sect. Cucumeroides, (iii) sect. Edulis, (iv) sect. Foliobracteola, (v) sect. Gymnopetalum, (vi) sect. Involucraria, (vii) sect. Pseudovariifera sect. nov., (viii) sect. Villosae stat. nov., (ix) sect. Trichosanthes, (x) sect. Tripodanthera, and (xi) sect. Truncata. A synopsis of Trichosanthes with the 91 species recognized here is presented, including four new combinations, Trichosanthes orientalis, Trichosanthes tubiflora, Trichosanthes scabra var. pectinata, Trichosanthes scabra var. penicaudii, and a clarified nomenclature of Trichosanthes costata and Trichosanthes scabra. PMID:22645411

  8. Phylogenetic mapping of bacterial morphology

    NASA Technical Reports Server (NTRS)

    Siefert, J. L.; Fox, G. E.

    1998-01-01

    The availability of a meaningful molecular phylogeny for bacteria provides a context for examining the historical significance of various developments in bacterial evolution. Herein, the classical morphological descriptions of selected members of the domain Bacteria are mapped upon the genealogical ancestry deduced from comparison of small-subunit rRNA sequences. For the species examined in this study, a distinct pattern emerges which indicates that the coccus shape has arisen and accumulated independently multiple times in separate lineages and typically survived as a persistent end-state morphology. At least two other morphologies persist but have evolved only once. This study demonstrates that although bacterial morphology is not useful in defining bacterial phylogeny, it is remarkably consistent with that phylogeny once it is known. An examination of the experimental evidence available for morphogenesis as well as microbial fossil evidence corroborates these findings. It is proposed that the accumulation of persistent morphologies is a result of the biophysical properties of peptidoglycan and their genetic control, and that an evolved body-plan strategy based on peptidoglycan may have been a fate-sealing step in the evolution of Bacteria. More generally, this study illustrates that significant evolutionary insights can be obtained by examining biological and biochemical data in the context of a reliable phylogenetic structure.

  9. Classifying the bacterial gut microbiota of termites and cockroaches: A curated phylogenetic reference database (DictDb).

    PubMed

    Mikaelyan, Aram; Köhler, Tim; Lampert, Niclas; Rohland, Jeffrey; Boga, Hamadi; Meuser, Katja; Brune, Andreas

    2015-10-01

    Recent developments in sequencing technology have given rise to a large number of studies that assess bacterial diversity and community structure in termite and cockroach guts based on large amplicon libraries of 16S rRNA genes. Although these studies have revealed important ecological and evolutionary patterns in the gut microbiota, classification of the short sequence reads is limited by the taxonomic depth and resolution of the reference databases used in the respective studies. Here, we present a curated reference database for accurate taxonomic analysis of the bacterial gut microbiota of dictyopteran insects. The Dictyopteran gut microbiota reference Database (DictDb) is based on the Silva database but was significantly expanded by the addition of clones from 11 mostly unexplored termite and cockroach groups, which increased the inventory of bacterial sequences from dictyopteran guts by 26%. The taxonomic depth and resolution of DictDb was significantly improved by a general revision of the taxonomic guide tree for all important lineages, including a detailed phylogenetic analysis of the Treponema and Alistipes complexes, the Fibrobacteres, and the TG3 phylum. The performance of this first documented version of DictDb (v. 3.0) using the revised taxonomic guide tree in the classification of short-read libraries obtained from termites and cockroaches was highly superior to that of the current Silva and RDP databases. DictDb uses an informative nomenclature that is consistent with the literature also for clades of uncultured bacteria and provides an invaluable tool for anyone exploring the gut community structure of termites and cockroaches. PMID:26283320

  10. Phylogenetic Relationships Matter: Antifungal Susceptibility among Clinically Relevant Yeasts

    PubMed Central

    Schmalreck, A. F.; Becker, K.; Fegeler, W.; Czaika, V.; Ulmer, H.; Lass-Flörl, C.

    2014-01-01

    The objective of this study was 2-fold: to evaluate whether phylogenetically closely related yeasts share common antifungal susceptibility profiles (ASPs) and whether these ASPs can be predicted from phylogeny. To address this question, 9,627 yeast strains were collected and tested for their antifungal susceptibility. Isolates were reidentified by considering recent changes in taxonomy and nomenclature. A phylogenetic (PHYLO) code based on the results of multilocus sequence analyses (large-subunit rRNA, small-subunit rRNA, translation elongation factor 1α, RNA polymerase II subunits 1 and 2) and the classification of the cellular neutral sugar composition of coenzyme Q and 18S ribosomal DNA was created to group related yeasts into PHYLO groups. The ASPs were determined for fluconazole, itraconazole, and voriconazole in each PHYLO group. The majority (95%) of the yeast strains were Ascomycetes. After reclassification, a total of 23 genera and 54 species were identified, resulting in an increase of 64% of genera and a decrease of 5% of species compared with the initial identification. These taxa were assigned to 17 distinct PHYLO groups (Ascomycota, n = 13; Basidiomycota, n = 4). ASPs for azoles were similar among members of the same PHYLO group and different between the various PHYLO groups. Yeast phylogeny may be an additional tool to significantly enhance the assessment of MIC values and to predict antifungal susceptibility, thereby more rapidly initiating appropriate patient management. PMID:24366735

  11. Phylogenetic Relationships of American Willows (Salix L., Salicaceae)

    PubMed Central

    Lauron-Moreau, Aurélien; Pitre, Frédéric E.; Argus, George W.; Labrecque, Michel; Brouillet, Luc

    2015-01-01

    Salix L. is the largest genus in the family Salicaceae (450 species). Several classifications have been published, but taxonomic subdivision has been under continuous revision. Our goal is to establish the phylogenetic structure of the genus using molecular data on all American willows, using three DNA markers. This complete phylogeny of American willows allows us to propose a biogeographic framework for the evolution of the genus. Material was obtained for the 122 native and introduced willow species of America. Sequences were obtained from the ITS (ribosomal nuclear DNA) and two plastid regions, matK and rbcL. Phylogenetic analyses (parsimony, maximum likelihood, Bayesian inference) were performed on the data. Geographic distribution was mapped onto the tree. The species tree provides strong support for a division of the genus into two subgenera, Salix and Vetrix. Subgenus Salix comprises temperate species from the Americas and Asia, and their disjunction may result from Tertiary events. Subgenus Vetrix is composed of boreo-arctic species of the Northern Hemisphere and their radiation may coincide with the Quaternary glaciations. Sixteen species have ambiguous positions; genetic diversity is lower in subg. Vetrix. A molecular phylogeny of all species of American willows has been inferred. It needs to be tested and further resolved using other molecular data. Nonetheless, the genus clearly has two clades that have distinct biogeographic patterns. PMID:25880993

  12. Phylogenetic uncertainty revisited: Implications for ecological analyses.

    PubMed

    Rangel, Thiago F; Colwell, Robert K; Graves, Gary R; Fučíková, Karolina; Rahbek, Carsten; Diniz-Filho, José Alexandre F

    2015-05-01

    Ecologists and biogeographers usually rely on a single phylogenetic tree to study evolutionary processes that affect macroecological patterns. This approach ignores the fact that each phylogenetic tree is a hypothesis about the evolutionary history of a clade, and cannot be directly observed in nature. Also, trees often leave out many extant species, or include missing species as polytomies because of a lack of information on the relationship among taxa. Still, researchers usually do not quantify the effects of phylogenetic uncertainty in ecological analyses. We propose here a novel analytical strategy to maximize the use of incomplete phylogenetic information, while simultaneously accounting for several sources of phylogenetic uncertainty that may distort statistical inferences about evolutionary processes. We illustrate the approach using a clade-wide analysis of the hummingbirds, evaluating how different sources of uncertainty affect several phylogenetic comparative analyses of trait evolution and biogeographic patterns. Although no statistical approximation can fully substitute for a complete and robust phylogeny, the method we describe and illustrate enables researchers to broaden the number of clades for which studies informed by evolutionary relationships are possible, while allowing the estimation and control of statistical error that arises from phylogenetic uncertainty. Software tools to carry out the necessary computations are offered. PMID:25800868

  13. Hazard classification methodology

    SciTech Connect

    Brereton, S.J.

    1996-07-22

    This document outlines the hazard classification methodology used to determine the hazard classification of the NIF LTAB, OAB, and the support facilities on the basis of radionuclides and chemicals. The hazard classification determines the safety analysis requirements for a facility.

  14. Classification and knowledge

    NASA Technical Reports Server (NTRS)

    Kurtz, Michael J.

    1989-01-01

    Automated procedures to classify objects are discussed. The classification problem is reviewed, and the relation of epistemology and classification is considered. The classification of stellar spectra and of resolved images of galaxies is addressed.

  15. Remote Sensing Information Classification

    NASA Technical Reports Server (NTRS)

    Rickman, Douglas L.

    2008-01-01

    This viewgraph presentation reviews the classification of Remote Sensing data in relation to epidemiology. Classification is a way to reduce the dimensionality and precision to something a human can understand. Classification changes SCALAR data into NOMINAL data.

  16. Hyperspectral image classification using functional data analysis.

    PubMed

    Li, Hong; Xiao, Guangrun; Xia, Tian; Tang, Y Y; Li, Luoqing

    2014-09-01

    The large number of spectral bands acquired by hyperspectral imaging sensors allows us to better distinguish many subtle objects and materials. Unlike other classical hyperspectral image classification methods in the multivariate analysis framework, in this paper, a novel method using functional data analysis (FDA) for accurate classification of hyperspectral images has been proposed. The central idea of FDA is to treat multivariate data as continuous functions. From this perspective, the spectral curve of each pixel in the hyperspectral images is naturally viewed as a function. This can be beneficial for making full use of the abundant spectral information. The relevance between adjacent pixel elements in the hyperspectral images can also be utilized reasonably. Functional principal component analysis is applied to solve the classification problem of these functions. Experimental results on three hyperspectral images show that the proposed method can achieve higher classification accuracies in comparison to some state-of-the-art hyperspectral image classification methods. PMID:25137684

  17. Soil classifications systems review. Final report

    SciTech Connect

    1997-11-01

    Systems used to classify soils are discussed and compared. Major types of classification systems that are reviewed include natural systems, technical systems, the FAO/UNESCO world soil map, soil survey map units, and numerical taxonomy. Natural Classification systems discussed in detail are the United States system, Soil Taxonomy, and the Russian and Canadian systems. Included in the section on technical classification systems are reviews on the AASHO and Unified (ASTM) classification systems. The review of soil classification systems was conducted to establish improved availability of accurate ground thermal conductivity and other heat transfer related properties information. These data are intended to help in the design of closed-loop ground heat exchange systems.

  18. Selecting Species Traits for Biomonitoring Applications in light of Phylogenetic Relationships among Lotic Insects

    NASA Astrophysics Data System (ADS)

    Poff, N.; Vieira, N. K.; Simmons, M. P.; Olden, J. D.; Kondratieff, B. C.; Finn, D. S.

    2005-05-01

    The use of species traits as indicators of environmental disturbance is being considered for biomonitoring programs globally. As such, methods to select relevant and informative traits for inclusion in biometrics need to be developed. In this research, we identified 20 traits of aquatic insects within six trait groups: morphology, mobility, life-history strategy, thermal tolerance, feeding guild and ecology (e.g., habitat preference). We constructed phylogenetic trees for 1) all lotic insect species of North America and 2) all Ephemeroptera, Plecoptera and Trichoptera species based on morphology- and molecular-based analyses and classifications. We then measured variability (i.e., plasticity) of the 20 traits and six trait groups across the two phylogenetic trees. Traits with higher degrees of plasticity indicated traits that were less phylogenetically constrained, and were considered informative for biomonitoring purposes. Thermal tolerance, rheophily, body size at maturity and feeding guild showed the highest plasticity across both phylogenetic trees. Two mobility traits, occurrence in drift and adult dispersal distance, showed moderate plasticity. By contrast, adult exiting ability, degree of attachment, adult lifespan and body shape showed low variability and were thus less informative. Plastic species traits that are less phylogenetically constrained may be most useful in detecting community change along environmental gradients.

  19. Molecular systematics of the Amazonian genus Aldina, a phylogenetically enigmatic ectomycorrhizal lineage of papilionoid legumes.

    PubMed

    Ramos, Gustavo; de Lima, Haroldo Cavalcante; Prenner, Gerhard; de Queiroz, Luciano Paganucci; Zartman, Charles E; Cardoso, Domingos

    2016-04-01

    Aldina (Leguminosae) is among the very few ecologically successful ectomycorrhizal lineages in a family largely marked by the evolution of nodulating symbiosis. The genus comprises 20 species predominantly distributed in Amazonia and has been traditionally classified in the tribe Swartzieae because of its radial flowers with an entire calyx and numerous free stamens. The taxonomy of Aldina is complicated due to its poor representation in herbaria and the lack of a robust phylogenetic hypothesis of relationship. Recent phylogenetic analyses of matK and trnL sequences confirmed the placement of Aldina in the 50-kb inversion clade, although the genus remained phylogenetically isolated or unresolved in the context of the evolutionary history of the main early-branching papilionoid lineages. We performed maximum likelihood and Bayesian analyses of combined chloroplast datasets (matK, rbcL, and trnL) and explored the effect of incomplete taxa or missing data in order to shed light on the enigmatic phylogenetic position of Aldina. Unexpectedly, a sister relationship of Aldina with the Andira clade (Andira and Hymenolobium) is revealed. We suggest that a new tribal phylogenetic classification of the papilionoid legumes should place Aldina along with Andira and Hymenolobium. These results highlight yet another example of the independent evolution of radial floral symmetry within the early-branching Papilionoideae, a large collection of florally heterogeneous lineages dominated by papilionate or bilaterally symmetric flower morphology. PMID:26748266

  20. Simple and fast classification of non-LTR retrotransposons based on phylogeny of their RT domain protein sequences

    PubMed Central

    Kapitonov, Vladimir V.; Tempel, Sébastien; Jurka, Jerzy

    2009-01-01

    Rapidly growing number of sequenced genomes requires fast and accurate computational tools for analysis of different transposable elements (TEs). In this paper we focus on rapid and reliable procedure for classification of autonomous non-LTR retrotransposons based on alignment and clustering of their reverse transcriptase (RT) domains. Typically, the RT domain protein sequences encoded by different non-LTR retrotransposons are similar to each other in terms of significant BLASTP E-values. Therefore, they can be easily detected by the routine BLASTP searches of genomic DNA sequences coding for proteins similar to the RT domains of known non-LTR retrotransposons. However, detailed classification of non-LTR retrotransposons, i.e. their assignment to specific clades, is a slow and complex procedure that is not formalized or integrated as a standard set of computational methods and data. Here we describe a tool (RTclass1) designed for the fast and accurate automated assignment of novel non-LTR retrotransposons to known or novel clades using phylogenetic analysis of the RT domain protein sequences. RTclass1 classifies a particular non-LTR retrotransposon based on its RT domain in less than 10 minutes on a standard desktop computer and achieves 99.5% accuracy. RT1class1 works either as a standalone program installed locally or as a web-server that can be accessed distantly by uploading sequence data through the internet (http://www.girinst.org/RTphylogeny/RTclass1). PMID:19651192

  1. Discriminating the effects of phylogenetic hypothesis, tree resolution and clade age estimates on phylogenetic signal measurements.

    PubMed

    Seger, G D S; Duarte, L D S; Debastiani, V J; Kindel, A; Jarenkow, J A

    2013-09-01

    Understanding how species traits evolved over time is the central question to comprehend assembly rules that govern the phylogenetic structure of communities. The measurement of phylogenetic signal (PS) in ecologically relevant traits is a first step to understand phylogenetically structured community patterns. The different methods available to estimate PS make it difficult to choose which is most appropriate. Furthermore, alternative phylogenetic tree hypotheses, node resolution and clade age estimates might influence PS measurements. In this study, we evaluated to what extent these parameters affect different methods of PS analysis, and discuss advantages and disadvantages when selecting which method to use. We measured fruit/seed traits and flowering/fruiting phenology of endozoochoric species occurring in Southern Brazilian Araucaria forests and evaluated their PS using Mantel regressions, phylogenetic eigenvector regressions (PVR) and K statistic. Mantel regressions always gave less significant results compared to PVR and K statistic in all combinations of phylogenetic trees constructed. Moreover, a better phylogenetic resolution affected PS, independently of the method used to estimate it. Morphological seed traits tended to show higher PS than diaspores traits, while PS in flowering/fruiting phenology depended mostly on the method used to estimate it. This study demonstrates that different PS estimates are obtained depending on the chosen method and the phylogenetic tree resolution. This finding has implications for inferences on phylogenetic niche conservatism or ecological processes determining phylogenetic community structure. PMID:23368095

  2. Phylogenetic relationships among subsurface microorganisms

    SciTech Connect

    Nierzwicki-Bauer, S.A.

    1991-01-01

    This report summarizes the progress made from 6/90--3/91 toward completion of our project, Phylogenetic Relationships among subsurface microorganisms. 16S rRNA was sequenced, and based on the sequence the SMCC isolates were assigned to preliminary groups. Microorganisms were obtained at various depths at the Savannah River Site, including the Surface (0 m), Congaree (91 m), and Middendorf (244 m, 259 m, 265 m). Sequence data from four isolates from the Congaree formation indicate these microorganisms can be divided into Pseudomonas spp. or Acinetobacter spp. Three 16S rRNA probes were synthesized based on sequence data. The synthesized probes were tested through in situ hybridization. Optimal conditions for in situ hybridization were determined. Because stability of RNA-DNA hybrids is dependent on hybridization stringency, related organisms can be differentiated using a single probe under different strigencies. The results of these hybridizations agree with results obtained by Balkwill and Reeves using restriction fragment length polymorphism analysis. The RNA content of a cell reflects its metabolic state. Cells which are starved for four days are not detectable with the homologous 16S rRNA probe. However, within 15 minutes of refeeding, detectable rRNA appeared. This suggests that organisms which are undetectable in environmental samples due to starvation may be detectable after addition of nutrients. Stepwise addition of specific nutrients could indicate which nutrients are rate limiting for growth. Preliminary experiments with soil samples from the Hanford Site indicate indigenous microorganisms can be detected by oligionucleotide probes. Further, using multiple probes based on universal sequences increases the number of organisms detected. Double label experiments, using a rhodamine-labelled oligionucleotide probe with free coumarin succinimidyl ester will allow simultaneous detection of total bacteria and specific 16S rRNA containing bacteria. 4 tabs. (MHB)

  3. Phylogenetics of Lophodermium from pine.

    PubMed

    Ortiz-García, Sol; Gernandt, David S; Stone, Jeffrey K; Johnston, Peter R; Chapela, Ignacio H; Salas-Lizana, Rodolfo; Alvarez-Buylla, Elena R

    2003-01-01

    Lophodermium comprises ascomycetous fungi that are both needle-cast pathogens and asymptomatic endophytes on a diversity of plant hosts. It is distinguished from other genera in the family Rhytismataceae by its filiform ascospores and ascocarps that open by a longitudinal slit. Nucleotide sequences of the internal transcribed spacer (ITS) region of nuclear ribosomal DNA were used to infer phylogenetic relationships within Lophodermium. Twenty-nine sequences from approximately 11 species of Lophodermium were analyzed together with eight sequences from isolates thought to represent six other genera of Rhytismataceae: Elytroderma, Lirula, Meloderma, Terriera, Tryblidiopsis and Colpoma. Two putative Meloderma desmazieresii isolates occurred within the Lophodermium clade but separate from one another, one grouped with L. indianum and the other with L. nitens. An isolate of Elytroderma deformans also occurred within the Lophodermium clade but on a solitary branch. The occurrence of these genera within the Lophodermium clade might be due to problems in generic concepts in Rhytismataceae, such as emphasis on spore morphology to delimit genera, to difficulty of isolating Rhytismataceae needle pathogens from material that also is colonized by Lophodermium or to a combination of both factors. We also evaluated the congruence of host distribution and several morphological characters on the ITS phylogeny. Lophodermium species from pine hosts formed a monophyletic sister group to Lophodermium species from more distant hosts from the southern hemisphere, but not to L. piceae from Picea. The ITS topology indicated that Lophodermium does not show strict cospeciation with pines at deeper branches, although several closely related isolates have closely related hosts. Pathogenic species occupy derived positions in the pine clade, suggesting that pathogenicity has evolved from endophytism. A new combination is proposed, Terriera minor (Tehon) P.R. Johnst. PMID:21148992

  4. Sparse and accurate high resolution SAR imaging

    NASA Astrophysics Data System (ADS)

    Vu, Duc; Zhao, Kexin; Rowe, William; Li, Jian

    2012-05-01

    We investigate the usage of an adaptive method, the Iterative Adaptive Approach (IAA), in combination with a maximum a posteriori (MAP) estimate to reconstruct high resolution SAR images that are both sparse and accurate. IAA is a nonparametric weighted least squares algorithm that is robust and user parameter-free. IAA has been shown to reconstruct SAR images with excellent side lobes suppression and high resolution enhancement. We first reconstruct the SAR images using IAA, and then we enforce sparsity by using MAP with a sparsity inducing prior. By coupling these two methods, we can produce a sparse and accurate high resolution image that are conducive for feature extractions and target classification applications. In addition, we show how IAA can be made computationally efficient without sacrificing accuracies, a desirable property for SAR applications where the size of the problems is quite large. We demonstrate the success of our approach using the Air Force Research Lab's "Gotcha Volumetric SAR Data Set Version 1.0" challenge dataset. Via the widely used FFT, individual vehicles contained in the scene are barely recognizable due to the poor resolution and high side lobe nature of FFT. However with our approach clear edges, boundaries, and textures of the vehicles are obtained.

  5. Land classification and ecosystem classification.

    PubMed

    Rowe, J S

    1996-01-01

    Earth, the ecosphere, is a unified functional ecosystem. Ecological land classification (ELC) and regionalization divides and categorizes this unity into similar and dissimilar pieces-sectoral ecosystems - at various scales, in the interests of admiration and understanding. The recognition of land/water ecosystems in a hierarchy of sizes provides a rational base for the many-scaled problems of protection and careful exploitation in the fields of agriculture, forestry, wildlife and recreation. In forested terrain the protection of biodiversity, old growth forests, watersheds and wildlife habitat depends on spatial-temporal planning of forestry operations to maintain a preferred mosaic structure of local ecosystems within each ecological region. Without ecological understanding and a good ELC, this is impossible. Conceiving the world as comprising nested land/water ecosystems that are the source of life, elevates the role of Earth-as-context, an antidote to destructive anthropocentrism. PMID:24197992

  6. Phylogenetic diversity of parabasalian symbionts from termites, including the phylogenetic position of Pseudotrypanosoma and Trichonympha.

    PubMed

    Keeling, P J; Poulsen, N; McFadden, G I

    1998-01-01

    The phylogenetic diversity of parabasalian flagellates from termite hindguts has been examined by small subunit ribosomal RNA (rRNA) amplification and sequencing. Two species of particular interest, the giant trichomonad Pseudotrypanosoma giganteum and the hypermastigote Trichonympha magna, were isolated from the gut of Porotermes adamsoni by micropipetting, and the rRNA genes from these small populations amplified and sequenced. rRNA genes representing Hypermastigida and the Trichomonadida families Devescovinidae and Trichomonadidae, were also recovered by amplification from whole hindguts of three termites, P. adamsoni, Cryptotermes brevis, and Cryptotermes dudleyi. The parabasalian rRNA genes from C. brevis were found to comprise a unique and extremely heterogeneous lineage with no clear affinities to any known parabasalian rRNAs. In addition, one of the sequences isolated from P. adamsoni was found to be similar to another uncharacterised rRNA gene from Reticulitermes flavipes. The phylogeny of all known parabasalian small subunit rRNAs was examined with these new sequences. We find many taxonomic groups to be supported by rRNA, but not all. We have found the root of parabasalia to be very difficult to discern accurately, but have nevertheless identified several possible positions. PMID:9864854

  7. Phylogenetic relationships of Chinese Adiantum based on five plastid markers.

    PubMed

    Lu, Jin-Mei; Wen, Jun; Lutz, Sue; Wang, Yi-Ping; Li, De-Zhu

    2012-03-01

    Adiantum consists of about 150-200 species mostly with a pantropical distribution, yet the classifications of Adiantum have been based primarily on regional studies. Confounding the clarity of reconstructing the evolutionary history of Adiantum is that previous molecular phylogenetic studies suggest that a separate and distinctive clade, the vittarioids, may be derived from within Adiantum. Five plastid markers (atpA, atpB, rbcL, trnL-F and rps4-trnS) are employed to assess the monophyly of Adiantum, and construct the molecular phylogeny of Chinese Adiantum. Our analyses support the monophyly of Adiantum. All temperate Adiantum species form a clade nested within the pantropical grade, suggesting a tropical origin of Adiantum. Six main clades are supported within Chinese Adiantum, which are only partially consistent with Lin's classification of the genus. Series Caudata is polyphyletic with series Gravesiana nested within one subgroup of series Caudata. The prolonged whip-like stolon at the apex of the fronds is the defining character for series Caudata, but it may have evolved multiple times. Adiantum reniforme with the simple fronds is sister to series Venusta, which has a decompound lamina with many flabellate to cuneate segments. Series Veneri-capilliformia is not monophyletic, with A. capillus-veneris sister to series Flabellulata except for A. diaphanum, and A. edentulum sister to series Pedata. Series Flabellulata is biphyletic with A. diaphanum nested within the pantropical grade. The phylogeny suggests that convergent evolution in frond architecture has occurred in Adiantum. PMID:21809178

  8. Phylogenetic diversity measures based on Hill numbers

    PubMed Central

    Chao, Anne; Chiu, Chun-Huo; Jost, Lou

    2010-01-01

    We propose a parametric class of phylogenetic diversity (PD) measures that are sensitive to both species abundance and species taxonomic or phylogenetic distances. This work extends the conventional parametric species-neutral approach (based on ‘effective number of species’ or Hill numbers) to take into account species relatedness, and also generalizes the traditional phylogenetic approach (based on ‘total phylogenetic length’) to incorporate species abundances. The proposed measure quantifies ‘the mean effective number of species’ over any time interval of interest, or the ‘effective number of maximally distinct lineages’ over that time interval. The product of the measure and the interval length quantifies the ‘branch diversity’ of the phylogenetic tree during that interval. The new measures generalize and unify many existing measures and lead to a natural definition of taxonomic diversity as a special case. The replication principle (or doubling property), an important requirement for species-neutral diversity, is generalized to PD. The widely used Rao's quadratic entropy and the phylogenetic entropy do not satisfy this essential property, but a simple transformation converts each to our measures, which do satisfy the property. The proposed approach is applied to forest data for interpreting the effects of thinning. PMID:20980309

  9. Phylogenetic structure in tropical hummingbird communities

    PubMed Central

    Graham, Catherine H.; Parra, Juan L.; Rahbek, Carsten; McGuire, Jimmy A.

    2009-01-01

    How biotic interactions, current and historical environment, and biogeographic barriers determine community structure is a fundamental question in ecology and evolution, especially in diverse tropical regions. To evaluate patterns of local and regional diversity, we quantified the phylogenetic composition of 189 hummingbird communities in Ecuador. We assessed how species and phylogenetic composition changed along environmental gradients and across biogeographic barriers. We show that humid, low-elevation communities are phylogenetically overdispersed (coexistence of distant relatives), a pattern that is consistent with the idea that competition influences the local composition of hummingbirds. At higher elevations communities are phylogenetically clustered (coexistence of close relatives), consistent with the expectation of environmental filtering, which may result from the challenge of sustaining an expensive means of locomotion at high elevations. We found that communities in the lowlands on opposite sides of the Andes tend to be phylogenetically similar despite their large differences in species composition, a pattern implicating the Andes as an important dispersal barrier. In contrast, along the steep environmental gradient between the lowlands and the Andes we found evidence that species turnover is comprised of relatively distantly related species. The integration of local and regional patterns of diversity across environmental gradients and biogeographic barriers provides insight into the potential underlying mechanisms that have shaped community composition and phylogenetic diversity in one of the most species-rich, complex regions of the world. PMID:19805042

  10. How does cognition evolve? Phylogenetic comparative psychology

    PubMed Central

    Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria

    2014-01-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850

  11. Maximizing the phylogenetic diversity of seed banks.

    PubMed

    Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard

    2015-04-01

    Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. PMID:25196170

  12. Phylogenetic Approaches to Natural Product Structure Prediction

    PubMed Central

    Ziemert, Nadine; Jensen, Paul R.

    2015-01-01

    Phylogenetics is the study of the evolutionary relatedness among groups of organisms. Molecular phylogenetics uses sequence data to infer these relationships for both organisms and the genes they maintain. With the large amount of publicly available sequence data, phylogenetic inference has become increasingly important in all fields of biology. In the case of natural product research, phylogenetic relationships are proving to be highly informative in terms of delineating the architecture and function of the genes involved in secondary metabolite biosynthesis. Polyketide synthases and nonribosomal peptide synthetases provide model examples in which individual domain phylogenies display different predictive capacities, resolving features ranging from substrate specificity to structural motifs associated with the final metabolic product. This chapter provides examples in which phylogeny has proven effective in terms of predicting functional or structural aspects of secondary metabolism. The basics of how to build a reliable phylogenetic tree are explained along with information about programs and tools that can be used for this purpose. Furthermore, it introduces the Natural Product Domain Seeker, a recently developed Web tool that employs phylogenetic logic to classify ketosynthase and condensation domains based on established enzyme architecture and biochemical function. PMID:23084938

  13. Phylogenetic structure in tropical hummingbird communities.

    PubMed

    Graham, Catherine H; Parra, Juan L; Rahbek, Carsten; McGuire, Jimmy A

    2009-11-17

    How biotic interactions, current and historical environment, and biogeographic barriers determine community structure is a fundamental question in ecology and evolution, especially in diverse tropical regions. To evaluate patterns of local and regional diversity, we quantified the phylogenetic composition of 189 hummingbird communities in Ecuador. We assessed how species and phylogenetic composition changed along environmental gradients and across biogeographic barriers. We show that humid, low-elevation communities are phylogenetically overdispersed (coexistence of distant relatives), a pattern that is consistent with the idea that competition influences the local composition of hummingbirds. At higher elevations communities are phylogenetically clustered (coexistence of close relatives), consistent with the expectation of environmental filtering, which may result from the challenge of sustaining an expensive means of locomotion at high elevations. We found that communities in the lowlands on opposite sides of the Andes tend to be phylogenetically similar despite their large differences in species composition, a pattern implicating the Andes as an important dispersal barrier. In contrast, along the steep environmental gradient between the lowlands and the Andes we found evidence that species turnover is comprised of relatively distantly related species. The integration of local and regional patterns of diversity across environmental gradients and biogeographic barriers provides insight into the potential underlying mechanisms that have shaped community composition and phylogenetic diversity in one of the most species-rich, complex regions of the world. PMID:19805042

  14. Accurate Evaluation of Quantum Integrals

    NASA Technical Reports Server (NTRS)

    Galant, D. C.; Goorvitch, D.; Witteborn, Fred C. (Technical Monitor)

    1995-01-01

    Combining an appropriate finite difference method with Richardson's extrapolation results in a simple, highly accurate numerical method for solving a Schrodinger's equation. Important results are that error estimates are provided, and that one can extrapolate expectation values rather than the wavefunctions to obtain highly accurate expectation values. We discuss the eigenvalues, the error growth in repeated Richardson's extrapolation, and show that the expectation values calculated on a crude mesh can be extrapolated to obtain expectation values of high accuracy.

  15. Phylogenetic Distribution of Fungal Sterols

    PubMed Central

    Weete, John D.; Abril, Maritza; Blackwell, Meredith

    2010-01-01

    Background Ergosterol has been considered the “fungal sterol” for almost 125 years; however, additional sterol data superimposed on a recent molecular phylogeny of kingdom Fungi reveals a different and more complex situation. Methodology/Principal Findings The interpretation of sterol distribution data in a modern phylogenetic context indicates that there is a clear trend from cholesterol and other Δ5 sterols in the earliest diverging fungal species to ergosterol in later diverging fungi. There are, however, deviations from this pattern in certain clades. Sterols of the diverse zoosporic and zygosporic forms exhibit structural diversity with cholesterol and 24-ethyl -Δ5 sterols in zoosporic taxa, and 24-methyl sterols in zygosporic fungi. For example, each of the three monophyletic lineages of zygosporic fungi has distinctive major sterols, ergosterol in Mucorales, 22-dihydroergosterol in Dimargaritales, Harpellales, and Kickxellales (DHK clade), and 24-methyl cholesterol in Entomophthorales. Other departures from ergosterol as the dominant sterol include: 24-ethyl cholesterol in Glomeromycota, 24-ethyl cholest-7-enol and 24-ethyl-cholesta-7,24(28)-dienol in rust fungi, brassicasterol in Taphrinales and hypogeous pezizalean species, and cholesterol in Pneumocystis. Conclusions/Significance Five dominant end products of sterol biosynthesis (cholesterol, ergosterol, 24-methyl cholesterol, 24-ethyl cholesterol, brassicasterol), and intermediates in the formation of 24-ethyl cholesterol, are major sterols in 175 species of Fungi. Although most fungi in the most speciose clades have ergosterol as a major sterol, sterols are more varied than currently understood, and their distribution supports certain clades of Fungi in current fungal phylogenies. In addition to the intellectual importance of understanding evolution of sterol synthesis in fungi, there is practical importance because certain antifungal drugs (e.g., azoles) target reactions in the synthesis of ergosterol. These findings also invalidate use of ergosterol as an indicator of biomass of certain fungal taxa (e.g., Glomeromycota). Data from this study are available from the Assembling the Fungal Tree of Life (AFTOL) Structural and Biochemical Database: http://aftol.umn.edu. PMID:20526375

  16. Explaining diversity in metagenomic datasets by phylogenetic-based feature weighting.

    PubMed

    Albanese, Davide; De Filippo, Carlotta; Cavalieri, Duccio; Donati, Claudio

    2015-03-01

    Metagenomics is revolutionizing our understanding of microbial communities, showing that their structure and composition have profound effects on the ecosystem and in a variety of health and disease conditions. Despite the flourishing of new analysis methods, current approaches based on statistical comparisons between high-level taxonomic classes often fail to identify the microbial taxa that are differentially distributed between sets of samples, since in many cases the taxonomic schema do not allow an adequate description of the structure of the microbiota. This constitutes a severe limitation to the use of metagenomic data in therapeutic and diagnostic applications. To provide a more robust statistical framework, we introduce a class of feature-weighting algorithms that discriminate the taxa responsible for the classification of metagenomic samples. The method unambiguously groups the relevant taxa into clades without relying on pre-defined taxonomic categories, thus including in the analysis also those sequences for which a taxonomic classification is difficult. The phylogenetic clades are weighted and ranked according to their abundance measuring their contribution to the differentiation of the classes of samples, and a criterion is provided to define a reduced set of most relevant clades. Applying the method to public datasets, we show that the data-driven definition of relevant phylogenetic clades accomplished by our ranking strategy identifies features in the samples that are lost if phylogenetic relationships are not considered, improving our ability to mine metagenomic datasets. Comparison with supervised classification methods currently used in metagenomic data analysis highlights the advantages of using phylogenetic information. PMID:25815895

  17. Hyperspectral image classification for mapping agricultural tillage practices

    Technology Transfer Automated Retrieval System (TEKTRAN)

    An efficient classification framework for mapping agricultural tillage practice using hyperspectral remote sensing imagery is proposed, which has the potential to be implemented practically to provide rapid, accurate, and objective surveying data for precision agricultural management and appraisal f...

  18. Support Vector Machine Classification Trees.

    PubMed

    de Boves Harrington, Peter

    2015-11-01

    Proteomic and metabolomic studies based on chemical profiling require powerful classifiers to model accurately complex collections of data. Support vector machines (SVMs) are advantageous in that they provide a maximum margin of separation for the classification hyperplane. A new method for constructing classification trees, for which the branches comprise SVMs, has been devised. The novel feature is that the distribution of the data objects is used to determine the SVM encoding. The variance and covariance of the data objects are used for determining the bipolar encoding required for the SVM. The SVM that yields the lowest entropy of classification becomes the branch of the tree. The SVM-tree classifier has the added advantage that nonlinearly separable data may be accurately classified without optimization of the cost parameter C or searching for a correct higher dimensional kernel transform. It compares favorably to a regularized linear discriminant analysis, SVMs in a one against all multiple classifier, and a fuzzy rule-building expert system, a tree classifier with a fuzzy margin of separation. SVMs offer a speed advantage, especially for data sets that have more measurements than objects. PMID:26461495

  19. [Characteristics and phylogenetic analysis of mitochondrial genome in the gobies].

    PubMed

    Jin, Xiao-Xiao; Sun, Yue-Na; Wang, Ri-Xin; Tang, Da; Zhao, Sheng-Long; Xu, Tian-Jun

    2013-12-01

    The vast number of species, small size and high variation of morphology make the morphological identification and classification of gobies very difficult. In this study, the complete mitochondrial genome (mitogenome) of 26 species of gobies was analyzed, aiming at accumulating the molecular information on the identification, classification and molecular evolution of gobies. The results showed that the gene composition and arrangement of mitogenome of gobies are similar to most vertebrates. Due to various degrees of repetitive sequences in the control region, the mitogenome of 26 gobies exhibits a great variation in length. The A+T content of the mitogenome is greater than 50% and the lowest frequency is for G among the four bases. Thirty-seven coding gene sequences were used to calculate the average Kimura 2-parameter genetic distance of 26 species of gobies. Acanthogobius hasta and A. ommaturus, Glossogobius olivaceus and G circumspectus were synonyms, respectively. By comparing the control region sequences of 26 gobies, the terminal associated sequences, central conserved sequence block and conserved sequence block were identified, respectively. Thirty-six coding gene sequences of 26 gobies were used to construct the phylogenetic tree and the results were different from the traditional morphological classification. The five subfamilies of Gobiidae were obviously evolved: Amblyopinae, Oxudercinae and Sicydiinae were clustered into a group and then formed a sister group with Gobionellinae; the fishes of Gobiinae had distant relationship with the four subfamilies and formed a group alone. Molecular clock analysis estimated that gobies probably originated in the late Eocene to Oligocene time and further evolved into modern characteristic gobies in the Miocene. PMID:24645349

  20. Discovery of Regulatory Elements by a Computational Method for Phylogenetic Footprinting

    PubMed Central

    Blanchette, Mathieu; Tompa, Martin

    2002-01-01

    Phylogenetic footprinting is a method for the discovery of regulatory elements in a set of orthologous regulatory regions from multiple species. It does so by identifying the best conserved motifs in those orthologous regions. We describe a computer algorithm designed specifically for this purpose, making use of the phylogenetic relationships among the sequences under study to make more accurate predictions. The program is guaranteed to report all sets of motifs with the lowest parsimony scores, calculated with respect to the phylogenetic tree relating the input species. We report the results of this algorithm on several data sets of interest. A large number of known functional binding sites are identified by our method, but we also find several highly conserved motifs for which no function is yet known. PMID:11997340

  1. Hierarchical classification of glycoside hydrolases.

    PubMed

    Naumoff, D G

    2011-06-01

    This review deals with structural and functional features of glycoside hydrolases, a widespread group of enzymes present in almost all living organisms. Their catalytic domains are grouped into 120 amino acid sequence-based families in the international classification of the carbohydrate-active enzymes (CAZy database). At a higher hierarchical level some of these families are combined in 14 clans. Enzymes of the same clan have common evolutionary origin of their genes and share the most important functional characteristics such as composition of the active center, anomeric configuration of cleaved glycosidic bonds, and molecular mechanism of the catalyzed reaction (either inverting, or retaining). There are now extensive data in the literature concerning the relationship between glycoside hydrolase families belonging to different clans and/or included in none of them, as well as information on phylogenetic protein relationship within particular families. Summarizing these data allows us to propose a multilevel hierarchical classification of glycoside hydrolases and their homologs. It is shown that almost the whole variety of the enzyme catalytic domains can be brought into six main folds, large groups of proteins having the same three-dimensional structure and the supposed common evolutionary origin. PMID:21639842

  2. Classification of Support Needs in a Residential Setting

    ERIC Educational Resources Information Center

    Riches, Vivienne Catherine

    2003-01-01

    A supports classification and assessment system is required in residential settings that enables accurate classification and resource allocation in the context of limited resources. In developing such a system, the American Association on Mental Retardation intensities of supports framework was used and consultation with all stakeholders resulted…

  3. Trends and concepts in fern classification

    PubMed Central

    Christenhusz, Maarten J. M.; Chase, Mark W.

    2014-01-01

    Background and Aims Throughout the history of fern classification, familial and generic concepts have been highly labile. Many classifications and evolutionary schemes have been proposed during the last two centuries, reflecting different interpretations of the available evidence. Knowledge of fern structure and life histories has increased through time, providing more evidence on which to base ideas of possible relationships, and classification has changed accordingly. This paper reviews previous classifications of ferns and presents ideas on how to achieve a more stable consensus. Scope An historical overview is provided from the first to the most recent fern classifications, from which conclusions are drawn on past changes and future trends. The problematic concept of family in ferns is discussed, with a particular focus on how this has changed over time. The history of molecular studies and the most recent findings are also presented. Key Results Fern classification generally shows a trend from highly artificial, based on an interpretation of a few extrinsic characters, via natural classifications derived from a multitude of intrinsic characters, towards more evolutionary circumscriptions of groups that do not in general align well with the distribution of these previously used characters. It also shows a progression from a few broad family concepts to systems that recognized many more narrowly and highly controversially circumscribed families; currently, the number of families recognized is stabilizing somewhere between these extremes. Placement of many genera was uncertain until the arrival of molecular phylogenetics, which has rapidly been improving our understanding of fern relationships. As a collective category, the so-called ‘fern allies’ (e.g. Lycopodiales, Psilotaceae, Equisetaceae) were unsurprisingly found to be polyphyletic, and the term should be abandoned. Lycopodiaceae, Selaginellaceae and Isoëtaceae form a clade (the lycopods) that is sister to all other vascular plants, whereas the whisk ferns (Psilotaceae), often included in the lycopods or believed to be associated with the first vascular plants, are sister to Ophioglossaceae and thus belong to the fern clade. The horsetails (Equisetaceae) are also members of the fern clade (sometimes inappropriately called ‘monilophytes’), but, within that clade, their placement is still uncertain. Leptosporangiate ferns are better understood, although deep relationships within this group are still unresolved. Earlier, almost all leptosporangiate ferns were placed in a single family (Polypodiaceae or Dennstaedtiaceae), but these families have been redefined to narrower more natural entities. Conclusions Concluding this paper, a classification is presented based on our current understanding of relationships of fern and lycopod clades. Major changes in our understanding of these families are highlighted, illustrating issues of classification in relation to convergent evolution and false homologies. Problems with the current classification and groups that still need study are pointed out. A summary phylogenetic tree is also presented. A new classification in which Aspleniaceae, Cyatheaceae, Polypodiaceae and Schizaeaceae are expanded in comparison with the most recent classifications is presented, which is a modification of those proposed by Smith et al. (2006, 2008) and Christenhusz et al. (2011). These classifications are now finding a wider acceptance and use, and even though a few amendments are made based on recently published results from molecular analyses, we have aimed for a stable family and generic classification of ferns. PMID:24532607

  4. Teaching Molecular Phylogenetics through Investigating a Real-World Phylogenetic Problem

    ERIC Educational Resources Information Center

    Zhang, Xiaorong

    2012-01-01

    A phylogenetics exercise is incorporated into the "Introduction to biocomputing" course, a junior-level course at Savannah State University. This exercise is designed to help students learn important concepts and practical skills in molecular phylogenetics through solving a real-world problem. In this application, students are required to identify

  5. Teaching Molecular Phylogenetics through Investigating a Real-World Phylogenetic Problem

    ERIC Educational Resources Information Center

    Zhang, Xiaorong

    2012-01-01

    A phylogenetics exercise is incorporated into the "Introduction to biocomputing" course, a junior-level course at Savannah State University. This exercise is designed to help students learn important concepts and practical skills in molecular phylogenetics through solving a real-world problem. In this application, students are required to identify…

  6. Phylogenetic structure of mammal assemblages at large geographical scales: linking phylogenetic community ecology with macroecology

    PubMed Central

    Cardillo, Marcel

    2011-01-01

    Phylogenetic community ecology seeks to explain the processes involved in the formation of species assemblages by analysing their phylogenetic structure, and to date has focused primarily on local-scale communities. Macroecology, on the other hand, is concerned with the structure of assemblages at large geographical scales, but has remained largely non-phylogenetic. Analysing the phylogenetic structure of large-scale assemblages provides a link between these two research programmes. In this paper, I ask whether we should expect large-scale assemblages to show significant phylogenetic structure, by outlining some of the ecological and macroevolutionary processes that may play a role in assemblage formation. As a case study, I then explore the phylogenetic structure of carnivore assemblages within the terrestrial ecoregions of Africa. Many assemblages at these scales are indeed phylogenetically non-random (either clustered or overdispersed). One interpretation of the observed patterns of phylogenetic structure is that many clades underwent rapid biome-filling radiations, followed by diversification slowdown and competitive sorting as niche space became saturated. PMID:21807735

  7. Phylogenetic relationships within the eared seals (Otariidae: Carnivora): implications for the historical biogeography of the family.

    PubMed

    Wynen, L P; Goldsworthy, S D; Insley, S J; Adams, M; Bickham, J W; Francis, J; Gallo, J P; Hoelzel, A R; Majluf, P; White, R W; Slade, R

    2001-11-01

    Phylogenetic relationships within the family Otariidae were investigated using two regions of the mitochondrial genome. A 360-bp region of the cytochrome b gene was employed for the primary phylogenetic analysis, while a 356-bp segment of the control region was used to enhance resolution of the terminal nodes. Traditional classification of the family into the subfamilies Arctocephalinae (fur seals) and Otariinae (sea lions) is not supported, with the fur seal Callorhinus ursinus having a basal relationship relative to the rest of the family. This is consistent with the fossil record which suggests that this genus diverged from the line leading to the remaining fur seals and sea lions about 6 million years ago (mya). There is also little evidence to support or refute the monophyly of sea lions. Four sea lion clades and five fur seal clades were observed, but relationships among these clades are unclear. Similar genetic divergences between the sea lion clades (D(a) = 0.054-0.078), as well as between the major Arctocephalus fur seal clades (D(a) = 0.040-0.069) suggest that these groups underwent periods of rapid radiation at about the time they diverged from each other. Rapid radiations of this type make the resolution of relationships between the resulting species difficult and indicate the requirement for additional molecular data from both nuclear and mitochondrial genes. The phylogenetic relationships within the family and the genetic distances among some taxa highlight inconsistencies in the current taxonomic classification of the family. PMID:11697921

  8. Worldwide phylogenetic relationship of avian poxviruses

    USGS Publications Warehouse

    Gyuranecz, Miklós; Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we have expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g. starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy.

  9. Prioritizing Populations for Conservation Using Phylogenetic Networks

    PubMed Central

    Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.

    2014-01-01

    In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451

  10. Genomic repeat abundances contain phylogenetic signal.

    PubMed

    Dodsworth, Steven; Chase, Mark W; Kelly, Laura J; Leitch, Ilia J; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  11. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  12. Worldwide Phylogenetic Relationship of Avian Poxviruses

    PubMed Central

    Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of the Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups, and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g., starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy. PMID:23408635

  13. Classification challenges in perfectionism.

    PubMed

    Rice, Kenneth G; Richardson, Clarissa M E

    2014-10-01

    High performance expectations are central to perfectionism, but because most participants endorse high standards, it becomes difficult for practitioners and researchers to accurately screen for perfectionists. We addressed problems linked to the measurement and classification of perfectionism by testing various strategies aimed at broadening the range and skew of scores on the Standards subscale from the Almost Perfect Scale-Revised (APS-R; Slaney, Mobley, Trippi, Ashby, & Johnson, 1996). Randomly assigned participants (N = 506) completed the APS-R following standard instructions or 1 of 2 variations, one prompting participants to consider their responses in light of a normal distribution of scores and another in which participants used a visual analog (slider) scale. The visual analog scale produced more differentiated scores, but range restrictions and skewed distributions remained for all 3 variations. Statistical transformations improved skew. Factor mixture modeling was conducted using transformed and nontransformed perfectionism scores along with criterion indicators of emotion regulation (reappraisal or suppression), perceived stress, and depression. Results supported a 3-class model, although more balanced distributions of classes emerged than were previously reported. Perfectionists were differentiated from nonperfectionists by their higher standards scores. Maladaptive perfectionists scored highest among the classes on most self-critical perfectionism indicators, suppression, perceived stress, and depression. Adaptive perfectionists had the lowest levels of perceived stress and depression and scored highest on reappraisal. Both perfectionist classes had generally comparable concerns about mistakes, but criterion indicators suggested those were more problematic for maladaptive perfectionists. Results supported the value of incorporating adaptive and maladaptive criterion indicators in classification models. PMID:25111705

  14. Improving Marginal Likelihood Estimation for Bayesian Phylogenetic Model Selection

    PubMed Central

    Xie, Wangang; Lewis, Paul O.; Fan, Yu; Kuo, Lynn; Chen, Ming-Hui

    2011-01-01

    The marginal likelihood is commonly used for comparing different evolutionary models in Bayesian phylogenetics and is the central quantity used in computing Bayes Factors for comparing model fit. A popular method for estimating marginal likelihoods, the harmonic mean (HM) method, can be easily computed from the output of a Markov chain Monte Carlo analysis but often greatly overestimates the marginal likelihood. The thermodynamic integration (TI) method is much more accurate than the HM method but requires more computation. In this paper, we introduce a new method, steppingstone sampling (SS), which uses importance sampling to estimate each ratio in a series (the “stepping stones”) bridging the posterior and prior distributions. We compare the performance of the SS approach to the TI and HM methods in simulation and using real data. We conclude that the greatly increased accuracy of the SS and TI methods argues for their use instead of the HM method, despite the extra computation needed. PMID:21187451

  15. Characterization of a branch of the phylogenetic tree

    SciTech Connect

    Samuel, Stuart A.; Weng, Gezhi

    2003-04-11

    We use a combination of analytic models and computer simulations to gain insight into the dynamics of evolution. Our results suggest that certain interesting phenomena should eventually emerge from the fossil record. For example, there should be a ''tortoise and hare effect'': Those genera with the smallest species death rate are likely to survive much longer than genera with large species birth and death rates. A complete characterization of the behavior of a branch of the phylogenetic tree corresponding to a genus and accurate mathematical representations of the various stages are obtained. We apply our results to address certain controversial issues that have arisen in paleontology such as the importance of punctuated equilibrium and whether unique Cambrian phyla have survived to the present.

  16. A Bayesian Phylogenetic Method to Estimate Unknown Sequence Ages

    PubMed Central

    Shapiro, Beth; Drummond, Alexei J.; Suchard, Marc A.; Pybus, Oliver G.; Rambaut, Andrew

    2011-01-01

    Heterochronous data sets comprise molecular sequences sampled at different points in time. If the temporal range of the sampled sequences is large relative to the rate of mutation, the sampling times can directly calibrate evolutionary rates to calendar time. Here, we extend this calibration process to provide a full probabilistic method that utilizes temporal information in heterochronous data sets to estimate sampling times (leaf-ages) for sequenced for which this information unavailable. Our method is similar to relaxing the constraints of the molecular clock on specific lineages within a phylogenetic tree. Using a combination of synthetic and empirical data sets, we demonstrate that the method estimates leaf-ages reliably and accurately. Potential applications of our approach include incorporating samples of uncertain or radiocarbon-infinite age into ancient DNA analyses, evaluating the temporal signal in a particular sequence or data set, and exploring the reliability of sequence ages that are somehow contentious. PMID:20889726

  17. Computational Prediction of Phylogenetically Conserved Sequence Motifs for Five Different Candidate Genes in Type II Diabetic Nephropathy

    PubMed Central

    Sindhu, T; Rajamanikandan, S; Srinivasan, P

    2012-01-01

    Background: Computational identification of phylogenetic motifs helps to understand the knowledge about known functional features that includes catalytic site, substrate binding epitopes, and protein-protein interfaces. Furthermore, they are strongly conserved among orthologs, indicating their evolutionary importance. The study aimed to analyze five candidate genes involved in type II diabetic nephropathy and to predict phylogenetic motifs from their corresponding orthologous protein sequences. Methods: AKR1B1, APOE, ENPP1, ELMO1 and IGFBP1 are the genes that have been identified as an important target for type II diabetic nephropathy through experimental studies. Their corresponding protein sequences, structures, orthologous sequences were retrieved from UniprotKB, PDB, and PHOG database respectively. Multiple sequence alignments were constructed using ClustalW and phylogenetic motifs were identified using MINER. The occurrence of amino acids in the obtained phylogenetic motifs was generated using WebLogo and false positive expectations were calculated against phylogenetic similarity. Results: In total, 17 phylogenetic motifs were identified from the five proteins and the residues such as glycine, leucine, tryptophan, aspartic acid were found in appreciable frequency whereas arginine identified in all the predicted PMs. The result implies that these residues can be important to the functional and structural role of the proteins and calculated false positive expectations implies that they were generally conserved in traditional sense. Conclusion: The prediction of phylogenetic motifs is an accurate method for detecting functionally important conserved residues. The conserved motifs can be used as a potential drug target for type II diabetic nephropathy. PMID:23113206

  18. The Transporter Classification Database (TCDB): recent advances.

    PubMed

    Saier, Milton H; Reddy, Vamsee S; Tsu, Brian V; Ahmed, Muhammad Saad; Li, Chun; Moreno-Hagelsieb, Gabriel

    2016-01-01

    The Transporter Classification Database (TCDB; http://www.tcdb.org) is a freely accessible reference database for transport protein research, which provides structural, functional, mechanistic, evolutionary and disease/medical information about transporters from organisms of all types. TCDB is the only transport protein classification database adopted by the International Union of Biochemistry and Molecular Biology (IUBMB). It consists of more than 10 000 non-redundant transport systems with more than 11 000 reference citations, classified into over 1000 transporter families. Transporters in TCDB can be single or multi-component systems, categorized in a functional/phylogenetic hierarchical system of classes, subclasses, families, subfamilies and transport systems. TCDB also includes updated software designed to analyze the distinctive features of transport proteins, extending its usefulness. Here we present a comprehensive update of the database contents and features and summarize recent discoveries recorded in TCDB. PMID:26546518

  19. The Transporter Classification Database (TCDB): recent advances

    PubMed Central

    Saier, Milton H.; Reddy, Vamsee S.; Tsu, Brian V.; Ahmed, Muhammad Saad; Li, Chun; Moreno-Hagelsieb, Gabriel

    2016-01-01

    The Transporter Classification Database (TCDB; http://www.tcdb.org) is a freely accessible reference database for transport protein research, which provides structural, functional, mechanistic, evolutionary and disease/medical information about transporters from organisms of all types. TCDB is the only transport protein classification database adopted by the International Union of Biochemistry and Molecular Biology (IUBMB). It consists of more than 10 000 non-redundant transport systems with more than 11 000 reference citations, classified into over 1000 transporter families. Transporters in TCDB can be single or multi-component systems, categorized in a functional/phylogenetic hierarchical system of classes, subclasses, families, subfamilies and transport systems. TCDB also includes updated software designed to analyze the distinctive features of transport proteins, extending its usefulness. Here we present a comprehensive update of the database contents and features and summarize recent discoveries recorded in TCDB. PMID:26546518

  20. Construction of the Platform for Phylogenetic Analysis

    NASA Astrophysics Data System (ADS)

    Meng, Zhen; Lin, Xiaoguang; He, Xing; Gao, Yanping; Liu, Hongmei; Liu, Yong; Zhou, Yuanchun; Li, Jianhui; Chen, Zhiduan; Zhang, Shouzhou; Li, Yong

    Based on discussing the history of advancement to building the tree of life using genetic and genomic information, effective strategies and methods for the construction of the tree of life, this paper carried out business process analysis and application design. It implements a phylogenetic analysis platform for the land plants based on this analysis. The platform extracts molecular data from the international public databases in batch, which is automated acquisition, cleaning function for users to understand the situation of peer data. The process of phylogenetic reconstruction includes several public modes and tools, such as batch extraction, multiple sequence alignment, cleaning & editing, tree reconstruction, phylogeny evaluation and visualization. All these procedures demand a number of interactive interfaces for phylogenetic tree automatic generation and decision-making aids experiment.

  1. Automated Decision Tree Classification of Corneal Shape

    PubMed Central

    Twa, Michael D.; Parthasarathy, Srinivasan; Roberts, Cynthia; Mahmoud, Ashraf M.; Raasch, Thomas W.; Bullimore, Mark A.

    2011-01-01

    Purpose The volume and complexity of data produced during videokeratography examinations present a challenge of interpretation. As a consequence, results are often analyzed qualitatively by subjective pattern recognition or reduced to comparisons of summary indices. We describe the application of decision tree induction, an automated machine learning classification method, to discriminate between normal and keratoconic corneal shapes in an objective and quantitative way. We then compared this method with other known classification methods. Methods The corneal surface was modeled with a seventh-order Zernike polynomial for 132 normal eyes of 92 subjects and 112 eyes of 71 subjects diagnosed with keratoconus. A decision tree classifier was induced using the C4.5 algorithm, and its classification performance was compared with the modified Rabinowitz–McDonnell index, Schwiegerling’s Z3 index (Z3), Keratoconus Prediction Index (KPI), KISA%, and Cone Location and Magnitude Index using recommended classification thresholds for each method. We also evaluated the area under the receiver operator characteristic (ROC) curve for each classification method. Results Our decision tree classifier performed equal to or better than the other classifiers tested: accuracy was 92% and the area under the ROC curve was 0.97. Our decision tree classifier reduced the information needed to distinguish between normal and keratoconus eyes using four of 36 Zernike polynomial coefficients. The four surface features selected as classification attributes by the decision tree method were inferior elevation, greater sagittal depth, oblique toricity, and trefoil. Conclusions Automated decision tree classification of corneal shape through Zernike polynomials is an accurate quantitative method of classification that is interpretable and can be generated from any instrument platform capable of raw elevation data output. This method of pattern classification is extendable to other classification problems. PMID:16357645

  2. Phylogenetic conservatism of functional traits in microorganisms.

    PubMed

    Martiny, Adam C; Treseder, Kathleen; Pusch, Gordon

    2013-04-01

    A central question in biology is how biodiversity influences ecosystem functioning. Underlying this is the relationship between organismal phylogeny and the presence of specific functional traits. The relationship is complicated by gene loss and convergent evolution, resulting in the polyphyletic distribution of many traits. In microorganisms, lateral gene transfer can further distort the linkage between phylogeny and the presence of specific functional traits. To identify the phylogenetic conservation of specific traits in microorganisms, we developed a new phylogenetic metric-consenTRAIT-to estimate the clade depth where organisms share a trait. We then analyzed the distribution of 89 functional traits across a broad range of Bacteria and Archaea using genotypic and phenotypic data. A total of 93% of the traits were significantly non-randomly distributed, which suggested that vertical inheritance was generally important for the phylogenetic dispersion of functional traits in microorganisms. Further, traits in microbes were associated with a continuum of trait depths (τD), ranging from a few deep to many shallow clades (average τD: 0.101-0.0011 rRNA sequence dissimilarity). Next, we demonstrated that the dispersion and the depth of clades that contain a trait is correlated with the trait's complexity. Specifically, complex traits encoded by many genes like photosynthesis and methanogenesis were found in a few deep clusters, whereas the ability to use simple carbon substrates was highly phylogenetically dispersed. On the basis of these results, we propose a framework for predicting the phylogenetic conservatism of functional traits depending on the complexity of the trait. This framework enables predicting how variation in microbial composition may affect microbially-mediated ecosystem processes as well as linking phylogenetic and trait-based patterns of biogeography. PMID:23235290

  3. TCS: a web server for multiple sequence alignment evaluation and phylogenetic reconstruction

    PubMed Central

    Chang, Jia-Ming; Di Tommaso, Paolo; Lefort, Vincent; Gascuel, Olivier; Notredame, Cedric

    2015-01-01

    This article introduces the Transitive Consistency Score (TCS) web server; a service making it possible to estimate the local reliability of protein multiple sequence alignments (MSAs) using the TCS index. The evaluation can be used to identify the aligned positions most likely to contain structurally analogous residues and also most likely to support an accurate phylogenetic reconstruction. The TCS scoring scheme has been shown to be accurate predictor of structural alignment correctness among commonly used methods. It has also been shown to outperform common filtering schemes like Gblocks or trimAl when doing MSA post-processing prior to phylogenetic tree reconstruction. The web server is available from http://tcoffee.crg.cat/tcs. PMID:25855806

  4. TCS: a web server for multiple sequence alignment evaluation and phylogenetic reconstruction.

    PubMed

    Chang, Jia-Ming; Di Tommaso, Paolo; Lefort, Vincent; Gascuel, Olivier; Notredame, Cedric

    2015-07-01

    This article introduces the Transitive Consistency Score (TCS) web server; a service making it possible to estimate the local reliability of protein multiple sequence alignments (MSAs) using the TCS index. The evaluation can be used to identify the aligned positions most likely to contain structurally analogous residues and also most likely to support an accurate phylogenetic reconstruction. The TCS scoring scheme has been shown to be accurate predictor of structural alignment correctness among commonly used methods. It has also been shown to outperform common filtering schemes like Gblocks or trimAl when doing MSA post-processing prior to phylogenetic tree reconstruction. The web server is available from http://tcoffee.crg.cat/tcs. PMID:25855806

  5. Understanding phylogenetic incongruence: lessons from phyllostomid bats

    PubMed Central

    Dávalos, Liliana M; Cirranello, Andrea L; Geisler, Jonathan H; Simmons, Nancy B

    2012-01-01

    All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive morphological convergence among nectar-feeding lineages, and incongruent gene trees. Applying methods to account for nucleotide sequence saturation reduces, but does not completely eliminate, phylogenetic conflict. We ruled out paralogy, lateral gene transfer, and poor taxon sampling and outgroup choices among the processes leading to incongruent gene trees in phyllostomid bats. Uncovering and countering the possible effects of introgression and lineage sorting of ancestral polymorphism on gene trees will require great leaps in genomic and allelic sequencing in this species-rich mammalian family. We also found evidence for adaptive molecular evolution leading to convergence in mitochondrial proteins among nectar-feeding lineages. In conclusion, the biological processes that generate phylogenetic conflict are ubiquitous, and overcoming incongruence requires better models and more data than have been collected even in well-studied organisms such as phyllostomid bats. PMID:22891620

  6. Phylogenetic Analysis of Mitochondrial Outer Membrane β-Barrel Channels

    PubMed Central

    Wojtkowska, Małgorzata; Jąkalski, Marcin; Pieńkowska, Joanna R.; Stobienia, Olgierd; Karachitos, Andonis; Przytycka, Teresa M.; Weiner, January; Kmita, Hanna; Makałowski, Wojciech

    2012-01-01

    Transport of molecules across mitochondrial outer membrane is pivotal for a proper function of mitochondria. The transport pathways across the membrane are formed by ion channels that participate in metabolite exchange between mitochondria and cytoplasm (voltage-dependent anion-selective channel, VDAC) as well as in import of proteins encoded by nuclear genes (Tom40 and Sam50/Tob55). VDAC, Tom40, and Sam50/Tob55 are present in all eukaryotic organisms, encoded in the nuclear genome, and have β-barrel topology. We have compiled data sets of these protein sequences and studied their phylogenetic relationships with a special focus on the position of Amoebozoa. Additionally, we identified these protein-coding genes in Acanthamoeba castellanii and Dictyostelium discoideum to complement our data set and verify the phylogenetic position of these model organisms. Our analysis show that mitochondrial β-barrel channels from Archaeplastida (plants) and Opisthokonta (animals and fungi) experienced many duplication events that resulted in multiple paralogous isoforms and form well-defined monophyletic clades that match the current model of eukaryotic evolution. However, in representatives of Amoebozoa, Chromalveolata, and Excavata (former Protista), they do not form clearly distinguishable clades, although they locate basally to the plant and algae branches. In most cases, they do not posses paralogs and their sequences appear to have evolved quickly or degenerated. Consequently, the obtained phylogenies of mitochondrial outer membrane β-channels do not entirely reflect the recent eukaryotic classification system involving the six supergroups: Chromalveolata, Excavata, Archaeplastida, Rhizaria, Amoebozoa, and Opisthokonta. PMID:22155732

  7. The phylogenetic significance of colour patterns in marine teleost larvae

    PubMed Central

    Baldwin, Carole C

    2013-01-01

    Ichthyologists, natural-history artists, and tropical-fish aquarists have described, illustrated, or photographed colour patterns in adult marine fishes for centuries, but colour patterns in marine fish larvae have largely been neglected. Yet the pelagic larval stages of many marine fishes exhibit subtle to striking, ephemeral patterns of chromatophores that warrant investigation into their potential taxonomic and phylogenetic significance. Colour patterns in larvae of over 200 species of marine teleosts, primarily from the western Caribbean, were examined from digital colour photographs, and their potential utility in elucidating evolutionary relationships at various taxonomic levels was assessed. Larvae of relatively few basal marine teleosts exhibit erythrophores, xanthophores, or iridophores (i.e. nonmelanistic chromatophores), but one or more of those types of chromatophores are visible in larvae of many basal marine neoteleosts and nearly all marine percomorphs. Whether or not the presence of nonmelanistic chromatophores in pelagic marine larvae diagnoses any major teleost taxonomic group cannot be determined based on the preliminary survey conducted, but there is a trend toward increased colour from elopomorphs to percomorphs. Within percomorphs, patterns of nonmelanistic chromatophores may help resolve or contribute evidence to existing hypotheses of relationships at multiple levels of classification. Mugilid and some beloniform larvae share a unique ontogenetic transformation of colour pattern that lends support to the hypothesis of a close relationship between them. Larvae of some tetraodontiforms and lophiiforms are strikingly similar in having the trunk enclosed in an inflated sac covered with xanthophores, a character that may help resolve the relationships of these enigmatic taxa. Colour patterns in percomorph larvae also appear to diagnose certain groups at the interfamilial, familial, intergeneric, and generic levels. Slight differences in generic colour patterns, including whether the pattern comprises xanthophores or erythrophores, often distinguish species. The homology, ontogeny, and possible functional significance of colour patterns in larvae are discussed. Considerably more investigation of larval colour patterns in marine teleosts is needed to assess fully their value in phylogenetic reconstruction. PMID:24039297

  8. Phylogenetic analysis of mitochondrial outer membrane β-barrel channels.

    PubMed

    Wojtkowska, Małgorzata; Jąkalski, Marcin; Pieńkowska, Joanna R; Stobienia, Olgierd; Karachitos, Andonis; Przytycka, Teresa M; Weiner, January; Kmita, Hanna; Makałowski, Wojciech

    2012-01-01

    Transport of molecules across mitochondrial outer membrane is pivotal for a proper function of mitochondria. The transport pathways across the membrane are formed by ion channels that participate in metabolite exchange between mitochondria and cytoplasm (voltage-dependent anion-selective channel, VDAC) as well as in import of proteins encoded by nuclear genes (Tom40 and Sam50/Tob55). VDAC, Tom40, and Sam50/Tob55 are present in all eukaryotic organisms, encoded in the nuclear genome, and have β-barrel topology. We have compiled data sets of these protein sequences and studied their phylogenetic relationships with a special focus on the position of Amoebozoa. Additionally, we identified these protein-coding genes in Acanthamoeba castellanii and Dictyostelium discoideum to complement our data set and verify the phylogenetic position of these model organisms. Our analysis show that mitochondrial β-barrel channels from Archaeplastida (plants) and Opisthokonta (animals and fungi) experienced many duplication events that resulted in multiple paralogous isoforms and form well-defined monophyletic clades that match the current model of eukaryotic evolution. However, in representatives of Amoebozoa, Chromalveolata, and Excavata (former Protista), they do not form clearly distinguishable clades, although they locate basally to the plant and algae branches. In most cases, they do not posses paralogs and their sequences appear to have evolved quickly or degenerated. Consequently, the obtained phylogenies of mitochondrial outer membrane β-channels do not entirely reflect the recent eukaryotic classification system involving the six supergroups: Chromalveolata, Excavata, Archaeplastida, Rhizaria, Amoebozoa, and Opisthokonta. PMID:22155732

  9. Phylogenetic and Structural Analysis of Polyketide Synthases in Aspergilli

    PubMed Central

    Bhetariya, Preetida J.; Prajapati, Madhvi; Bhaduri, Asani; Mandal, Rahul Shubhra; Varma, Anupam; Madan, Taruna; Singh, Yogendra; Sarma, P. Usha

    2016-01-01

    Polyketide synthases (PKSs) of Aspergillus species are multidomain and multifunctional megaenzymes that play an important role in the synthesis of diverse polyketide compounds. Putative PKS protein sequences from Aspergillus species representing medically, agriculturally, and industrially important Aspergillus species were chosen and screened for in silico studies. Six candidate Aspergillus species, Aspergillus fumigatus Af293, Aspergillus flavus NRRL3357, Aspergillus niger CBS 513.88, Aspergillus terreus NIH2624, Aspergillus oryzae RIB40, and Aspergillus clavatus NRRL1, were selected to study the PKS phylogeny. Full-length PKS proteins and only ketosynthase (KS) domain sequence were retrieved for independent phylogenetic analysis from the aforementioned species, and phylogenetic analysis was performed with characterized fungal PKS. This resulted into grouping of Aspergilli PKSs into nonreducing (NR), partially reducing (PR), and highly reducing (HR) PKS enzymes. Eight distinct clades with unique domain arrangements were classified based on homology with functionally characterized PKS enzymes. Conserved motif signatures corresponding to each type of PKS were observed. Three proteins from Protein Data Bank corresponding to NR, PR, and HR type of PKS (XP_002384329.1, XP_753141.2, and XP_001402408.2, respectively) were selected for mapping of conserved motifs on three-dimensional structures of KS domain. Structural variations were found at the active sites on modeled NR, PR, and HR enzymes of Aspergillus. It was observed that the number of iteration cycles was dependent on the size of the cavity in the active site of the PKS enzyme correlating with a type with reducing or NR products, such as pigment, 6MSA, and lovastatin. The current study reports the grouping and classification of PKS proteins of Aspergilli for possible exploration of novel polyketides based on sequence homology; this information can be useful for selection of PKS for polyketide exploration and specific detection of Aspergilli. PMID:27199544

  10. PHYML Online—a web server for fast maximum likelihood-based phylogenetic inference

    PubMed Central

    Guindon, Stéphane; Lethiec, Franck; Duroux, Patrice; Gascuel, Olivier

    2005-01-01

    PHYML Online is a web interface to PHYML, a software that implements a fast and accurate heuristic for estimating maximum likelihood phylogenies from DNA and protein sequences. This tool provides the user with a number of options, e.g. nonparametric bootstrap and estimation of various evolutionary parameters, in order to perform comprehensive phylogenetic analyses on large datasets in reasonable computing time. The server and its documentation are available at . PMID:15980534

  11. Spectral-spatial hyperspectral classification based on multi-center SAM and MRF

    NASA Astrophysics Data System (ADS)

    Tang, Bo; Liu, Zhi; Xiao, Xiaoyan; Nie, Mingyu; Chang, Jun; Jiang, Wei; Li, Xiaomei; Zheng, Chengyun

    2015-12-01

    In this paper, a novel framework for an accurate spectral-spatial classification of hyperspectral images is proposed to address nonlinear classification problems. The algorithm is based on the spectral angle mapper (SAM), which is achieved by introducing the multi-center model and Markov random fields (MRF) into a probabilistic decision framework to obtain an accurate classification. Experimental comparisons between several traditional classification methods and the proposed MSAM-MRF algorithm have demonstrated that the performance of the proposed MSAM-MRF algorithm outperforms the traditional classification algorithms.

  12. The complete mitochondrial genome of Meriones libycus (Rodentia: Cricetidae) and its phylogenetic analysis.

    PubMed

    Luo, Guangjie; Liao, Jicheng

    2016-07-01

    Meriones libycus belongs to the genus Meriones in Gerbillinae, its complete mitochondrial genome is 16,341 bp in length. The heavy strand contains 32.8% A, 13.1% G, 25.3% C, 28.8% T, protein-coding genes approximately accounting for 69.54%. Results of phylogenetic analysis showed that M. libycus and Meriones unguiculatus were clustered together, and it was consistent with that of primary morphological taxonomy. This study verifies the evolutionary status of M. libycus in Meriones at the molecular level. The mitochondrial genome would be a significant supplement for the gene pool of Rodentia and the conclusion of phylogenetic analysis could be an important molecular evidence for the classification of Gerbillinae. PMID:26017047

  13. Phylogenetics of the Phlebotomine Sand Fly Group Verrucarum (Diptera: Psychodidae: Lutzomyia)

    PubMed Central

    Cohnstaedt, Lee W.; Beati, Lorenza; Caceres, Abraham G.; Ferro, Cristina; Munstermann, Leonard E.

    2011-01-01

    Within the sand fly genus Lutzomyia, the Verrucarum species group contains several of the principal vectors of American cutaneous leishmaniasis and human bartonellosis in the Andean region of South America. The group encompasses 40 species for which the taxonomic status, phylogenetic relationships, and role of each species in disease transmission remain unresolved. Mitochondrial cytochrome c oxidase I (COI) phylogenetic analysis of a 667-bp fragment supported the morphological classification of the Verrucarum group into series. Genetic sequences from seven species were grouped in well-supported monophyletic lineages. Four species, however, clustered in two paraphyletic lineages that indicate conspecificity—the Lutzomyia longiflocosa–Lutzomyia sauroida pair and the Lutzomyia quasitownsendi–Lutzomyia torvida pair. COI sequences were also evaluated as a taxonomic tool based on interspecific genetic variability within the Verrucarum group and the intraspecific variability of one of its members, Lutzomyia verrucarum, across its known distribution. PMID:21633028

  14. The complete mitochondrial genome sequence of Acentrogobius sp. (Gobiiformes: Gobiidae) and phylogenetic studies of Gobiidae.

    PubMed

    Yang, Qiu-Hua; Lin, Qi; He, Li-Bin; Huang, Rui-Fang; Lin, Ke-Bing; Ge, Hui; Wu, Jian-Shao; Zhou, Chen

    2016-07-01

    At present, few morphological descriptions are available for Acentrogobius species and there exist some confused issues on the species classification and phylogeny. In this study, we first determined and described the complete mitochondrial genome of Acentrogobius sp. The complete mitogenome sequence is 17 083 bp in length, containing 13 protein-coding genes, two rRNA genes, 22 tRNA genes, a putative control region (CR), and a light-strand replication origin (OL). The overall base composition is 28.9% A, 26.2% T, 28.5% C, and 16.4% G, with a slight AT bias (55.1%). To furthermore validate the new determined sequences, phylogenetic trees involving all the Gobiidae species available in GenBank database were constructed. These results are expected to provide useful molecular data for species identification and further phylogenetic studies of Gobiiformes. PMID:26152347

  15. Comparative genomic analysis and phylogenetic position of Theileria equi

    PubMed Central

    2012-01-01

    Background Transmission of arthropod-borne apicomplexan parasites that cause disease and result in death or persistent infection represents a major challenge to global human and animal health. First described in 1901 as Piroplasma equi, this re-emergent apicomplexan parasite was renamed Babesia equi and subsequently Theileria equi, reflecting an uncertain taxonomy. Understanding mechanisms by which apicomplexan parasites evade immune or chemotherapeutic elimination is required for development of effective vaccines or chemotherapeutics. The continued risk of transmission of T. equi from clinically silent, persistently infected equids impedes the goal of returning the U. S. to non-endemic status. Therefore comparative genomic analysis of T. equi was undertaken to: 1) identify genes contributing to immune evasion and persistence in equid hosts, 2) identify genes involved in PBMC infection biology and 3) define the phylogenetic position of T. equi relative to sequenced apicomplexan parasites. Results The known immunodominant proteins, EMA1, 2 and 3 were discovered to belong to a ten member gene family with a mean amino acid identity, in pairwise comparisons, of 39%. Importantly, the amino acid diversity of EMAs is distributed throughout the length of the proteins. Eight of the EMA genes were simultaneously transcribed. As the agents that cause bovine theileriosis infect and transform host cell PBMCs, we confirmed that T. equi infects equine PBMCs, however, there is no evidence of host cell transformation. Indeed, a number of genes identified as potential manipulators of the host cell phenotype are absent from the T. equi genome. Comparative genomic analysis of T. equi revealed the phylogenetic positioning relative to seven apicomplexan parasites using deduced amino acid sequences from 150 genes placed it as a sister taxon to Theileria spp. Conclusions The EMA family does not fit the paradigm for classical antigenic variation, and we propose a novel model describing the role of the EMA family in persistence. T. equi has lost the putative genes for host cell transformation, or the genes were acquired by T. parva and T. annulata after divergence from T. equi. Our analysis identified 50 genes that will be useful for definitive phylogenetic classification of T. equi and closely related organisms. PMID:23137308

  16. New Classification of Headache

    PubMed Central

    Gawel, Marek J.

    1992-01-01

    The Headache Classification Committee of the International Headache Society has developed a new classification system for headache, cranial neuralgia, and facial pain. The value of the classification for the practising clinician is that it forces him or her to take a more careful history in order to determine the nature of the headache. This article reviews the classification system and gives examples of case histories and subsequent diagnoses. PMID:21221276

  17. Threat Diversity Will Erode Mammalian Phylogenetic Diversity in the Near Future

    PubMed Central

    Jono, Clémentine M. A.; Pavoine, Sandrine

    2012-01-01

    To reduce the accelerating rate of phylogenetic diversity loss, many studies have searched for mechanisms that could explain why certain species are at risk, whereas others are not. In particular, it has been demonstrated that species might be affected by both extrinsic threat factors as well as intrinsic biological traits that could render a species more sensitive to extinction; here, we focus on extrinsic factors. Recently, the International Union for Conservation of Nature developed a new classification of threat types, including climate change, urbanization, pollution, agriculture and aquaculture, and harvesting/hunting. We have used this new classification to analyze two main factors that could explain the expected future loss of mammalian phylogenetic diversity: 1. differences in the type of threats that affect mammals and 2. differences in the number of major threats that accumulate for a single species. Our results showed that Cetartiodactyla, Diprotodontia, Monotremata, Perissodactyla, Primates, and Proboscidea could lose a high proportion of their current phylogenetic diversity in the coming decades. In contrast, Chiroptera, Didelphimorphia, and Rodentia could lose less phylogenetic diversity than expected if extinctions were random. Some mammalian clades, including Marsupiala, Chiroptera, and a subclade of Primates, are affected by particular threat types, most likely due solely to their geographic locations and associations with particular habitats. However, regardless of the geography, habitat, and taxon considered, it is not the threat type, but the threat diversity that determines the extinction risk for species and clades. Thus, some mammals might be randomly located in areas subjected to a large diversity of threats; they might also accumulate detrimental traits that render them sensitive to different threats, which is a characteristic that could be associated with large body size. Any action reducing threat diversity is expected to have a significant impact on future mammalian phylogeny. PMID:23029443

  18. Threat diversity will erode mammalian phylogenetic diversity in the near future.

    PubMed

    Jono, Clémentine M A; Pavoine, Sandrine

    2012-01-01

    To reduce the accelerating rate of phylogenetic diversity loss, many studies have searched for mechanisms that could explain why certain species are at risk, whereas others are not. In particular, it has been demonstrated that species might be affected by both extrinsic threat factors as well as intrinsic biological traits that could render a species more sensitive to extinction; here, we focus on extrinsic factors. Recently, the International Union for Conservation of Nature developed a new classification of threat types, including climate change, urbanization, pollution, agriculture and aquaculture, and harvesting/hunting. We have used this new classification to analyze two main factors that could explain the expected future loss of mammalian phylogenetic diversity: 1. differences in the type of threats that affect mammals and 2. differences in the number of major threats that accumulate for a single species. Our results showed that Cetartiodactyla, Diprotodontia, Monotremata, Perissodactyla, Primates, and Proboscidea could lose a high proportion of their current phylogenetic diversity in the coming decades. In contrast, Chiroptera, Didelphimorphia, and Rodentia could lose less phylogenetic diversity than expected if extinctions were random. Some mammalian clades, including Marsupiala, Chiroptera, and a subclade of Primates, are affected by particular threat types, most likely due solely to their geographic locations and associations with particular habitats. However, regardless of the geography, habitat, and taxon considered, it is not the threat type, but the threat diversity that determines the extinction risk for species and clades. Thus, some mammals might be randomly located in areas subjected to a large diversity of threats; they might also accumulate detrimental traits that render them sensitive to different threats, which is a characteristic that could be associated with large body size. Any action reducing threat diversity is expected to have a significant impact on future mammalian phylogeny. PMID:23029443

  19. Quantifying MCMC Exploration of Phylogenetic Tree Space

    PubMed Central

    Whidden, Chris; Matsen, Frederick A.

    2015-01-01

    In order to gain an understanding of the effectiveness of phylogenetic Markov chain Monte Carlo (MCMC), it is important to understand how quickly the empirical distribution of the MCMC converges to the posterior distribution. In this article, we investigate this problem on phylogenetic tree topologies with a metric that is especially well suited to the task: the subtree prune-and-regraft (SPR) metric. This metric directly corresponds to the minimum number of MCMC rearrangements required to move between trees in common phylogenetic MCMC implementations. We develop a novel graph-based approach to analyze tree posteriors and find that the SPR metric is much more informative than simpler metrics that are unrelated to MCMC moves. In doing so, we show conclusively that topological peaks do occur in Bayesian phylogenetic posteriors from real data sets as sampled with standard MCMC approaches, investigate the efficiency of Metropolis-coupled MCMC (MCMCMC) in traversing the valleys between peaks, and show that conditional clade distribution (CCD) can have systematic problems when there are multiple peaks. PMID:25631175

  20. On the analysis of phylogenetically paired designs

    PubMed Central

    Funk, Jennifer L; Rakovski, Cyril S; Macpherson, J Michael

    2015-01-01

    As phylogenetically controlled experimental designs become increasingly common in ecology, the need arises for a standardized statistical treatment of these datasets. Phylogenetically paired designs circumvent the need for resolved phylogenies and have been used to compare species groups, particularly in the areas of invasion biology and adaptation. Despite the widespread use of this approach, the statistical analysis of paired designs has not been critically evaluated. We propose a mixed model approach that includes random effects for pair and species. These random effects introduce a “two-layer” compound symmetry variance structure that captures both the correlations between observations on related species within a pair as well as the correlations between the repeated measurements within species. We conducted a simulation study to assess the effect of model misspecification on Type I and II error rates. We also provide an illustrative example with data containing taxonomically similar species and several outcome variables of interest. We found that a mixed model with species and pair as random effects performed better in these phylogenetically explicit simulations than two commonly used reference models (no or single random effect) by optimizing Type I error rates and power. The proposed mixed model produces acceptable Type I and II error rates despite the absence of a phylogenetic tree. This design can be generalized to a variety of datasets to analyze repeated measurements in clusters of related subjects/species. PMID:25750719

  1. Molecular Epidemiology of PRRSV: A Phylogenetic Perspective

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Since its first discovery two decades ago, porcine reproductive and respiratory syndrome virus (PRRSV) has been the subject of intensive research due to its huge impact on the worldwide swine industry. Thanks to phylogenetic analyses, much has been learned about the genetic diversity and evolution h...

  2. Classification of articulators.

    PubMed

    Rihani, A

    1980-03-01

    A simple classification in familiar terms with definite, clear characteristics can be adopted. This classification system is based on the number of records used and the adjustments necessary for the articulator to accept these records. The classification divides the articulators into nonadjustable, semiadjustable, and fully adjustable articulators (Table I). PMID:6928204

  3. Aircraft Operations Classification System

    NASA Technical Reports Server (NTRS)

    Harlow, Charles; Zhu, Weihong

    2001-01-01

    Accurate data is important in the aviation planning process. In this project we consider systems for measuring aircraft activity at airports. This would include determining the type of aircraft such as jet, helicopter, single engine, and multiengine propeller. Some of the issues involved in deploying technologies for monitoring aircraft operations are cost, reliability, and accuracy. In addition, the system must be field portable and acceptable at airports. A comparison of technologies was conducted and it was decided that an aircraft monitoring system should be based upon acoustic technology. A multimedia relational database was established for the study. The information contained in the database consists of airport information, runway information, acoustic records, photographic records, a description of the event (takeoff, landing), aircraft type, and environmental information. We extracted features from the time signal and the frequency content of the signal. A multi-layer feed-forward neural network was chosen as the classifier. Training and testing results were obtained. We were able to obtain classification results of over 90 percent for training and testing for takeoff events.

  4. [Phylogenetic thinking in the modern biology].

    PubMed

    Pavlinov, I Ia

    2007-01-01

    Any research activity is conducted within the framework of a cognitive situation which is defined by certain basic assumptions about ontology of the portion of the objective world under investigation. From the standpoint of the non-classical scientific epistemology, a part of that situation is constituted by personal knowledge which is formed by a set of thinking (cognitive) styles. The scholastic thinking existing in taxonomy and phylogenetics is considered as an example showing unavoidability of such styles in the natural history knowledge. It is initially rooted in the antic, mythological by its essence, persuasion of isomorphism between movements of the objective reality and of the mind. The instrumentalism entailed by scholastic thinking is based on the mythologeme according to which the "right method" par excellence can lead to the "right knowledge". That is why any disputes between different numerical methods of phylogenetic reconstructions are vain: their validity could be assessed not formally but within particular cognitive situations formed by particular basic models of the phylogenesis. Phylogenetic thinking is of the key importance in evolutionary biology and has great impact on various fields of biology based on it. It is pretty mythological because of non-observability of the phylogenesis: the latter is rather "thinked-in" in the objective world then is induced from the observed facts. It constitutes a part of the evolutionary thinking considering mainly macroevolutionary trends and stressing the initial causes in the structure of causal relations in the analyses of the diversity of organisms. The "tree thinking" of O'Hara is its rough operational equivalent. Relation between phylogenetic thinking and some other styles are considered, which are population, phenetic, typological, and epigenetic ("developmental" of O'Hara). Phylogenetic thinking makes it obliged inclusion of the initial causes in the explanatory models which underlie adaptive and functional peculiarities of organisms, as well as of the entire structure of the biodiversity. It manifests itself in such kind of models through uncovering the phylogenetic signal. This thinking style has great effect on understanding of the ontology of taxa and acknowledges the objective status of the phylogenetic pattern. It is intrinsically included in the argumentation schemes of constructional morphology, comparative phylogenetics. The central metaphor of the phylogenetics is the Tree of Life. Emagination of its unity and uniquiness is of naturphilosophical nature. From the contemporary epistemological standpoint, it should be considered as a generalization upon partial hypotheses of evolution of particular structures each corresponding to certain consideration aspect of the global phylogenesis. Acknowledging of multi-aspectness of the phylogenesis constitutes one of the important points of modern phylogenetic thinking. As different semogeneses are incompletely congruent, the Tree of Life is less certain than each of the initial hypotheses. Any attempt to make it more resolved would lead to its reduction to any of the particular semogenetic scheme (i.e. to a "gene tree") or to its "decay" into several trees each corresponding to a particular consideration aspect of the global Tree. PMID:17338264

  5. Cirrhosis Classification Based on Texture Classification of Random Features

    PubMed Central

    Shao, Ying; Guo, Dongmei; Zheng, Yuanjie; Zhao, Zuowei; Qiu, Tianshuang

    2014-01-01

    Accurate staging of hepatic cirrhosis is important in investigating the cause and slowing down the effects of cirrhosis. Computer-aided diagnosis (CAD) can provide doctors with an alternative second opinion and assist them to make a specific treatment with accurate cirrhosis stage. MRI has many advantages, including high resolution for soft tissue, no radiation, and multiparameters imaging modalities. So in this paper, multisequences MRIs, including T1-weighted, T2-weighted, arterial, portal venous, and equilibrium phase, are applied. However, CAD does not meet the clinical needs of cirrhosis and few researchers are concerned with it at present. Cirrhosis is characterized by the presence of widespread fibrosis and regenerative nodules in the hepatic, leading to different texture patterns of different stages. So, extracting texture feature is the primary task. Compared with typical gray level cooccurrence matrix (GLCM) features, texture classification from random features provides an effective way, and we adopt it and propose CCTCRF for triple classification (normal, early, and middle and advanced stage). CCTCRF does not need strong assumptions except the sparse character of image, contains sufficient texture information, includes concise and effective process, and makes case decision with high accuracy. Experimental results also illustrate the satisfying performance and they are also compared with typical NN with GLCM. PMID:24707317

  6. Cirrhosis classification based on texture classification of random features.

    PubMed

    Liu, Hui; Shao, Ying; Guo, Dongmei; Zheng, Yuanjie; Zhao, Zuowei; Qiu, Tianshuang

    2014-01-01

    Accurate staging of hepatic cirrhosis is important in investigating the cause and slowing down the effects of cirrhosis. Computer-aided diagnosis (CAD) can provide doctors with an alternative second opinion and assist them to make a specific treatment with accurate cirrhosis stage. MRI has many advantages, including high resolution for soft tissue, no radiation, and multiparameters imaging modalities. So in this paper, multisequences MRIs, including T1-weighted, T2-weighted, arterial, portal venous, and equilibrium phase, are applied. However, CAD does not meet the clinical needs of cirrhosis and few researchers are concerned with it at present. Cirrhosis is characterized by the presence of widespread fibrosis and regenerative nodules in the hepatic, leading to different texture patterns of different stages. So, extracting texture feature is the primary task. Compared with typical gray level cooccurrence matrix (GLCM) features, texture classification from random features provides an effective way, and we adopt it and propose CCTCRF for triple classification (normal, early, and middle and advanced stage). CCTCRF does not need strong assumptions except the sparse character of image, contains sufficient texture information, includes concise and effective process, and makes case decision with high accuracy. Experimental results also illustrate the satisfying performance and they are also compared with typical NN with GLCM. PMID:24707317

  7. More accurate time from the Heathkit most accurate clock

    NASA Astrophysics Data System (ADS)

    Tollefson, M. V.; Bloomer, R. H., Jr.

    1986-05-01

    The addition of an auxiliary circuit to the Heathkit GC-1000 clock is described. The circuit, which consists of two integrated circuits, two resistors, and three capacitors, will supply a more accurate timing pulse to the computer. The circuit contains two input signals and produces one output; the inputs are multiplexed seven-digit displays (two digits for hour, minute, and second, and one for tenths of a second) and the output appears as a string of about 10 low-going pulses about 1.25 ms in duration. Low pass filters (R1 and C1) are utilized to eliminate extraneous pulses. The materials and procedures for attaching the circuit to the clock are examined. The software for the data set ready signal, and the method for accurate timing of data collection are discussed. The accuracy of the clock is evaluated and it is observed that the circuit improves the correct time provided by the clock from + or - 29 ms to + or - 5 ms.

  8. Cnidarian phylogenetic relationships as revealed by mitogenomics

    PubMed Central

    2013-01-01

    Background Cnidaria (corals, sea anemones, hydroids, jellyfish) is a phylum of relatively simple aquatic animals characterized by the presence of the cnidocyst: a cell containing a giant capsular organelle with an eversible tubule (cnida). Species within Cnidaria have life cycles that involve one or both of the two distinct body forms, a typically benthic polyp, which may or may not be colonial, and a typically pelagic mostly solitary medusa. The currently accepted taxonomic scheme subdivides Cnidaria into two main assemblages: Anthozoa (Hexacorallia + Octocorallia) – cnidarians with a reproductive polyp and the absence of a medusa stage – and Medusozoa (Cubozoa, Hydrozoa, Scyphozoa, Staurozoa) – cnidarians that usually possess a reproductive medusa stage. Hypothesized relationships among these taxa greatly impact interpretations of cnidarian character evolution. Results We expanded the sampling of cnidarian mitochondrial genomes, particularly from Medusozoa, to reevaluate phylogenetic relationships within Cnidaria. Our phylogenetic analyses based on a mitochogenomic dataset support many prior hypotheses, including monophyly of Hexacorallia, Octocorallia, Medusozoa, Cubozoa, Staurozoa, Hydrozoa, Carybdeida, Chirodropida, and Hydroidolina, but reject the monophyly of Anthozoa, indicating that the Octocorallia + Medusozoa relationship is not the result of sampling bias, as proposed earlier. Further, our analyses contradict Scyphozoa [Discomedusae + Coronatae], Acraspeda [Cubozoa + Scyphozoa], as well as the hypothesis that Staurozoa is the sister group to all the other medusozoans. Conclusions Cnidarian mitochondrial genomic data contain phylogenetic signal informative for understanding the evolutionary history of this phylum. Mitogenome-based phylogenies, which reject the monophyly of Anthozoa, provide further evidence for the polyp-first hypothesis. By rejecting the traditional Acraspeda and Scyphozoa hypotheses, these analyses suggest that the shared morphological characters in these groups are plesiomorphies, originated in the branch leading to Medusozoa. The expansion of mitogenomic data along with improvements in phylogenetic inference methods and use of additional nuclear markers will further enhance our understanding of the phylogenetic relationships and character evolution within Cnidaria. PMID:23302374

  9. Phyloclimatic modeling: combining phylogenetics and bioclimatic modeling.

    PubMed

    Yesson, C; Culham, A

    2006-10-01

    We investigate the impact of past climates on plant diversification by tracking the "footprint" of climate change on a phylogenetic tree. Diversity within the cosmopolitan carnivorous plant genus Drosera (Droseraceae) is focused within Mediterranean climate regions. We explore whether this diversity is temporally linked to Mediterranean-type climatic shifts of the mid-Miocene and whether climate preferences are conservative over phylogenetic timescales. Phyloclimatic modeling combines environmental niche (bioclimatic) modeling with phylogenetics in order to study evolutionary patterns in relation to climate change. We present the largest and most complete such example to date using Drosera. The bioclimatic models of extant species demonstrate clear phylogenetic patterns; this is particularly evident for the tuberous sundews from southwestern Australia (subgenus Ergaleium). We employ a method for establishing confidence intervals of node ages on a phylogeny using replicates from a Bayesian phylogenetic analysis. This chronogram shows that many clades, including subgenus Ergaleium and section Bryastrum, diversified during the establishment of the Mediterranean-type climate. Ancestral reconstructions of bioclimatic models demonstrate a pattern of preference for this climate type within these groups. Ancestral bioclimatic models are projected into palaeo-climate reconstructions for the time periods indicated by the chronogram. We present two such examples that each generate plausible estimates of ancestral lineage distribution, which are similar to their current distributions. This is the first study to attempt bioclimatic projections on evolutionary time scales. The sundews appear to have diversified in response to local climate development. Some groups are specialized for Mediterranean climates, others show wide-ranging generalism. This demonstrates that Phyloclimatic modeling could be repeated for other plant groups and is fundamental to the understanding of evolutionary responses to climate change. PMID:17060200

  10. High-accurate and noise-tolerant texture descriptor

    NASA Astrophysics Data System (ADS)

    Akoushideh, Alireza; Mazloom-Nezhad Maybodi, Babak

    2015-02-01

    In this paper, we extend pyramid transform domain approach on local binary pattern (PLBP) to make a high-accurate and noise-tolerant texture descriptor. We combine PLBP information of sub-band images, which are attained using wavelet transform, in different resolution and make some new descriptors. Multi-level and -resolution LBP(MPR_LBP), multi-level and -band LBP (MPB_LBP), and multi-level, -band and -resolution LBP (MPBR_LBP) are our proposed descriptors that are applied to unsupervised classification of texture images on Outex, UIUC, and Scene-13 data sets. Experimental results show that the proposed descriptors not only demonstrate acceptable texture classification accuracy with significantly lower feature length, but also they are more noise-robustness to a number of recent state-of-the-art LBP extensions.

  11. A new measure to study phylogenetic relations in the brown algal order Ectocarpales: the "codon impact parameter".

    PubMed

    Das, Smarajit; Chakrabarti, Jayprokas; Ghosh, Zhumur; Sahoo, Satyabrata; Mallick, Bibekanand

    2005-12-01

    We analyse forty-seven chloroplast genes of the large subunit of RuBisCO, from the algal order Ectocarpales, sourced from GenBank. Codon-usage weighted by the nucleotide base-bias defines our score called the codon-impact-parameter. This score is used to obtain phylogenetic relations amongst the 47 Ectocarpales. We compare our classification with the ones done earlier. PMID:16388144

  12. PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification.

    PubMed

    Thomas, Paul D; Kejariwal, Anish; Campbell, Michael J; Mi, Huaiyu; Diemer, Karen; Guo, Nan; Ladunga, Istvan; Ulitsky-Lazareva, Betty; Muruganujan, Anushya; Rabkin, Steven; Vandergriff, Jody A; Doremieux, Olivier

    2003-01-01

    The PANTHER database was designed for high-throughput analysis of protein sequences. One of the key features is a simplified ontology of protein function, which allows browsing of the database by biological functions. Biologist curators have associated the ontology terms with groups of protein sequences rather than individual sequences. Statistical models (Hidden Markov Models, or HMMs) are built from each of these groups. The advantage of this approach is that new sequences can be automatically classified as they become available. To ensure accurate functional classification, HMMs are constructed not only for families, but also for functionally distinct subfamilies. Multiple sequence alignments and phylogenetic trees, including curator-assigned information, are available for each family. The current version of the PANTHER database includes training sequences from all organisms in the GenBank non-redundant protein database, and the HMMs have been used to classify gene products across the entire genomes of human, and Drosophila melanogaster. The ontology terms and protein families and subfamilies, as well as Drosophila gene c;assifications, can be browsed and searched for free. Due to outstanding contractual obligations, access to human gene classifications and to protein family trees and multiple sequence alignments will temporarily require a nominal registration fee. PANTHER is publicly available on the web at http://panther.celera.com. PMID:12520017

  13. Analysis of the peroxiredoxin family: using active site structure and sequence information for global classification and residue analysis

    PubMed Central

    Nelson, Kimberly J.; Knutson, Stacy T.; Soito, Laura; Klomsiri, Chananat; Poole, Leslie B.; Fetrow, Jacquelyn S.

    2010-01-01

    Peroxiredoxins (Prxs) are a widespread and highly expressed family of cysteine-based peroxidases that react very rapidly with H2O2, organic peroxides, and peroxynitrite. Correct subfamily classification has been problematic since Prx subfamilies are frequently not correlated with phylogenetic distribution and diverge in their preferred reductant, oligomerization state, and tendency towards overoxidation. We have developed a method that uses the Deacon Active Site Profiler (DASP) tool to extract functional site profiles from structurally characterized proteins, to computationally define subfamilies, and to identify new Prx subfamily members from GenBank(nr). For the 58 literature-defined Prx test proteins, 57 were correctly assigned and none were assigned to the incorrect subfamily. The >3500 putative Prx sequences identified were then used to analyze residue conservation in the active site of each Prx subfamily. Our results indicate that the existence and location of the resolving cysteine varies in some subfamilies (e.g. Prx5) to a greater degree than previously appreciated and that interactions at the A interface (common to Prx5, Tpx and higher order AhpC/Prx1 structures) are important for stabilization of the correct active site geometry. Interestingly, this method also allows us to further divide the AhpC/Prx1 into four groups that are correlated with functional characteristics. The DASP method provides more accurate subfamily classification than PSI-BLAST for members of the Prx family and can now readily be applied to other large protein families. PMID:21287625

  14. Identification and Classification of Rhizobia by Matrix-Assisted Laser Desorption/Ionization Time-Of-Flight Mass Spectrometry

    PubMed Central

    Jia, Rui Zong; Zhang, Rong Juan; Wei, Qing; Chen, Wen Feng; Cho, Il Kyu; Chen, Wen Xin; Li, Qing X

    2015-01-01

    Mass spectrometry (MS) has been widely used for specific, sensitive and rapid analysis of proteins and has shown a high potential for bacterial identification and characterization. Type strains of four species of rhizobia and Escherichia coli DH5α were employed as reference bacteria to optimize various parameters for identification and classification of species of rhizobia by matrix-assisted laser desorption/ionization time-of-flight MS (MALDI TOF MS). The parameters optimized included culture medium states (liquid or solid), bacterial growth phases, colony storage temperature and duration, and protein data processing to enhance the bacterial identification resolution, accuracy and reliability. The medium state had little effects on the mass spectra of protein profiles. A suitable sampling time was between the exponential phase and the stationary phase. Consistent protein mass spectral profiles were observed for E. coli colonies pre-grown for 14 days and rhizobia for 21 days at 4°C or 21°C. A dendrogram of 75 rhizobial strains of 4 genera was constructed based on MALDI TOF mass spectra and the topological patterns agreed well with those in the 16S rDNA phylogenetic tree. The potential of developing a mass spectral database for all rhizobia species was assessed with blind samples. The entire process from sample preparation to accurate identification and classification of species required approximately one hour. PMID:26500417

  15. Draft genome sequences for the obligate bacterial predators Bacteriovorax spp. of four phylogenetic clusters

    PubMed Central

    2015-01-01

    Bacteriovorax is the halophilic genus of the obligate bacterial predators, Bdellovibrio and like organisms. The predators are known for their unique biphasic life style in which they search for and attack their prey in the free living phase; penetrate, grow, multiply and lyse the prey in the intraperiplasmic phase. Bacteriovorax isolates representing four phylogenetic clusters were selected for genomic sequencing. Only one type strain genome has been published so far from the genus Bacteriovorax. We report the genomes from non-type strains isolated from aquatic environments. Here we describe and compare the genomic features of the four strains, together with the classification and annotation. PMID:26203326

  16. A User's Guide to a Data Base of the Diversity of Pseudomonas syringae and Its Application to Classifying Strains in This Phylogenetic Complex

    PubMed Central

    Berge, Odile; Monteil, Caroline L.; Bartoli, Claudia; Chandeysson, Charlotte; Guilbaud, Caroline; Sands, David C.; Morris, Cindy E.

    2014-01-01

    The Pseudomonas syringae complex is composed of numerous genetic lineages of strains from both agricultural and environmental habitats including habitats closely linked to the water cycle. The new insights from the discovery of this bacterial species in habitats outside of agricultural contexts per se have led to the revelation of a wide diversity of strains in this complex beyond what was known from agricultural contexts. Here, through Multi Locus Sequence Typing (MLST) of 216 strains, we identified 23 clades within 13 phylogroups among which the seven previously described P. syringae phylogroups were included. The phylogeny of the core genome of 29 strains representing nine phylogroups was similar to the phylogeny obtained with MLST thereby confirming the robustness of MLST-phylogroups. We show that phenotypic traits rarely provide a satisfactory means for classification of strains even if some combinations are highly probable in some phylogroups. We demonstrate that the citrate synthase (cts) housekeeping gene can accurately predict the phylogenetic affiliation for more than 97% of strains tested. We propose a list of cts sequences to be used as a simple tool for quickly and precisely classifying new strains. Finally, our analysis leads to predictions about the diversity of P. syringae that is yet to be discovered. We present here an expandable framework mainly based on cts genetic analysis into which more diversity can be integrated. PMID:25184292

  17. Hydrometor classification from 2 dimensional videodisdrometer data

    NASA Astrophysics Data System (ADS)

    Grazioli, J.; Tuia, D.; Monhart, S.; Schneebeli, M.; Raupach, T.; Berne, A.

    2014-02-01

    This paper presents a hydrometeor classification technique based on two-dimensional video disdrometer (2DVD) data. The method provides an estimate of the dominant hydrometeor type falling over time intervals of 60 s during precipitation, using as input the statistical behavior of a set of particle descriptors, calculated for each particle image. The employed supervised algorithm is a support vector machine (SVM), trained over precipitation time steps labeled by visual inspection. In this way, 8 dominant hydrometeor classes could be discriminated. The algorithm achieves accurate classification performances, with median overall accuracies (Cohen's K) of 90% (0.88), and with accuracies higher than 84% for each hydrometeor class.

  18. Post-Mortem Evaluation of Amyloid-Dopamine Terminal PET Dementia Classifications

    PubMed Central

    Albin, Roger L.; Fisher-Hubbard, Amanda; Shanmugasundaram, Krithika; Koeppe, Robert A.; Burke, James F.; Camelo-Piragua, Sandra; Lieberman, Andrew P.; Giordani, Bruno; Frey, Kirk A.

    2016-01-01

    Clinical classification of early dementia and mild cognitive impairment is imprecise. We reported previously that molecular imaging classification of early dementia and mild cognitive impairment with dual amyloid and dopamine terminal positron emission tomography differs significantly from expert clinical classification. We now report pathologic diagnoses in a substantial subset of our previously imaged subjects. Among 36 subjects coming to autopsy, imaging classifications and pathologic diagnosis were concordant in 33 cases (κ=0.85). This approach enhanced specificity of Alzheimer disease diagnosis. The strong concordance of imaging based classifications and pathologic diagnoses suggests that this imaging approach will be useful in establishing more accurate and convenient classification biomarkers for dementia research. PMID:26183692

  19. MOLECULAR PHYLOGENETIC RELATIONSHIPS AMONG DIABROTICA SPECIES (ACCESSION NO. AF195197)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Corn rootworms of the genus Diabrotica (Coleoptera: Chrysomelidae) are the most serious pest of corn in midwestern United States. Despite their economic importance, phylogenetic relationships within the genus remain unclear. Phylogenetic analysis of five Diabrotica was undertaken using DNA sequences...

  20. MOLECULAR PHYLOGENETIC RELATIONSHIPS AMONG DIABROTICA SPECIES (ACCESSION NO. AF195202)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Corn rootworms of the genus Diabrotica (Coleoptera: Chrysomelidae) are the most serious pest of corn in midwestern United States. Despite their economic importance, phylogenetic relationships within the genus remain unclear. Phylogenetic analysis of five Diabrotica was undertaken using DNA sequences...

  1. MOLECULAR PHYLOGENETIC RELATIONSHIPS AMONG DIABROTICA SPECIES (ACCESSION NO. AF195199)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Corn rootworms of the genus Diabrotica (Coleoptera: Chrysomelidae) are the most serious pest of corn in midwestern United States. Despite their economic importance, phylogenetic relationships within the genus remain unclear. Phylogenetic analysis of five Diabrotica was undertaken using DNA sequences...

  2. MOLECULAR PHYLOGENETIC RELATIONSHIPS AMONG DIABROTICA SPECIES (ACCESSION NO. AF195195)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Corn rootworms of the genus Diabrotica (Coleoptera: Chrysomelidae) are the most serious pest of corn in midwestern United States. Despite their economic importance, phylogenetic relationships within the genus remain unclear. Phylogenetic analysis of five Diabrotica was undertaken using DNA sequences...

  3. MOLECULAR PHYLOGENETIC RELATIONSHIPS AMONG DIABROTICA SPECIES (ACCESSION NO. AF195198)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Corn rootworms of the genus Diabrotica (Coleoptera: Chrysomelidae) are the most serious pest of corn in midwestern United States. Despite their economic importance, phylogenetic relationships within the genus remain unclear. Phylogenetic analysis of five Diabrotica was undertaken using DNA sequences...

  4. MOLECULAR PHYLOGENETIC RELATIONSHIPS AMONG DIABROTICA SPECIES (ACCESSION NO. AF195196)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Corn rootworms of the genus Diabrotica (Coleoptera: Chrysomelidae) are the most serious pest of corn in midwestern United States. Despite their economic importance, phylogenetic relationships within the genus remain unclear. Phylogenetic analysis of five Diabrotica was undertaken using DNA sequences...

  5. Automatic classification of blank substrate defects

    NASA Astrophysics Data System (ADS)

    Boettiger, Tom; Buck, Peter; Paninjath, Sankaranarayanan; Pereira, Mark; Ronald, Rob; Rost, Dan; Samir, Bhamidipati

    2014-10-01

    Mask preparation stages are crucial in mask manufacturing, since this mask is to later act as a template for considerable number of dies on wafer. Defects on the initial blank substrate, and subsequent cleaned and coated substrates, can have a profound impact on the usability of the finished mask. This emphasizes the need for early and accurate identification of blank substrate defects and the risk they pose to the patterned reticle. While Automatic Defect Classification (ADC) is a well-developed technology for inspection and analysis of defects on patterned wafers and masks in the semiconductors industry, ADC for mask blanks is still in the early stages of adoption and development. Calibre ADC is a powerful analysis tool for fast, accurate, consistent and automatic classification of defects on mask blanks. Accurate, automated classification of mask blanks leads to better usability of blanks by enabling defect avoidance technologies during mask writing. Detailed information on blank defects can help to select appropriate job-decks to be written on the mask by defect avoidance tools [1][4][5]. Smart algorithms separate critical defects from the potentially large number of non-critical defects or false defects detected at various stages during mask blank preparation. Mechanisms used by Calibre ADC to identify and characterize defects include defect location and size, signal polarity (dark, bright) in both transmitted and reflected review images, distinguishing defect signals from background noise in defect images. The Calibre ADC engine then uses a decision tree to translate this information into a defect classification code. Using this automated process improves classification accuracy, repeatability and speed, while avoiding the subjectivity of human judgment compared to the alternative of manual defect classification by trained personnel [2]. This paper focuses on the results from the evaluation of Automatic Defect Classification (ADC) product at MP Mask Technology Center (MPMask). The Calibre ADC tool was qualified on production mask blanks against the manual classification. The classification accuracy of ADC is greater than 95% for critical defects with an overall accuracy of 90%. The sensitivity to weak defect signals and locating the defect in the images is a challenge we are resolving. The performance of the tool has been demonstrated on multiple mask types and is ready for deployment in full volume mask manufacturing production flow. Implementation of Calibre ADC is estimated to reduce the misclassification of critical defects by 60-80%.

  6. Phylogenetic analyses of Vitis (Vitaceae) based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids

    PubMed Central

    Jansen, Robert K; Kaittanis, Charalambos; Saski, Christopher; Lee, Seung-Bum; Tomkins, Jeffrey; Alverson, Andrew J; Daniell, Henry

    2006-01-01

    Background The Vitaceae (grape) is an economically important family of angiosperms whose phylogenetic placement is currently unresolved. Recent phylogenetic analyses based on one to several genes have suggested several alternative placements of this family, including sister to Caryophyllales, asterids, Saxifragales, Dilleniaceae or to rest of rosids, though support for these different results has been weak. There has been a recent interest in using complete chloroplast genome sequences for resolving phylogenetic relationships among angiosperms. These studies have clarified relationships among several major lineages but they have also emphasized the importance of taxon sampling and the effects of different phylogenetic methods for obtaining accurate phylogenies. We sequenced the complete chloroplast genome of Vitis vinifera and used these data to assess relationships among 27 angiosperms, including nine taxa of rosids. Results The Vitis vinifera chloroplast genome is 160,928 bp in length, including a pair of inverted repeats of 26,358 bp that are separated by small and large single copy regions of 19,065 bp and 89,147 bp, respectively. The gene content and order of Vitis is identical to many other unrearranged angiosperm chloroplast genomes, including tobacco. Phylogenetic analyses using maximum parsimony and maximum likelihood were performed on DNA sequences of 61 protein-coding genes for two datasets with 28 or 29 taxa, including eight or nine taxa from four of the seven currently recognized major clades of rosids. Parsimony and likelihood phylogenies of both data sets provide strong support for the placement of Vitaceae as sister to the remaining rosids. However, the position of the Myrtales and support for the monophyly of the eurosid I clade differs between the two data sets and the two methods of analysis. In parsimony analyses, the inclusion of Gossypium is necessary to obtain trees that support the monophyly of the eurosid I clade. However, maximum likelihood analyses place Cucumis as sister to the Myrtales and therefore do not support the monophyly of the eurosid I clade. Conclusion Phylogenies based on DNA sequences from complete chloroplast genome sequences provide strong support for the position of the Vitaceae as the earliest diverging lineage of rosids. Our phylogenetic analyses support recent assertions that inadequate taxon sampling and incorrect model specification for concatenated multi-gene data sets can mislead phylogenetic inferences when using whole chloroplast genomes for phylogeny reconstruction. PMID:16603088

  7. Accurate positioning based on acoustic and optical sensors

    NASA Astrophysics Data System (ADS)

    Cai, Kerong; Deng, Jiahao; Guo, Hualing

    2009-11-01

    Unattended laser target designator (ULTD) was designed to partly take the place of conventional LTDs for accurate positioning and laser marking. Analyzed the precision, accuracy and errors of acoustic sensor array, the requirements of laser generator, and the technology of image analysis and tracking, the major system modules were determined. The target's classification, velocity and position can be measured by sensors, and then coded laser beam will be emitted intelligently to mark the excellent position at the excellent time. The conclusion shows that, ULTD can not only avoid security threats, be deployed massively, and accomplish battle damage assessment (BDA), but also be fit for information-based warfare.

  8. [It is normal for classification approaches to be diverse].

    PubMed

    Pavlinov, I Ia

    2003-01-01

    It is asserted that the postmodern concept of science, unlike the classical ideal, presumes necessary existence of various classification approaches (schools) in taxonomy, each corresponding to a particular aspect of consideration of the "taxic reality". They are set up by diversity of initial epistemological and ontological backgrounds which fix in a certain way a) fragments of that reality allowable for investigation, and b) allowable methods of exploration of the fragments being fixed. It makes it possible to define a taxonomic school as a unity of the above backgrounds together with consideration aspect delimited by them. Two extreme positions of these backgrounds could be recognized in recent taxonomic thought. One of them follows the scholastic tradition of elaboration of a formal and, hence, universal classificatory method ("new typology", numerical phenetics, pattern cladistics). Another one asserts dependence of classificatory approach on the judgment of the nature of taxic reality (natural philosophy, evolutionary schools of taxonomy). Some arguments are put forward in favor of significant impact of evolutionary thinking onto the theory of modern taxonomy. This impact is manifested by the correspondence principle which makes classificatory algorithms (and hence resulting classifications) depending onto initial assumptions about causes of taxic diversity. It is asserted that criteria of "quality" of both classifications proper and classificatory methods can be correctly formulated within the framework of a particular consideration aspect only. For any group of organisms, several particular classifications are rightful to exist, each corresponding to a particular consideration aspect. These classifications could not be arranged along the "better-worse" scale, as they reflect different fragments of the taxic reality. Their mutual interpretation depends on degree of compatibility of background assumptions and of the tasks being resolved. Extensionally, classifications are compatible as much as they coincide by context and hierarchical structure of included taxa. Intentionally, typological classifications are compatible if included taxa are comparable by their diagnoses, while phylogenetic classifications are compatible if the included taxa are ascribed monophyletic status. A brief consideration is given to the "new phylogenetics" (= "genophyletics") as to a classificatory approach aimed at elaboration of parsimonious phylogenetic hypotheses based on molecular biology data and employing numerical methods of cladistic analysis. This approach is shown to borrows some phenetic ideas and revives scholastic principle of unified classificatory basis. It is supposed that, in a time, biological classification would get escaping from plethora of positivistic ideas (including those being developed by nowaday cladistics) and would assimilate (revive) more actively holistic worldview. PMID:14524225

  9. An Integrated Perspective on Phylogenetic Workflows.

    PubMed

    Guang, August; Zapata, Felipe; Howison, Mark; Lawrence, Charles E; Dunn, Casey W

    2016-02-01

    Molecular phylogenetics is the study of evolutionary relationships between biological sequences, often to infer the evolutionary relationships of organisms. These studies require many analysis components, including sequence assembly, identification of homologous sequences, gene tree inference, and species tree inference. At present, each component is usually treated as a single step in a linear analysis, where the output of each component is passed as input to the next as a point estimate. Here we outline a generative model that helps clarify assumptions that are implicit to phylogenetic workflows, focusing on the assumption of low relative entropy. This perspective unifies currently disparate advances, and will help investigators evaluate which steps would benefit the most from additional computation and future methods development. PMID:26775796

  10. Choosing and Using Introns in Molecular Phylogenetics

    PubMed Central

    Creer, Simon

    2007-01-01

    Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984

  11. Morphological Phylogenetics in the Genomic Age.

    PubMed

    Lee, Michael S Y; Palci, Alessandro

    2015-10-01

    Evolutionary trees underpin virtually all of biology, and the wealth of new genomic data has enabled us to reconstruct them with increasing detail and confidence. While phenotypic (typically morphological) traits are becoming less important in reconstructing evolutionary trees, they still serve vital and unique roles in phylogenetics, even for living taxa for which vast amounts of genetic information are available. Morphology remains a powerful independent source of evidence for testing molecular clades, and - through fossil phenotypes - the primary means for time-scaling phylogenies. Morphological phylogenetics is therefore vital for transforming undated molecular topologies into dated evolutionary trees. However, if morphology is to be employed to its full potential, biologists need to start scrutinising phenotypes in a more objective fashion, models of phenotypic evolution need to be improved, and approaches for analysing phenotypic traits and fossils together with genomic data need to be refined. PMID:26439355

  12. On numerically accurate finite element

    NASA Technical Reports Server (NTRS)

    Nagtegaal, J. C.; Parks, D. M.; Rice, J. R.

    1974-01-01

    A general criterion for testing a mesh with topologically similar repeat units is given, and the analysis shows that only a few conventional element types and arrangements are, or can be made suitable for computations in the fully plastic range. Further, a new variational principle, which can easily and simply be incorporated into an existing finite element program, is presented. This allows accurate computations to be made even for element designs that would not normally be suitable. Numerical results are given for three plane strain problems, namely pure bending of a beam, a thick-walled tube under pressure, and a deep double edge cracked tensile specimen. The effects of various element designs and of the new variational procedure are illustrated. Elastic-plastic computation at finite strain are discussed.

  13. Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics

    PubMed Central

    Kolaczkowski, Bryan; Thornton, Joseph W.

    2009-01-01

    Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias—which is apparent under both controlled simulation conditions and in analyses of empirical sequence data—also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages—that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis. PMID:20011052

  14. Accurate thickness measurement of graphene.

    PubMed

    Shearer, Cameron J; Slattery, Ashley D; Stapleton, Andrew J; Shapter, Joseph G; Gibson, Christopher T

    2016-03-29

    Graphene has emerged as a material with a vast variety of applications. The electronic, optical and mechanical properties of graphene are strongly influenced by the number of layers present in a sample. As a result, the dimensional characterization of graphene films is crucial, especially with the continued development of new synthesis methods and applications. A number of techniques exist to determine the thickness of graphene films including optical contrast, Raman scattering and scanning probe microscopy techniques. Atomic force microscopy (AFM), in particular, is used extensively since it provides three-dimensional images that enable the measurement of the lateral dimensions of graphene films as well as the thickness, and by extension the number of layers present. However, in the literature AFM has proven to be inaccurate with a wide range of measured values for single layer graphene thickness reported (between 0.4 and 1.7 nm). This discrepancy has been attributed to tip-surface interactions, image feedback settings and surface chemistry. In this work, we use standard and carbon nanotube modified AFM probes and a relatively new AFM imaging mode known as PeakForce tapping mode to establish a protocol that will allow users to accurately determine the thickness of graphene films. In particular, the error in measuring the first layer is reduced from 0.1-1.3 nm to 0.1-0.3 nm. Furthermore, in the process we establish that the graphene-substrate adsorbate layer and imaging force, in particular the pressure the tip exerts on the surface, are crucial components in the accurate measurement of graphene using AFM. These findings can be applied to other 2D materials. PMID:26894444

  15. Accurate thickness measurement of graphene

    NASA Astrophysics Data System (ADS)

    Shearer, Cameron J.; Slattery, Ashley D.; Stapleton, Andrew J.; Shapter, Joseph G.; Gibson, Christopher T.

    2016-03-01

    Graphene has emerged as a material with a vast variety of applications. The electronic, optical and mechanical properties of graphene are strongly influenced by the number of layers present in a sample. As a result, the dimensional characterization of graphene films is crucial, especially with the continued development of new synthesis methods and applications. A number of techniques exist to determine the thickness of graphene films including optical contrast, Raman scattering and scanning probe microscopy techniques. Atomic force microscopy (AFM), in particular, is used extensively since it provides three-dimensional images that enable the measurement of the lateral dimensions of graphene films as well as the thickness, and by extension the number of layers present. However, in the literature AFM has proven to be inaccurate with a wide range of measured values for single layer graphene thickness reported (between 0.4 and 1.7 nm). This discrepancy has been attributed to tip-surface interactions, image feedback settings and surface chemistry. In this work, we use standard and carbon nanotube modified AFM probes and a relatively new AFM imaging mode known as PeakForce tapping mode to establish a protocol that will allow users to accurately determine the thickness of graphene films. In particular, the error in measuring the first layer is reduced from 0.1-1.3 nm to 0.1-0.3 nm. Furthermore, in the process we establish that the graphene-substrate adsorbate layer and imaging force, in particular the pressure the tip exerts on the surface, are crucial components in the accurate measurement of graphene using AFM. These findings can be applied to other 2D materials.

  16. Recursive heuristic classification

    NASA Technical Reports Server (NTRS)

    Wilkins, David C.

    1994-01-01

    The author will describe a new problem-solving approach called recursive heuristic classification, whereby a subproblem of heuristic classification is itself formulated and solved by heuristic classification. This allows the construction of more knowledge-intensive classification programs in a way that yields a clean organization. Further, standard knowledge acquisition and learning techniques for heuristic classification can be used to create, refine, and maintain the knowledge base associated with the recursively called classification expert system. The method of recursive heuristic classification was used in the Minerva blackboard shell for heuristic classification. Minerva recursively calls itself every problem-solving cycle to solve the important blackboard scheduler task, which involves assigning a desirability rating to alternative problem-solving actions. Knowing these ratings is critical to the use of an expert system as a component of a critiquing or apprenticeship tutoring system. One innovation of this research is a method called dynamic heuristic classification, which allows selection among dynamically generated classification categories instead of requiring them to be prenumerated.

  17. Security classification of information

    SciTech Connect

    Quist, A.S.

    1993-04-01

    This document is the second of a planned four-volume work that comprehensively discusses the security classification of information. The main focus of Volume 2 is on the principles for classification of information. Included herein are descriptions of the two major types of information that governments classify for national security reasons (subjective and objective information), guidance to use when determining whether information under consideration for classification is controlled by the government (a necessary requirement for classification to be effective), information disclosure risks and benefits (the benefits and costs of classification), standards to use when balancing information disclosure risks and benefits, guidance for assigning classification levels (Top Secret, Secret, or Confidential) to classified information, guidance for determining how long information should be classified (classification duration), classification of associations of information, classification of compilations of information, and principles for declassifying and downgrading information. Rules or principles of certain areas of our legal system (e.g., trade secret law) are sometimes mentioned to .provide added support to some of those classification principles.

  18. A phylogenetic analysis of Aquifex pyrophilus

    NASA Technical Reports Server (NTRS)

    Burggraf, S.; Olsen, G. J.; Stetter, K. O.; Woese, C. R.

    1992-01-01

    The 16S rRNA of the bacterion Aquifex pyrophilus, a microaerophilic, oxygen-reducing hyperthermophile, has been sequenced directly from the the PCR amplified gene. Phylogenetic analyses show the Aq. pyrophilus lineage to be probably the deepest (earliest) in the (eu)bacterial tree. The addition of this deep branching to the bacterial tree further supports the argument that the Bacteria are of thermophilic ancestry.

  19. Phylogenetic diversity of termite gut spirochaetes.

    PubMed

    Lilburn, T G; Schmidt, T M; Breznak, J A

    1999-08-01

    A molecular phylogenetic analysis was done of not-yet-cultured spirochaetes inhabiting the gut of the termite, Reticulitermes flavipes (Kollar). Ninety-eight clones of near-full-length spirochaetal 16S rDNA genes were classified by ARDRA pattern and by partial sequencing. All clones grouped within the genus Treponema, and at least 21 new species of Treponema were recognized within R. flavipes alone. Analysis of 190 additional clones from guts of Coptotermes formosanus Shiraki and Zootermopsis angusticollis (Hagen), as well as published data on clones from Cryptotermes domesticus (Haviland), Mastotermes darwiniensis Froggatt, Nasutitermes lujae (Wasmann) and Reticulitermes speratus(Kolbe), revealed a similar level of novel treponemal phylogenetic diversity in these representatives of five of the seven termite families. None of the clones was closely related (i.e. all bore < or = 91% sequence similarity) to any previously recognized treponeme. The data also revealed the existence of two major phylogenetic groups of treponemes: one containing all of the currently known isolates of Treponema and a large number of phylotypes from the human gingival crevice, but only a minority of the termite gut spirochaete clones; another containing the majority of termite spirochaete clones and two Spirochaeta (S. caldaria and S. stenostrepta), which, although free living, group within the genus Treponema on the basis of 16S rRNA sequence. Signature nucleotides that almost perfectly distinguished the latter group, herein referred to as the 'termite cluster', occurred at the following (E. coli numbering) positions: 289-G x C-311; A at 812; and an inserted nucleotide at 1273. The emerging picture is that the long-recognized and striking morphological diversity of termite gut spirochaetes is paralleled by their phylogenetic diversity and may reflect substantial physiological diversity as well. PMID:11207751

  20. The Independent Evolution Method Is Not a Viable Phylogenetic Comparative Method

    PubMed Central

    2015-01-01

    Phylogenetic comparative methods (PCMs) use data on species traits and phylogenetic relationships to shed light on evolutionary questions. Recently, Smaers and Vinicius suggested a new PCM, Independent Evolution (IE), which purportedly employs a novel model of evolution based on Felsenstein’s Adaptive Peak Model. The authors found that IE improves upon previous PCMs by producing more accurate estimates of ancestral states, as well as separate estimates of evolutionary rates for each branch of a phylogenetic tree. Here, we document substantial theoretical and computational issues with IE. When data are simulated under a simple Brownian motion model of evolution, IE produces severely biased estimates of ancestral states and changes along individual branches. We show that these branch-specific changes are essentially ancestor-descendant or “directional” contrasts, and draw parallels between IE and previous PCMs such as “minimum evolution”. Additionally, while comparisons of branch-specific changes between variables have been interpreted as reflecting the relative strength of selection on those traits, we demonstrate through simulations that regressing IE estimated branch-specific changes against one another gives a biased estimate of the scaling relationship between these variables, and provides no advantages or insights beyond established PCMs such as phylogenetically independent contrasts. In light of our findings, we discuss the results of previous papers that employed IE. We conclude that Independent Evolution is not a viable PCM, and should not be used in comparative analyses. PMID:26683838

  1. Phylogenetic signal in amphibian sensitivity to copper sulfate relative to experimental temperature.

    PubMed

    Chiari, Ylenia; Glaberman, Scott; Serén, Nina; Carretero, Miguel A; Capellini, Isabella

    2015-04-01

    The release of large quantities of chemicals into the environment represents a major source of environmental disturbance. In recent years, the focus of ecotoxicology has shifted from describing the effects of chemical contaminants on individual species to developing more integrated approaches for predicting and evaluating long term effects of chemicals across species and ecosystems. Traditional ecotoxicology is typically based on data of sensitivity to a contaminant of a few surrogate species and often considers little variability in chemical sensitivity within and among taxonomic groups. This approach assumes that evolutionary history and phylogenetic relatedness among species have little or no impact on species' sensitivity to chemical compounds. Few studies have tested this assumption. Using phylogenetic comparative methods and published data for amphibians, we show that sensitivity to copper sulfate, a commonly used pesticide, exhibits a strong phylogenetic signal when controlling for experimental temperature. Our results indicate that evolutionary history needs to be accounted for to make accurate predictions of amphibian sensitivity to this contaminant under different temperature scenarios. Since physiological and metabolic traits showing high phylogenetic signal likely underlie variation in species sensitivity to chemical stressors, future studies should evaluate and predict species vulnerability to pollutants using evolutionarily informed approaches. PMID:26214907

  2. Marine turtle mitogenome phylogenetics and evolution.

    PubMed

    Duchene, Sebastián; Frey, Amy; Alfaro-Núñez, Alonzo; Dutton, Peter H; Thomas P Gilbert, M; Morin, Phillip A

    2012-10-01

    The sea turtles are a group of cretaceous origin containing seven recognized living species: leatherback, hawksbill, Kemp's ridley, olive ridley, loggerhead, green, and flatback. The leatherback is the single member of the Dermochelidae family, whereas all other sea turtles belong in Cheloniidae. Analyses of partial mitochondrial sequences and some nuclear markers have revealed phylogenetic inconsistencies within Cheloniidae, especially regarding the placement of the flatback. Population genetic studies based on D-Loop sequences have shown considerable structuring in species with broad geographic distributions, shedding light on complex migration patterns and possible geographic or climatic events as driving forces of sea-turtle distribution. We have sequenced complete mitogenomes for all sea-turtle species, including samples from their geographic range extremes, and performed phylogenetic analyses to assess sea-turtle evolution with a large molecular dataset. We found variation in the length of the ATP8 gene and a highly variable site in ND4 near a proton translocation channel in the resulting protein. Complete mitogenomes show strong support and resolution for phylogenetic relationships among all sea turtles, and reveal phylogeographic patterns within globally-distributed species. Although there was clear concordance between phylogenies and geographic origin of samples in most taxa, we found evidence of more recent dispersal events in the loggerhead and olive ridley turtles, suggesting more recent migrations (<1 Myr) in these species. Overall, our results demonstrate the complexity of sea-turtle diversity, and indicate the need for further research in phylogeography and molecular evolution. PMID:22750111

  3. DNA barcoding and phylogenetic relationships in Anatidae.

    PubMed

    Huang, Zuhao; Yang, Chengzhong; Ke, Dianhua

    2016-01-01

    Mitochondrial cytochrome c oxidase subunit I (COI) has been used as a powerful marker in a variety of phylogenetic studies. According to studies of bird species, the 694-bp sequence of the mitochondrial gene encoding COI is extremely useful for species identification and phylogeny. In the present study, we analyzed the COI barcodes of 79 species from 26 genera belonging to the Anatidae family. Sixty-six species (83.54%) of the species were identified correctly from their DNA barcodes. The remaining 13 species shared barcodes sequences with closely related species. Kimura two-parameter (K2P) distances were calculated between barcodes. The average genetic distance between species was 41 times higher compared to the average genetic distance within species. Neighbor-joining method was used to construct a phylogenetic tree, which grouped all of the genera into three divergent clades. Dendrocygna and Nomonyx + Oxyura were identified as early offshoots of the Anatidae. All the remaining taxa fell into two clades that correspond to the two subfamilies Anserinae and Anatiane. Based on our results, DNA barcoding is an effective molecular tool for Anatidae species identification and phylogenetic inference. PMID:24938090

  4. Consequences of recombination on traditional phylogenetic analysis.

    PubMed Central

    Schierup, M H; Hein, J

    2000-01-01

    We investigate the shape of a phylogenetic tree reconstructed from sequences evolving under the coalescent with recombination. The motivation is that evolutionary inferences are often made from phylogenetic trees reconstructed from population data even though recombination may well occur (mtDNA or viral sequences) or does occur (nuclear sequences). We investigate the size and direction of biases when a single tree is reconstructed ignoring recombination. Standard software (PHYLIP) was used to construct the best phylogenetic tree from sequences simulated under the coalescent with recombination. With recombination present, the length of terminal branches and the total branch length are larger, and the time to the most recent common ancestor smaller, than for a tree reconstructed from sequences evolving with no recombination. The effects are pronounced even for small levels of recombination that may not be immediately detectable in a data set. The phylogenies when recombination is present superficially resemble phylogenies for sequences from an exponentially growing population. However, exponential growth has a different effect on statistics such as Tajima's D. Furthermore, ignoring recombination leads to a large overestimation of the substitution rate heterogeneity and the loss of the molecular clock. These results are discussed in relation to viral and mtDNA data sets. PMID:11014833

  5. The phylogenetic affinities of the extinct glyptodonts.

    PubMed

    Delsuc, Frédéric; Gibb, Gillian C; Kuch, Melanie; Billet, Guillaume; Hautier, Lionel; Southon, John; Rouillard, Jean-Marie; Fernicola, Juan Carlos; Vizcaíno, Sergio F; MacPhee, Ross D E; Poinar, Hendrik N

    2016-02-22

    Among the fossils of hitherto unknown mammals that Darwin collected in South America between 1832 and 1833 during the Beagle expedition [1] were examples of the large, heavily armored herbivores later known as glyptodonts. Ever since, glyptodonts have fascinated evolutionary biologists because of their remarkable skeletal adaptations and seemingly isolated phylogenetic position even within their natural group, the cingulate xenarthrans (armadillos and their allies [2]). In possessing a carapace comprised of fused osteoderms, the glyptodonts were clearly related to other cingulates, but their precise phylogenetic position as suggested by morphology remains unresolved [3,4]. To provide a molecular perspective on this issue, we designed sequence-capture baits using in silico reconstructed ancestral sequences and successfully assembled the complete mitochondrial genome of Doedicurus sp., one of the largest glyptodonts. Our phylogenetic reconstructions establish that glyptodonts are in fact deeply nested within the armadillo crown-group, representing a distinct subfamily (Glyptodontinae) within family Chlamyphoridae [5]. Molecular dating suggests that glyptodonts diverged no earlier than around 35 million years ago, in good agreement with their fossil record. Our results highlight the derived nature of the glyptodont morphotype, one aspect of which is a spectacular increase in body size until their extinction at the end of the last ice age. PMID:26906483

  6. A Consistent Phylogenetic Backbone for the Fungi

    PubMed Central

    Ebersberger, Ingo; de Matos Simoes, Ricardo; Kupczok, Anne; Gube, Matthias; Kothe, Erika; Voigt, Kerstin; von Haeseler, Arndt

    2012-01-01

    The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data—a common practice in phylogenomic analyses—introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses. PMID:22114356

  7. Phylogenetic stochastic mapping without matrix exponentiation.

    PubMed

    Irvahn, Jan; Minin, Vladimir N

    2014-09-01

    Phylogenetic stochastic mapping is a method for reconstructing the history of trait changes on a phylogenetic tree relating species/organism carrying the trait. State-of-the-art methods assume that the trait evolves according to a continuous-time Markov chain (CTMC) and works well for small state spaces. The computations slow down considerably for larger state spaces (e.g., space of codons), because current methodology relies on exponentiating CTMC infinitesimal rate matrices-an operation whose computational complexity grows as the size of the CTMC state space cubed. In this work, we introduce a new approach, based on a CTMC technique called uniformization, which does not use matrix exponentiation for phylogenetic stochastic mapping. Our method is based on a new Markov chain Monte Carlo (MCMC) algorithm that targets the distribution of trait histories conditional on the trait data observed at the tips of the tree. The computational complexity of our MCMC method grows as the size of the CTMC state space squared. Moreover, in contrast to competing matrix exponentiation methods, if the rate matrix is sparse, we can leverage this sparsity and increase the computational efficiency of our algorithm further. Using simulated data, we illustrate advantages of our MCMC algorithm and investigate how large the state space needs to be for our method to outperform matrix exponentiation approaches. We show that even on the moderately large state space of codons our MCMC method can be significantly faster than currently used matrix exponentiation methods. PMID:24918812

  8. Phylogenetic Stochastic Mapping Without Matrix Exponentiation

    PubMed Central

    Irvahn, Jan; Minin, Vladimir N.

    2014-01-01

    Abstract Phylogenetic stochastic mapping is a method for reconstructing the history of trait changes on a phylogenetic tree relating species/organism carrying the trait. State-of-the-art methods assume that the trait evolves according to a continuous-time Markov chain (CTMC) and works well for small state spaces. The computations slow down considerably for larger state spaces (e.g., space of codons), because current methodology relies on exponentiating CTMC infinitesimal rate matrices—an operation whose computational complexity grows as the size of the CTMC state space cubed. In this work, we introduce a new approach, based on a CTMC technique called uniformization, which does not use matrix exponentiation for phylogenetic stochastic mapping. Our method is based on a new Markov chain Monte Carlo (MCMC) algorithm that targets the distribution of trait histories conditional on the trait data observed at the tips of the tree. The computational complexity of our MCMC method grows as the size of the CTMC state space squared. Moreover, in contrast to competing matrix exponentiation methods, if the rate matrix is sparse, we can leverage this sparsity and increase the computational efficiency of our algorithm further. Using simulated data, we illustrate advantages of our MCMC algorithm and investigate how large the state space needs to be for our method to outperform matrix exponentiation approaches. We show that even on the moderately large state space of codons our MCMC method can be significantly faster than currently used matrix exponentiation methods. PMID:24918812

  9. New substitution models for rooting phylogenetic trees

    PubMed Central

    Williams, Tom A.; Heaps, Sarah E.; Cherlin, Svetlana; Nye, Tom M. W.; Boys, Richard J.; Embley, T. Martin

    2015-01-01

    The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made. PMID:26323766

  10. A consistent phylogenetic backbone for the fungi.

    PubMed

    Ebersberger, Ingo; de Matos Simoes, Ricardo; Kupczok, Anne; Gube, Matthias; Kothe, Erika; Voigt, Kerstin; von Haeseler, Arndt

    2012-05-01

    The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data-a common practice in phylogenomic analyses-introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses. PMID:22114356

  11. A multigene phylogenetic synthesis for the class Lecanoromycetes (Ascomycota): 1307 fungi representing 1139 infrageneric taxa, 317 genera and 66 families.

    PubMed

    Miadlikowska, Jolanta; Kauff, Frank; Högnabba, Filip; Oliver, Jeffrey C; Molnár, Katalin; Fraker, Emily; Gaya, Ester; Hafellner, Josef; Hofstetter, Valérie; Gueidan, Cécile; Otálora, Mónica A G; Hodkinson, Brendan; Kukwa, Martin; Lücking, Robert; Björk, Curtis; Sipman, Harrie J M; Burgaz, Ana Rosa; Thell, Arne; Passo, Alfredo; Myllys, Leena; Goward, Trevor; Fernández-Brime, Samantha; Hestmark, Geir; Lendemer, James; Lumbsch, H Thorsten; Schmull, Michaela; Schoch, Conrad L; Sérusiaux, Emmanuël; Maddison, David R; Arnold, A Elizabeth; Lutzoni, François; Stenroos, Soili

    2014-10-01

    The Lecanoromycetes is the largest class of lichenized Fungi, and one of the most species-rich classes in the kingdom. Here we provide a multigene phylogenetic synthesis (using three ribosomal RNA-coding and two protein-coding genes) of the Lecanoromycetes based on 642 newly generated and 3329 publicly available sequences representing 1139 taxa, 317 genera, 66 families, 17 orders and five subclasses (four currently recognized: Acarosporomycetidae, Lecanoromycetidae, Ostropomycetidae, Umbilicariomycetidae; and one provisionarily recognized, 'Candelariomycetidae'). Maximum likelihood phylogenetic analyses on four multigene datasets assembled using a cumulative supermatrix approach with a progressively higher number of species and missing data (5-gene, 5+4-gene, 5+4+3-gene and 5+4+3+2-gene datasets) show that the current classification includes non-monophyletic taxa at various ranks, which need to be recircumscribed and require revisionary treatments based on denser taxon sampling and more loci. Two newly circumscribed orders (Arctomiales and Hymeneliales in the Ostropomycetidae) and three families (Ramboldiaceae and Psilolechiaceae in the Lecanorales, and Strangosporaceae in the Lecanoromycetes inc. sed.) are introduced. The potential resurrection of the families Eigleraceae and Lopadiaceae is considered here to alleviate phylogenetic and classification disparities. An overview of the photobionts associated with the main fungal lineages in the Lecanoromycetes based on available published records is provided. A revised schematic classification at the family level in the phylogenetic context of widely accepted and newly revealed relationships across Lecanoromycetes is included. The cumulative addition of taxa with an increasing amount of missing data (i.e., a cumulative supermatrix approach, starting with taxa for which sequences were available for all five targeted genes and ending with the addition of taxa for which only two genes have been sequenced) revealed relatively stable relationships for many families and orders. However, the increasing number of taxa without the addition of more loci also resulted in an expected substantial loss of phylogenetic resolving power and support (especially for deep phylogenetic relationships), potentially including the misplacements of several taxa. Future phylogenetic analyses should include additional single copy protein-coding markers in order to improve the tree of the Lecanoromycetes. As part of this study, a new module ("Hypha") of the freely available Mesquite software was developed to compare and display the internodal support values derived from this cumulative supermatrix approach. PMID:24747130

  12. A multigene phylogenetic synthesis for the class Lecanoromycetes (Ascomycota): 1307 fungi representing 1139 infrageneric taxa, 317 genera and 66 families

    PubMed Central

    Miadlikowska, Jolanta; Kauff, Frank; Högnabba, Filip; Oliver, Jeffrey C.; Molnár, Katalin; Fraker, Emily; Gaya, Ester; Hafellner, Josef; Hofstetter, Valérie; Gueidan, Cécile; Otálora, Mónica A.G.; Hodkinson, Brendan; Kukwa, Martin; Lücking, Robert; Björk, Curtis; Sipman, Harrie J.M.; Burgaz, Ana Rosa; Thell, Arne; Passo, Alfredo; Myllys, Leena; Goward, Trevor; Fernández-Brime, Samantha; Hestmark, Geir; Lendemer, James; Lumbsch, H. Thorsten; Schmull, Michaela; Schoch, Conrad; Sérusiaux, Emmanuël; Maddison, David R.; Arnold, A. Elizabeth; Lutzoni, François; Stenroos, Soili

    2014-01-01

    The Lecanoromycetes is the largest class of lichenized Fungi, and one of the most species-rich classes in the kingdom. Here we provide a multigene phylogenetic synthesis (using three ribosomal RNA-coding and two protein-coding genes) of the Lecanoromycetes based on 642 newly generated and 3329 publicly available sequences representing 1139 taxa, 317 genera, 66 families, 17 orders and five subclasses (four currently recognized: Acarosporomycetidae, Lecanoromycetidae, Ostropomycetidae, Umbilicariomycetidae; and one provisionarily recognized, ‘Candelariomycetidae’). Maximum likelihood phylogenetic analyses on four multigene datasets assembled using a cumulative supermatrix approach with a progressively higher number of species and missing data (5-gene, 5+4-gene, 5+4+3-gene and 5+4+3+2-gene datasets) show that the current classification includes non-monophyletic taxa at various ranks, which need to be recircumscribed and require revisionary treatments based on denser taxon sampling and more loci. Two newly circumscribed orders (Arctomiales and Hymeneliales in the Ostropomycetidae) and three families (Ramboldiaceae and Psilolechiaceae in the Lecanorales, and Strangosporaceae in the Lecanoromycetes inc. sed.) are introduced. The potential resurrection of the families Eigleraceae and Lopadiaceae is considered here to alleviate phylogenetic and classification disparities. An overview of the photobionts associated with the main fungal lineages in the Lecanoromycetes based on available published records is provided. A revised schematic classification at the family level in the phylogenetic context of widely accepted and newly revealed relationships across Lecanoromycetes is included. The cumulative addition of taxa with an increasing amount of missing data (i.e., a cumulative supermatrix approach, starting with taxa for which sequences were available for all five targeted genes and ending with the addition of taxa for which only two genes have been sequenced) revealed relatively stable relationships for many families and orders. However, the increasing number of taxa without the addition of more loci also resulted in an expected substantial loss of phylogenetic resolving power and support (especially for deep phylogenetic relationships), potentially including the misplacements of several taxa. Future phylogenetic analyses should include additional single copy protein-coding markers in order to improve the tree of the Lecanoromycetes. As part of this study, a new module (“Hypha”) of the freely available Mesquite software was developed to compare and display the internodal support values derived from this cumulative supermatrix approach. PMID:24747130

  13. Cloud field classification based on textural features

    NASA Technical Reports Server (NTRS)

    Sengupta, Sailes Kumar

    1989-01-01

    An essential component in global climate research is accurate cloud cover and type determination. Of the two approaches to texture-based classification (statistical and textural), only the former is effective in the classification of natural scenes such as land, ocean, and atmosphere. In the statistical approach that was adopted, parameters characterizing the stochastic properties of the spatial distribution of grey levels in an image are estimated and then used as features for cloud classification. Two types of textural measures were used. One is based on the distribution of the grey level difference vector (GLDV), and the other on a set of textural features derived from the MaxMin cooccurrence matrix (MMCM). The GLDV method looks at the difference D of grey levels at pixels separated by a horizontal distance d and computes several statistics based on this distribution. These are then used as features in subsequent classification. The MaxMin tectural features on the other hand are based on the MMCM, a matrix whose (I,J)th entry give the relative frequency of occurrences of the grey level pair (I,J) that are consecutive and thresholded local extremes separated by a given pixel distance d. Textural measures are then computed based on this matrix in much the same manner as is done in texture computation using the grey level cooccurrence matrix. The database consists of 37 cloud field scenes from LANDSAT imagery using a near IR visible channel. The classification algorithm used is the well known Stepwise Discriminant Analysis. The overall accuracy was estimated by the percentage or correct classifications in each case. It turns out that both types of classifiers, at their best combination of features, and at any given spatial resolution give approximately the same classification accuracy. A neural network based classifier with a feed forward architecture and a back propagation training algorithm is used to increase the classification accuracy, using these two classes of features. Preliminary results based on the GLDV textural features alone look promising.

  14. Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach

    PubMed Central

    Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John

    2014-01-01

    Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723

  15. Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach.

    PubMed

    Erickson, David L; Jones, Frank A; Swenson, Nathan G; Pei, Nancai; Bourg, Norman A; Chen, Wenna; Davies, Stuart J; Ge, Xue-Jun; Hao, Zhanqing; Howe, Robert W; Huang, Chun-Lin; Larson, Andrew J; Lum, Shawn K Y; Lutz, James A; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D; Fang-Sun, I; Wright, S Joseph; Wolf, Amy T; Ye, W; Xing, Dingliang; Zimmerman, Jess K; Kress, W John

    2014-01-01

    Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723

  16. Evaluating Support for the Current Classification of Eukaryotic Diversity

    PubMed Central

    Parfrey, Laura Wegener; Barbero, Erika; Lasser, Elyse; Dunthorn, Micah; Bhattacharya, Debashish; Patterson, David J; Katz, Laura A

    2006-01-01

    Perspectives on the classification of eukaryotic diversity have changed rapidly in recent years, as the four eukaryotic groups within the five-kingdom classification—plants, animals, fungi, and protists—have been transformed through numerous permutations into the current system of six “supergroups.” The intent of the supergroup classification system is to unite microbial and macroscopic eukaryotes based on phylogenetic inference. This supergroup approach is increasing in popularity in the literature and is appearing in introductory biology textbooks. We evaluate the stability and support for the current six-supergroup classification of eukaryotes based on molecular genealogies. We assess three aspects of each supergroup: (1) the stability of its taxonomy, (2) the support for monophyly (single evolutionary origin) in molecular analyses targeting a supergroup, and (3) the support for monophyly when a supergroup is included as an out-group in phylogenetic studies targeting other taxa. Our analysis demonstrates that supergroup taxonomies are unstable and that support for groups varies tremendously, indicating that the current classification scheme of eukaryotes is likely premature. We highlight several trends contributing to the instability and discuss the requirements for establishing robust clades within the eukaryotic tree of life. PMID:17194223

  17. AGN Zoo and Classifications of Active Galaxies

    NASA Astrophysics Data System (ADS)

    Mickaelian, Areg M.

    2015-07-01

    We review the variety of Active Galactic Nuclei (AGN) classes (so-called "AGN zoo") and classification schemes of galaxies by activity types based on their optical emission-line spectrum, as well as other parameters and other than optical wavelength ranges. A historical overview of discoveries of various types of active galaxies is given, including Seyfert galaxies, radio galaxies, QSOs, BL Lacertae objects, Starbursts, LINERs, etc. Various kinds of AGN diagnostics are discussed. All known AGN types and subtypes are presented and described to have a homogeneous classification scheme based on the optical emission-line spectra and in many cases, also other parameters. Problems connected with accurate classifications and open questions related to AGN and their classes are discussed and summarized.

  18. Sea floor classification from airborne lidar data

    NASA Astrophysics Data System (ADS)

    Tulldahl, H. Michael; Vahlberg, Claes; Axelsson, Andreas; Karlsson, Henrik; Jonsson, Peter

    2007-10-01

    Airborne depth sounding lidar has proven to be a valuable sensor for rapid and accurate sounding of shallow areas. The received lidar pulse echo contains information of the sea floor depth, but also other data can be extracted. We currently perform work on bottom classification and water turbidity estimation based on lidar data. In this paper we present the theoretical background and experimental results on bottom classification. The algorithms are developed from simulations and then tested on experimental data from the operational airborne lidar system Hawk Eye II. We compare the results to field data taken from underwater video recordings. Our results indicate that bottom classification from airborne lidar data can be made with high accuracy.

  19. Classiology and soil classification

    NASA Astrophysics Data System (ADS)

    Rozhkov, V. A.

    2012-03-01

    Classiology can be defined as a science studying the principles and rules of classification of objects of any nature. The development of the theory of classification and the particular methods for classifying objects are the main challenges of classiology; to a certain extent, they are close to the challenges of pattern recognition. The methodology of classiology integrates a wide range of methods and approaches: from expert judgment to formal logic, multivariate statistics, and informatics. Soil classification assumes generalization of available data and practical experience, formalization of our notions about soils, and their representation in the form of an information system. As an information system, soil classification is designed to predict the maximum number of a soil's properties from the position of this soil in the classification space. The existing soil classification systems do not completely satisfy the principles of classiology. The violation of logical basis, poor structuring, low integrity, and inadequate level of formalization make these systems verbal schemes rather than classification systems sensu stricto. The concept of classification as listing (enumeration) of objects makes it possible to introduce the notion of the information base of classification. For soil objects, this is the database of soil indices (properties) that might be applied for generating target-oriented soil classification system. Mathematical methods enlarge the prognostic capacity of classification systems; they can be applied to assess the quality of these systems and to recognize new soil objects to be included in the existing systems. The application of particular principles and rules of classiology for soil classification purposes is discussed in this paper.

  20. Genome-wide identification and phylogenetic analysis of the ERF gene family in cucumbers

    PubMed Central

    Hu, Lifang; Liu, Shiqiang

    2011-01-01

    Members of the ERF transcription-factor family participate in a number of biological processes, viz., responses to hormones, adaptation to biotic and abiotic stress, metabolism regulation, beneficial symbiotic interactions, cell differentiation and developmental processes. So far, no tissue-expression profile of any cucumber ERF protein has been reported in detail. Recent completion of the cucumber full-genome sequence has come to facilitate, not only genome-wide analysis of ERF family members in cucumbers themselves, but also a comparative analysis with those in Arabidopsis and rice. In this study, 103 hypothetical ERF family genes in the cucumber genome were identified, phylogenetic analysis indicating their classification into 10 groups, designated I to X. Motif analysis further indicated that most of the conserved motifs outside the AP2/ERF domain, are selectively distributed among the specific clades in the phylogenetic tree. From chromosomal localization and genome distribution analysis, it appears that tandem-duplication may have contributed to CsERF gene expansion. Intron/exon structure analysis indicated that a few CsERFs still conserved the former intron-position patterns existent in the common ancestor of monocots and eudicots. Expression analysis revealed the widespread distribution of the cucumber ERF gene family within plant tissues, thereby implying the probability of their performing various roles therein. Furthermore, members of some groups presented mutually similar expression patterns that might be related to their phylogenetic groups. PMID:22215967

  1. Molecular phylogenetics unveils the ancient evolutionary origins of the enigmatic fairy armadillos.

    PubMed

    Delsuc, Frédéric; Superina, Mariella; Tilak, Marie-Ka; Douzery, Emmanuel J P; Hassanin, Alexandre

    2012-02-01

    Fairy armadillos or pichiciegos (Xenarthra, Dasypodidae) are among the most elusive mammals. Due to their subterranean and nocturnal lifestyle, their basic biology and evolutionary history remain virtually unknown. Two distinct species with allopatric distributions are recognized: Chlamyphorus truncatus is restricted to central Argentina, while Calyptophractus retusus occurs in the Gran Chaco of Argentina, Paraguay, and Bolivia. To test their monophyly and resolve their phylogenetic affinities within armadillos, we obtained sequence data from modern and museum specimens for two mitochondrial genes (12S RNA [MT-RNR1] and NADH dehydrogenase 1 [MT-ND1]) and two nuclear exons (breast cancer 1 early onset exon 11 [BRCA1] and von Willebrand factor exon 28 [VWF]). Phylogenetic analyses provided a reference phylogeny and timescale for living xenarthran genera. Our results reveal monophyletic pichiciegos as members of a major armadillo subfamily (Chlamyphorinae). Their strictly fossorial lifestyle probably evolved as a response to the Oligocene aridification that occurred in South America after their divergence from Tolypeutinae around 32 million years ago (Mya). The ancient divergence date (∼17Mya) for separation between the two species supports their taxonomic classification into distinct genera. The synchronicity with Middle Miocene marine incursions along the Paraná river basin suggests a vicariant origin for pichiciegos by the disruption of their ancestral range. Their phylogenetic distinctiveness and rarity in the wild argue in favor of high conservation priority. PMID:22122941

  2. Molecular phylogenetics and historical biogeography amid shifting continents in the cockles and giant clams (Bivalvia: Cardiidae).

    PubMed

    Herrera, Nathanael D; Ter Poorten, Jan Johan; Bieler, Rüdiger; Mikkelsen, Paula M; Strong, Ellen E; Jablonski, David; Steppan, Scott J

    2015-12-01

    Reconstructing historical biogeography of the marine realm is complicated by indistinct barriers and, over deeper time scales, a dynamic landscape shaped by plate tectonics. Here we present the most extensive examination of model-based historical biogeography among marine invertebrates to date. We conducted the largest phylogenetic and molecular clock analyses to date for the bivalve family Cardiidae (cockles and giant clams) with three unlinked loci for 110 species representing 37 of the 50 genera. Ancestral ranges were reconstructed using the dispersal-extinction-cladogenesis (DEC) method with a time-stratified paleogeographic model wherein dispersal rates varied with shifting tectonics. Results were compared to previous classifications and the extensive paleontological record. Six of the eight prior subfamily groupings were found to be para- or polyphyletic. Cardiidae originated and subsequently diversified in the tropical Indo-Pacific starting in the Late Triassic. Eastern Atlantic species were mainly derived from the tropical Indo-Mediterranean region via the Tethys Sea. In contrast, the western Atlantic fauna was derived from Indo-Pacific clades. Our phylogenetic results demonstrated greater concordance with geography than did previous phylogenies based on morphology. Time-stratifying the DEC reconstruction improved the fit and was highly consistent with paleo-ocean currents and paleogeography. Lastly, combining molecular phylogenetics with a rich and well-documented fossil record allowed us to test the accuracy and precision of biogeographic range reconstructions. PMID:26234273

  3. DNA sequence support for a close phylogenetic relationship between some storks and New World vultures.

    PubMed Central

    Avise, J C; Nelson, W S; Sibley, C G

    1994-01-01

    Nucleotide sequences from the mitochondrial cytochrome b gene were used to address a controversial suggestion that New World vultures are related more closely to storks than to Old World vultures. Phylogenetic analyses of 1-kb sequences from 18 relevant avian species indicate that the similarities in morphology and behavior between New World and Old World vultures probably manifest convergent adaptations associated with carrion-feeding, rather than propinquity of descent. Direct sequence evidence for a close phylogenetic alliance between at least some New World vultures and storks lends support to conclusions reached previously from DNA.DNA hybridization methods and detailed morphology-based appraisals, and it illustrates how mistaken assumptions of homology for organismal adaptations can compromise biological classifications. However, there was a lack of significant resolution for most other branches in the cytochrome b phylogenetic reconstructions. This irresolution is most likely attributable to a close temporal clustering of nodes, rather than to ceiling effects (mutational saturation) producing an inappropriate window of resolution for the cytochrome b sequences. Images PMID:8197203

  4. Controlled recovery of phylogenetic communities from an evolutionary model using a network approach

    NASA Astrophysics Data System (ADS)

    Sousa, Arthur M. Y. R.; Vieira, André P.; Prado, Carmen P. C.; Andrade, Roberto F. S.

    2016-04-01

    This works reports the use of a complex network approach to produce a phylogenetic classification tree of a simple evolutionary model. This approach has already been used to treat proteomic data of actual extant organisms, but an investigation of its reliability to retrieve a traceable evolutionary history is missing. The used evolutionary model includes key ingredients for the emergence of groups of related organisms by differentiation through random mutations and population growth, but purposefully omits other realistic ingredients that are not strictly necessary to originate an evolutionary history. This choice causes the model to depend only on a small set of parameters, controlling the mutation probability and the population of different species. Our results indicate that for a set of parameter values, the phylogenetic classification produced by the used framework reproduces the actual evolutionary history with a very high average degree of accuracy. This includes parameter values where the species originated by the evolutionary dynamics have modular structures. In the more general context of community identification in complex networks, our model offers a simple setting for evaluating the effects, on the efficiency of community formation and identification, of the underlying dynamics generating the network itself.

  5. Molecular phylogenetic analysis of mudflat snails (Gastropoda: Euthyneura: Amphiboloidea) supports an Australasian centre of origin.

    PubMed

    Golding, Rosemary E

    2012-04-01

    Amphiboloidea is a small but widespread group of snails found exclusively, and often abundantly, in mudflat and associated salt marsh or mangrove habitat. This study uses molecular data from three loci (COI, 16S and 28S) to infer phylogenetic relationships in Amphiboloidea and examine its position in Euthyneura. All but two of the named extant species of Amphiboloidea and additional undescribed taxa from across Southeast Asia and the Arabian Gulf were sampled. In contrast to the current morphology-based classification dividing Amphiboloidea into three families, analysis of molecular data supports revision of the classification to comprise two families. Maningrididae is a monotypic family basal to Amphibolidae, which is revised to comprise three subfamilies: Amphibolinae, Phallomedusinae and Salinatorinae. Sequence divergence between Asian populations of Naranjia is relatively large and possibly indicative of species complexes divergent across the Strait of Malacca. Salinatorrosacea and Salinator burmana do not cluster with other Salinator species, and require generic reassignment. In addition, sequences were obtained from an undescribed species of Lactiforis from the Malay Peninsula. Reconstruction of ancestral distributions indicates a plesiomorphic distribution and centre of origin in Australasia, with two genera subsequently diversifying throughout Asia. Increasing the sampling density of amphiboloid taxa in a phylogenetic analysis of Euthyneura did not resolve the identity of the sister taxon to Amphibolidae, but confirmed its inclusion in Pulmonata/Panpulmonata. PMID:22210412

  6. Phylogenetic inferences reveal a large extent of novel biodiversity in chemically rich tropical marine cyanobacteria.

    PubMed

    Engene, Niclas; Gunasekera, Sarath P; Gerwick, William H; Paul, Valerie J

    2013-03-01

    Benthic marine cyanobacteria are known for their prolific biosynthetic capacities to produce structurally diverse secondary metabolites with biomedical application and their ability to form cyanobacterial harmful algal blooms. In an effort to provide taxonomic clarity to better guide future natural product drug discovery investigations and harmful algal bloom monitoring, this study investigated the taxonomy of tropical and subtropical natural product-producing marine cyanobacteria on the basis of their evolutionary relatedness. Our phylogenetic inferences of marine cyanobacterial strains responsible for over 100 bioactive secondary metabolites revealed an uneven taxonomic distribution, with a few groups being responsible for the vast majority of these molecules. Our data also suggest a high degree of novel biodiversity among natural product-producing strains that was previously overlooked by traditional morphology-based taxonomic approaches. This unrecognized biodiversity is primarily due to a lack of proper classification systems since the taxonomy of tropical and subtropical, benthic marine cyanobacteria has only recently been analyzed by phylogenetic methods. This evolutionary study provides a framework for a more robust classification system to better understand the taxonomy of tropical and subtropical marine cyanobacteria and the distribution of natural products in marine cyanobacteria. PMID:23315747

  7. Phylogenetic relationships among Staphylococcus species and refinement of cluster groups based on multilocus data

    PubMed Central

    2012-01-01

    Background Estimates of relationships among Staphylococcus species have been hampered by poor and inconsistent resolution of phylogenies based largely on single gene analyses incorporating only a limited taxon sample. As such, the evolutionary relationships and hierarchical classification schemes among species have not been confidently established. Here, we address these points through analyses of DNA sequence data from multiple loci (16S rRNA gene, dnaJ, rpoB, and tuf gene fragments) using multiple Bayesian and maximum likelihood phylogenetic approaches that incorporate nearly all recognized Staphylococcus taxa. Results We estimated the phylogeny of fifty-seven Staphylococcus taxa using partitioned-model Bayesian and maximum likelihood analysis, as well as Bayesian gene-tree species-tree methods. Regardless of methodology, we found broad agreement among methods that the current cluster groups require revision, although there was some disagreement among methods in resolution of higher order relationships. Based on our phylogenetic estimates, we propose a refined classification for Staphylococcus with species being classified into 15 cluster groups (based on molecular data) that adhere to six species groups (based on phenotypic properties). Conclusions Our findings are in general agreement with gene tree-based reports of the staphylococcal phylogeny, although we identify multiple previously unreported relationships among species. Our results support the general importance of such multilocus assessments as a standard in microbial studies to more robustly infer relationships among recognized and newly discovered lineages. PMID:22950675

  8. The bacterial species dilemma and the genomic–phylogenetic species concept

    PubMed Central

    Staley, James T

    2006-01-01

    The number of species of Bacteria and Archaea (ca 5000) is surprisingly small considering their early evolution, genetic diversity and residence in all ecosystems. The bacterial species definition accounts in part for the small number of named species. The primary procedures required to identify new species of Bacteria and Archaea are DNA–DNA hybridization and phenotypic characterization. Recently, 16S rRNA gene sequencing and phylogenetic analysis have been applied to bacterial taxonomy. Although 16S phylogeny is arguably excellent for classification of Bacteria and Archaea from the Domain level down to the family or genus, it lacks resolution below that level. Newer approaches, including multilocus sequence analysis, and genome sequence and microarray analyses, promise to provide necessary information to better understand bacterial speciation. Indeed, recent data using these approaches, while meagre, support the view that speciation processes may occur at the subspecies level within ecological niches (ecovars) and owing to biogeography (geovars). A major dilemma for bacterial taxonomists is how to incorporate this new information into the present hierarchical system for classification of Bacteria and Archaea without causing undesirable confusion and contention. This author proposes the genomic–phylogenetic species concept (GPSC) for the taxonomy of prokaryotes. The aim is twofold. First, the GPSC would provide a conceptual and testable framework for bacterial taxonomy. Second, the GPSC would replace the burdensome requirement for DNA hybridization presently needed to describe new species. Furthermore, the GPSC is consistent with the present treatment at higher taxonomic levels. PMID:17062409

  9. Prostate segmentation by sparse representation based classification.

    PubMed

    Gao, Yaozong; Liao, Shu; Shen, Dinggang

    2012-01-01

    Accurate segmentation of prostate in CT images is important in image-guided radiotherapy. However, it is difficult to localize the prostate in CT images due to low image contrast, unpredicted motion and large appearance variations across different treatment days. To address these issues, we propose a sparse representation based classification method to accurately segment the prostate. The main contributions of this paper are: (1) A discriminant dictionary learning technique is proposed to overcome the limitation of the traditional Sparse Representation based classifier (SRC). (2) Context features are incorporated into SRC to refine the prostate boundary in an iterative scheme. (3) A residue-based linear regression model is trained to increase the classification performance of SRC and extend it from hard classification to soft classification. To segment the prostate, the new treatment image is first rigidly aligned to the planning image space based on the pelvic bones. Then two sets of location-adaptive SRCs along two coordinate directions are applied on the aligned treatment image to produce a probability map, based on which all previously segmented images of the same patient are rigidly aligned onto the new treatment image and majority voting strategy is further adopted to finally segment the prostate in the new treatment image. The proposed method has been evaluated on a CT dataset consisting of 15 patients and 230 CT images. Promising results have been achieved. PMID:23286162

  10. Complete mitochondrial genomes elucidate phylogenetic relationships of the deep-sea octocoral families Coralliidae and Paragorgiidae

    NASA Astrophysics Data System (ADS)

    Figueroa, Diego F.; Baco, Amy R.

    2014-01-01

    In the past decade, molecular phylogenetic analyses of octocorals have shown that the current morphological taxonomic classification of these organisms needs to be revised. The latest phylogenetic analyses show that most octocorals can be divided into three main clades. One of these clades contains the families Coralliidae and Paragorgiidae. These families share several taxonomically important characters and it has been suggested that they may not be monophyletic; with the possibility of the Coralliidae being a derived branch of the Paragorgiidae. Uncertainty exists not only in the relationship of these two families, but also in the classification of the two genera that make up the Coralliidae, Corallium and Paracorallium. Molecular analyses suggest that the genus Corallium is paraphyletic, and it can be divided into two main clades, with the Paracorallium as members of one of these clades. In this study we sequenced the whole mitochondrial genome of five species of Paragorgia and of five species of Corallium to use in a phylogenetic analysis to achieve two main objectives; the first to elucidate the phylogenetic relationship between the Paragorgiidae and Coralliidae and the second to determine whether the genera Corallium and Paracorallium are monophyletic. Our results show that other members of the Coralliidae share the two novel mitochondrial gene arrangements found in a previous study in Corallium konojoi and Paracorallium japonicum; and that the Corallium konojoi arrangement is also found in the Paragorgiidae. Our phylogenetic reconstruction based on all the protein coding genes and ribosomal RNAs of the mitochondrial genome suggest that the Coralliidae are not a derived branch of the Paragorgiidae, but rather a monophyletic sister branch to the Paragorgiidae. While our manuscript was in review a study was published using morphological data and several fragments from mitochondrial genes to redefine the taxonomy of the Coralliidae. Paracorallium was subsumed into Corallium and the genus Hemicorallium was resurrected. This left two disjunct clades as Corallium, making that genus paraphyletic. One of the clades includes the type specimens of Corallium, indicating that clade should remain Corallium. For the other clade, we support the resurrection of the genus Pleurocorallium to fix the paraphyly of Corallium. Based on congruent phylogenies in both studies, the genus Pleurocorallium includes the species C. secundum, C. kishinouyei, C. konojoi, C. elatius, and C. niveum.

  11. An updated evolutionary classification of CRISPR-Cas systems.

    PubMed

    Makarova, Kira S; Wolf, Yuri I; Alkhnbashi, Omer S; Costa, Fabrizio; Shah, Shiraz A; Saunders, Sita J; Barrangou, Rodolphe; Brouns, Stan J J; Charpentier, Emmanuelle; Haft, Daniel H; Horvath, Philippe; Moineau, Sylvain; Mojica, Francisco J M; Terns, Rebecca M; Terns, Michael P; White, Malcolm F; Yakunin, Alexander F; Garrett, Roger A; van der Oost, John; Backofen, Rolf; Koonin, Eugene V

    2015-11-01

    The evolution of CRISPR-cas loci, which encode adaptive immune systems in archaea and bacteria, involves rapid changes, in particular numerous rearrangements of the locus architecture and horizontal transfer of complete loci or individual modules. These dynamics complicate straightforward phylogenetic classification, but here we present an approach combining the analysis of signature protein families and features of the architecture of cas loci that unambiguously partitions most CRISPR-cas loci into distinct classes, types and subtypes. The new classification retains the overall structure of the previous version but is expanded to now encompass two classes, five types and 16 subtypes. The relative stability of the classification suggests that the most prevalent variants of CRISPR-Cas systems are already known. However, the existence of rare, currently unclassifiable variants implies that additional types and subtypes remain to be characterized. PMID:26411297

  12. Classification of HIV-1 sequences using profile Hidden Markov Models.

    PubMed

    Dwivedi, Sanjiv K; Sengupta, Supratim

    2012-01-01

    Accurate classification of HIV-1 subtypes is essential for studying the dynamic spatial distribution pattern of HIV-1 subtypes and also for developing effective methods of treatment that can be targeted to attack specific subtypes. We propose a classification method based on profile Hidden Markov Model that can accurately identify an unknown strain. We show that a standard method that relies on the construction of a positive training set only, to capture unique features associated with a particular subtype, can accurately classify sequences belonging to all subtypes except B and D. We point out the drawbacks of the standard method; namely, an arbitrary choice of threshold to distinguish between true positives and true negatives, and the inability to discriminate between closely related subtypes. We then propose an improved classification method based on construction of a positive as well as a negative training set to improve discriminating ability between closely related subtypes like B and D. Finally, we show how the improved method can be used to accurately determine the subtype composition of Common Recombinant Forms of the virus that are made up of two or more subtypes. Our method provides a simple and highly accurate alternative to other classification methods and will be useful in accurately annotating newly sequenced HIV-1 strains. PMID:22623958

  13. Multilocus phylogeny of the New-World mud turtles (Kinosternidae) supports the traditional classification of the group.

    PubMed

    Spinks, Phillip Q; Thomson, Robert C; Gidiş, Müge; Bradley Shaffer, H

    2014-07-01

    A goal of modern taxonomy is to develop classifications that reflect current phylogenetic relationships and are as stable as possible given the inherent uncertainties in much of the tree of life. Here, we provide an in-depth phylogenetic analysis, based on 14 nuclear loci comprising 10,305 base pairs of aligned sequence data from all but two species of the turtle family Kinosternidae, to determine whether recent proposed changes to the group's classification are justified and necessary. We conclude that those proposed changes were based on (1) mtDNA gene tree anomalies, (2) preliminary analyses that do not fully capture the breadth of geographic variation necessary to motivate taxonomic changes, and (3) changes in rank that are not motivated by non-monophyletic groups. Our recommendation, for this and other similar cases, is that taxonomic changes be made only when phylogenetic results that are statistically well-supported and corroborated by multiple independent lines of genetic evidence indicate that non-monophyletic groups are currently recognized and need to be corrected. We hope that other members of the phylogenetics community will join us in proposing taxonomic changes only when the strongest phylogenetic data demand such changes, and in so doing that we can move toward stable, phylogenetically informed classifications of lasting value. PMID:24704303

  14. Accurate gene-tree reconstruction by learning gene- and species-specific substitution rates across multiple complete genomes

    PubMed Central

    Rasmussen, Matthew D.; Kellis, Manolis

    2007-01-01

    Comparative genomics provides a general methodology for discovering functional DNA elements and understanding their evolution. The availability of many related genomes enables more powerful analyses, but requires rigorous phylogenetic methods to resolve orthologous genes and regions. Here, we use 12 recently sequenced Drosophila genomes and nine fungal genomes to address the problem of accurate gene-tree reconstruction across many complete genomes. We show that existing phylogenetic methods that treat each gene tree in isolation show large-scale inaccuracies, largely due to insufficient phylogenetic information in individual genes. However, we find that gene trees exhibit common properties that can be exploited for evolutionary studies and accurate phylogenetic reconstruction. Evolutionary rates can be decoupled into gene-specific and species-specific components, which can be learned across complete genomes. We develop a phylogenetic reconstruction methodology that exploits these properties and achieves significantly higher accuracy, addressing the species-level heterotachy and enabling studies of gene evolution in the context of species evolution. PMID:17989260

  15. Modularized evolution in archaeal methanogens phylogenetic forest.

    PubMed

    Li, Jun; Wong, Chi-Fat; Wong, Mabel Ting; Huang, He; Leung, Frederick C

    2014-12-01

    Methanogens are methane-producing archaea that plays a key role in the global carbon cycle. To date, the evolutionary history of methanogens and closely related nonmethanogen species remains unresolved among studies conducted upon different genetic markers, attributing to horizontal gene transfers (HGTs). With an effort to decipher both congruent and conflicting evolutionary events, reconstruction of coevolved gene clusters and hierarchical structure in the archaeal methanogen phylogenetic forest, comprehensive evolution, and network analyses were performed upon 3,694 gene families from 41 methanogens and 33 closely related archaea. Our results show that 1) greater than 50% of genes are in topological dissonance with others; 2) the prevalent interorder HGTs, even for core genes, in methanogen genomes led to their scrambled phylogenetic relationships; 3) most methanogenesis-related genes have experienced at least one HGT; 4) greater than 20% of the genes in methanogen genomes were transferred horizontally from other archaea, with genes involved in cell-wall synthesis and defense system having been transferred most frequently; 5) the coevolution network contains seven statistically robust modules, wherein the central module has the highest average node strength and comprises a majority of the core genes; 6) different coevolutionary module genes boomed in different time and evolutionary lineage, constructing diversified pan-genome structures; 7) the modularized evolution is also closely related to the vertical evolution signals and the HGT rate of the genes. Overall, this study presented a modularized phylogenetic forest that describes a combination of complicated vertical and nonvertical evolutionary processes for methanogenic archaeal species. PMID:25502908

  16. Linear Classification Functions.

    ERIC Educational Resources Information Center

    Huberty, Carl J.; Smith, Jerry D.

    Linear classification functions (LCFs) arise in a predictive discriminant analysis for the purpose of classifying experimental units into criterion groups. The relative contribution of the response variables to classification accuracy may be based on LCF-variable correlations for each group. It is proved that, if the raw response measures are…

  17. Library Classification 2020

    ERIC Educational Resources Information Center

    Harris, Christopher

    2013-01-01

    In this article the author explores how a new library classification system might be designed using some aspects of the Dewey Decimal Classification (DDC) and ideas from other systems to create something that works for school libraries in the year 2020. By examining what works well with the Dewey Decimal System, what features should be carried…

  18. Maternal Language Classification Scale.

    ERIC Educational Resources Information Center

    Mahoney, Gerald; Petersen, Gail

    This study reports the interrater agreement of the Maternal Language Classification Scale (MLCS), a functional language classification system, developed partly to avoid problems identified with previous scales. The MLCS is a comprehensive system for classifying the functional content of maternal language addressed to children whose mean length of…

  19. The genetic code can cause systematic bias in simple phylogenetic models.

    PubMed

    Whelan, Simon

    2008-12-27

    Phylogenetic analysis depends on inferential methodology estimating accurately the degree of divergence between sequences. Inaccurate estimates can lead to misleading evolutionary inferences, including incorrect tree topology estimates and poor dating of historical species divergence. Protein coding sequences are ubiquitous in phylogenetic inference, but many of the standard methods commonly used to describe their evolution do not explicitly account for the dependencies between sites in a codon induced by the genetic code. This study evaluates the performance of several standard methods on datasets simulated under a simple substitution model, describing codon evolution under a range of different types of selective pressures. This approach also offers insights into the relative performance of different phylogenetic methods when there are dependencies acting between the sites in the data. Methods based on statistical models performed well when there was no or limited purifying selection in the simulated sequences (low degree of dependency between sites in a codon), although more biologically realistic models tended to outperform simpler models. Phylogenetic methods exhibited greater variability in performance for sequences simulated under strong purifying selection (high degree of the dependencies between sites in a codon). Simple models substantially underestimate the degree of divergence between sequences, and underestimation was more pronounced on the internal branches of the tree. This underestimation resulted in some statistical methods performing poorly and exhibiting evidence for systematic bias in tree inference. Amino acid-based and nucleotide models that contained generic descriptions of spatial and temporal heterogeneity, such as mixture and temporal hidden Markov models, coped notably better, producing more accurate estimates of evolutionary divergence and the tree topology. PMID:18852102

  20. Evidence of Statistical Inconsistency of Phylogenetic Methods in the Presence of Multiple Sequence Alignment Uncertainty

    PubMed Central

    Md Mukarram Hossain, A.S.; Blackburne, Benjamin P.; Shah, Abhijeet; Whelan, Simon

    2015-01-01

    Evolutionary studies usually use a two-step process to investigate sequence data. Step one estimates a multiple sequence alignment (MSA) and step two applies phylogenetic methods to ask evolutionary questions of that MSA. Modern phylogenetic methods infer evolutionary parameters using maximum likelihood or Bayesian inference, mediated by a probabilistic substitution model that describes sequence change over a tree. The statistical properties of these methods mean that more data directly translates to an increased confidence in downstream results, providing the substitution model is adequate and the MSA is correct. Many studies have investigated the robustness of phylogenetic methods in the presence of substitution model misspecification, but few have examined the statistical properties of those methods when the MSA is unknown. This simulation study examines the statistical properties of the complete two-step process when inferring sequence divergence and the phylogenetic tree topology. Both nucleotide and amino acid analyses are negatively affected by the alignment step, both through inaccurate guide tree estimates and through overfitting to that guide tree. For many alignment tools these effects become more pronounced when additional sequences are added to the analysis. Nucleotide sequences are particularly susceptible, with MSA errors leading to statistical support for long-branch attraction artifacts, which are usually associated with gross substitution model misspecification. Amino acid MSAs are more robust, but do tend to arbitrarily resolve multifurcations in favor of the guide tree. No inference strategies produce consistently accurate estimates of divergence between sequences, although amino acid MSAs are again more accurate than their nucleotide counterparts. We conclude with some practical suggestions about how to limit the effect of MSA uncertainty on evolutionary inference. PMID:26139831

  1. 2-Stage Classification Modeling

    Energy Science and Technology Software Center (ESTSC)

    1994-11-01

    CIRCUIT2.4 is used to design optimum two-stage classification configurations and operating conditions for energy conservation. It permits simulation of five basic grinding-classification circuits, including one single-stage and four two-stage classification arrangements. Hydrocyclones, spiral classifiers, and sieve band screens can be simulated, and the user may choose the combination of devices for the flowsheet simulation. In addition, the user may select from four classification modeling methods to achieve the goals of a simulation project using themore » most familiar concepts. Circuit performance is modeled based on classification parameters or equipment operating conditions. A modular approach was taken in designing the program, which allows future addition of other models with relatively minor changes.« less

  2. Improving classification of psychoses.

    PubMed

    Lawrie, Stephen M; O'Donovan, Michael C; Saks, Elyn; Burns, Tom; Lieberman, Jeffrey A

    2016-04-01

    Psychosis has been recognised as an abnormal state in need of care throughout history and by diverse cultures. Present classifications of psychotic disorder remain based on the presence of specific psychotic symptoms, relative to affective and other symptoms, and their sequence and duration. Although extant diagnostic classifications have restricted validity, they have proven reliability and most clinicians and some patients find them useful. Moreover, these classifications have yet to be replaced by anything better. We propose that an expansion of the subgrouping approach inherent to classification will provide incremental improvement to present diagnostic constructs-as has worked in the rest of medicine. We also propose that subgroups could be created both within and across present diagnostic classifications, taking into consideration the potential value of continuous measures (eg, duration of psychotic symptoms and intelligence quotient). Health-care workers also need to work with service users and carers to develop and adapt approaches to diagnosis that are seen as helpful. PMID:27063387

  3. Phylogenetic relationships and the evolution of BMP4 in triggerfishes and filefishes (Balistoidea).

    PubMed

    McCord, Charlene L; Westneat, Mark W

    2016-01-01

    The triggerfishes (family Balistidae) and filefishes (family Monacanthidae) comprise a charismatic superfamily (Balistoidea) within the diverse order Tetraodontiformes. This group of largely marine fishes occupies an impressive ecological range across the world's oceans, and is well known for its locomotor and feeding diversity, unusual body shapes, small genome size, and ecological and economic importance. In order to investigate the evolutionary history of these important fish families, we used multiple phylogenetic methods to analyze molecular data from 86 species spanning the extant biodiversity of Balistidae and Monacanthidae. In addition to three gene regions that have been used extensively in phylogenetic analyses, we include sequence data for two mitochondrial regions, two nuclear markers, and the growth factor gene bmp4, which is involved with cranial development. Phylogenetic analyses strongly support the monophyly of the superfamily Balistoidea, the sister-family relationship of Balistidae and Monacanthidae, as well as three triggerfish and four filefish clades that are well resolved. A new classification for the Balistidae is proposed based on phylogenetic groups. Bayesian topology, as well as the timing of major cladogenesis events, is largely congruent with previous hypotheses of balistid phylogeny. However, we present a novel topology for major clades in the filefish family that illustrate the genera Aluterus and Stephanolepis are more closely related than previously posited. Molecular rates suggest a Miocene and Oligocene origin for the families Balistidae and Monacanthidae, respectively, and significant divergence of species in both families within the past 5 million years. A second key finding of this study is that, relative to the other protein-coding gene regions in our DNA supermatrix, bmp4 shows a rapid accumulation of both synonymous and non-synonymous substitutions, especially within the family Monacanthidae. Overall substitution patterns in bmp4 support the hypothesis of stabilizing selection during the evolutionary history of regulatory genes, with a small number of isolated examples of accelerated non-synonymous changes detected in our phylogeny. PMID:26408967

  4. Phylogenetic relationships of Malaysia's long-tailed macaques, Macaca fascicularis, based on cytochrome b sequences.

    PubMed

    Abdul-Latiff, Muhammad Abu Bakar; Ruslin, Farhani; Fui, Vun Vui; Abu, Mohd-Hashim; Rovie-Ryan, Jeffrine Japning; Abdul-Patah, Pazil; Lakim, Maklarin; Roos, Christian; Yaakop, Salmah; Md-Zain, Badrul Munir

    2014-01-01

    Phylogenetic relationships among Malaysia's long-tailed macaques have yet to be established, despite abundant genetic studies of the species worldwide. The aims of this study are to examine the phylogenetic relationships of Macaca fascicularis in Malaysia and to test its classification as a morphological subspecies. A total of 25 genetic samples of M. fascicularis yielding 383 bp of Cytochrome b (Cyt b) sequences were used in phylogenetic analysis along with one sample each of M. nemestrina and M. arctoides used as outgroups. Sequence character analysis reveals that Cyt b locus is a highly conserved region with only 23% parsimony informative character detected among ingroups. Further analysis indicates a clear separation between populations originating from different regions; the Malay Peninsula versus Borneo Insular, the East Coast versus West Coast of the Malay Peninsula, and the island versus mainland Malay Peninsula populations. Phylogenetic trees (NJ, MP and Bayesian) portray a consistent clustering paradigm as Borneo's population was distinguished from Peninsula's population (99% and 100% bootstrap value in NJ and MP respectively and 1.00 posterior probability in Bayesian trees). The East coast population was separated from other Peninsula populations (64% in NJ, 66% in MP and 0.53 posterior probability in Bayesian). West coast populations were divided into 2 clades: the North-South (47%/54% in NJ, 26/26% in MP and 1.00/0.80 posterior probability in Bayesian) and Island-Mainland (93% in NJ, 90% in MP and 1.00 posterior probability in Bayesian). The results confirm the previous morphological assignment of 2 subspecies, M. f. fascicularis and M. f. argentimembris, in the Malay Peninsula. These populations should be treated as separate genetic entities in order to conserve the genetic diversity of Malaysia's M. fascicularis. These findings are crucial in aiding the conservation management and translocation process of M. fascicularis populations in Malaysia. PMID:24899832

  5. Behavior Based Social Dimensions Extraction for Multi-Label Classification.

    PubMed

    Li, Le; Xu, Junyi; Xiao, Weidong; Ge, Bin

    2016-01-01

    Classification based on social dimensions is commonly used to handle the multi-label classification task in heterogeneous networks. However, traditional methods, which mostly rely on the community detection algorithms to extract the latent social dimensions, produce unsatisfactory performance when community detection algorithms fail. In this paper, we propose a novel behavior based social dimensions extraction method to improve the classification performance in multi-label heterogeneous networks. In our method, nodes' behavior features, instead of community memberships, are used to extract social dimensions. By introducing Latent Dirichlet Allocation (LDA) to model the network generation process, nodes' connection behaviors with different communities can be extracted accurately, which are applied as latent social dimensions for classification. Experiments on various public datasets reveal that the proposed method can obtain satisfactory classification results in comparison to other state-of-the-art methods on smaller social dimensions. PMID:27049849

  6. Behavior Based Social Dimensions Extraction for Multi-Label Classification

    PubMed Central

    Li, Le; Xu, Junyi; Xiao, Weidong; Ge, Bin

    2016-01-01

    Classification based on social dimensions is commonly used to handle the multi-label classification task in heterogeneous networks. However, traditional methods, which mostly rely on the community detection algorithms to extract the latent social dimensions, produce unsatisfactory performance when community detection algorithms fail. In this paper, we propose a novel behavior based social dimensions extraction method to improve the classification performance in multi-label heterogeneous networks. In our method, nodes’ behavior features, instead of community memberships, are used to extract social dimensions. By introducing Latent Dirichlet Allocation (LDA) to model the network generation process, nodes’ connection behaviors with different communities can be extracted accurately, which are applied as latent social dimensions for classification. Experiments on various public datasets reveal that the proposed method can obtain satisfactory classification results in comparison to other state-of-the-art methods on smaller social dimensions. PMID:27049849

  7. The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation

    PubMed Central

    Roger, Andrew J; Hug, Laura A

    2006-01-01

    Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods used and their assumptions. Accurate dating of divergence times among the major eukaryote lineages will require a robust tree of eukaryotes, a much richer Proterozoic fossil record of microbial eukaryotes assignable to extant groups for calibration, more sophisticated relaxed molecular clock methods and many more genes sampled from the full diversity of microbial eukaryotes. PMID:16754613

  8. Strong phylogenetic signals and phylogenetic niche conservatism in ecophysiological traits across divergent lineages of Magnoliaceae.

    PubMed

    Liu, Hui; Xu, Qiuyuan; He, Pengcheng; Santiago, Louis S; Yang, Keming; Ye, Qing

    2015-01-01

    The early diverged Magnoliaceae shows a historical temperate-tropical distribution among lineages indicating divergent evolution, yet which ecophysiological traits are phylogenetically conserved, and whether these traits are involved in correlated evolution remain unclear. Integrating phylogeny and 20 ecophysiological traits of 27 species, from the four largest sections of Magnoliaceae, we tested the phylogenetic signals of these traits and the correlated evolution between trait pairs. Phylogenetic niche conservatism (PNC) in water-conducting and nutrient-use related traits was identified, and correlated evolution of several key functional traits was demonstrated. Among the three evergreen sections of tropical origin, Gwillimia had the lowest hydraulic-photosynthetic capacity and the highest drought tolerance compared with Manglietia and Michelia. Contrastingly, the temperate centred deciduous section, Yulania, showed high rates of hydraulic conductivity and photosynthesis at the cost of drought tolerance. This study elucidated the regulation of hydraulic and photosynthetic processes in the temperate-tropical adaptations for Magnoliaceae species, which led to strong phylogenetic signals and PNC in ecophysiological traits across divergent lineages of Magnoliaceae. PMID:26179320

  9. Strong phylogenetic signals and phylogenetic niche conservatism in ecophysiological traits across divergent lineages of Magnoliaceae

    PubMed Central

    Liu, Hui; Xu, Qiuyuan; He, Pengcheng; Santiago, Louis S.; Yang, Keming; Ye, Qing

    2015-01-01

    The early diverged Magnoliaceae shows a historical temperate-tropical distribution among lineages indicating divergent evolution, yet which ecophysiological traits are phylogenetically conserved, and whether these traits are involved in correlated evolution remain unclear. Integrating phylogeny and 20 ecophysiological traits of 27 species, from the four largest sections of Magnoliaceae, we tested the phylogenetic signals of these traits and the correlated evolution between trait pairs. Phylogenetic niche conservatism (PNC) in water-conducting and nutrient-use related traits was identified, and correlated evolution of several key functional traits was demonstrated. Among the three evergreen sections of tropical origin, Gwillimia had the lowest hydraulic-photosynthetic capacity and the highest drought tolerance compared with Manglietia and Michelia. Contrastingly, the temperate centred deciduous section, Yulania, showed high rates of hydraulic conductivity and photosynthesis at the cost of drought tolerance. This study elucidated the regulation of hydraulic and photosynthetic processes in the temperate-tropical adaptations for Magnoliaceae species, which led to strong phylogenetic signals and PNC in ecophysiological traits across divergent lineages of Magnoliaceae. PMID:26179320

  10. Following phylogenetic tracks of Astragalus cicer microsymbionts.

    PubMed

    Wdowiak-Wróbel, Sylwia; Małek, Wanda

    2010-01-01

    A multilocus phylogenetic approach was applied to elucidate the phylogeny of Astragalus cicer rhizobia derived from Poland, Ukraine, and Canada. The strains selected for the studies represented three main geographically different phenons of these bacteria. Phylogenetic analyses were performed with three chromosomal housekeeping loci (16S rRNA, atpD, glnII) and three symbiotic genes located on a plasmid (nodA, nodC, nifH). The "core" and "auxiliary" gene trees revealed that A. cicer nodule isolates were intermingled with the strains of Mesorhizobium species, which implies that they are descendents of the same ancestor as mesorhizobia and fall into the Mesorhizobium genus. The noted congruence of the housekeeping and symbiotic gene phylogenies of A. cicer microsymbionts indicates that sym loci are transferred to these bacteria through vertical transmission without a significant participation of intergeneric horizontal gene spread. All the three sym gene sequences of the Polish and Ukrainian A. cicer nodule isolates were more closely related to one another than to the corresponding sequences of the Canadian isolates. The phylogeographic patterns of the sym genes of intercontinental strains point to their relatively long, separate, evolutionary history. PMID:19821140

  11. Phylogenetic analysis of honey bee behavioral evolution.

    PubMed

    Raffiudin, Rika; Crozier, Ross H

    2007-05-01

    DNA sequences from three mitochondrial (rrnL, cox2, nad2) and one nuclear gene (itpr) from all 9 known honey bee species (Apis), a 10th possible species, Apis dorsata binghami, and three outgroup species (Bombus terrestris, Melipona bicolor and Trigona fimbriata) were used to infer Apis phylogenetic relationships using Bayesian analysis. The dwarf honey bees were confirmed as basal, and the giant and cavity-nesting species to be monophyletic. All nodes were strongly supported except that grouping Apis cerana with A. nigrocincta. Two thousand post-burnin trees from the phylogenetic analysis were used in a Bayesian comparative analysis to explore the evolution of dance type, nest structure, comb structure and dance sound within Apis. The ancestral honey bee species was inferred with high support to have nested in the open, and to have more likely than not had a silent vertical waggle dance and a single comb. The common ancestor of the giant and cavity-dwelling bees is strongly inferred to have had a buzzing vertical directional dance. All pairwise combinations of characters showed strong association, but the multiple comparisons problem reduces the ability to infer associations between states between characters. Nevertheless, a buzzing dance is significantly associated with cavity-nesting, several vertical combs, and dancing vertically, a horizontal dance is significantly associated with a nest with a single comb wrapped around the support, and open nesting with a single pendant comb and a silent waggle dance. PMID:17123837

  12. Phylogenetic hypotheses for the turtle family Geoemydidae.

    PubMed

    Spinks, Phillip Q; Shaffer, H Bradley; Iverson, John B; McCord, William P

    2004-07-01

    The turtle family Geoemydidae represents the largest, most diverse, and most poorly understood family of turtles. Little is known about this group, including intrafamilial systematics. The only complete phylogenetic hypothesis for this family positions geoemydids as paraphyletic with respect to tortoises, but this arrangement has not been accepted by many workers. We compiled a 79-taxon mitochondrial and nuclear DNA data set to reconstruct phylogenetic relationships for 65 species and subspecies representing all 23 genera of the Geoemydidae. Maximum parsimony (MP) and maximum-likelihood (ML) analyses and Bayesian analysis produced similar, well-resolved trees. Our analyses identified three main clades comprising the tortoises (Testudinidae), the old-world Geoemydidae, and the South American geoemydid genus Rhinoclemmys. Within Geoemydidae, many nodes were strongly supported, particularly based on Bayesian posterior probabilities of the combined three-gene dataset. We found that adding data for a subset of taxa improved resolution of some deeper nodes in the tree. Several strongly supported groupings within the Geoemydidae demonstrate non-monophyly of some genera and possible interspecific hybrids, and we recommend several taxonomic revisions based on available evidence. PMID:15186805

  13. A Distance Measure for Genome Phylogenetic Analysis

    NASA Astrophysics Data System (ADS)

    Cao, Minh Duc; Allison, Lloyd; Dix, Trevor

    Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.

  14. Phylogenetic circumscription of Arthrographis (Eremomycetaceae, Dothideomycetes).

    PubMed

    Giraldo, A; Gen, J; Sutton, D A; Madrid, H; Cano, J; Crous, P W; Guarro, J

    2014-06-01

    Numerous members of Ascomycota and Basidiomycota produce only poorly differentiated arthroconidial asexual morphs in culture. These arthroconidial fungi are grouped in genera where the asexual-sexual connections and their taxonomic circumscription are poorly known. In the present study we explored the phylogenetic relationships of two of these ascomycetous genera, Arthrographis and Arthropsis. Analysis of D1/D2 sequences of all species of both genera revealed that both are polyphyletic, with species being accommodated in different orders and classes. Because genetic variability was detected among reference strains and fresh isolates resembling the genus Arthrographis, we carried out a detailed phenotypic and phylogenetic analysis based on sequence data of the ITS region, actin and chitin synthase genes. Based on these results, four new species are recognised, namely Arthrographis chlamydospora, A. curvata, A. globosa and A. longispora. Arthrographis chlamydospora is distinguished by its cerebriform colonies, branched conidiophores, cuboid arthroconidia and terminal or intercalary globose to subglobose chlamydospores. Arthrographis curvata produced both sexual and asexual morphs, and is characterised by navicular ascospores and dimorphic conidia, namely cylindrical arthroconidia and curved, cashewnut-shaped conidia formed laterally on vegetative hyphae. Arthrographis globosa produced membranous colonies, but is mainly characterised by doliiform to globose arthroconidia. Arthrographis longispora also produces membranous colonies, but has poorly differentiated conidiophores and long arthroconidia. Morphological variants are described for A. kalrae and our results also revealed that Eremomyces langeronii and A. kalrae, traditionally considered the sexual and asexual morphs of the same species, are not conspecific. PMID:25264385

  15. Face shape differs in phylogenetically related populations.

    PubMed

    Hopman, Saskia M J; Merks, Johannes H M; Suttie, Michael; Hennekam, Raoul C M; Hammond, Peter

    2014-11-01

    3D analysis of facial morphology has delineated facial phenotypes in many medical conditions and detected fine grained differences between typical and atypical patients to inform genotype-phenotype studies. Next-generation sequencing techniques have enabled extremely detailed genotype-phenotype correlative analysis. Such comparisons typically employ control groups matched for age, sex and ethnicity and the distinction between ethnic categories in genotype-phenotype studies has been widely debated. The phylogenetic tree based on genetic polymorphism studies divides the world population into nine subpopulations. Here we show statistically significant face shape differences between two European Caucasian populations of close phylogenetic and geographic proximity from the UK and The Netherlands. The average face shape differences between the Dutch and UK cohorts were visualised in dynamic morphs and signature heat maps, and quantified for their statistical significance using both conventional anthropometry and state of the art dense surface modelling techniques. Our results demonstrate significant differences between Dutch and UK face shape. Other studies have shown that genetic variants influence normal facial variation. Thus, face shape difference between populations could reflect underlying genetic difference. This should be taken into account in genotype-phenotype studies and we recommend that in those studies reference groups be established in the same population as the individuals who form the subject of the study. PMID:24398794

  16. Posterior predictive Bayesian phylogenetic model selection.

    PubMed

    Lewis, Paul O; Xie, Wangang; Chen, Ming-Hui; Fan, Yu; Kuo, Lynn

    2014-05-01

    We present two distinctly different posterior predictive approaches to Bayesian phylogenetic model selection and illustrate these methods using examples from green algal protein-coding cpDNA sequences and flowering plant rDNA sequences. The Gelfand-Ghosh (GG) approach allows dissection of an overall measure of model fit into components due to posterior predictive variance (GGp) and goodness-of-fit (GGg), which distinguishes this method from the posterior predictive P-value approach. The conditional predictive ordinate (CPO) method provides a site-specific measure of model fit useful for exploratory analyses and can be combined over sites yielding the log pseudomarginal likelihood (LPML) which is useful as an overall measure of model fit. CPO provides a useful cross-validation approach that is computationally efficient, requiring only a sample from the posterior distribution (no additional simulation is required). Both GG and CPO add new perspectives to Bayesian phylogenetic model selection based on the predictive abilities of models and complement the perspective provided by the marginal likelihood (including Bayes Factor comparisons) based solely on the fit of competing models to observed data. PMID:24193892

  17. 38 CFR 4.46 - Accurate measurement.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2014-07-01 2014-07-01 false Accurate measurement. 4... RATING DISABILITIES Disability Ratings The Musculoskeletal System 4.46 Accurate measurement. Accurate measurement of the length of stumps, excursion of joints, dimensions and location of scars with respect...

  18. 38 CFR 4.46 - Accurate measurement.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2013-07-01 2013-07-01 false Accurate measurement. 4... RATING DISABILITIES Disability Ratings The Musculoskeletal System 4.46 Accurate measurement. Accurate measurement of the length of stumps, excursion of joints, dimensions and location of scars with respect...

  19. 38 CFR 4.46 - Accurate measurement.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2012-07-01 2012-07-01 false Accurate measurement. 4... RATING DISABILITIES Disability Ratings The Musculoskeletal System 4.46 Accurate measurement. Accurate measurement of the length of stumps, excursion of joints, dimensions and location of scars with respect...

  20. 38 CFR 4.46 - Accurate measurement.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2010-07-01 2010-07-01 false Accurate measurement. 4... RATING DISABILITIES Disability Ratings The Musculoskeletal System § 4.46 Accurate measurement. Accurate measurement of the length of stumps, excursion of joints, dimensions and location of scars with respect...

  1. Automated protein subfamily identification and classification.

    PubMed

    Brown, Duncan P; Krishnamurthy, Nandini; Sjölander, Kimmen

    2007-08-01

    Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to automate for high-throughput application. To address this limitation, we present a computationally efficient pipeline for phylogenomic classification of proteins. This pipeline uses the SCI-PHY (Subfamily Classification in Phylogenomics) algorithm for automatic subfamily identification, followed by subfamily hidden Markov model (HMM) construction. A simple and computationally efficient scoring scheme using family and subfamily HMMs enables classification of novel sequences to protein families and subfamilies. Sequences representing entirely novel subfamilies are differentiated from those that can be classified to subfamilies in the input training set using logistic regression. Subfamily HMM parameters are estimated using an information-sharing protocol, enabling subfamilies containing even a single sequence to benefit from conservation patterns defining the family as a whole or in related subfamilies. SCI-PHY subfamilies correspond closely to functional subtypes defined by experts and to conserved clades found by phylogenetic analysis. Extensive comparisons of subfamily and family HMM performances show that subfamily HMMs dramatically improve the separation between homologous and non-homologous proteins in sequence database searches. Subfamily HMMs also provide extremely high specificity of classification and can be used to predict entirely novel subtypes. The SCI-PHY Web server at http://phylogenomics.berkeley.edu/SCI-PHY/ allows users to upload a multiple sequence alignment for subfamily identification and subfamily HMM construction. Biologists wishing to provide their own subfamily definitions can do so. Source code is available on the Web page. The Berkeley Phylogenomics Group PhyloFacts resource contains pre-calculated subfamily predictions and subfamily HMMs for more than 40,000 protein families and domains at http://phylogenomics.berkeley.edu/phylofacts/. PMID:17708678

  2. Automated Protein Subfamily Identification and Classification

    PubMed Central

    Brown, Duncan P; Krishnamurthy, Nandini; Sjlander, Kimmen

    2007-01-01

    Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to automate for high-throughput application. To address this limitation, we present a computationally efficient pipeline for phylogenomic classification of proteins. This pipeline uses the SCI-PHY (Subfamily Classification in Phylogenomics) algorithm for automatic subfamily identification, followed by subfamily hidden Markov model (HMM) construction. A simple and computationally efficient scoring scheme using family and subfamily HMMs enables classification of novel sequences to protein families and subfamilies. Sequences representing entirely novel subfamilies are differentiated from those that can be classified to subfamilies in the input training set using logistic regression. Subfamily HMM parameters are estimated using an information-sharing protocol, enabling subfamilies containing even a single sequence to benefit from conservation patterns defining the family as a whole or in related subfamilies. SCI-PHY subfamilies correspond closely to functional subtypes defined by experts and to conserved clades found by phylogenetic analysis. Extensive comparisons of subfamily and family HMM performances show that subfamily HMMs dramatically improve the separation between homologous and non-homologous proteins in sequence database searches. Subfamily HMMs also provide extremely high specificity of classification and can be used to predict entirely novel subtypes. The SCI-PHY Web server at http://phylogenomics.berkeley.edu/SCI-PHY/ allows users to upload a multiple sequence alignment for subfamily identification and subfamily HMM construction. Biologists wishing to provide their own subfamily definitions can do so. Source code is available on the Web page. The Berkeley Phylogenomics Group PhyloFacts resource contains pre-calculated subfamily predictions and subfamily HMMs for more than 40,000 protein families and domains at http://phylogenomics.berkeley.edu/phylofacts/. PMID:17708678

  3. Assessment of available anatomical characters for linking living mammals to fossil taxa in phylogenetic analyses.

    PubMed

    Guillerme, Thomas; Cooper, Natalie

    2016-05-01

    Analyses of living and fossil taxa are crucial for understanding biodiversity through time. The total evidence method allows living and fossil taxa to be combined in phylogenies, using molecular data for living taxa and morphological data for living and fossil taxa. With this method, substantial overlap of coded anatomical characters among living and fossil taxa is vital for accurately inferring topology. However, although molecular data for living species are widely available, scientists generating morphological data mainly focus on fossils. Therefore, there are fewer coded anatomical characters in living taxa, even in well-studied groups such as mammals. We investigated the number of coded anatomical characters available in phylogenetic matrices for living mammals and how these were phylogenetically distributed across orders. Eleven of 28 mammalian orders have less than 25% species with available characters; this has implications for the accurate placement of fossils, although the issue is less pronounced at higher taxonomic levels. In most orders, species with available characters are randomly distributed across the phylogeny, which may reduce the impact of the problem. We suggest that increased morphological data collection efforts for living taxa are needed to produce accurate total evidence phylogenies. PMID:27146442

  4. Methodology for hyperspectral image classification using novel neural network

    SciTech Connect

    Subramanian, S., Gat, N., Sheffield, M.,; Barhen, J.; Toomarian, N.

    1997-04-01

    A novel feed forward neural network is used to classify hyperspectral data from the AVIRIS sector. The network applies an alternating direction singular value decomposition technique to achieve rapid training times (few seconds per class). Very few samples (10-12) are required for training. 100% accurate classification is obtained using test data sets. The methodology combines this rapid training neural network together with data reduction and maximal feature separation techniques such as principal component analysis and simultaneous diagonalization of covariance matrices, for rapid and accurate classification of large hyperspectral images. The results are compared to those of standard statistical classifiers. 21 refs., 3 figs., 5 tabs.

  5. The Impact of Plant Enemies Shows a Phylogenetic Signal

    PubMed Central

    Gilbert, Gregory S.; Briggs, Heather M.; Magarey, Roger

    2015-01-01

    The host ranges of plant pathogens and herbivores are phylogenetically constrained, so that closely related plant species are more likely to share pests and pathogens. Here we conducted a reanalysis of data from published experimental studies to test whether the severity of host-enemy interactions follows a similar phylogenetic signal. The impact of herbivores and pathogens on their host plants declined steadily with phylogenetic distance from the most severely affected focal hosts. The steepness of this phylogenetic signal was similar to that previously measured for binary-response host ranges. Enemy behavior and development showed similar, but weaker phylogenetic signal, with oviposition and growth rates declining with evolutionary distance from optimal hosts. Phylogenetic distance is an informative surrogate for estimating the likely impacts of a pest or pathogen on potential plant hosts, and may be particularly useful in early assessing risk from emergent plant pests, where critical decisions must be made with incomplete host records. PMID:25893581

  6. Phylogenetic relatedness and the determinants of competitive outcomes.

    PubMed

    Godoy, Oscar; Kraft, Nathan J B; Levine, Jonathan M

    2014-07-01

    Recent hypotheses argue that phylogenetic relatedness should predict both the niche differences that stabilise coexistence and the average fitness differences that drive competitive dominance. These still largely untested predictions complicate Darwin's hypothesis that more closely related species less easily coexist, and challenge the use of community phylogenetic patterns to infer competition. We field parameterised models of competitor dynamics with pairs of 18 California annual plant species, and then related species' niche and fitness differences to their phylogenetic distance. Stabilising niche differences were unrelated to phylogenetic distance, while species' average fitness showed phylogenetic structure. This meant that more distant relatives had greater competitive asymmetry, which should favour the coexistence of close relatives. Nonetheless, coexistence proved unrelated to phylogeny, due in part to increasing variance in fitness differences with phylogenetic distance, a previously overlooked property of such relationships. Together, these findings question the expectation that distant relatives should more readily coexist. PMID:24766326

  7. Phylogenomics of zygomycete fungi: impacts on a phylogenetic classification of Kingdom Fungi

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The zygomycetous fungi (”zygomycetes”) mark the major transition from zoosporic life histories of the common ancestor of Fungi and the earliest diverging chytrid lineages (Chytridiomycota and Blastocladiomycota). Their ecological and economic importance range from the earliest documented symbionts o...

  8. Best Practices for Data Sharing in Phylogenetic Research

    PubMed Central

    Cranston, Karen; Harmon, Luke J.; O'Leary, Maureen A.; Lisle, Curtis

    2014-01-01

    As phylogenetic data becomes increasingly available, along with associated data on species’ genomes, traits, and geographic distributions, the need to ensure data availability and reuse become more and more acute. In this paper, we provide ten “simple rules” that we view as best practices for data sharing in phylogenetic research. These rules will help lead towards a future phylogenetics where data can easily be archived, shared, reused, and repurposed across a wide variety of projects. PMID:24987572

  9. Phylogenetic tree construction based on 2D graphical representation

    NASA Astrophysics Data System (ADS)

    Liao, Bo; Shan, Xinzhou; Zhu, Wen; Li, Renfa

    2006-04-01

    A new approach based on the two-dimensional (2D) graphical representation of the whole genome sequence [Bo Liao, Chem. Phys. Lett., 401(2005) 196.] is proposed to analyze the phylogenetic relationships of genomes. The evolutionary distances are obtained through measuring the differences among the 2D curves. The fuzzy theory is used to construct phylogenetic tree. The phylogenetic relationships of H5N1 avian influenza virus illustrate the utility of our approach.

  10. Acoustic classification of zooplankton

    NASA Astrophysics Data System (ADS)

    Martin Traykovski, Linda V.

    1998-11-01

    Work on the forward problem in zooplankton bioacoustics has resulted in the identification of three categories of acoustic scatterers: elastic-shelled (e.g. pteropods), fluid-like (e.g. euphausiids), and gas-bearing (e.g. siphonophores). The relationship between backscattered energy and animal biomass has been shown to vary by a factor of ~19,000 across these categories, so that to make accurate estimates of zooplankton biomass from acoustic backscatter measurements of the ocean, the acoustic characteristics of the species of interest must be well-understood. This thesis describes the development of both feature based and model based classification techniques to invert broadband acoustic echoes from individual zooplankton for scatterer type, as well as for particular parameters such as animal orientation. The feature based Empirical Orthogonal Function Classifier (EOFC) discriminates scatterer types by identifying characteristic modes of variability in the echo spectra, exploiting only the inherent characteristic structure of the acoustic signatures. The model based Model Parameterisation Classifier (MPC) classifies based on correlation of observed echo spectra with simplified parameterisations of theoretical scattering models for the three classes. The Covariance Mean Variance Classifiers (CMVC) are a set of advanced model based techniques which exploit the full complexity of the theoretical models by searching the entire physical model parameter space without employing simplifying parameterisations. Three different CMVC algorithms were developed: the Integrated Score Classifier (ISC), the Pairwise Score Classifier (PSC) and the Bayesian Probability Classifier (BPC); these classifiers assign observations to a class based on similarities in covariance, mean, and variance, while accounting for model ambiguity and validity. These feature based and model based inversion techniques were successfully applied to several thousand echoes acquired from broadband (~350 kHz-750 kHz) insonifications of live zooplankton collected on Georges Bank and the Gulf of Maine to determine scatterer class. CMVC techniques were also applied to echoes from fluid-like zooplankton (Antarctic krill) to invert for angle of orientation using generic and animal-specific theoretical and empirical models. Application of these inversion techniques in situ will allow correct apportionment of backscattered energy to animal biomass, significantly improving estimates of zooplankton biomass based on acoustic surveys. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

  11. Improved Surgical Site Infection (SSI) rate through accurately assessed surgical wounds

    PubMed Central

    John, Honeymol; Nimeri, Abdelrahman; ELLAHHAM, SAMER

    2015-01-01

    Sheikh Khalifa Medical City's (SKMC) Surgery Institute was identified as a high outlier in Surgical Site Infections (SSI) based on the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP) - Semi-Annual Report (SAR) in January 2012. The aim of this project was to improve SSI rates through accurate wound classification. We identified SSI rate reduction as a performance improvement and safety priority at SKMC, a tertiary referral center. We used the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP) best practice guidelines as a guide. ACS NSQIP is a clinical registry that provides risk-adjusted clinical outcome reports every six months. The rates of SSI are reported in an observed/expected ratio. The expected ratio is calculated based on the risk factors of the patients which include wound classification. We established a multidisciplinary SSI taskforce. The members of the SSI taskforce included the ACS NSQIP team members, quality, surgeons, nurses, infection control, IT, pharmacy, microbiology, and it was chaired by a colorectal surgeon. The taskforce focused on five areas: pre-op showering and hair removal, skin antisepsis, prophylactic antibiotics, peri-operative maintenance of glycaemia, and normothermia. We planned audits to evaluate our wound classification and our SSI rates based on the SAR. Our expected SSI rates in general surgery and the whole department were 2.52% and 1.70% respectively, while our observed SSI rates were 4.68% and 3.57% respectively, giving us a high outlier status with an odd's ratio of 1.72 and 2.03. Wound classifications were identified as an area of concern. For example, wound classifications were preoperatively selected based on the default wound classification of the booked procedure in the Electronic Medical Record (EMR) which led to under classifying wounds in many occasions. A total of 998 cases were reviewed, our rate of incorrect wound classification assignment was 36%, and the worst rates were in appendectomies (97%). Over time our incorrect wound classification decreased down to 22%, while at the same time our actual SSI wound occurrences per month and our odds ratio of SSI in the department have decreased an average of six to three per month. We followed the best practice guidelines of the ACS NSQIP. Accurate assessment of wound classification is necessary to make sure the expected SSI rates are not falsely high if wounds are under-classified. The present study shows that accurate wound classification in contaminated and dirty wounds can lead to lower odds ratio of SSI. PMID:26734358

  12. Trinets encode tree-child and level-2 phylogenetic networks.

    PubMed

    van Iersel, Leo; Moulton, Vincent

    2014-06-01

    Phylogenetic networks generalize evolutionary trees, and are commonly used to represent evolutionary histories of species that undergo reticulate evolutionary processes such as hybridization, recombination and lateral gene transfer. Recently, there has been great interest in trying to develop methods to construct rooted phylogenetic networks from triplets, that is rooted trees on three species. However, although triplets determine or encode rooted phylogenetic trees, they do not in general encode rooted phylogenetic networks, which is a potential issue for any such method. Motivated by this fact, Huber and Moulton recently introduced trinets as a natural extension of rooted triplets to networks. In particular, they showed that [Formula: see text] phylogenetic networks are encoded by their trinets, and also conjectured that all "recoverable" rooted phylogenetic networks are encoded by their trinets. Here we prove that recoverable binary level-2 networks and binary tree-child networks are also encoded by their trinets. To do this we prove two decomposition theorems based on trinets which hold for all recoverable binary rooted phylogenetic networks. Our results provide some additional evidence in support of the conjecture that trinets encode all recoverable rooted phylogenetic networks, and could also lead to new approaches to construct phylogenetic networks from trinets. PMID:23680992

  13. Phylogenetic Interrelationships of Ginglymodian Fishes (Actinopterygii: Neopterygii)

    PubMed Central

    López-Arbarello, Adriana

    2012-01-01

    The Ginglymodi is one of the most common, though poorly understood groups of neopterygians, which includes gars, macrosemiiforms, and “semionotiforms.” In particular, the phylogenetic relationships between the widely distributed “semionotiforms,” and between them and other ginglymodians have been enigmatic. Here, the phylogenetic relationships between eight of the 11 “semionotiform” genera, five genera of living and fossil gars and three macrosemiid genera, are analysed through cladistic analysis, based on 90 morphological characters and 37 taxa, including 7 out-group taxa. The results of the analysis show that the Ginglymodi includes two main lineages: Lepisosteiformes and †Semionotiformes. The genera †Pliodetes, †Araripelepidotes, †Lepidotes, †Scheenstia, and †Isanichthys are lepisosteiforms, and not semionotiforms, as previously thought, and these taxa extend the stratigraphic range of the lineage leading to gars back up to the Early Jurassic. A monophyletic †Lepidotes is restricted to the Early Jurassic species, whereas the strongly tritoral species previously referred to †Lepidotes are referred to †Scheenstia. Other species previously referred to †Lepidotes represent other genera or new taxa. The macrosemiids are well nested within semionotiforms, together with †Semionotidae, here restricted to †Semionotus, and a new family including †Callipurbeckia n. gen. minor (previously referred to †Lepidotes), †Macrosemimimus, †Tlayuamichin, †Paralepidotus, and †Semiolepis. Due to the numerous taxonomic changes needed according to the phylogenetic analysis, this article also includes formal taxonomic definitions and diagnoses for all generic and higher taxa, which are new or modified. The study of Mesozoic ginglymodians led to confirm Patterson’s observation that these fishes show morphological affinities with both halecomorphs and teleosts. Therefore, the compilation of large data sets including the Mesozoic ginglymodians and the re-evaluation of several hypotheses of homology are essential to test the hypotheses of the Halecostomi vs. the Holostei, which is one of the major topics in the evolution of Mesozoic vertebrates and the origin of modern fish faunas. PMID:22808031

  14. Phylogenetic and Ontogenetic View of Erythroblastic Islands

    PubMed Central

    Giger, Katie M.; Kalfa, Theodosia A.

    2015-01-01

    Erythroblastic islands are a hallmark of mammalian erythropoiesis consisting of a central macrophage surrounded by and interacting closely with the maturing erythroblasts. The macrophages are thought to serve many functions such as supporting erythroblast proliferation, supplying iron for hemoglobin, promoting enucleation, and clearing the nuclear debris; moreover, inhibition of erythroblastic island formation is often detrimental to erythropoiesis. There is still much not understood about the role that macrophages and microenvironment play in erythropoiesis and insights may be gleaned from a comparative analysis with erythropoietic niches in nonmammalian vertebrates which, unlike mammals, have erythrocytes that retain their nucleus. The phylogenetic development of erythroblastic islands in mammals in which the erythrocytes are anucleate underlines the importance of the macrophage in erythroblast enucleation. PMID:26557707

  15. Dating human cultural capacity using phylogenetic principles

    PubMed Central

    Lind, J.; Lindenfors, P.; Ghirlanda, S.; Lidén, K.; Enquist, M.

    2013-01-01

    Humans have genetically based unique abilities making complex culture possible; an assemblage of traits which we term “cultural capacity”. The age of this capacity has for long been subject to controversy. We apply phylogenetic principles to date this capacity, integrating evidence from archaeology, genetics, paleoanthropology, and linguistics. We show that cultural capacity is older than the first split in the modern human lineage, and at least 170,000 years old, based on data on hyoid bone morphology, FOXP2 alleles, agreement between genetic and language trees, fire use, burials, and the early appearance of tools comparable to those of modern hunter-gatherers. We cannot exclude that Neanderthals had cultural capacity some 500,000 years ago. A capacity for complex culture, therefore, must have existed before complex culture itself. It may even originated long before. This seeming paradox is resolved by theoretical models suggesting that cultural evolution is exceedingly slow in its initial stages. PMID:23648831

  16. S1 gene-based phylogeny of infectious bronchitis virus: An attempt to harmonize virus classification.

    PubMed

    Valastro, Viviana; Holmes, Edward C; Britton, Paul; Fusaro, Alice; Jackwood, Mark W; Cattoli, Giovanni; Monne, Isabella

    2016-04-01

    Infectious bronchitis virus (IBV) is the causative agent of a highly contagious disease that results in severe economic losses to the global poultry industry. The virus exists in a wide variety of genetically distinct viral types, and both phylogenetic analysis and measures of pairwise similarity among nucleotide or amino acid sequences have been used to classify IBV strains. However, there is currently no consensus on the method by which IBV sequences should be compared, and heterogeneous genetic group designations that are inconsistent with phylogenetic history have been adopted, leading to the confusing coexistence of multiple genotyping schemes. Herein, we propose a simple and repeatable phylogeny-based classification system combined with an unambiguous and rationale lineage nomenclature for the assignment of IBV strains. By using complete nucleotide sequences of the S1 gene we determined the phylogenetic structure of IBV, which in turn allowed us to define 6 genotypes that together comprise 32 distinct viral lineages and a number of inter-lineage recombinants. Because of extensive rate variation among IBVs, we suggest that the inference of phylogenetic relationships alone represents a more appropriate criterion for sequence classification than pairwise sequence comparisons. The adoption of an internationally accepted viral nomenclature is crucial for future studies of IBV epidemiology and evolution, and the classification scheme presented here can be updated and revised novel S1 sequences should become available. PMID:26883378

  17. Movement disorders: classifications.

    PubMed

    Klein, C

    2005-01-01

    Movement disorders (ataxia, dystonic disorders, gait disorders, Huntington disease, myoclonus, parkinsonism, spasticity, tardive dyskinesia, tics and tremor) are clinically, pathologically and genetically heterogeneous and are characterized by impairment of the planning, control or execution of movement. Current classifications of these disorders have inherent shortcomings due to the complex nature of movement disorders and the lack of diagnostic tests for the majority. Undiscriminating terminology, as well as the clinical, pathological and genetic heterogeneity, further complicate the development of comprehensive categorizations. Modern classification schemes tend to focus on clinical, pathological or genetic/molecular criteria, but more recent attempts have been made to integrate across these levels. From a historical perspective, two 'golden ages' have shaped the current and evolving classification schemes: (1) the definition of clinical pathological entities in the early twentieth century and (2) the application of molecular neurogenetics in the past 10-15 years. However, the classification of movement disorders on clinical grounds (according to age at onset, distribution of symptoms, disease course, provoking factors and therapeutic response) remains one of the most useful modes of categorization. Postmortem criteria have been employed to distinguish between degenerative and nondegenerative disorders, and specific hallmarks may be required to establish or confirm a diagnosis. Genetic features used for classification purposes include mode of inheritance and molecular genetic data, such as linkage to a known gene locus or identification of a specific genetic defect. A final classification scheme is based on alterations in molecular mechanisms (e.g. trinucleotide expansions) or protein function (e.g. channelopathies). Despite recent advances, it may not be possible to develop the 'ultimate' classification of movement disorders, and different patterns of lumping and splitting may be useful for the clinician, the pathologist or the geneticist/molecular biologist. Furthermore, certain individual cases with unique features may not fit into any particular category. Continued research by both clinicians and basic scientists is necessary in order to refine and redefine classification schemes of movement disorders. PMID:15868475

  18. Conservation biology of Malagasy strepsirhines: a phylogenetic approach.

    PubMed

    Lehman, Shawn M

    2006-06-01

    The phylogenetic diversity of extant lemurs represents one of the most important but least studied aspects of the conservation biology of primates. The phylogenetic diversity of a species is inversely proportional to the relative number and closeness of its phylogenetic relatives. Phylogenetic diversity can then be used to determine conservation priorities for specific biogeographic regions. Although Malagasy strepsirhines represent the highest phylogenetic diversity among primates at the global level, there are few phylogenetic data on species-specific and regional conservation plans for lemurs in Madagascar. Therefore, in this paper the following questions are addressed for extant lemurs: 1) how does the measure of taxonomic uniqueness used by Mittermeier et al. (1992 Lemurs of Madagascar; Gland, Switzerland: IUCN) equate with an index of phylogenetic diversity, 2) what are the regional conservation priorities based on analyses of phylogenetic diversity in extant lemurs, and 3) what conservation recommendations can be made based on analyses of phylogenetic diversity in lemurs? Taxonomic endemicity standardized weight (TESW) indices of phylogenetic diversity were used to determine the evolutionary component of biodiversity and to prioritize regions for conserving lemur taxa. TESW refers to the standardization of phylogenetic diversity indices for widespread taxa and endemicity of species. The phylogenetic data came from recent genetic studies of Malagasy strepsirhines at the species level. Lemur species were assigned as being either present or absent in six biogeographic regions. TESW indices were combined with data on lemur complementarity and protected areas to assign conservation priorities at the regional level. Although there were no overall differences between taxonomic ranks and phylogenetic rankings, there were significant differences for the top-ranked taxa. The phylogenetic component of lemur diversity is greatest for Daubentonia madagascariensis, Allocebus trichotis, Lepilemur septentrionalis, Indri indri, and Mirza coquereli. Regional conservation priorities are highest for lemurs that range into northeast humid forests and western dry forests. Expansion of existing protected areas in these regions may provide the most rapid method for preserving lemurs. In the long term, new protected areas must be created because there are lemur species that: 1) are not found in existing protected areas, 2) exist only in one or two protected areas, and 3) are still being discovered outside the current network of protected areas. Data on the population dynamics and feeding ecology of phylogenetically important species are needed to ensure that protected areas adequately conserve lemur populations in Madagascar. PMID:16365858

  19. Progressive Classification Using Support Vector Machines

    NASA Technical Reports Server (NTRS)

    Wagstaff, Kiri; Kocurek, Michael

    2009-01-01

    An algorithm for progressive classification of data, analogous to progressive rendering of images, makes it possible to compromise between speed and accuracy. This algorithm uses support vector machines (SVMs) to classify data. An SVM is a machine learning algorithm that builds a mathematical model of the desired classification concept by identifying the critical data points, called support vectors. Coarse approximations to the concept require only a few support vectors, while precise, highly accurate models require far more support vectors. Once the model has been constructed, the SVM can be applied to new observations. The cost of classifying a new observation is proportional to the number of support vectors in the model. When computational resources are limited, an SVM of the appropriate complexity can be produced. However, if the constraints are not known when the model is constructed, or if they can change over time, a method for adaptively responding to the current resource constraints is required. This capability is particularly relevant for spacecraft (or any other real-time systems) that perform onboard data analysis. The new algorithm enables the fast, interactive application of an SVM classifier to a new set of data. The classification process achieved by this algorithm is characterized as progressive because a coarse approximation to the true classification is generated rapidly and thereafter iteratively refined. The algorithm uses two SVMs: (1) a fast, approximate one and (2) slow, highly accurate one. New data are initially classified by the fast SVM, producing a baseline approximate classification. For each classified data point, the algorithm calculates a confidence index that indicates the likelihood that it was classified correctly in the first pass. Next, the data points are sorted by their confidence indices and progressively reclassified by the slower, more accurate SVM, starting with the items most likely to be incorrectly classified. The user can halt this reclassification process at any point, thereby obtaining the best possible result for a given amount of computation time. Alternatively, the results can be displayed as they are generated, providing the user with real-time feedback about the current accuracy of classification.

  20. Supernova Photometric Lightcurve Classification

    NASA Astrophysics Data System (ADS)

    Zaidi, Tayeb; Narayan, Gautham

    2016-01-01

    This is a preliminary report on photometric supernova classification. We first explore the properties of supernova light curves, and attempt to restructure the unevenly sampled and sparse data from assorted datasets to allow for processing and classification. The data was primarily drawn from the Dark Energy Survey (DES) simulated data, created for the Supernova Photometric Classification Challenge. This poster shows a method for producing a non-parametric representation of the light curve data, and applying a Random Forest classifier algorithm to distinguish between supernovae types. We examine the impact of Principal Component Analysis to reduce the dimensionality of the dataset, for future classification work. The classification code will be used in a stage of the ANTARES pipeline, created for use on the Large Synoptic Survey Telescope alert data and other wide-field surveys. The final figure-of-merit for the DES data in the r band was 60% for binary classification (Type I vs II).Zaidi was supported by the NOAO/KPNO Research Experiences for Undergraduates (REU) Program which is funded by the National Science Foundation Research Experiences for Undergraduates Program (AST-1262829).

  1. Species identification and phylogenetic relationships based on partial HSP60 gene sequences within the genus Staphylococcus.

    PubMed

    Kwok, A Y; Su, S C; Reynolds, R P; Bay, S J; Av-Gay, Y; Dovichi, N J; Chow, A W

    1999-07-01

    The phylogenetic relationships among 36 validly described species or subspecies within the genus Staphylococcus were investigated by cloning and sequencing their 60 kDa heat-shock protein (HSP60) genes using a set of universal degenerate HSP60 PCR primers. The cloned partial HSP60 DNA sequences from nine Staphylococcus aureus strains were highly conserved (97-100% DNA sequence similarity; mean 98%), indicating that the HSP60 gene of multiple isolates within the same species have little microheterogeneity. At the subspecies level, DNA sequence similarity among members of S. aureus, Staphylococcus schleiferi, Staphylococcus cohnii and Staphylococcus capitis ranged from 91 to 98%. At the interspecies level, sequence similarity among 23 distinct species of staphylococci ranged from 74 to 93% (mean 82%). By comparison, the highest sequence similarity of Bacillus subtilis and Escherichia coli with members within the genus Staphylococcus was only 70 and 59%, respectively. Importantly, phylogenetic analysis based on the neighbour-joining distance method revealed remarkable concordance between the tree derived from partial HSP60 gene sequences and that based on genomic DNA-DNA hybridization, while 16S rRNA gene sequences correlated less well. The results demonstrate that DNA sequences from the highly conserved and ubiquitous HSP60 gene offer a convenient and accurate tool for species-specific identification and phylogenetic analysis of staphylococci. PMID:10425778

  2. Multi-locus phylogenetic analysis reveals the pattern and tempo of bony fish evolution

    PubMed Central

    Broughton, Richard E.; Betancur-R., Ricardo; Li, Chenhong; Arratia, Gloria; Ortí, Guillermo

    2013-01-01

    Over half of all vertebrates are “fishes”, which exhibit enormous diversity in morphology, physiology, behavior, reproductive biology, and ecology. Investigation of fundamental areas of vertebrate biology depend critically on a robust phylogeny of fishes, yet evolutionary relationships among the major actinopterygian and sarcopterygian lineages have not been conclusively resolved. Although a consensus phylogeny of teleosts has been emerging recently, it has been based on analyses of various subsets of actinopterygian taxa, but not on a full sample of all bony fishes. Here we conducted a comprehensive phylogenetic study on a broad taxonomic sample of 61 actinopterygian and sarcopterygian lineages (with a chondrichthyan outgroup) using a molecular data set of 21 independent loci. These data yielded a resolved phylogenetic hypothesis for extant Osteichthyes, including 1) reciprocally monophyletic Sarcopterygii and Actinopterygii, as currently understood, with polypteriforms as the first diverging lineage within Actinopterygii; 2) a monophyletic group containing gars and bowfin (= Holostei) as sister group to teleosts; and 3) the earliest diverging lineage among teleosts being Elopomorpha, rather than Osteoglossomorpha. Relaxed-clock dating analysis employing a set of 24 newly applied fossil calibrations reveals divergence times that are more consistent with paleontological estimates than previous studies. Establishing a new phylogenetic pattern with accurate divergence dates for bony fishes illustrates several areas where the fossil record is incomplete and provides critical new insights on diversification of this important vertebrate group. PMID:23788273

  3. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    PubMed Central

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  4. morePhyML: improving the phylogenetic tree space exploration with PhyML 3.

    PubMed

    Criscuolo, Alexis

    2011-12-01

    PhyML is a widely used Maximum Likelihood (ML) phylogenetic tree inference software based on a standard hill-climbing method. Starting from an initial tree, the version 3 of PhyML explores the tree space by using "Nearest Neighbor Interchange" (NNI) or "Subtree Pruning and Regrafting" (SPR) tree swapping techniques in order to find the ML phylogenetic tree. NNI-based local searches are fast but can often get trapped in local optima, whereas it is expected that the larger (but slower to cover) SPR-based neighborhoods will lead to trees with higher likelihood. Here, I verify that PhyML infers more likely trees with SPRs than with NNIs in almost all cases. However, I also show that the SPR-based local search of PhyML often does not succeed at locating the ML tree. To improve the tree space exploration, I deliver a script, named morePhyML, which allows escaping from local optima by performing character reweighting. This ML tree search strategy, named ratchet, often leads to higher likelihood estimates. Based on the analysis of a large number of amino acid and nucleotide data, I show that morePhyML allows inferring more accurate phylogenetic trees than several other recently developed ML tree inference softwares in many cases. PMID:21925283

  5. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses.

    PubMed

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  6. TIME-INTEGRATED EXPOSURE MEASURES TO IMPROVE THE PREDICTIVE POWER OF EXPOSURE CLASSIFICATION FOR EPIDEMIOLOGIC STUDIES

    EPA Science Inventory

    Accurate exposure classification tools are required to link exposure with health effects in epidemiological studies. Although long-term integrated exposure measurements are a critical component of exposure assessment, the ability to include these measurements into epidemiologic...

  7. INVENTORY AND CLASSIFICATION OF GREAT LAKES COASTAL WETLANDS FOR MONITORING AND ASSESSMENT AT LARGE SPATIAL SCALES

    EPA Science Inventory

    Monitoring aquatic resources for regional assessments requires an accurate and comprehensive inventory of the resource and useful classification of exosystem similarities. Our research effort to create an electronic database and work with various ways to classify coastal wetlands...

  8. Applying species-tree analyses to deep phylogenetic histories: challenges and potential suggested from a survey of empirical phylogenetic studies.

    PubMed

    Lanier, Hayley C; Knowles, L Lacey

    2015-02-01

    Coalescent-based methods for species-tree estimation are becoming a dominant approach for reconstructing species histories from multi-locus data, with most of the studies examining these methodologies focused on recently diverged species. However, deeper phylogenies, such as the datasets that comprise many Tree of Life (ToL) studies, also exhibit gene-tree discordance. This discord may also arise from the stochastic sorting of gene lineages during the speciation process (i.e., reflecting the random coalescence of gene lineages in ancestral populations). It remains unknown whether guidelines regarding methodologies and numbers of loci established by simulation studies at shallow tree depths translate into accurate species relationships for deeper phylogenetic histories. We address this knowledge gap and specifically identify the challenges and limitations of species-tree methods that account for coalescent variance for deeper phylogenies. Using simulated data with characteristics informed by empirical studies, we evaluate both the accuracy of estimated species trees and the characteristics associated with recalcitrant nodes, with a specific focus on whether coalescent variance is generally responsible for the lack of resolution. By determining the proportion of coalescent genealogies that support a particular node, we demonstrate that (1) species-tree methods account for coalescent variance at deep nodes and (2) mutational variance - not gene-tree discord arising from the coalescent - posed the primary challenge for accurate reconstruction across the tree. For example, many nodes were accurately resolved despite predicted discord from the random coalescence of gene lineages and nodes with poor support were distributed across a range of depths (i.e., they were not restricted to a particular recent divergences). Given their broad taxonomic scope and large sampling of taxa, deep level phylogenies pose several potential methodological complications including difficulties with MCMC convergence and estimation of requisite population genetic parameters for coalescent-based approaches. Despite these difficulties, the findings generally support the utility of species-tree analyses for the estimation of species relationships throughout the ToL. We discuss strategies for successful application of species-tree approaches to deep phylogenies. PMID:25450097

  9. Considering external information to improve the phylogenetic comparison of microbial communities: a new approach based on constrained Double Principal Coordinates Analysis (cDPCoA).

    PubMed

    Dray, S; Pavoine, S; Aguirre de Cárcer, D

    2015-03-01

    The use of next-generation sequencing technologies is revolutionizing microbial ecology by allowing a deep phylogenetic coverage of tens to thousands of samples simultaneously. Double Principal Coordinates Analysis (DPCoA) is a multivariate method, developed in community ecology, able to integrate a distance matrix describing differences among species (e.g. phylogenetic distances) in the analysis of a species abundance matrix. This ordination technique has been used recently to describe microbial communities taking into account phylogenetic relatedness. In this work, we extend DPCoA to integrate the information of external variables measured on communities. The constrained Double Principal Coordinates Analysis (cDPCoA) is able to enforce a priori classifications to retrieve subtle differences and (or) remove the effect of confounding factors. We describe the main principles of this new approach and demonstrate its usefulness by providing application examples based on published 16S rRNA gene data sets. PMID:24974884

  10. Phylogenetic relationships of Mesoamerican spider monkeys (Ateles geoffroyi): Molecular evidence suggests the need for a revised taxonomy.

    PubMed

    Morales-Jimenez, Alba Lucia; Cortés-Ortiz, Liliana; Di Fiore, Anthony

    2015-01-01

    Mesoamerican spider monkeys (Ateles geoffroyi sensu lato) are widely distributed from Mexico to northern Colombia. This group of primates includes many allopatric forms with morphologically distinct pelage color and patterning, but its taxonomy and phylogenetic history are poorly understood. We explored the genetic relationships among the different forms of Mesoamerican spider monkeys using mtDNA sequence data, and we offer a new hypothesis for the evolutionary history of the group. We collected up to ∼800 bp of DNA sequence data from hypervariable region 1 (HV1) of the control region, or D-loop, of the mitochondrion for multiple putative subspecies of Ateles geoffroyi sensu lato. Both maximum likelihood and Bayesian reconstructions, using Ateles paniscus as an outgroup, showed that (1) A. fusciceps and A. geoffroyi form two different monophyletic groups and (2) currently recognized subspecies of A. geoffroyi are not monophyletic. Within A. geoffroyi, our phylogenetic analysis revealed little concordance between any of the classifications proposed for this taxon and their phylogenetic relationships, therefore a new classification is needed for this group. Several possible clades with recent divergence times (1.7-0.8 Ma) were identified within Ateles geoffroyi sensu lato. Some previously recognized taxa were not separated by our data (e.g., A. g. vellerosus and A. g. yucatanensis), while one distinct clade had never been described as a different evolutionary unit based on pelage or geography (Ateles geoffroyi ssp. indet. from El Salvador). Based on well-supported phylogenetic relationships, our results challenge previous taxonomic arrangements for Mesoamerican spider monkeys. We suggest a revised arrangement based on our data and call for a thorough taxonomic revision of this group. PMID:25451800

  11. A Novel Vehicle Classification Using Embedded Strain Gauge Sensors

    PubMed Central

    Zhang, Wenbin; Wang, Qi; Suo, Chunguang

    2008-01-01

    This paper presents a new vehicle classification and develops a traffic monitoring detector to provide reliable vehicle classification to aid traffic management systems. The basic principle of this approach is based on measuring the dynamic strain caused by vehicles across pavement to obtain the corresponding vehicle parameters – wheelbase and number of axles – to then accurately classify the vehicle. A system prototype with five embedded strain sensors was developed to validate the accuracy and effectiveness of the classification method. According to the special arrangement of the sensors and the different time a vehicle arrived at the sensors one can estimate the vehicle's speed accurately, corresponding to the estimated vehicle wheelbase and number of axles. Because of measurement errors and vehicle characteristics, there is a lot of overlap between vehicle wheelbase patterns. Therefore, directly setting up a fixed threshold for vehicle classification often leads to low-accuracy results. Using the machine learning pattern recognition method to deal with this problem is believed as one of the most effective tools. In this study, support vector machines (SVMs) were used to integrate the classification features extracted from the strain sensors to automatically classify vehicles into five types, ranging from small vehicles to combination trucks, along the lines of the Federal Highway Administration vehicle classification guide. Test bench and field experiments will be introduced in this paper. Two support vector machines classification algorithms (one-against-all, one-against-one) are used to classify single sensor data and multiple sensor combination data. Comparison of the two classification method results shows that the classification accuracy is very close using single data or multiple data. Our results indicate that using multiclass SVM-based fusion multiple sensor data significantly improves the results of a single sensor data, which is trained on the whole multisensor data set.

  12. Accurate heart rate estimation from camera recording via MUSIC algorithm.

    PubMed

    Fouladi, Seyyed Hamed; Balasingham, Ilangko; Ramstad, Tor Audun; Kansanen, Kimmo

    2015-08-01

    In this paper, we propose an algorithm to extract heart rate frequency from video camera using the Multiple SIgnal Classification (MUSIC) algorithm. This leads to improved accuracy of the estimated heart rate frequency in cases the performance is limited by the number of samples and frame rate. Monitoring vital signs remotely can be exploited for both non-contact physiological and psychological diagnosis. The color variation recorded by ordinary cameras is used for heart rate monitoring. The orthogonality between signal space and noise space is used to find more accurate heart rate frequency in comparison with traditional methods. It is shown via experimental results that the limitation of previous methods can be overcome by using subspace methods. PMID:26738015

  13. What should an ideal spinal injury classification system consist of? A methodological review and conceptual proposal for future classifications

    PubMed Central

    Audigé, Laurent; Hanson, Beate; Chapman, Jens R.; Hosman, Allard J. F.

    2010-01-01

    Since Böhler published the first categorization of spinal injuries based on plain radiographic examinations in 1929, numerous classifications have been proposed. Despite all these efforts, however, only a few have been tested for reliability and validity. This methodological, conceptual review summarizes that a spinal injury classification system should be clinically relevant, reliable and accurate. The clinical relevance of a classification is directly related to its content validity. The ideal content of a spinal injury classification should only include injury characteristics of the vertebral column, is primarily based on the increasingly routinely performed CT imaging, and is clearly distinctive from severity scales and treatment algorithms. Clearly defined observation and conversion criteria are crucial determinants of classification systems’ reliability and accuracy. Ideally, two principle spinal injury characteristics should be easy to discern on diagnostic images: the specific location and morphology of the injured spinal structure. Given the current evidence and diagnostic imaging technology, descriptions of the mechanisms of injury and ligamentous injury should not be included in a spinal injury classification. The presence of concomitant neurologic deficits can be integrated in a spinal injury severity scale, which in turn can be considered in a spinal injury treatment algorithm. Ideally, a validation pathway of a spinal injury classification system should be completed prior to its clinical and scientific implementation. This review provides a methodological concept which might be considered prior to the synthesis of new or modified spinal injury classifications. PMID:20464432

  14. Phylogenetic diversity (PD) and biodiversity conservation: some bioinformatics challenges

    PubMed Central

    Faith, Daniel P.; Baker, Andrew M.

    2007-01-01

    Biodiversity conservation addresses information challenges through estimations encapsulated in measures of diversity. A quantitative measure of phylogenetic diversity, “PD”, has been defined as the minimum total length of all the phylogenetic branches required to span a given set of taxa on the phylogenetic tree (Faith 1992a). While a recent paper incorrectly characterizes PD as not including information about deeper phylogenetic branches, PD applications over the past decade document the proper incorporation of shared deep branches when assessing the total PD of a set of taxa. Current PD applications to macroinvertebrate taxa in streams of New South Wales, Australia illustrate the practical importance of this definition. Phylogenetic lineages, often corresponding to new, “cryptic”, taxa, are restricted to a small number of stream localities. A recent case of human impact causing loss of taxa in one locality implies a higher PD value for another locality, because it now uniquely represents a deeper branch. This molecular-based phylogenetic pattern supports the use of DNA barcoding programs for biodiversity conservation planning. Here, PD assessments side-step the contentious use of barcoding-based “species” designations. Bio-informatics challenges include combining different phylogenetic evidence, optimization problems for conservation planning, and effective integration of phylogenetic information with environmental and socio-economic data. PMID:19455206

  15. Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

    ERIC Educational Resources Information Center

    Dees, Jonathan; Momsen, Jennifer L.; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa

  16. Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

    ERIC Educational Resources Information Center

    Dees, Jonathan; Momsen, Jennifer L.; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa…

  17. Different Relationships between Temporal Phylogenetic Turnover and Phylogenetic Similarity and in Two Forests Were Detected by a New Null Model

    PubMed Central

    Huang, Jian-Xiong; Zhang, Jian; Shen, Yong; Lian, Ju-yu; Cao, Hong-lin; Ye, Wan-hui; Wu, Lin-fang; Bin, Yue

    2014-01-01

    Background Ecologists have been monitoring community dynamics with the purpose of understanding the rates and causes of community change. However, there is a lack of monitoring of community dynamics from the perspective of phylogeny. Methods/Principle Findings We attempted to understand temporal phylogenetic turnover in a 50 ha tropical forest (Barro Colorado Island, BCI) and a 20 ha subtropical forest (Dinghushan in southern China, DHS). To obtain temporal phylogenetic turnover under random conditions, two null models were used. The first shuffled names of species that are widely used in community phylogenetic analyses. The second simulated demographic processes with careful consideration on the variation in dispersal ability among species and the variations in mortality both among species and among size classes. With the two models, we tested the relationships between temporal phylogenetic turnover and phylogenetic similarity at different spatial scales in the two forests. Results were more consistent with previous findings using the second null model suggesting that the second null model is more appropriate for our purposes. With the second null model, a significantly positive relationship was detected between phylogenetic turnover and phylogenetic similarity in BCI at a 10 m×10 m scale, potentially indicating phylogenetic density dependence. This relationship in DHS was significantly negative at three of five spatial scales. This could indicate abiotic filtering processes for community assembly. Using variation partitioning, we found phylogenetic similarity contributed to variation in temporal phylogenetic turnover in the DHS plot but not in BCI plot. Conclusions/Significance The mechanisms for community assembly in BCI and DHS vary from phylogenetic perspective. Only the second null model detected this difference indicating the importance of choosing a proper null model. PMID:24748022

  18. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

    PubMed

    Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data. PMID:27057545

  19. Consistency between molecular phylogeny and morphological classification of the Salix matsudana Koidz. complex (Salicaceae).

    PubMed

    Du, S H; Wang, Z S; Li, Y X; Wang, D S; Zhang, J G

    2015-01-01

    The morphological species concept is based on morpho-logical traits, which are often subject to subjectivity or artifact. Molecular evidence is needed to test the reliability of morphological classification of taxa that are controversial and to provide appropriate taxonomic de-limitation. In this study, we used 15 single-copy nuclear loci and 2 chloroplast fragments to verify the morphological classification of the Salix matsudana Koidz. complex using phylogenetic approaches. Complete sequence alignment showed slight diversification in nuclear sequences and no variety in chloroplast DNA fragments. Phylogenetic trees revealed a monophyletic group consisting of all individuals of S. matsudana and 2 clades within this group, with a 100% bootstrap support value and 1.00 posterior probability. The topology of the phylogenetic trees was highly consistent with the morphological classification of the S. matsudana complex. Verifying the genetic background of these classification units based on remarkable morphological differences will provide a foundation for future studies of Salix and the breeding of new horticultural varieties. PMID:26345798

  20. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

    PubMed Central

    Wang, Yin; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data. PMID:27057545

  1. Open Reading Frame Phylogenetic Analysis on the Cloud

    PubMed Central

    2013-01-01

    Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843

  2. Open reading frame phylogenetic analysis on the cloud.

    PubMed

    Hung, Che-Lun; Lin, Chun-Yuan

    2013-01-01

    Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843

  3. Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

    PubMed Central

    Gao, Beile

    2012-01-01

    Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973

  4. Bayesian models for comparative analysis integrating phylogenetic uncertainty

    PubMed Central

    2012-01-01

    Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses, particularly for modelling in the face of phylogenetic uncertainty and accounting for measurement error or individual variation in explanatory variables. Code for all models is provided in the BUGS model description language. PMID:22741602

  5. Proteomic applications of automated GPCR classification.

    PubMed

    Davies, Matthew N; Gloriam, David E; Secker, Andrew; Freitas, Alex A; Mendao, Miguel; Timmis, Jon; Flower, Darren R

    2007-08-01

    The G-protein coupled receptor (GPCR) superfamily fulfils various metabolic functions and interacts with a diverse range of ligands. There is a lack of sequence similarity between the six classes that comprise the GPCR superfamily. Moreover, most novel GPCRs found have low sequence similarity to other family members which makes it difficult to infer properties from related receptors. Many different approaches have been taken towards developing efficient and accurate methods for GPCR classification, ranging from motif-based systems to machine learning as well as a variety of alignment-free techniques based on the physiochemical properties of their amino acid sequences. This review describes the inherent difficulties in developing a GPCR classification algorithm and includes techniques previously employed in this area. PMID:17639603

  6. Segmentation assisted food classification for dietary assessment

    NASA Astrophysics Data System (ADS)

    Zhu, Fengqing; Bosch, Marc; Schap, TusaRebecca; Khanna, Nitin; Ebert, David S.; Boushey, Carol J.; Delp, Edward J.

    2011-03-01

    Accurate methods and tools to assess food and nutrient intake are essential for the association between diet and health. Preliminary studies have indicated that the use of a mobile device with a built-in camera to obtain images of the food consumed may provide a less burdensome and more accurate method for dietary assessment. We are developing methods to identify food items using a single image acquired from the mobile device. Our goal is to automatically determine the regions in an image where a particular food is located (segmentation) and correctly identify the food type based on its features (classification or food labeling). Images of foods are segmented using Normalized Cuts based on intensity and color. Color and texture features are extracted from each segmented food region. Classification decisions for each segmented region are made using support vector machine methods. The segmentation of each food region is refined based on feedback from the output of classifier to provide more accurate estimation of the quantity of food consumed.

  7. Segmentation Assisted Food Classification for Dietary Assessment

    PubMed Central

    Zhu, Fengqing; Bosch, Marc; Schap, TusaRebecca; Khanna, Nitin; Ebert, David S.; Boushey, Carol J.; Delp, Edward J.

    2011-01-01

    Accurate methods and tools to assess food and nutrient intake are essential for the association between diet and health. Preliminary studies have indicated that the use of a mobile device with a built-in camera to obtain images of the food consumed may provide a less burdensome and more accurate method for dietary assessment. We are developing methods to identify food items using a single image acquired from the mobile device. Our goal is to automatically determine the regions in an image where a particular food is located (segmentation) and correctly identify the food type based on its features (classification or food labeling). Images of foods are segmented using Normalized Cuts based on intensity and color. Color and texture features are extracted from each segmented food region. Classification decisions for each segmented region are made using support vector machine methods. The segmentation of each food region is refined based on feedback from the output of classifier to provide more accurate estimation of the quantity of food consumed. PMID:22128304

  8. Automated Stellar Spectral Classification

    NASA Astrophysics Data System (ADS)

    Bailer-Jones, Coryn; Irwin, Mike; von Hippel, Ted

    1996-05-01

    Stellar classification has long been a useful tool for probing important astrophysical phenomena. Beyond simply categorizing stars it yields fundamental stellar parameters, acts as a probe of galactic abundance distributions and gives a first foothold on the cosmological distance ladder. The MK system in particular has survived on account of its robustness to changes in the calibrations of the physical parameters. Nonetheless, if stellar classification is to continue as a useful tool in stellar surveys, then it must adapt to keep pace with the large amounts of data which will be acquired as magnitude limits are pushed ever deeper. We are working on a project to automate the multi-parameter classification of visual stellar spectra, using artificial neural networks and other techniques. Our techniques have been developed with 10,000 spectra (B <= 11) extracted from the objective prism plates of the Michigan Blue Survey, as well as follow-up CCD objective prism spectra of fainter stars (B <= 14). In addition to classification using the whole spectrum over the MK range, we have investigated the use of Principal Components Analysis as a front-end compression of the data. Our continuing work also looks at the application of synthetic spectra to the direct classification of spectra in terms of the physical parameters of Teff, log g, and [Fe/H].

  9. Application of fuzzy classification in modern primary dental care.

    PubMed

    Veryha, Yauheni; Adamczyk, Katarzyna

    2005-01-01

    This paper describes a framework for implementing fuzzy classifications in primary dental care services. Dental practices aim to provide the highest quality services for their patients. To achieve this, it is important that dentists are able to obtain patients' opinions about their experiences in the dental practice and are able to accurately evaluate this. We propose the use of fuzzy classification to combine various assessment criteria into one general measure to assess patients' satisfaction with primary dental care services. The proposed framework can be used in conventional dental practice information systems and easily integrated with those already used. The benefits of using the proposed fuzzy classification approach include more flexible and accurate analysis of patients' feedback, combining verbal and numeric data. To confirm our theory, a prototype was developed based on the Microsoft SQL Server database management system for two criteria used in dental practices, namely making an appointment with a dentist and waiting time for dental care services. PMID:15949172

  10. 38 CFR 4.46 - Accurate measurement.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2011-07-01 2011-07-01 false Accurate measurement. 4.46 Section 4.46 Pensions, Bonuses, and Veterans' Relief DEPARTMENT OF VETERANS AFFAIRS SCHEDULE FOR RATING DISABILITIES Disability Ratings The Musculoskeletal System § 4.46 Accurate measurement....

  11. Finding Single Copy Genes Out of Sequenced Genomes for Multilocus Phylogenetics in Non-Model Fungi

    PubMed Central

    Feau, Nicolas; Decourcelle, Thibaut; Husson, Claude; Desprez-Loustau, Marie-Laure; Dutech, Cyril

    2011-01-01

    Historically, fungal multigene phylogenies have been reconstructed based on a small number of commonly used genes. The availability of complete fungal genomes has given rise to a new wave of model organisms that provide large number of genes potentially useful for building robust gene genealogies. Unfortunately, cross-utilization of these resources to study phylogenetic relationships in the vast majority of non-model fungi (i.e. “orphan” species) remains an unexamined question. To address this problem, we developed a method coupled with a program named “PHYLORPH” (PHYLogenetic markers for ORPHans). The method screens fungal genomic databases (107 fungal genomes fully sequenced) for single copy genes that might be easily transferable and well suited for studies at low taxonomic levels (for example, in species complexes) in non-model fungal species. To maximize the chance to target genes with informative regions, PHYLORPH displays a graphical evaluation system based on the estimation of nucleotide divergence relative to substitution type. The usefulness of this approach was tested by developing markers in four non-model groups of fungal pathogens. For each pathogen considered, 7 to 40% of the 10–15 best candidate genes proposed by PHYLORPH yielded sequencing success. Levels of polymorphism of these genes were compared with those obtained for some genes traditionally used to build fungal phylogenies (e.g. nuclear rDNA, β-tubulin, γ-actin, Elongation factor EF-1α). These genes were ranked among the best-performing ones and resolved accurately taxa relationships in each of the four non-model groups of fungi considered. We envision that PHYLORPH will constitute a useful tool for obtaining new and accurate phylogenetic markers to resolve relationships between closely related non-model fungal species. PMID:21533204

  12. Identification of two phylogenetic lineages of equine hepacivirus and high prevalence in Brazil.

    PubMed

    Figueiredo, Andreza Soriano; Lampe, Elisabeth; do Esprito-Santo, Mrcia Paschoal; Mello, Francisco Campello do Amaral; de Almeida, Fernando Queiroz; de Lemos, Elba Regina Sampaio; Godoi, Tatianne Leme Oliveira Santos; Dimache, Luana Avila Giorgia; Dos Santos, Debora Regina Lopes; Villar, Livia Melo

    2015-12-01

    Non-primate hepacivirus (NPHV), as described in horses, is the virus most genetically related to hepatitis C virus (HCV). Although detected worldwide, limited data on genomic variability and distribution of NPHV are available in Latin America. The aim of this study was to investigate the genetic diversity and prevalence of equine NPHV in Brazil. Thirteen percent of 202 equines from three Brazilian states were positive for NPHV genome by reverse transcriptase PCR. Nucleotide sequences of the partial NS5B genome presented the greatest diversity described to date (25.6%), which is comparable to the upper limit of diversity for HCV subtype classification for the same region. Phylogenetic analysis revealed that Brazilian NPHV sequences along with isolates worldwide form two strongly supported clades (pp?=?1.0) suggesting the existence of two distinct lineages. PMID:26545848

  13. Application of COI Sequences in Studies of Phylogenetic Relationships Among 40 Apionidae Species

    PubMed Central

    Ptaszyńska, Aneta A.; Łętowski, Jacek; Gnat, Sebastian; Małek, Wanda

    2012-01-01

    The systematics of the family Apionidae, as well as the superfamily Curculionoidea, is currently in a state of flux. The comparative analyses of COI sequences from our studies shed some light on the systematics of these weevils. To study the relationship among the organisms of the family Apionidae, we determined the COI sequences of representatives of 23 species and 15 genera, i.e., Apion, Betulapion, Catapion, Ceratapion, Cyanapion, Eutrichapion, Exapion, Hemitrichapion, Holotrichapion, Ischnopterapion, Protapion, Pseudoperapion, Psudoprotapion, Pseudostenapion, and Stenopterapion. Then, they were compared with the COI sequences of 19 species and eight genera from GenBank (Aspidapion, Ceratapion, Exapion, Ischnopterapion, Lepidapion, Omphalapion, Oxystoma, and Protapion). The phylogenetic relationships inferred from molecular data are similar to the classification system developed by Alonso-Zarazaga and Lyal (1999), with some exceptions within the tribe Oxystomatini, and genera Ceratapion and Exapion. PMID:22934614

  14. Using minimum DNA marker loci for accurate population classification in rice (Oryza sativa L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Using few DNA markers to classify genetic background of a germplasm pool will help breeders make a quick decision while saving time and resources. WHICHLOCI is a computer program that selects the best combination of loci for population assignment through empiric analysis of molecular marker data. Th...

  15. Spatial Pattern Classification for More Accurate Forecasting of Variable Energy Resources

    NASA Astrophysics Data System (ADS)

    Novakovskaia, E.; Hayes, C.; Collier, C.

    2014-12-01

    The accuracy of solar and wind forecasts is becoming increasingly essential as grid operators continue to integrate additional renewable generation onto the electric grid. Forecast errors affect rate payers, grid operators, wind and solar plant maintenance crews and energy traders through increases in prices, project down time or lost revenue. While extensive and beneficial efforts were undertaken in recent years to improve physical weather models for a broad spectrum of applications these improvements have generally not been sufficient to meet the accuracy demands of system planners. For renewables, these models are often used in conjunction with additional statistical models utilizing both meteorological observations and the power generation data. Forecast accuracy can be dependent on specific weather regimes for a given location. To account for these dependencies it is important that parameterizations used in statistical models change as the regime changes. An automated tool, based on an artificial neural network model, has been developed to identify different weather regimes as they impact power output forecast accuracy at wind or solar farms. In this study, improvements in forecast accuracy were analyzed for varying time horizons for wind farms and utility-scale PV plants located in different geographical regions.

  16. A phylogenetic overview of the Agaricomycotina.

    PubMed

    Hibbett, David S

    2006-01-01

    The Agaricomycotina contains about one-third of the described species of Fungi, including mushrooms, jelly fungi and basidiomycetous yeasts. Recent phylogenetic analyses by P. Matheny and colleagues combining nuclear rRNA genes with the protein-coding genes rpb1, rpb2 and tef1 support the division of Agaricomycotina into Tremellomycetes, Dacrymycetes and Agaricomycetes. There is strong support for the monophyly of the Tremellomycetes, and its position as the sister group of the rest of the Agaricomycotina. Dacrymycetes and Agaricomycetes also are supported strongly, and together they form a clade that is equivalent to the Hymenomycetidae of Swann and Taylor. The deepest nodes in the Agaricomycetes, which are supported only by Bayesian measures of confidence, suggest that the Sebacinales, Cantharellales and Auriculariales are among the most ancient lineages. For the first time, the Polyporales are strongly supported as monophyletic and are placed as the sister group of the Thelephorales. The Agaricales, Boletales and Atheliales are united as the Agaricomycetidae, and the Russulales might be its sister group. There are still some problematical nodes that will require more loci to be resolved. Phylogenomics has promise for reconstructing these difficult backbone nodes, but current genome projects are limited mostly to the Agaricales, Boletales and Polyporales. Genome sequences from other major lineages, especially the early diverging clades, are needed to resolve the most ancient nodes and to assess deep homology in ecological characters in the Agaricomycotina. PMID:17486968

  17. Conjugate Gibbs sampling for Bayesian phylogenetic models.

    PubMed

    Lartillot, Nicolas

    2006-12-01

    We propose a new Markov Chain Monte Carlo (MCMC) sampling mechanism for Bayesian phylogenetic inference. This method, which we call conjugate Gibbs, relies on analytical conjugacy properties, and is based on an alternation between data augmentation and Gibbs sampling. The data augmentation step consists in sampling a detailed substitution history for each site, and across the whole tree, given the current value of the model parameters. Provided convenient priors are used, the parameters of the model can then be directly updated by a Gibbs sampling procedure, conditional on the current substitution history. Alternating between these two sampling steps yields a MCMC device whose equilibrium distribution is the posterior probability density of interest. We show, on real examples, that this conjugate Gibbs method leads to a significant improvement of the mixing behavior of the MCMC. In all cases, the decorrelation times of the resulting chains are smaller than those obtained by standard Metropolis Hastings procedures by at least one order of magnitude. The method is particularly well suited to heterogeneous models, i.e. assuming site-specific random variables. In particular, the conjugate Gibbs formalism allows one to propose efficient implementations of complex models, for instance assuming site-specific substitution processes, that would not be accessible to standard MCMC methods. PMID:17238840

  18. Influence function for robust phylogenetic reconstructions.

    PubMed

    Bar-Hen, Avner; Mariadassou, Mahendra; Poursat, Marie-Anne; Vandenkoornhuyse, Philippe

    2008-05-01

    Based on the computation of the influence function, a tool to measure the impact of each piece of sampled data on the statistical inference of a parameter, we propose to analyze the support of the maximum-likelihood (ML) tree for each site. We provide a new tool for filtering data sets (nucleotides, amino acids, and others) in the context of ML phylogenetic reconstructions. Because different sites support different phylogenic topologies in different ways, outlier sites, that is, sites with a very negative influence value, are important: they can drastically change the topology resulting from the statistical inference. Therefore, these outlier sites must be clearly identified and their effects accounted for before drawing biological conclusions from the inferred tree. A matrix containing 158 fungal terminals all belonging to Chytridiomycota, Zygomycota, and Glomeromycota is analyzed. We show that removing the strongest outlier from the analysis strikingly modifies the ML topology, with a loss of as many as 20% of the internal nodes. As a result, estimating the topology on the filtered data set results in a topology with enhanced bootstrap support. From this analysis, the polyphyletic status of the fungal phyla Chytridiomycota and Zygomycota is reinforced, suggesting the necessity of revisiting the systematics of these fungal groups. We show the ability of influence function to produce new evolution hypotheses. PMID:18263606

  19. A phylogenetic re-evaluation of Arthrinium.

    PubMed

    Crous, Pedro W; Groenewald, Johannes Z

    2013-07-01

    Although the genus Arthrinium (sexual morph Apiospora) is commonly isolated as an endophyte from a range of substrates, and is extremely interesting for the pharmaceutical industry, its molecular phylogeny has never been resolved. Based on morphology and DNA sequence data of the large subunit nuclear ribosomal RNA gene (LSU, 28S) and the internal transcribed spacers (ITS) and 5.8S rRNA gene of the nrDNA operon, the genus Arthrinium is shown to belong to Apiosporaceae in Xylariales. Arthrinium is morphologically and phylogenetically circumscribed, and the sexual genus Apiospora treated as synonym on the basis that Arthinium is older, more commonly encountered, and more frequently used in literature. An epitype is designated for Arthrinium pterospermum, and several well-known species are redefined based on their morphology and sequence data of the translation elongation factor 1-alpha (TEF), beta-tubulin (TUB) and internal transcribed spacer (ITS1, 5.8S, ITS2) gene regions. Newly described are A. hydei on Bambusa tuldoides from Hong Kong, A. kogelbergense on dead culms of Restionaceae from South Africa, A. malaysianum on Macaranga hullettii from Malaysia, A. ovatum on Arundinaria hindsii from Hong Kong, A. phragmites on Phragmites australis from Italy, A. pseudospegazzinii on Macaranga hullettii from Malaysia, A. pseudosinense on bamboo from The Netherlands, and A. xenocordella from soil in Zimbabwe. Furthermore, the genera Pteroconium and Cordella are also reduced to synonymy, rejecting spore shape and the presence of setae as characters of generic significance separating them from Arthrinium. PMID:23898419

  20. Phylogenetic autocorrelation under distinct evolutionary processes.

    PubMed

    Diniz-Filho, J A

    2001-06-01

    I show how phylogenetic correlograms track distinct microevolutionary processes and can be used as empirical descriptors of the relationship between interspecific covariance (V(B)) and time since divergence (t). Data were simulated under models of gradual and speciational change, using increasing levels of stabilizing selection in a stochastic Ornstein-Uhlenbeck (O-U) process, on a phylogeny of 42 species. For each simulated dataset, correlograms were constructed using Moran's I coefficients estimated at five time slices, established at constant intervals. The correlograms generated under different evolutionary models differ significantly according to F-values derived from analysis of variance comparing Moran's I at each time slice and based on Wilks' lambda from multivariate analysis of variance comparing their overall profiles in a two-way design. Under Brownian motion or with small restraining forces in the O-U process, correlograms were better fit by a linear model. However, increasing restraining forces in the O-U process cause a lack of linear fit, and correlograms are better described by exponential models. These patterns are better fit for gradual than for speciational modes of change. Correlograms can be used as a diagnostic method and to describe the V(B)/t relationship before using methods to analyze correlated evolution that assume (or perform statistically better when) this relationship is linear. PMID:11475046

  1. Phylogenetic Distribution of Potential Cellulases in Bacteria

    PubMed Central

    Berlemont, Renaud

    2013-01-01

    Many microorganisms contain cellulases that are important for plant cell wall degradation and overall soil ecosystem functioning. At present, we have extensive biochemical knowledge of cellulases but little is known about the phylogenetic distribution of these enzymes. To address this, we analyzed the distribution of 21,985 genes encoding proteins related to cellulose utilization in 5,123 sequenced bacterial genomes. First, we identified the distribution of glycoside hydrolases involved in cellulose utilization and synthesis at different taxonomic levels, from the phylum to the strain. Cellulose degradation/utilization capabilities appeared in nearly all major groups and resulted in strains displaying various enzyme gene combinations. Potential cellulose degraders, having both cellulases and β-glucosidases, constituted 24% of all genomes whereas potential opportunistic strains, having β-glucosidases only, accounted for 56%. Finally, 20% of the bacteria have no relevant enzymes and do not rely on cellulose utilization. The latter group was primarily connected to specific bacterial lifestyles like autotrophy and parasitism. Cellulose degraders, as well as opportunists, have multiple enzymes with similar functions. However, the potential degraders systematically harbor about twice more β-glucosidases than their potential opportunistic relatives. Although scattered, the distribution of functional types, in bacterial lineages, is not random but mostly follows a Brownian motion evolution model. Degraders form clusters of relatives at the species level, whereas opportunists are clustered at the genus level. This information can form a mechanistic basis for the linking of changes in microbial community composition to soil ecosystem processes. PMID:23263967

  2. Interpretation of bootstrap values in phylogenetic analysis.

    PubMed

    Wiesemüller, Bernhard; Rothe, Hartmut

    2006-06-01

    Bootstrap Analysis is a common tool in cladistics, and consequently many authors tend to believe that it could be close to a test of monophyly. In fact, it is only a procedure to calculate the redundancy of a certain character pattern among taxa. To demonstrate this, we set up a study with questionable data: Four skulls of great apes and humans were digitally photographed, and the pixels' brightness values were simply transformed to a one-zero-matrix, which was then used to calculate a Wagner tree with PHYLIP. As a rule, the higher the resolution of the photos is, the higher are the bootstrap values of supported taxa (and the lower are the bootstrap values of non-supported data). Redundancy of intertaxic information might indeed be an indicator of phylogenetic relationship, but can also be due to other reasons, like functional-adaptive needs in morphology, or semantic needs in a DNA-code. As a result, we tend to believe that high bootstrap values are actually less important than low ones. It is safer, based on a low bootstrap value, to claim that a certain taxon is not well supported by certain data. Therefore, we recommend discussions of low bootstrap values in future publications. PMID:16850767

  3. Dental microwear. Morphological, functional and phylogenetic correlations.

    PubMed

    Villa, G; Giacobini, G

    1998-01-01

    Dental wear, at first considered a pathological condition, is now regarded as a physiological mechanism of teeth adaptation to continuous masticatory stresses. Excessive wear is limited by characteristic structural adaptations of dental hard tissues showing a phylogenetic trend and specialisation. Enamel is the main tissue subjected to wear; however, advanced enamel wear exposes increasingly large areas of dentine. Enamel hardness and anisotropy are the major factors contrasting wear and microfractures. Anisotropy is mainly related to the different orientation of prism bundles (and of hydroxiapatite cristals). Enamel wear development is also related to differences in microhardness, density, mineral composition and protein distribution. Masticatory loads distributed along the enamel-dentine junction uniformly disperse in the underlying dentine. In spite of its structural characteristics, dentine is relatively isotropic by the functional point of view. Even if its lower hardness opposes less efficaciously to wear, its biomechanical characteristics successfully contrast microfractures. The study of microwear (namely the microscopic analysis of worn dental surfaces) can be made both on original surfaces and on high definition silicone-resin replicas. Scanning electron microscope observations allow identification of surface damage (microtraces) produced by different physical and chemical agents. Microwear analysis may provide indications about alimentary and non alimentary habits, masticatory biomechanics and pathological situations (e.g., bruxism). PMID:9766174

  4. Comprehensive Phylogenetic Analysis of Bacterial Reverse Transcriptases

    PubMed Central

    Toro, Nicolás; Nisa-Martínez, Rafael

    2014-01-01

    Much less is known about reverse transcriptases (RTs) in prokaryotes than in eukaryotes, with most prokaryotic enzymes still uncharacterized. Two surveys involving BLAST searches for RT genes in prokaryotic genomes revealed the presence of large numbers of diverse, uncharacterized RTs and RT-like sequences. Here, using consistent annotation across all sequenced bacterial species from GenBank and other sources via RAST, available from the PATRIC (Pathogenic Resource Integration Center) platform, we have compiled the data for currently annotated reverse transcriptases from completely sequenced bacterial genomes. RT sequences are broadly distributed across bacterial phyla, but green sulfur bacteria and cyanobacteria have the highest levels of RT sequence diversity (≤85% identity) per genome. By contrast, phylum Actinobacteria, for which a large number of genomes have been sequenced, was found to have a low RT sequence diversity. Phylogenetic analyses revealed that bacterial RTs could be classified into 17 main groups: group II introns, retrons/retron-like RTs, diversity-generating retroelements (DGRs), Abi-like RTs, CRISPR-Cas-associated RTs, group II-like RTs (G2L), and 11 other groups of RTs of unknown function. Proteobacteria had the highest potential functional diversity, as they possessed most of the RT groups. Group II introns and DGRs were the most widely distributed RTs in bacterial phyla. Our results provide insights into bacterial RT phylogeny and the basis for an update of annotation systems based on sequence/domain homology. PMID:25423096

  5. Phylogenetic biodiversity assessment based on systematic nomenclature

    PubMed Central

    Crozier, Ross H; Dunnett, Lisa J; Agapow, Paul-Michael

    2007-01-01

    Biodiversity assessment demands objective measures, because ultimately conservation decisions must prioritize the use of limited resources for preserving taxa. The most general framework for the objective assessment of conservation worth are those that assess evolutionary distinctiveness, e.g. Genetic (Crozier 1992) and Phylogenetic Diversity (Faith 1992), and Evolutionary History (Nee & May 1997). These measures all attempt to assess the conservation worth of any scheme based on how much of the encompassing phylogeny of organisms is preserved. However, their general applicability is limited by the small proportion of taxa that have been reliably placed in a phylogeny. Given that phylogenizaton of many interesting taxa or important is unlikely to occur soon, we present a framework for using taxonomy as a reasonable surrogate for phylogeny. Combining this framework with exhaustive searches for combinations of sites containing maximal diversity, we provide a proof-of-concept for assessing conservation schemes for systematized but un-phylogenised taxa spread over a series of sites. This is illustrated with data from four studies, on North Queensland flightless insects (Yeates et al. 2002), ants from a Florida Transect (Lubertazzi & Tschinkel 2003), New England bog ants (Gotelli & Ellison 2002) and a simulated distribution of the known New Zealand Lepidosauria (Daugherty et al. 1994). The results support this approach, indicating that species, genus and site numbers predict evolutionary history, to a degree depending on the size of the data set. PMID:19325850

  6. Phylogenetic characterization of archaea in saltpan sediments.

    PubMed

    Ahmad, Nasier; Johri, Sarojini; Sultan, Phalisteen; Abdin, Malik Z; Qazi, Ghulam N

    2011-06-01

    A study was undertaken to investigate the presence of archaeal diversity in saltpan sediments of Goa, India by 16S rDNA-dependent molecular phylogeny. Small subunit rRNA (16S rDNA) from saltpan sediment metagenome were amplified by polymerase chain reaction (PCR) using primers specific to the domain archaea. 10 unique phylotypes were obtained by PCR based RFLP of 16S rRNA genes using endonuclease Msp 1, which was most suitable to score the genetic diversity. These phylotypes spanned a wide range within the domain archaea including both crenarchaeota and euryarcheaota. None of the retrieved crenarchaeota sequences could be grouped with previously cultured crenarchaeota however; two sequences were related with haloarchaea. Most of the sequences determined were closely related to the sequences that had been previously obtained from metagenome of a variety of marine environments. The phylogenetic study of a site investigated for the first time revealed the presence of low archaeal population but showed yet unclassified species, may specially adapted to the salt pan sediment of Goa. PMID:22654153

  7. Postural control in man: the phylogenetic perspective.

    PubMed

    Gramsbergen, Albert

    2005-01-01

    Erect posture in man is a recent affordance from an evolutionary perspective. About eight million years ago, the stock from which modern humans derived split off from the ape family, and from around sixty-thousand years ago, modern man developed. Upright gait and manipulations while standing pose intricate cybernetic problems for postural control. The trunk, having an older evolutionary history than the extremities, is innervated by medially descending motor systems and extremity muscles by the more recent, laterally descending systems. Movements obviously require concerted actions from both systems. Research in rats has demonstrated the interdependencies between postural control and the development of fluent walking. Only 15 days after birth, adult-like fluent locomotion emerges and is critically dependent upon postural development. Vesttibular deprivation induces a retardation in postural development and, consequently, a retarded development of adult-like locomotion. The cerebellum obviously has an important role in mutual adjustments in postural control and extremity movements, or, in coupling the phylogenetic older and newer structures. In the human, the cerebellum develops partly after birth and therefore is vulnerable to adverse perinatal influences. Such vulnerability seems to justify focusing our scientific research efforts onto the development of this structure. PMID:16097476

  8. Commission 45: Spectral Classification

    NASA Astrophysics Data System (ADS)

    Giridhar, Sunetra; Gray, Richard O.; Corbally, Christopher J.; Bailer-Jones, Coryn A. L.; Eyer, Laurent; Irwin, Michael J.; Kirkpatrick, J. Davy; Majewski, Steven; Minniti, Dante; Nordström, Birgitta

    This report gives an update of developments (since the last General Assembly at Prague) in the areas that are of relevance to the commission. In addition to numerous papers, a ne