Sample records for accurate phylogenetic breakpoint

  1. Stepwise detection of recombination breakpoints in sequence alignments.

    PubMed

    Graham, Jinko; McNeney, Brad; Seillier-Moiseiwitsch, Françoise

    2005-03-01

    We propose a stepwise approach to identify recombination breakpoints in a sequence alignment. The approach can be applied to any recombination detection method that uses a permutation test and provides estimates of breakpoints. We illustrate the approach by analyses of a simulated dataset and alignments of real data from HIV-1 and human chromosome 7. The presented simulation results compare the statistical properties of one-step and two-step procedures. More breakpoints are found with a two-step procedure than with a single application of a given method, particularly for higher recombination rates. At higher recombination rates, the additional breakpoints were located at the cost of only a slight increase in the number of falsely declared breakpoints. However, a large proportion of breakpoints still go undetected. A makefile and C source code for phylogenetic profiling and the maximum chi2 method, tested with the gcc compiler on Linux and WindowsXP, are available at http://stat-db.stat.sfu.ca/stepwise/ jgraham@stat.sfu.ca.

  2. DB2: a probabilistic approach for accurate detection of tandem duplication breakpoints using paired-end reads.

    PubMed

    Yavaş, Gökhan; Koyutürk, Mehmet; Gould, Meetha P; McMahon, Sarah; LaFramboise, Thomas

    2014-03-05

    With the advent of paired-end high throughput sequencing, it is now possible to identify various types of structural variation on a genome-wide scale. Although many methods have been proposed for structural variation detection, most do not provide precise boundaries for identified variants. In this paper, we propose a new method, Distribution Based detection of Duplication Boundaries (DB2), for accurate detection of tandem duplication breakpoints, an important class of structural variation, with high precision and recall. Our computational experiments on simulated data show that DB2 outperforms state-of-the-art methods in terms of finding breakpoints of tandem duplications, with a higher positive predictive value (precision) in calling the duplications' presence. In particular, DB2's prediction of tandem duplications is correct 99% of the time even for very noisy data, while narrowing down the space of possible breakpoints within a margin of 15 to 20 bps on the average. Most of the existing methods provide boundaries in ranges that extend to hundreds of bases with lower precision values. Our method is also highly robust to varying properties of the sequencing library and to the sizes of the tandem duplications, as shown by its stable precision, recall and mean boundary mismatch performance. We demonstrate our method's efficacy using both simulated paired-end reads, and those generated from a melanoma sample and two ovarian cancer samples. Newly discovered tandem duplications are validated using PCR and Sanger sequencing. Our method, DB2, uses discordantly aligned reads, taking into account the distribution of fragment length to predict tandem duplications along with their breakpoints on a donor genome. The proposed method fine tunes the breakpoint calls by applying a novel probabilistic framework that incorporates the empirical fragment length distribution to score each feasible breakpoint. DB2 is implemented in Java programming language and is freely available

  3. Accurate Phylogenetic Tree Reconstruction from Quartets: A Heuristic Approach

    PubMed Central

    Reaz, Rezwana; Bayzid, Md. Shamsuzzoha; Rahman, M. Sohel

    2014-01-01

    Supertree methods construct trees on a set of taxa (species) combining many smaller trees on the overlapping subsets of the entire set of taxa. A ‘quartet’ is an unrooted tree over taxa, hence the quartet-based supertree methods combine many -taxon unrooted trees into a single and coherent tree over the complete set of taxa. Quartet-based phylogeny reconstruction methods have been receiving considerable attentions in the recent years. An accurate and efficient quartet-based method might be competitive with the current best phylogenetic tree reconstruction methods (such as maximum likelihood or Bayesian MCMC analyses), without being as computationally intensive. In this paper, we present a novel and highly accurate quartet-based phylogenetic tree reconstruction method. We performed an extensive experimental study to evaluate the accuracy and scalability of our approach on both simulated and biological datasets. PMID:25117474

  4. Accurate phylogenetic classification of DNA fragments based onsequence composition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McHardy, Alice C.; Garcia Martin, Hector; Tsirigos, Aristotelis

    2006-05-01

    Metagenome studies have retrieved vast amounts of sequenceout of a variety of environments, leading to novel discoveries and greatinsights into the uncultured microbial world. Except for very simplecommunities, diversity makes sequence assembly and analysis a verychallenging problem. To understand the structure a 5 nd function ofmicrobial communities, a taxonomic characterization of the obtainedsequence fragments is highly desirable, yet currently limited mostly tothose sequences that contain phylogenetic marker genes. We show that forclades at the rank of domain down to genus, sequence composition allowsthe very accurate phylogenetic 10 characterization of genomic sequence.We developed a composition-based classifier, PhyloPythia, for de novophylogenetic sequencemore » characterization and have trained it on adata setof 340 genomes. By extensive evaluation experiments we show that themethodis accurate across all taxonomic ranks considered, even forsequences that originate fromnovel organisms and are as short as 1kb.Application to two metagenome datasets 15 obtained from samples ofphosphorus-removing sludge showed that the method allows the accurateclassification at genus level of most sequence fragments from thedominant populations, while at the same time correctly characterizingeven larger parts of the samples at higher taxonomic levels.« less

  5. DNA Probe Pooling for Rapid Delineation of Chromosomal Breakpoints

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lu, Chun-Mei; Kwan, Johnson; Baumgartner, Adolf

    2009-01-30

    Structural chromosome aberrations are hallmarks of many human genetic diseases. The precise mapping of translocation breakpoints in tumors is important for identification of genes with altered levels of expression, prediction of tumor progression, therapy response, or length of disease-free survival as well as the preparation of probes for detection of tumor cells in peripheral blood. Similarly, in vitro fertilization (IVF) and preimplantation genetic diagnosis (PGD) for carriers of balanced, reciprocal translocations benefit from accurate breakpoint maps in the preparation of patient-specific DNA probes followed by a selection of normal or balanced oocytes or embryos. We expedited the process of breakpointmore » mapping and preparation of case-specific probes by utilizing physically mapped bacterial artificial chromosome (BAC) clones. Historically, breakpoint mapping is based on the definition of the smallest interval between proximal and distal probes. Thus, many of the DNA probes prepared for multi-clone and multi-color mapping experiments do not generate additional information. Our pooling protocol described here with examples from thyroid cancer research and PGD accelerates the delineation of translocation breakpoints without sacrificing resolution. The turnaround time from clone selection to mapping results using tumor or IVF patient samples can be as short as three to four days.« less

  6. EUCAST breakpoints for antifungals.

    PubMed

    Rodríguez-Tudela, Juan L; Arendrup, Maiken C; Cuenca-Estrella, Manuel; Donnelly, J Peter; Lass-Flörl, Cornelia

    2010-03-01

    Susceptibility testing of fungi and development of interpretative breakpoints has become increasingly important due to the growing incidence of invasive fungal infections, the number and classes of antifungals, and the emerging reports of acquired resistance. The subcommittee on antifungal susceptibility testing of the European Committee on Antibiotic Susceptibility Testing (EUCAST) has developed standards for susceptibility testing of fermentative yeasts and molds as well as proposing breakpoints for fluconazole and voriconazole against Candida. The aim of this work is to describe the EUCAST process of setting breakpoints for antifungals. Five aspects are evaluated during the process of developing breakpoints: 1) the most common dosage used in each European country, 2) the definition of the wild-type population for each target microorganism at the species level and the determination of epidemiological cutoffs, 3) the drug's pharmacokinetics and 4) pharmacodynamics, including Monte Carlo simulations, and 5) the correlation of MICs with clinical outcome of patients treated with the compound. When insufficient data are available (e.g., due to lack of information on the clinical outcome of infections caused by isolates with an elevated MIC), epidemiological cutoff values, rather than breakpoints, are recommended until the necessary information becomes available. Copyright 2010 Prous Science, S.A.U. or its licensors. All rights reserved.

  7. Distinct retroelement classes define evolutionary breakpoints demarcating sites of evolutionary novelty

    PubMed Central

    Longo, Mark S; Carone, Dawn M; Green, Eric D; O'Neill, Michael J; O'Neill, Rachel J

    2009-01-01

    Background Large-scale genome rearrangements brought about by chromosome breaks underlie numerous inherited diseases, initiate or promote many cancers and are also associated with karyotype diversification during species evolution. Recent research has shown that these breakpoints are nonrandomly distributed throughout the mammalian genome and many, termed "evolutionary breakpoints" (EB), are specific genomic locations that are "reused" during karyotypic evolution. When the phylogenetic trajectory of orthologous chromosome segments is considered, many of these EB are coincident with ancient centromere activity as well as new centromere formation. While EB have been characterized as repeat-rich regions, it has not been determined whether specific sequences have been retained during evolution that would indicate previous centromere activity or a propensity for new centromere formation. Likewise, the conservation of specific sequence motifs or classes at EBs among divergent mammalian taxa has not been determined. Results To define conserved sequence features of EBs associated with centromere evolution, we performed comparative sequence analysis of more than 4.8 Mb within the tammar wallaby, Macropus eugenii, derived from centromeric regions (CEN), euchromatic regions (EU), and an evolutionary breakpoint (EB) that has undergone convergent breakpoint reuse and past centromere activity in marsupials. We found a dramatic enrichment for long interspersed nucleotide elements (LINE1s) and endogenous retroviruses (ERVs) and a depletion of short interspersed nucleotide elements (SINEs) shared between CEN and EBs. We analyzed the orthologous human EB (14q32.33), known to be associated with translocations in many cancers including multiple myelomas and plasma cell leukemias, and found a conserved distribution of similar repetitive elements. Conclusion Our data indicate that EBs tracked within the class Mammalia harbor sequence features retained since the divergence of marsupials

  8. Mapping the breakpoints of an individual with congenital glaucoma and a 6:13 translocation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nishimura, D.Y.; Patil, S.; Alward, W.L.M.

    1994-09-01

    We have identified an individual with a balanced translocation involving the short arm of chromosome 6 (6p25) and the long arm of chromosome 13 (13q22). This person was found to have congenital glaucoma along with other anomalies. The identification of a second individual with both congenital glaucoma and an unbalanced translocation involving the distal short arm of chromosome 6 suggested that this region of chromosome 6 might be involved in at least some cases of congenital glaucoma. Therefore, we wished to characterize the two breakpoints with an emphasis on the 6p25 breakpoint. Somatic cell hybrids were used to isolate themore » two translocated chromosomes away from the normal 6 and 13. Two hybrids were identified which contained the distal region of chromosome 6p, while a third hybrid contained the reciprocal translocated chromosome. Characterization of the hybrids demonstrated that the breakpoint on chromosome 6p lies between the markers D6S344 and D6S477, while the chromosome 13 breakpoint lies between the markers D13S160 and D13S170. The distances between the flanking markers on both of the chromosomes are less than 5 cM. The lack of genetic markers within the 6p25 region made it difficult to accurately map the location of the breakpoint in this region. However, we were able to demonstrate that at least two markers are distal to the 6p25 breakpoint. We are working to improve the quality of both the genetic and physical maps of 6p25 and 13q22 in an attempt to further refine the localization of the breakpoints. Preliminary work on the individual with the unbalanced translocation suggests that the 6p deletion in this person includes the 6p25 breakpoints.« less

  9. Breakpoint Forcing Revisited: Phase Between Forcing and Response

    NASA Astrophysics Data System (ADS)

    Contardo, S.; Symonds, G.; Dufois, F.

    2018-02-01

    Using the breakpoint forcing model, for long wave generation in the surf zone, expressions for the phase difference between the breakpoint-forced long waves and the incident short wave groups are obtained. Contrary to assumptions made in previous studies, the breakpoint-forced long waves and incident wave groups are not in phase and outgoing breakpoint-forced long waves and incident wave groups are not π out of phase. The phase between the breakpoint-forced long wave and the incident wave group is shown to depend on beach geometry and wave group parameters. The breakpoint-forced incoming long wave lags behind the wave group, by a phase smaller than π/2. The phase lag decreases as the beach slope decreases and the group frequency increases, approaching approximately π/16 within reasonable limits of the parameter space. The phase between the breakpoint-forced outgoing long wave and the wave group is between π/2 and π and it increases as the beach slope decreases and the group frequency increases, approaching 15π/16 within reasonable limits of the parameter space. The phase between the standing long wave (composed of the incoming long wave and its reflection) and the incident wave group tends to zero when the wave group is long compared to the surf zone width. These results clarify the phase relationships in the breakpoint forcing model and provide a new base for the identification of breakpoint forcing signal from observations, laboratory experiments and numerical modeling.

  10. Characterization of the breakpoints of a polymorphic inversion complex detects strict and broad breakpoint reuse at the molecular level.

    PubMed

    Puerma, Eva; Orengo, Dorcas J; Salguero, David; Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat

    2014-09-01

    Inversions are an integral part of structural variation within species, and they play a leading role in genome reorganization across species. Work at both the cytological and genome sequence levels has revealed heterogeneity in the distribution of inversion breakpoints, with some regions being recurrently used. Breakpoint reuse at the molecular level has mostly been assessed for fixed inversions through genome sequence comparison, and therefore rather broadly. Here, we have identified and sequenced the breakpoints of two polymorphic inversions-E1 and E2 that share a breakpoint-in the extant Est and E1 + 2 chromosomal arrangements of Drosophila subobscura. The breakpoints are two medium-sized repeated motifs that mediated the inversions by two different mechanisms: E1 via staggered breaks and subsequent repair and E2 via repeat-mediated ectopic recombination. The fine delimitation of the shared breakpoint revealed its strict reuse at the molecular level regardless of which was the intermediate arrangement. The occurrence of other rearrangements in the most proximal and distal extended breakpoint regions reveals the broad reuse of these regions. This differential degree of fragility might be related to their sharing the presence outside the inverted region of snoRNA-encoding genes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  11. Two novel translocation breakpoints upstream of SOX9 define borders of the proximal and distal breakpoint cluster region in campomelic dysplasia.

    PubMed

    Leipoldt, M; Erdel, M; Bien-Willner, G A; Smyk, M; Theurl, M; Yatsenko, S A; Lupski, J R; Lane, A H; Shanske, A L; Stankiewicz, P; Scherer, G

    2007-01-01

    The semilethal skeletal malformation syndrome campomelic dysplasia (CD) with or without XY sex reversal is caused by mutations within the SOX9 gene on 17q24.3 or by chromosomal aberrations (translocations, inversions or deletions) with breakpoints outside the SOX9 coding region. The previously published CD translocation breakpoints upstream of SOX9 fall into two clusters: a proximal cluster with breakpoints between 50-300 kb and a distal cluster with breakpoints between 899-932 kb. Here, we present clinical, cytogenetic and molecular data from two novel CD translocation cases. Case 1 with karyotype 46,XY,t(1;17)(q42.1;q24.3) has characteristic symptoms of CD, including mild tibial bowing, cryptorchidism and hypospadias. By standard fluorescence in situ hybridization (FISH) and by high-resolution fiber FISH, the 17q breakpoint was mapped 375 kb from SOX9, defining the centromeric border of the proximal breakpoint cluster region. Case 2 with karyotype 46,X,t(Y;17)(q11.2;q24.3) has the acampomelic form of CD and complete XY sex reversal. By FISH and somatic cell hybrid analysis, the 17q breakpoint was mapped 789 kb from SOX9, defining the telomeric border of the distal breakpoint cluster region. We discuss the structure of the 1 Mb cis-control region upstream of SOX9 and the correlation between the position of the 14 mapped translocation breakpoints with respect to disease severity and XY sex reversal.

  12. Discovery of Genomic Breakpoints Affecting Breast Cancer Progression and Prognosis

    DTIC Science & Technology

    2009-10-01

    157 genomic breakpoints in MCF-7 cells that could be confirmed by PCR across breakpoint joins as likely somatic mutations . A total of 79 genes are...SUPPLEMENTARY NOTES 14. ABSTRACT 157 genomic breakpoints could be confirmed as likely somatic mutations . We focused on breakpoints predicted to lead to...enrichment for breakpoints containing genes (50.3% vs 77.4%), and for fusion-containing breakpoints (6.4% vs 16.1%). Also, all chimeric mRNA products are

  13. Handling debugger breakpoints in a shared instruction system

    DOEpatents

    Gooding, Thomas Michael; Shok, Richard Michael

    2014-01-21

    A debugger debugs processes that execute shared instructions so that a breakpoint set for one process will not cause a breakpoint to occur in the other processes. A breakpoint is set by recording the original instruction at the desired location and writing a trap instruction to the shared instructions at that location. When a process encounters the breakpoint, the process passes control to the debugger for breakpoint processing if the breakpoint was set at that location for that process. If the trap was not set at that location for that process, the cacheline containing the trap is copied to a small scratchpad memory, and the virtual memory mappings are changed to translate the virtual address of the cacheline to the scratchpad. The original instruction is then written to replace the trap instruction in the scratchpad, so that process can execute the instructions in the scatchpad thereby avoiding the trap instruction.

  14. Reconstruction of phylogenetic trees of prokaryotes using maximal common intervals.

    PubMed

    Heydari, Mahdi; Marashi, Sayed-Amir; Tusserkani, Ruzbeh; Sadeghi, Mehdi

    2014-10-01

    One of the fundamental problems in bioinformatics is phylogenetic tree reconstruction, which can be used for classifying living organisms into different taxonomic clades. The classical approach to this problem is based on a marker such as 16S ribosomal RNA. Since evolutionary events like genomic rearrangements are not included in reconstructions of phylogenetic trees based on single genes, much effort has been made to find other characteristics for phylogenetic reconstruction in recent years. With the increasing availability of completely sequenced genomes, gene order can be considered as a new solution for this problem. In the present work, we applied maximal common intervals (MCIs) in two or more genomes to infer their distance and to reconstruct their evolutionary relationship. Additionally, measures based on uncommon segments (UCS's), i.e., those genomic segments which are not detected as part of any of the MCIs, are also used for phylogenetic tree reconstruction. We applied these two types of measures for reconstructing the phylogenetic tree of 63 prokaryotes with known COG (clusters of orthologous groups) families. Similarity between the MCI-based (resp. UCS-based) reconstructed phylogenetic trees and the phylogenetic tree obtained from NCBI taxonomy browser is as high as 93.1% (resp. 94.9%). We show that in the case of this diverse dataset of prokaryotes, tree reconstruction based on MCI and UCS outperforms most of the currently available methods based on gene orders, including breakpoint distance and DCJ. We additionally tested our new measures on a dataset of 13 closely-related bacteria from the genus Prochlorococcus. In this case, distances like rearrangement distance, breakpoint distance and DCJ proved to be useful, while our new measures are still appropriate for phylogenetic reconstruction. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  15. YAHA: fast and flexible long-read alignment with optimal breakpoint detection.

    PubMed

    Faust, Gregory G; Hall, Ira M

    2012-10-01

    With improved short-read assembly algorithms and the recent development of long-read sequencers, split mapping will soon be the preferred method for structural variant (SV) detection. Yet, current alignment tools are not well suited for this. We present YAHA, a fast and flexible hash-based aligner. YAHA is as fast and accurate as BWA-SW at finding the single best alignment per query and is dramatically faster and more sensitive than both SSAHA2 and MegaBLAST at finding all possible alignments. Unlike other aligners that report all, or one, alignment per query, or that use simple heuristics to select alignments, YAHA uses a directed acyclic graph to find the optimal set of alignments that cover a query using a biologically relevant breakpoint penalty. YAHA can also report multiple mappings per defined segment of the query. We show that YAHA detects more breakpoints in less time than BWA-SW across all SV classes, and especially excels at complex SVs comprising multiple breakpoints. YAHA is currently supported on 64-bit Linux systems. Binaries and sample data are freely available for download from http://faculty.virginia.edu/irahall/YAHA. imh4y@virginia.edu.

  16. SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.

    PubMed

    Liu, Kevin; Warnow, Tandy J; Holder, Mark T; Nelesen, Serita M; Yu, Jiaye; Stamatakis, Alexandros P; Linder, C Randal

    2012-01-01

    Highly accurate estimation of phylogenetic trees for large data sets is difficult, in part because multiple sequence alignments must be accurate for phylogeny estimation methods to be accurate. Coestimation of alignments and trees has been attempted but currently only SATé estimates reasonably accurate trees and alignments for large data sets in practical time frames (Liu K., Raghavan S., Nelesen S., Linder C.R., Warnow T. 2009b. Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees. Science. 324:1561-1564). Here, we present a modification to the original SATé algorithm that improves upon SATé (which we now call SATé-I) in terms of speed and of phylogenetic and alignment accuracy. SATé-II uses a different divide-and-conquer strategy than SATé-I and so produces smaller more closely related subsets than SATé-I; as a result, SATé-II produces more accurate alignments and trees, can analyze larger data sets, and runs more efficiently than SATé-I. Generally, SATé is a metamethod that takes an existing multiple sequence alignment method as an input parameter and boosts the quality of that alignment method. SATé-II-boosted alignment methods are significantly more accurate than their unboosted versions, and trees based upon these improved alignments are more accurate than trees based upon the original alignments. Because SATé-I used maximum likelihood (ML) methods that treat gaps as missing data to estimate trees and because we found a correlation between the quality of tree/alignment pairs and ML scores, we explored the degree to which SATé's performance depends on using ML with gaps treated as missing data to determine the best tree/alignment pair. We present two lines of evidence that using ML with gaps treated as missing data to optimize the alignment and tree produces very poor results. First, we show that the optimization problem where a set of unaligned DNA sequences is given and the output is the tree and alignment of

  17. Precise detection of chromosomal translocation or inversion breakpoints by whole-genome sequencing.

    PubMed

    Suzuki, Toshifumi; Tsurusaki, Yoshinori; Nakashima, Mitsuko; Miyake, Noriko; Saitsu, Hirotomo; Takeda, Satoru; Matsumoto, Naomichi

    2014-12-01

    Structural variations (SVs), including translocations, inversions, deletions and duplications, are potentially associated with Mendelian diseases and contiguous gene syndromes. Determination of SV-related breakpoints at the nucleotide level is important to reveal the genetic causes for diseases. Whole-genome sequencing (WGS) by next-generation sequencers is expected to determine structural abnormalities more directly and efficiently than conventional methods. In this study, 14 SVs (9 balanced translocations, 1 inversion and 4 microdeletions) in 9 patients were analyzed by WGS with a shallow (5 × ) to moderate read coverage (20 × ). Among 28 breakpoints (as each SV has two breakpoints), 19 SV breakpoints had been determined previously at the nucleotide level by any other methods and 9 were uncharacterized. BreakDancer and Integrative Genomics Viewer determined 20 breakpoints (16 translocation, 2 inversion and 2 deletion breakpoints), but did not detect 8 breakpoints (2 translocation and 6 deletion breakpoints). These data indicate the efficacy of WGS for the precise determination of translocation and inversion breakpoints.

  18. Detection of gene expression changes at chromosomal rearrangement breakpoints in evolution

    PubMed Central

    2012-01-01

    Background We study the relation between genome rearrangements, breakpoints and gene expression. Genome rearrangement research has been concerned with the creation of breakpoints and their position in the chromosome, but the functional consequences of individual breakpoints remain virtually unknown, and there are no direct genome-wide studies of breakpoints from this point of view. A question arises of what the biological consequences of breakpoint creation are, rather than just their structural aspects. The question is whether proximity to the site of a breakpoint event changes the activity of a gene. Results We investigate this by comparing the distribution of distances to the nearest breakpoint of genes that are differentially expressed with the distribution of the same distances for the entire gene complement. We study this in data on whole blood tissue in human versus macaque, and in cerebral cortex tissue in human versus chimpanzee. We find in both data sets that the distribution of distances to the nearest breakpoint of "changed expression genes" differs little from this distance calculated for the rest of the gene complement. In focusing on the changed expression genes closest to the breakpoints, however, we discover that several of these have previously been implicated in the literature as being connected to the evolutionary divergence of humans from other primates. Conclusions We conjecture that chromosomal rearrangements occasionally interrupt the regulatory configurations of genes close to the breakpoint, leading to changes in expression. PMID:22536904

  19. MitoBreak: the mitochondrial DNA breakpoints database.

    PubMed

    Damas, Joana; Carneiro, João; Amorim, António; Pereira, Filipe

    2014-01-01

    Mitochondrial DNA (mtDNA) rearrangements are key events in the development of many diseases. Investigations of mtDNA regions affected by rearrangements (i.e. breakpoints) can lead to important discoveries about rearrangement mechanisms and can offer important clues about the causes of mitochondrial diseases. Here, we present the mitochondrial DNA breakpoints database (MitoBreak; http://mitobreak.portugene.com), a free, web-accessible comprehensive list of breakpoints from three classes of somatic mtDNA rearrangements: circular deleted (deletions), circular partially duplicated (duplications) and linear mtDNAs. Currently, MitoBreak contains >1400 mtDNA rearrangements from seven species (Homo sapiens, Mus musculus, Rattus norvegicus, Macaca mulatta, Drosophila melanogaster, Caenorhabditis elegans and Podospora anserina) and their associated phenotypic information collected from nearly 400 publications. The database allows researchers to perform multiple types of data analyses through user-friendly interfaces with full or partial datasets. It also permits the download of curated data and the submission of new mtDNA rearrangements. For each reported case, MitoBreak also documents the precise breakpoint positions, junction sequences, disease or associated symptoms and links to the related publications, providing a useful resource to study the causes and consequences of mtDNA structural alterations.

  20. A long journey from minimum inhibitory concentration testing to clinically predictive breakpoints: deterministic and probabilistic approaches in deriving breakpoints.

    PubMed

    Dalhoff, A; Ambrose, P G; Mouton, J W

    2009-08-01

    Since the origin of an "'International Collaborative Study on Antibiotic Sensitivity Testing'" in 1971, considerable advancement has been made to standardize clinical susceptibility testing procedures of antimicrobial agents. However, a consensus on the methods to be used and interpretive criteria was not reached, so the results of susceptibility testing were discrepant. Recently, the European Committee on Antimicrobial Susceptibility Testing achieved a harmonization of existing methods for susceptibility testing and now co-ordinates the process for setting breakpoints. Previously, breakpoints were set by adjusting the mean pharmacokinetic parameters derived from healthy volunteers to the susceptibilities of a population of potential pathogens expressed as the mean minimum inhibitory concentration (MIC) or MIC90%. Breakpoints derived by the deterministic approach tend to be too high, since this procedure does not take the variabilities of drug exposure and the susceptibility patterns into account. Therefore, first-step mutants or borderline susceptible bacteria may be considered as fully susceptible. As the drug exposure of such sub-populations is inadequate, resistance development will increase and eradication rates will decrease, resulting in clinical failure. The science of pharmacokinetics/pharmacodynamics integrates all possible drug exposures for standard dosage regimens and all MIC values likely to be found for the clinical isolates into the breakpoint definitions. Ideally, the data sets used originate from patients suffering from the disease to be treated. Probability density functions for both the pharmacokinetic and microbiological variables are determined, and a large number of MIC/drug exposure scenarios are calculated. Therefore, this method is defined as the probabilistic approach. The breakpoints thus derived are lower than the ones defined deterministically, as the entire range of probable drug exposures from low to high is modeled. Therefore, the

  1. Susceptibility Breakpoint for Enrofloxacin against Swine Salmonella spp.

    PubMed Central

    Hao, Haihong; Pan, Huafang; Ahmad, Ijaz; Cheng, Guyue; Wang, Yulian; Dai, Menghong; Tao, Yanfei; Chen, Dongmei; Peng, Dapeng; Liu, Zhenli

    2013-01-01

    Susceptibility breakpoints are crucial for prudent use of antimicrobials. This study has developed the first susceptibility breakpoint (MIC ≤ 0.25 μg/ml) for enrofloxacin against swine Salmonella spp. based on wild-type cutoff (COWT) and pharmacokinetic-pharmacodynamic (PK-PD) cutoff (COPD) values, consequently providing a criterion for susceptibility testing and clinical usage of enrofloxacin. PMID:23784134

  2. Phylogenetic Analysis of Genome Rearrangements among Five Mammalian Orders

    PubMed Central

    Luo, Haiwei; Arndt, William; Zhang, Yiwei; Shi, Guanqun; Alekseyev, Max; Tang, Jijun; Hughes, Austin L.; Friedman, Robert

    2015-01-01

    Evolutionary relationships among placental mammalian orders have been controversial. Whole genome sequencing and new computational methods offer opportunities to resolve the relationships among 10 genomes belonging to the mammalian orders Primates, Rodentia, Carnivora, Perissodactyla and Artiodactyla. By application of the double cut and join distance metric, where gene order is the phylogenetic character, we computed genomic distances among the sampled mammalian genomes. With a marsupial outgroup, the gene order tree supported a topology in which Rodentia fell outside the cluster of Primates, Carnivora, Perissodactyla, and Artiodactyla. Results of breakpoint reuse rate and synteny block length analyses were consistent with the prediction of random breakage model, which provided a diagnostic test to support use of gene order as an appropriate phylogenetic character in this study. We the influence of rate differences among lineages and other factors that may contribute to different resolutions of mammalian ordinal relationships by different methods of phylogenetic reconstruction. PMID:22929217

  3. Selection on Inversion Breakpoints Favors Proximity to Pairing Sensitive Sites in Drosophila melanogaster

    PubMed Central

    Corbett-Detig, Russell B.

    2016-01-01

    Chromosomal inversions are widespread among taxa, and have been implicated in a number of biological processes including adaptation, sex chromosome evolution, and segregation distortion. Consistent with selection favoring linkage between loci, it is well established that length is a selected trait of inversions. However, the factors that affect the distribution of inversion breakpoints remain poorly understood. “Sensitive sites” have been mapped on all euchromatic chromosome arms in Drosophila melanogaster, and may be a source of natural selection on inversion breakpoint positions. Briefly, sensitive sites are genomic regions wherein proximal structural rearrangements result in large reductions in local recombination rates in heterozygotes. Here, I show that breakpoints of common inversions are significantly more likely to lie within a cytological band containing a sensitive site than are breakpoints of rare inversions. Furthermore, common inversions for which neither breakpoint intersects a sensitive site are significantly longer than rare inversions, but common inversions whose breakpoints intersect a sensitive site show no evidence for increased length. I interpret these results to mean that selection favors inversions whose breakpoints disrupt synteny near to sensitive sites, possibly because these inversions suppress recombination in large genomic regions. To my knowledge this is the first evidence consistent with positive selection acting on inversion breakpoint positions. PMID:27343234

  4. Selection on Inversion Breakpoints Favors Proximity to Pairing Sensitive Sites in Drosophila melanogaster.

    PubMed

    Corbett-Detig, Russell B

    2016-09-01

    Chromosomal inversions are widespread among taxa, and have been implicated in a number of biological processes including adaptation, sex chromosome evolution, and segregation distortion. Consistent with selection favoring linkage between loci, it is well established that length is a selected trait of inversions. However, the factors that affect the distribution of inversion breakpoints remain poorly understood. "Sensitive sites" have been mapped on all euchromatic chromosome arms in Drosophila melanogaster, and may be a source of natural selection on inversion breakpoint positions. Briefly, sensitive sites are genomic regions wherein proximal structural rearrangements result in large reductions in local recombination rates in heterozygotes. Here, I show that breakpoints of common inversions are significantly more likely to lie within a cytological band containing a sensitive site than are breakpoints of rare inversions. Furthermore, common inversions for which neither breakpoint intersects a sensitive site are significantly longer than rare inversions, but common inversions whose breakpoints intersect a sensitive site show no evidence for increased length. I interpret these results to mean that selection favors inversions whose breakpoints disrupt synteny near to sensitive sites, possibly because these inversions suppress recombination in large genomic regions. To my knowledge this is the first evidence consistent with positive selection acting on inversion breakpoint positions. Copyright © 2016 by the Genetics Society of America.

  5. Progress towards mapping the constitutional t(11:22) breakpoint

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Barnoski, B.L.; Emanuel, B.S.; Bell, C.J.

    1994-09-01

    The reciprocal t(11;22)(q23;q11) is the most frequent, recurrent, non-Robertsonian, constitutional translocation in humans. Balanced carriers of this rearrangement are phenotypically normal, but are at risk for producing abnormal offspring with the Supernumerary der(22)t(11;22) Syndrome. Further, a recent report of association between t(11;22) balanced translocation carriers and breast cancer, suggests the involvement of genes on 11q and/or 22q in breast cancer tumorigenesis. Studies are in progress to examine the similarity between 11q23 and 22q11 breakpoints in multiple families with the constitutional t(11;22). A 750 kb YAC, which contains markers known to flank the 11q23 breakpoint, was identified in CEPH/Genethon database. FISHmore » with this YAC to two independent t(11;22) cell lines demonstrates signal on both derivative chromosomes. Numerous YACs containing BCRL2, the closest marker proximal to the breakpoint, were identified. Analysis of these YACs to determine which contain the actual breakpoint sequences is complicated by the presence of a duplicated segment of 22q11 which contains a GGTL and a BCRL locus. Sequences homologous to these loci are present at several other locations in 22q11. The BCRL positive YACs were analyzed by Southern hybridization under conditions which distinguish the four members of the BCR/BCRL family. FISH of total yeast DNA plus YAC DNA labeled by nick translation, or biotin-labeled inter-Alu PCR products confirmed the localization of these YACs to 22q11. Additional FISH with these YACS to metaphase spreads prepared from balanced t(11;22) carriers confirm that these clones span the breakpoint, and will allow rapid isolation and definition of the genetic region adjacent to the t(11;22) breakpoint.« less

  6. Localization and characterization of X chromosome inversion breakpoints separating Drosophila mojavensis and Drosophila arizonae.

    PubMed

    Cirulli, Elizabeth T; Noor, Mohamed A F

    2007-01-01

    Ectopic exchange between transposable elements or other repetitive sequences along a chromosome can produce chromosomal inversions. As a result, genome sequence studies typically find sequence similarity between corresponding inversion breakpoint regions. Here, we identify and investigate the breakpoint regions of the X chromosome inversion distinguishing Drosophila mojavensis and Drosophila arizonae. We localize one inversion breakpoint to 13.7 kb and localize the other to a 1-Mb interval. Using this localization and assuming microsynteny between Drosophila melanogaster and D. arizonae, we pinpoint likely positions of the inversion breakpoints to windows of less than 3000 bp. These breakpoints define the size of the inversion to approximately 11 Mb. However, in contrast to many other studies, we fail to find significant sequence similarity between the 2 breakpoint regions. The localization of these inversion breakpoints will facilitate future genetic and molecular evolutionary studies in this species group, an emerging model system for ecological genetics.

  7. Molecular population genetics of inversion breakpoint regions in Drosophila pseudoobscura.

    PubMed

    Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

    2013-07-08

    Paracentric inversions in populations can have a profound effect on the pattern and organization of nucleotide variability along a chromosome. Regions near inversion breakpoints are expected to have greater levels of differentiation because of reduced genetic exchange between different gene arrangements whereas central regions in the inverted segments are predicted to have lower levels of nucleotide differentiation due to greater levels of genetic flux among different karyotypes. We used the inversion polymorphism on the third chromosome of Drosophila pseudoobscura to test these predictions with an analysis of nucleotide diversity of 18 genetic markers near and away from inversion breakpoints. We tested hypotheses about how the presence of different chromosomal arrangements affects the pattern and organization of nucleotide variation. Overall, markers in the distal segment of the chromosome had greater levels of nucleotide heterozygosity than markers within the proximal segment of the chromosome. In addition, our results rejected the hypothesis that the breakpoints of derived inversions will have lower levels of nucleotide variability than breakpoints of ancestral inversions, even when strains with gene conversion events were removed. High levels of linkage disequilibrium were observed within all 11 breakpoint regions as well as between the ends of most proximal and distal breakpoints. The central region of the chromosome had the greatest levels of linkage disequilibrium compared with the proximal and distal regions because this is the region that experiences the highest level of recombination suppression. These data do not fully support the idea that genetic exchange is the sole force that influences genetic variation on inverted chromosomes.

  8. Rapid mapping of chromosomal breakpoints: from blood to BAC in 20 days.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lu, Chun-Mei; Kwan, Johnson; Weier, Jingly F.

    2009-02-25

    Structural chromosome aberrations and associated segmental or chromosomal aneusomies are major causes of reproductive failure in humans. Despite the fact that carriers of reciprocal balanced translocation often have no other clinical symptoms or disease, impaired chromosome homologue pairing in meiosis and karyokinesis errors lead to over-representation of translocations carriers in the infertile population and in recurrent pregnancy loss patients. At present, clinicians have no means to select healthy germ cells or balanced zygotes in vivo, but in vitro fertilization (IVF) followed by preimplantation genetic diagnosis (PGD) offers translocation carriers a chance to select balanced or normal embryos for transfer. Althoughmore » a combination of telomeric and centromeric probes can differentiate embryos that are unbalanced from normal or unbalanced ones, a seemingly random position of breakpoints in these IVF-patients poses a serious obstacle to differentiating between normal and balanced embryos, which for most translocation couples, is desirable. Using a carrier with reciprocal translocation t(4;13) as an example, we describe our state-of-the-art approach to the preparation of patient-specific DNA probes that span or 'extent' the breakpoints. With the techniques and resources described here, most breakpoints can be accurately mapped in a matter of days using carrier lymphocytes, and a few extra days are allowed for PGD-probe optimization. The optimized probes will then be suitable for interphase cell analysis, a prerequisite for PGD since blastomeres are biopsied from normally growing day 3 - embryos regardless of their position in the mitotic cell cycle. Furthermore, routine application of these rapid methods should make PGD even more affordable for translocation carriers enrolled in IVF programs.« less

  9. A molecular perspective on a complex polymorphic inversion system with cytological evidence of multiply reused breakpoints.

    PubMed

    Orengo, D J; Puerma, E; Papaceit, M; Segarra, C; Aguadé, M

    2015-06-01

    Genome sequence comparison across the Drosophila genus revealed that some fixed inversion breakpoints had been multiply reused at this long timescale. Cytological studies of Drosophila inversion polymorphism had previously shown that, also at this shorter timescale, some breakpoints had been multiply reused. The paucity of molecularly characterized polymorphic inversion breakpoints has so far precluded contrasting whether cytologically shared breakpoints of these relatively young inversions are actually reused at the molecular level. The E chromosome of Drosophila subobscura stands out because it presents several inversion complexes. This is the case of the E1+2+9+3 arrangement that originated from the ancestral Est arrangement through the sequential accumulation of four inversions (E1, E2, E9 and E3) sharing some breakpoints. We recently identified the breakpoints of inversions E1 and E2, which allowed establishing reuse at the molecular level of the cytologically shared breakpoint of these inversions. Here, we identified and sequenced the breakpoints of inversions E9 and E3, because they share breakpoints at sections 58D and 64C with those of inversions E1 and E2. This has allowed establishing that E9 and E3 originated through the staggered-break mechanism. Most importantly, sequence comparison has revealed the multiple reuse at the molecular level of the proximal breakpoint (section 58D), which would have been used at least by inversions E2, E9 and E3. In contrast, the distal breakpoint (section 64C) might have been only reused once by inversions E1 and E2, because the distal E3 breakpoint is displaced >70 kb from the other breakpoint limits.

  10. A molecular perspective on a complex polymorphic inversion system with cytological evidence of multiply reused breakpoints

    PubMed Central

    Orengo, D J; Puerma, E; Papaceit, M; Segarra, C; Aguadé, M

    2015-01-01

    Genome sequence comparison across the Drosophila genus revealed that some fixed inversion breakpoints had been multiply reused at this long timescale. Cytological studies of Drosophila inversion polymorphism had previously shown that, also at this shorter timescale, some breakpoints had been multiply reused. The paucity of molecularly characterized polymorphic inversion breakpoints has so far precluded contrasting whether cytologically shared breakpoints of these relatively young inversions are actually reused at the molecular level. The E chromosome of Drosophila subobscura stands out because it presents several inversion complexes. This is the case of the E1+2+9+3 arrangement that originated from the ancestral Est arrangement through the sequential accumulation of four inversions (E1, E2, E9 and E3) sharing some breakpoints. We recently identified the breakpoints of inversions E1 and E2, which allowed establishing reuse at the molecular level of the cytologically shared breakpoint of these inversions. Here, we identified and sequenced the breakpoints of inversions E9 and E3, because they share breakpoints at sections 58D and 64C with those of inversions E1 and E2. This has allowed establishing that E9 and E3 originated through the staggered-break mechanism. Most importantly, sequence comparison has revealed the multiple reuse at the molecular level of the proximal breakpoint (section 58D), which would have been used at least by inversions E2, E9 and E3. In contrast, the distal breakpoint (section 64C) might have been only reused once by inversions E1 and E2, because the distal E3 breakpoint is displaced >70 kb from the other breakpoint limits. PMID:25712227

  11. Breakpoint structure of the Anopheles gambiae 2Rb chromosomal inversion.

    PubMed

    Lobo, Neil F; Sangaré, Djibril M; Regier, Allison A; Reidenbach, Kyanne R; Bretz, David A; Sharakhova, Maria V; Emrich, Scott J; Traore, Sekou F; Costantini, Carlo; Besansky, Nora J; Collins, Frank H

    2010-10-25

    Alternative arrangements of chromosome 2 inversions in Anopheles gambiae are important sources of population structure, and are associated with adaptation to environmental heterogeneity. The forces responsible for their origin and maintenance are incompletely understood. Molecular characterization of inversion breakpoints provides insight into how they arose, and provides the basis for development of molecular karyotyping methods useful in future studies. Sequence comparison of regions near the cytological breakpoints of 2Rb allowed the molecular delineation of breakpoint boundaries. Comparisons were made between the standard 2R+b arrangement in the An. gambiae PEST reference genome and the inverted 2Rb arrangements in the An. gambiae M and S genome assemblies. Sequence differences between alternative 2Rb arrangements were exploited in the design of a PCR diagnostic assay, which was evaluated against the known chromosomal banding pattern of laboratory colonies and field-collected samples from Mali and Cameroon. The breakpoints of the 7.55 Mb 2Rb inversion are flanked by extensive runs of the same short (72 bp) tandemly organized sequence, which was likely responsible for chromosomal breakage and rearrangement. Application of the molecular diagnostic assay suggested that 2Rb has a single common origin in An. gambiae and its sibling species, Anopheles arabiensis, and also that the standard arrangement (2R+b) may have arisen twice through breakpoint reuse. The molecular diagnostic was reliable when applied to laboratory colonies, but its accuracy was lower in natural populations. The complex repetitive sequence flanking the 2Rb breakpoint region may be prone to structural and sequence-level instability. The 2Rb molecular diagnostic has immediate application in studies based on laboratory colonies, but its usefulness in natural populations awaits development of complementary molecular tools.

  12. On the Existence of Step-To-Step Breakpoint Transitions in Accelerated Sprinting

    PubMed Central

    McGhie, David; Danielsen, Jørgen; Sandbakk, Øyvind; Haugen, Thomas

    2016-01-01

    Accelerated running is characterised by a continuous change of kinematics from one step to the next. It has been argued that breakpoints in the step-to-step transitions may occur, and that these breakpoints are an essential characteristic of dynamics during accelerated running. We examined this notion by comparing a continuous exponential curve fit (indicating continuity, i.e., smooth transitions) with linear piecewise fitting (indicating breakpoint). We recorded the kinematics of 24 well trained sprinters during a 25 m sprint run with start from competition starting blocks. Kinematic data were collected for 24 anatomical landmarks in 3D, and the location of centre of mass (CoM) was calculated from this data set. The step-to-step development of seven variables (four related to CoM position, and ground contact time, aerial time and step length) were analysed by curve fitting. In most individual sprints (in total, 41 sprints were successfully recorded) no breakpoints were identified for the variables investigated. However, for the mean results (i.e., the mean curve for all athletes) breakpoints were identified for the development of vertical CoM position, angle of acceleration and distance between support surface and CoM. It must be noted that for these variables the exponential fit showed high correlations (r2>0.99). No relationship was found between the occurrences of breakpoints for different variables as investigated using odds ratios (Mantel-Haenszel Chi-square statistic). It is concluded that although breakpoints regularly appear during accelerated running, these are not the rule and thereby unlikely a fundamental characteristic, but more likely an expression of imperfection of performance. PMID:27467387

  13. Structure and population genetics of the breakpoints of a polymorphic inversion in Drosophila subobscura.

    PubMed

    Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat

    2013-01-01

    Drosophila subobscura is a paleartic species of the obscura group with a rich chromosomal polymorphism. To further our understanding on the origin of inversions and on how they regain variation, we have identified and sequenced the two breakpoints of a polymorphic inversion of D. subobscura--inversion 3 of the O chromosome--in a population sample. The breakpoints could be identified as two rather short fragments (∼300 bp and 60 bp long) with no similarity to any known transposable element family or repetitive sequence. The presence of the ∼300-bp fragment at the two breakpoints of inverted chromosomes implies its duplication, an indication of the inversion origin via staggered double-strand breaks. Present results and previous findings support that the mode of origin of inversions is neither related to the inversion age nor species-group specific. The breakpoint regions do not consistently exhibit the lower level of variation within and stronger genetic differentiation between arrangements than more internal regions that would be expected, even in moderately small inversions, if gene conversion were greatly restricted at inversion breakpoints. Comparison of the proximal breakpoint region in species of the obscura group shows that this breakpoint lies in a small high-turnover fragment within a long collinear region (∼300 kb). © 2012 The Author(s). Evolution© 2012 The Society for the Study of Evolution.

  14. Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements

    PubMed Central

    Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.

    2008-01-01

    Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679

  15. Determination of moxifloxacin anaerobic susceptibility breakpoints according to the Clinical and Laboratory Standards Institute guidelines.

    PubMed

    Ambler, Jane; Rennie, Robert; Poupard, James; Koeth, Laura; Stass, Heino; Endermann, Rainer; Choudhri, Shurjeel

    2008-05-01

    A summary of the key data presented to Clinical and Laboratory Standards Institute (CLSI, formerly National Committee for Clinical and Laboratory Standards) in determination of moxifloxacin anaerobic breakpoints is presented. The breakpoint analysis required review of a variety of data, including bacteriologic and clinical outcomes by MIC of anaerobic isolates from prospective clinical trials in patients with complicated intra-abdominal infections, human and animal pharmacokinetic/pharmacodynamic (PK/PD) information and in vitro models, MIC distributions of indicated organisms, and animal model efficacy data for strains with MIC values around prospective breakpoints. The compilation of the various components of this breakpoint analysis supports the US Food and Drug Administration (FDA) and CLSI moxifloxacin anaerobic breakpoints of < or =2 mg/L (susceptible), 4 mg/L (intermediate), and > or =8 mg/L (resistant), and provides information to European investigators for interpretation of MICs prior to establishment of the European Committee on Antimicrobial Susceptibility Testing breakpoints.

  16. Validation of EUCAST zone diameter breakpoints against reference broth microdilution.

    PubMed

    Bengtsson, S; Bjelkenbrant, C; Kahlmeter, G

    2014-06-01

    The European Committee on Antimicrobial Susceptibility Testing (EUCAST) began harmonizing clinical breakpoints in Europe 2002. In 2009, work to develop a disc diffusion method began and the first disc diffusion breakpoints calibrated to EUCAST clinical MIC breakpoints were published in December 2009. In this study we validated EUCAST clinical zone diameter breakpoints against the International Standard Organization (ISO) reference broth microdilution. A collection of 544 isolates (238 Gram-negative and 306 Gram-positive) were tested against a panel of antimicrobial agents. Antimicrobial susceptibility testing was performed with broth microdilution as described by ISO and disc diffusion in accordance with EUCAST methodology. Inhibition zone diameters and MIC values were interpreted and categorized (S, I and R) according to EUCAST clinical breakpoint table version 2.0. Categorical agreement (CA) as well as minor (mD), major (MD) and very major (VMD) discrepancies were determined. There was in general good correlation between susceptibility test results obtained with disc diffusion and broth microdilution. Overall CA was 97.3% for all combinations of organisms and antimicrobial agents (n = 5231) and the overall discrepancy rates were 110 (2.1%) mD, 24 (0.5%) MD and 7 (0.1%) VMD. The overall CA for Gram-positive and Gram-negative organisms were 98.7% (2346 tests) and 96.2% (2942 tests), respectively. Seven VMD were observed, five for Gram-positive organisms (coagulase negative staphylococci (n = 2) and Staphylococcus aureus (n = 3)) and two for Gram-negative organisms (Pseudomonas aeruginosa). Minor discrepancies were mainly observed in Gram-negatives and were related to different antimicrobial agents and species. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.

  17. Second-line drug susceptibility breakpoints for Mycobacterium tuberculosis using the MODS assay.

    PubMed

    Trollip, A P; Moore, D; Coronel, J; Caviedes, L; Klages, S; Victor, T; Romancenco, E; Crudu, V; Ajbani, K; Vineet, V P; Rodrigues, C; Jackson, R L; Eisenach, K; Garfein, R S; Rodwell, T C; Desmond, E; Groessl, E J; Ganiats, T G; Catanzaro, A

    2014-02-01

    To establish breakpoint concentrations for the fluoroquinolones (moxifloxacin [MFX] and ofloxacin [OFX]) and injectable second-line drugs (amikacin [AMK], kanamycin [KM] and capreomycin [CPM]) using the microscopic observation drug susceptibility (MODS) assay. A multinational study conducted between February 2011 and August 2012 in Peru, India, Moldova and South Africa. In the first phase, breakpoints for the fluoroquinolones and injectable second-line drugs (n = 58) were determined. In the second phase, MODS second-line drug susceptibility testing (DST) as an indirect test was compared to MGIT™ DST (n = 89). In the third (n = 30) and fourth (n = 156) phases, we determined the reproducibility and concordance of MODS second-line DST directly from sputum. Breakpoints for MFX (0.5 μg/ml), OFX (1 μg/ml), AMK (2 μg/ml), KM (5 μg/ml) and CPM (2.5 μg/ml) were determined. In all phases, MODS results were highly concordant with MGIT DST. The few discrepancies suggest that the MODS breakpoint concentrations for some drugs may be too low. MODS second-line DST yielded comparable results to MGIT second-line DST, and is thus a promising alternative. Further studies are needed to confirm the accuracy of the drug breakpoints and the reliability of MODS second-line DST as a direct test.

  18. Breakpoints for antifungal agents: an update from EUCAST focussing on echinocandins against Candida spp. and triazoles against Aspergillus spp.

    PubMed

    Arendrup, Maiken C; Cuenca-Estrella, Manuel; Lass-Flörl, Cornelia; Hope, William W

    2013-12-01

    Candida and Aspergillus infections have emerged as significant pathogens in recent decades. During this same time, broad spectrum triazole and echinocandin antifungal agents have been developed and increasingly used. One consequence of widespread use is leading to the emergence of mutants with acquired resistance mutations. Therefore, accurate susceptibility testing and appropriate clinical breakpoints for the interpretation of susceptibility results have become increasingly important. Here we review the underlying methodology by which breakpoints have been selected by EUCAST (European Committee on Antimicrobial Susceptibility Testing). Five parameters are evaluated: dosing regimens used; EUCAST MIC distributions from multiple laboratories, species and compound specific epidemiological cut off values (upper MIC limits of wild type isolates or ECOFFs), pharmacokinetic/pharmacodynamic relationships and targets associated with outcome and finally clinical data by species and MIC when available. The general principles are reviewed followed by a detailed review of the individual aspects for Candida species and the three echinocandins and for Aspergillus and the three mould-active azoles. This review provides an update of the subcommittee on antifungal susceptibility testing (AFST) of the EUCAST methodology and summarises the current EUCAST breakpoints for Candida and Aspergillus. Recommendations about applicability of antifungal susceptibility testing in the routine setting are also included. Copyright © 2014 Elsevier Ltd. All rights reserved.

  19. Segmental Duplication, Microinversion, and Gene Loss Associated with a Complex Inversion Breakpoint Region in Drosophila

    PubMed Central

    Calvete, Oriol; González, Josefa; Betrán, Esther; Ruiz, Alfredo

    2012-01-01

    Chromosomal inversions are usually portrayed as simple two-breakpoint rearrangements changing gene order but not gene number or structure. However, increasing evidence suggests that inversion breakpoints may often have a complex structure and entail gene duplications with potential functional consequences. Here, we used a combination of different techniques to investigate the breakpoint structure and the functional consequences of a complex rearrangement fixed in Drosophila buzzatii and comprising two tandemly arranged inversions sharing the middle breakpoint: 2m and 2n. By comparing the sequence in the breakpoint regions between D. buzzatii (inverted chromosome) and D. mojavensis (noninverted chromosome), we corroborate the breakpoint reuse at the molecular level and infer that inversion 2m was associated with a duplication of a ∼13 kb segment and likely generated by staggered breaks plus repair by nonhomologous end joining. The duplicated segment contained the gene CG4673, involved in nuclear transport, and its two nested genes CG5071 and CG5079. Interestingly, we found that other than the inversion and the associated duplication, both breakpoints suffered additional rearrangements, that is, the proximal breakpoint experienced a microinversion event associated at both ends with a 121-bp long duplication that contains a promoter. As a consequence of all these different rearrangements, CG5079 has been lost from the genome, CG5071 is now a single copy nonnested gene, and CG4673 has a transcript ∼9 kb shorter and seems to have acquired a more complex gene regulation. Our results illustrate the complex effects of chromosomal rearrangements and highlight the need of complementing genomic approaches with detailed sequence-level and functional analyses of breakpoint regions if we are to fully understand genome structure, function, and evolutionary dynamics. PMID:22328714

  20. Interstitial telomeric sequences in human chromosomes cluster with common fragile sites, mutagen sensitive sites, viral integration sites, cancer breakpoints, proto-oncogenes and breakpoints involved in primate evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Adekunle, S.S.A.; Wyandt, H.; Mark, H.F.L.

    1994-09-01

    Recently we mapped the telomeric repeat sequences to 111 interstitial sites in the human genome and to sites of gaps and breaks induced by aphidicolin and sister chromatid exchange sites detected by BrdU. Many of these sites correspond to conserved fragile sites in man, gorilla and chimpazee, to sites of conserved sister chromatid exchange in the mammalian X chromosome, to mutagenic sensitive sites, mapped locations of proto-oncogenes, breakpoints implicated in primate evolution and to breakpoints indicated as the sole anomaly in neoplasia. This observation prompted us to investigate if the interstitial telomeric sites cluster with these sites. An extensive literaturemore » search was carried out to find all the available published sites mentioned above. For comparison, we also carried out a statistical analysis of the clustering of the sites of the telomeric repeats with the gene locations where only nucleotide mutations have been observed as the only chromosomal abnormality. Our results indicate that the telomeric repeats cluster most with fragile sites, mutagenic sensitive sites and breakpoints implicated in primate evolution and least with cancer breakpoints, mapped locations of proto-oncogenes and other genes with nucleotide mutations.« less

  1. Performance of Vitek 2 for Antimicrobial Susceptibility Testing of Enterobacteriaceae with Vitek 2 (2009 FDA) and 2014 CLSI Breakpoints

    PubMed Central

    Bobenchik, April M.; Deak, Eszter; Hindler, Janet A.; Charlton, Carmen L.

    2014-01-01

    Vitek 2 (bioMérieux Inc., Durham, NC) is a widely used commercial antimicrobial susceptibility test system. We compared the MIC results obtained using the Vitek 2 AST-GN69 and AST-XN06 cards to those obtained by CLSI broth microdilution (BMD) for 255 isolates of Enterobacteriaceae, including 25 isolates of carbapenem-resistant Enterobacteriaceae. In total, 25 antimicrobial agents were examined. For 10 agents, the MIC data were evaluated using two sets of breakpoints: (i) the Vitek 2 breakpoints, which utilized the 2009 FDA breakpoints at the time of the study and are equivalent to the 2009 CLSI M100-S19 breakpoints, and (ii) the 2014 CLSI M100-S24 breakpoints. There was an overall 98.7% essential agreement (EA). The categorical agreement was 95.5% (CA) using the Vitek 2 breakpoints and 95.7% using the CLSI breakpoints. There was 1 very major error (VME) (0.05%) observed using the Vitek 2 breakpoints (cefazolin) and 8 VMEs (0.5%) using the CLSI breakpoints (2 each for aztreonam, cefepime, and ceftriaxone, and 1 for cefazolin and ceftazidime). Fifteen major errors (MEs) (0.4%) were noted using the Vitek 2 breakpoints and 8 (0.5%) using the CLSI breakpoints. Overall, the Vitek 2 performance was comparable to that of BMD for testing a limited number of Enterobacteriaceae commonly isolated by clinical laboratories. Ongoing studies are warranted to assess performance in isolates with emerging resistance. PMID:25540403

  2. Pharmacodynamics of Doxycycline and Tetracycline against Staphylococcus pseudintermedius: Proposal of Canine-Specific Breakpoints for Doxycycline

    PubMed Central

    Papich, Mark G.; Turnidge, John; Guardabassi, Luca

    2013-01-01

    Doxycycline is a tetracycline that has been licensed for veterinary use in some countries, but no clinical breakpoints are available for veterinary pathogens. The objectives of this study were (i) to establish breakpoints for doxycycline and (ii) to evaluate the use of tetracycline as a surrogate to predict the doxycycline susceptibility of Staphylococcus pseudintermedius isolates. MICs and inhibition zone diameters were determined for 168 canine S. pseudintermedius isolates according to Clinical and Laboratory Standards Institute (CLSI) standards. Tetracycline resistance genes were detected by PCR, and time-kill curves were determined for representative strains. In vitro pharmacodynamic and target animal pharmacokinetic data were analyzed by Monte Carlo simulation (MCS) for the development of MIC interpretive criteria. Optimal zone diameter breakpoints were defined using the standard error rate-bounded method. The two drugs displayed bacteriostatic activity and bimodal MIC distributions. Doxycycline was more active than tetracycline in non-wild-type strains. MCS and target attainment analysis indicated a certainty of ≥90% for attaining an area under the curve (AUC)/MIC ratio of >25 with a standard dosage of doxycycline (5 mg/kg of body weight every 12 h) for strains with MICs of ≤0.125 μg/ml. Tetracycline predicted doxycycline susceptibility, but current tetracycline breakpoints were inappropriate for the interpretation of doxycycline susceptibility results. Accordingly, canine-specific doxycycline MIC breakpoints (susceptible, ≤0.125 μg/ml; intermediate, 0.25 μg/ml; resistant, ≥0.5 μg/ml) and zone diameter breakpoints (susceptible, ≥25 mm; intermediate, 21 to 24 mm; resistant, ≤20 mm) and surrogate tetracycline MIC breakpoints (susceptible, ≤0.25 μg/ml; intermediate, 0.5 μg/ml; resistant, ≥1 μg/ml) and zone diameter breakpoints (susceptible, ≥23 mm; intermediate, 18 to 22 mm; resistant, ≤17 mm) were proposed based on the data generated

  3. Breakpoint-forced and bound long waves in the nearshore: A model comparison

    USGS Publications Warehouse

    List, Jeffrey H.; ,

    1993-01-01

    A finite-difference model is used to compare long wave amplitudes arising from two-group forced generation mechanisms in the nearshore: long waves generated at a time-varying breakpoint and the shallow-water extension of the bound long wave. Plane beach results demonstrate that the strong frequency selection in the outgoing wave predicted by the breakpoint-forcing mechanism may not be observable in field data due to this wave's relatively small size and its predicted phase relation with the bound wave. Over a bar/trough nearshore, it is shown that a strong frequency selection in shoreline amplitudes is not a unique result of the time-varying breakpoint model, but a general result of the interaction between topography and any broad-banded forcing of nearshore long waves.

  4. A new approach to assess COPD by identifying lung function break-points

    PubMed Central

    Eriksson, Göran; Jarenbäck, Linnea; Peterson, Stefan; Ankerst, Jaro; Bjermer, Leif; Tufvesson, Ellen

    2015-01-01

    Purpose COPD is a progressive disease, which can take different routes, leading to great heterogeneity. The aim of the post-hoc analysis reported here was to perform continuous analyses of advanced lung function measurements, using linear and nonlinear regressions. Patients and methods Fifty-one COPD patients with mild to very severe disease (Global Initiative for Chronic Obstructive Lung Disease [GOLD] Stages I–IV) and 41 healthy smokers were investigated post-bronchodilation by flow-volume spirometry, body plethysmography, diffusion capacity testing, and impulse oscillometry. The relationship between COPD severity, based on forced expiratory volume in 1 second (FEV1), and different lung function parameters was analyzed by flexible nonparametric method, linear regression, and segmented linear regression with break-points. Results Most lung function parameters were nonlinear in relation to spirometric severity. Parameters related to volume (residual volume, functional residual capacity, total lung capacity, diffusion capacity [diffusion capacity of the lung for carbon monoxide], diffusion capacity of the lung for carbon monoxide/alveolar volume) and reactance (reactance area and reactance at 5Hz) were segmented with break-points at 60%–70% of FEV1. FEV1/forced vital capacity (FVC) and resonance frequency had break-points around 80% of FEV1, while many resistance parameters had break-points below 40%. The slopes in percent predicted differed; resistance at 5 Hz minus resistance at 20 Hz had a linear slope change of −5.3 per unit FEV1, while residual volume had no slope change above and −3.3 change per unit FEV1 below its break-point of 61%. Conclusion Continuous analyses of different lung function parameters over the spirometric COPD severity range gave valuable information additional to categorical analyses. Parameters related to volume, diffusion capacity, and reactance showed break-points around 65% of FEV1, indicating that air trapping starts to dominate

  5. Heterogeneity of chromosome 22 breakpoint in Philadelphia-positive (Ph/sup +/) acute lymphocytic leukemia

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Erikson, J.; Griffin, C.A.; Ar-Rushdi, A.

    1986-03-01

    In chronic myelogenous leukemias (CML) with the t(9;22)(q34;q11) chromosome translocation the breakpoints on chromosome 22 occur within a 5.8-kilobase segment of DNA referred to as breakpoint cluster region (bcr). The same cytogenetically indinstinguishable translocation occurs in approximately 10% of patients with acute lymphocytic leukemias (ALL). In this study the authors have investigated the chromosome breakpoints in several cases of ALL carrying the t(9;22) translocation. In three of five cases of ALL they found that the bcr region was not involved in the chromosome rearrangement and that the 22q11 chromosome breakpoints were proximal (5') to the bcr region at band 22q11.more » In addition, they observed normal size bcr and c-alb transcripts in an ALL cell line carrying the t(9;22) translocation. They conclude, therefore, that if c-alb is inappropriately expressed in ALL cells without bcr rearrangements, the genetic mechanism of activation must be different from that reported for CML.« less

  6. Translocation and deletion breakpoints in cancer genomes are associated with potential non-B DNA-forming sequences.

    PubMed

    Bacolla, Albino; Tainer, John A; Vasquez, Karen M; Cooper, David N

    2016-07-08

    Gross chromosomal rearrangements (including translocations, deletions, insertions and duplications) are a hallmark of cancer genomes and often create oncogenic fusion genes. An obligate step in the generation of such gross rearrangements is the formation of DNA double-strand breaks (DSBs). Since the genomic distribution of rearrangement breakpoints is non-random, intrinsic cellular factors may predispose certain genomic regions to breakage. Notably, certain DNA sequences with the potential to fold into secondary structures [potential non-B DNA structures (PONDS); e.g. triplexes, quadruplexes, hairpin/cruciforms, Z-DNA and single-stranded looped-out structures with implications in DNA replication and transcription] can stimulate the formation of DNA DSBs. Here, we tested the postulate that these DNA sequences might be found at, or in close proximity to, rearrangement breakpoints. By analyzing the distribution of PONDS-forming sequences within ±500 bases of 19 947 translocation and 46 365 sequence-characterized deletion breakpoints in cancer genomes, we find significant association between PONDS-forming repeats and cancer breakpoints. Specifically, (AT)n, (GAA)n and (GAAA)n constitute the most frequent repeats at translocation breakpoints, whereas A-tracts occur preferentially at deletion breakpoints. Translocation breakpoints near PONDS-forming repeats also recur in different individuals and patient tumor samples. Hence, PONDS-forming sequences represent an intrinsic risk factor for genomic rearrangements in cancer genomes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Zoom‐in comparative genomic hybridisation arrays for the characterisation of variable breakpoint contiguous gene syndromes

    PubMed Central

    Johnston, Jennifer J; Walker, Robert L; Davis, Sean; Facio, Flavia; Turner, Joyce T; Bick, David P; Daentl, Donna L; Ellison, Jay W; Meltzer, Paul S; Biesecker, Leslie G

    2007-01-01

    Contiguous gene syndromes cause disorders via haploinsufficiency for adjacent genes. Some contiguous gene syndromes (CGS) have stereotypical breakpoints, but others have variable breakpoints. In CGS that have variable breakpoints, the extent of the deletions may be correlated with severity. The Greig cephalopolysyndactyly contiguous gene syndrome (GCPS‐CGS) is a multiple malformation syndrome caused by haploinsufficiency of GLI3 and adjacent genes. In addition, non‐CGS GCPS can be caused by deletions or duplications in GLI3. Although fluorescence in situ hybridisation (FISH) can identify large deletion mutations in patients with GCPS or GCPS‐CGS, it is not practical for identification of small intragenic deletions or insertions, and it is difficult to accurately characterise the extent of the large deletions using this technique. We have designed a custom comparative genomic hybridisation (CGH) array that allows identification of deletions and duplications at kilobase resolution in the vicinity of GLI3. The array averages one probe every 730 bp for a total of about 14 000 probes over 10 Mb. We have analysed 16 individuals with known or suspected deletions or duplications. In 15 of 16 individuals (14 deletions and 1 duplication), the array confirmed the prior results. In the remaining patient, the normal CGH array result was correct, and the prior assessment was a false positive quantitative polymerase chain reaction result. We conclude that high‐density CGH array analysis is more sensitive than FISH analysis for detecting deletions and provides clinically useful results on the extent of the deletion. We suggest that high‐density CGH array analysis should replace FISH analysis for assessment of deletions and duplications in patients with contiguous gene syndromes caused by variable deletions. PMID:17098889

  8. Molecular definition of breakpoints associated with human Xq isochromosomes: Implications for mechanisms of formation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wolff, D.J.; Miller, A.P.; Schwartz, S.

    1996-01-01

    To test the centromere misdivision model of isochromosome formation, we have defined the breakpoints of cytogenetically monocentric and dicentric Xq isochromosomes (i(Xq)) from Turner syndrome probands, using FISH with cosmids and YACs derived from a contig spanning proximal Xp. Seven different pericentromeric breakpoints were identified, with 10 of 11 of the i(Xq)s containing varying amounts of material from Xp. Only one of the eight cytogenetically monocentric i(Xq)s demonstrated a single alpha-satellite (DXZ1) signal, consistent with classical models involving centromere misdivision. The remaining seven were inconsistent with such a model and had breakpoints that spanned proximal Xp11.21: one was between DXZ1more » and the most proximal marker, ZXDA; one occurred between the duplicated genes, ZXDA and ZXDB; two were {approximately}2 Mb from DXZ1; two were adjacent to ALAS2 located 3.5 Mb from DXZ1; and the largest had a breakpoint just distal to DXS1013E, indicating the inclusion of 8 Mb of Xp DNA between centromeres. The three cytologically dicentric i(Xq)s had breakpoints distal to DXS423E in Xp11.22 and therefore contained {ge}12 Mb of DNA between centromeres. These data demonstrate that the majority of breakpoints resulting in i(Xq) formation are in band Xp11.2 and not in the centromere itself. Therefore, we hypothesize that the predominant mechanism of i(Xq) formation involves sequences in the proximal short arm that are prone to breakage and reunion events between sister chromatids or homologous X chromosomes. 39 refs., 4 figs., 2 tabs.« less

  9. The distribution of MLL breakpoints correlates with outcome in infant acute leukaemia.

    PubMed

    Emerenciano, Mariana; Meyer, Claus; Mansur, Marcela B; Marschalek, Rolf; Pombo-de-Oliveira, Maria S

    2013-04-01

    Acute leukaemia in early childhood - and mainly infant leukaemia (IL) - is characterized by acquired genetic alterations, most commonly by the presence of distinct MLL rearrangements (MLL-r). The aim of this study was to investigate possible correlations between clinical features and molecular analyses of a series of 545 childhood leukaemia (≤24 months of age) cases: 385 acute lymphoblastic leukaemia (ALL) and 160 acute myeloid leukaemia (AML). The location of the genomic breakpoints was determined in a subset of 30 MLL-r cases. The overall survival of the investigated cohort was 60·5%, as determined by the Kaplan-Meier method. Worse outcomes were associated with age at diagnosis ≤6 months (P < 0·001), high white blood cell count (P = 0·001), and MLL-r (P = 0·002) in ALL, while children with AML displayed a poorer outcome (P = 0·009) regardless of their age strata. Moreover, we present first evidence that MLL-r patients with poor outcome preferentially displayed chromosomal breakpoints within MLL intron 11. Based on the literature, most MLL-r IL display a breakpoint localization towards intron 11, which in turn may explain their worse clinical course. In summary, the MLL breakpoint localization is of clinical importance and should be considered as a novel outcome predictor for MLL-r patients. © 2013 Blackwell Publishing Ltd.

  10. Identification of chromosome 7 inversion breakpoints in an autistic family narrows candidate region for autism susceptibility.

    PubMed

    Cukier, Holly N; Skaar, David A; Rayner-Evans, Melissa Y; Konidari, Ioanna; Whitehead, Patrice L; Jaworski, James M; Cuccaro, Michael L; Pericak-Vance, Margaret A; Gilbert, John R

    2009-10-01

    Chromosomal breaks and rearrangements have been observed in conjunction with autism and autistic spectrum disorders. A chromosomal inversion has been previously reported in autistic siblings, spanning the region from approximately 7q22.1 to 7q31. This family is distinguished by having multiple individuals with autism and associated disabilities. The region containing the inversion has been strongly implicated in autism by multiple linkage studies, and has been particularly associated with language defects in autism as well as in other disorders with language components. Mapping of the inversion breakpoints by FISH has localized the inversion to the region spanning approximately 99-108.75 Mb of chromosome 7. The proximal breakpoint has the potential to disrupt either the coding sequence or regulatory regions of a number of cytochrome P450 genes while the distal region falls in a relative gene desert. Copy number variant analysis of the breakpoint regions detected no duplication or deletion that could clearly be associated with disease status. Association analysis in our autism data set using single nucleotide polymorphisms located near the breakpoints showed no significant association with proximal breakpoint markers, but has identified markers near the distal breakpoint ( approximately 108-110 Mb) with significant associations to autism. The chromosomal abnormality in this family strengthens the case for an autism susceptibility gene in the chromosome 7q22-31 region and targets a candidate region for further investigation.

  11. The Newick utilities: high-throughput phylogenetic tree processing in the UNIX shell.

    PubMed

    Junier, Thomas; Zdobnov, Evgeny M

    2010-07-01

    We present a suite of Unix shell programs for processing any number of phylogenetic trees of any size. They perform frequently-used tree operations without requiring user interaction. They also allow tree drawing as scalable vector graphics (SVG), suitable for high-quality presentations and further editing, and as ASCII graphics for command-line inspection. As an example we include an implementation of bootscanning, a procedure for finding recombination breakpoints in viral genomes. C source code, Python bindings and executables for various platforms are available from http://cegg.unige.ch/newick_utils. The distribution includes a manual and example data. The package is distributed under the BSD License. thomas.junier@unige.ch

  12. Remote sensing of the correlation between breakpoint oscillations and infragravity waves in the surf and swash zone

    NASA Astrophysics Data System (ADS)

    Moura, T.; Baldock, T. E.

    2017-04-01

    A novel remote sensing methodology to determine the dominant infragravity mechanism in the inner surf and swash zone in the field is presented. Video observations of the breakpoint motion are correlated with the shoreline motion and inner surf zone water levels to determine the relationship between the time-varying breakpoint oscillations and the shoreline motion. The results of 13 field data sets collected from three different beaches indicate that, inside the surf zone, the dominance of bound wave or breakpoint forcing is strongly dependent on the surf zone width and the type of short wave breaking. Infragravity generation by bound wave release was stronger for conditions with relatively narrow surf zones and plunging waves; breakpoint forcing was dominant for wider surf zones and spilling breaker conditions.

  13. Refining the 22q11.2 deletion breakpoints in DiGeorge syndrome by aCGH

    PubMed Central

    Bittel, D.C.; Yu, S.; Newkirk, H.; Kibiryeva, N.; Holt, S.; Butler, M.G.; Cooley, L.D.

    2009-01-01

    Hemizygous deletions of the chromosome 22q11.2 region result in the 22q11.2 deletion syndrome also referred to as DiGeorge, Velocardiofacial or Shprintzen syndromes. The phenotype is variable but commonly includes conotruncal cardiac defects, palatal abnormalities, learning and behavioral problems, immune deficiency, and facial anomalies. Four distinct highly homologous blocks of low copy number repeat sequences (LCRs) flank the deletion region. Mispairing of LCRs during meiosis with unequal meiotic exchange is assumed to cause the recurrent and consistent deletions. The proximal LCR is reportedly located at 22q11.2 from 17.037 to 17.083 Mb while the distal LCR is located from 19.835 to 19.880 Mb. Although the chromosome breakpoints are thought to localize to the LCRs, the positions of the breakpoints have been investigated in only a few individuals. Therefore, we used high resolution oligonucleotide-based 244K microarray comparative genomic hybridization (aCGH) to resolve the breakpoints in a cohort of 20 subjects with known 22q11.2 deletions. We also investigated copy number variation (CNV) in the rest of the genome. The 22q11.2 breaks occurred on either side of the LCR in our subjects, although more commonly on the distal side of the reported proximal LCR. The proximal breakpoints in our subjects spanned the region from 17.036 to 17.398 Mb. This region includes the genes DGCR6 (DiGeorge syndrome critical region protein 6) and PRODH (proline dehydrogenase 1), along with three open reading frames that may encode proteins of unknown function. The distal breakpoints spanned the region from 19.788 to 20.122 Mb. This region includes the genes GGT2 (gamma-glutamyltransferase-like protein 2), HIC2 (hypermethylated in cancer 2), and multiple transcripts of unknown function. The genes in these two breakpoint regions are variably hemizygous depending on the location of the breakpoints. Our 20 subjects had 254 CNVs throughout the genome, 94 duplications and 160 deletions

  14. Narrowing the DiGeorge Region (DGCR) using DGS-VCFS associated translocation breakpoints

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, M.; Budarf, M.L.; Sellinger, B.

    1994-09-01

    The initial evidence linking 22q11 with DiGeorge syndrome (DGS) came from identification of DGS patients with unbalanced translocations resulting in loss of 22pter{r_arrow}q11. Molecular detection of 22q11.2 deletions in over 85% of our DGS and VCFS patient population confirms the role of 22q11 haploinsufficiency in the etiology of these two disorders. In the present study, DGS/VCFS-associated translocations are used to further refine the DGS minimal critical region. We obtained previously described cell lines: GM5878 [t(10;22)], GM5401 [t(4;22)], GM0980 [t(11;22)], and LGL6012 [t(20;22)]. Lymphoblastoid cell lines were established from two new unbalanced translocations, [t(15;22)(p11;q11)] and [t(12;22)(p13.31;q11.2)] and from a family withmore » balanced and unbalanced forms of a t(X;22)(p22.31;q11). All probands are missing 22pter{r_arrow}q11 and have mild dysmorphia, short stature, frequent infections and developmental delay. Cleft palate was also seen in the two sibs resulting from malsegregation of the t(X;22)mat. These seven breakpoints were positioned by FISH utilizing cosmids from 22q11.2. The cosmids include the loci D22S75 (N25), D22S66 (160b), and D22S259 (R32) which we have previously used to define the DGS/VCFS commonly deleted region. The t(12;22) and t(20;22) breakpoints map distal to R32. Four translocation breakpoints map between N25 and R32: CEN - N25 - t(15;22) - t(11;22) - t(10;22) - 160b - t(4;22) - R32 - TEL. The t(X;22) breakpoint lies between the proximal flanking locus D22S36 (pH11) and N25, suggesting that genes critical to the phenotype may lie between these markers. However, the der(X) is inactivated in both sibs, raising the possibility that spreading of inactivation to the translocated, 22-derived segment may silence gene(s) distal to the breakpoint. Thus, the DGCR has been narrowed to a region between D22S36 and the t(15;22) breakpoint. This enables us to narrow the search for the critical gene(s) deleted in patients with DGS and

  15. Large inverted repeats within Xp11.2 are present at the breakpoints of isodicentric X chromosomes in Turner syndrome.

    PubMed

    Scott, Stuart A; Cohen, Ninette; Brandt, Tracy; Warburton, Peter E; Edelmann, Lisa

    2010-09-01

    Turner syndrome (TS) results from whole or partial monosomy X and is mediated by haploinsufficiency of genes that normally escape X-inactivation. Although a 45,X karyotype is observed in half of all TS cases, the most frequent variant TS karyotype includes the isodicentric X chromosome alone [46,X,idic(X)(p11)] or as a mosaic [46,X,idic(X)(p11)/45,X]. Given the mechanism of idic(X)(p11) rearrangement is poorly understood and breakpoint sequence information is unknown, this study sought to investigate the molecular mechanism of idic(X)(p11) formation by determining their precise breakpoint intervals. Karyotype analysis and fluorescence in situ hybridization mapping of eight idic(X)(p11) cell lines and three unbalanced Xp11.2 translocation lines identified the majority of breakpoints within a 5 Mb region, from approximately 53 to 58 Mb, in Xp11.1-p11.22, clustering into four regions. To further refine the breakpoints, a high-resolution oligonucleotide microarray (average of approximately 350 bp) was designed and array-based comparative genomic hybridization (aCGH) was performed on all 11 idic(X)(p11) and Xp11.2 translocation lines. aCGH analyses identified all breakpoint regions, including an idic(X)(p11) line with two potential breakpoints, one breakpoint shared between two idic(X)(p11) lines and two Xp translocations that shared breakpoints with idic(X)(p11) lines. Four of the breakpoint regions included large inverted repeats composed of repetitive gene clusters and segmental duplications, which corresponded to regions of copy-number variation. These data indicate that the rearrangement sites on Xp11.2 that lead to isodicentric chromosome formation and translocations are probably not random and suggest that the complex repetitive architecture of this region predisposes it to rearrangements, some of which are recurrent.

  16. Effects of Phylogenetic Tree Style on Student Comprehension

    NASA Astrophysics Data System (ADS)

    Dees, Jonathan Andrew

    Phylogenetic trees are powerful tools of evolutionary biology that have become prominent across the life sciences. Consequently, learning to interpret and reason from phylogenetic trees is now an essential component of biology education. However, students often struggle to understand these diagrams, even after explicit instruction. One factor that has been observed to affect student understanding of phylogenetic trees is style (i.e., diagonal or bracket). The goal of this dissertation research was to systematically explore effects of style on student interpretations and construction of phylogenetic trees in the context of an introductory biology course. Before instruction, students were significantly more accurate with bracket phylogenetic trees for a variety of interpretation and construction tasks. Explicit instruction that balanced the use of diagonal and bracket phylogenetic trees mitigated some, but not all, style effects. After instruction, students were significantly more accurate for interpretation tasks involving taxa relatedness and construction exercises when using the bracket style. Based on this dissertation research and prior studies on style effects, I advocate for introductory biology instructors to use only the bracket style. Future research should examine causes of style effects and variables other than style to inform the development of research-based instruction that best supports student understanding of phylogenetic trees.

  17. Concurrent progressive ratio schedules: Effects of reinforcer probability on breakpoint and response allocation.

    PubMed

    Jarmolowicz, David P; Sofis, Michael J; Darden, Alexandria C

    2016-07-01

    Although progressive ratio (PR) schedules have been used to explore effects of a range of reinforcer parameters (e.g., magnitude, delay), effects of reinforcer probability remain underexplored. The present project used independently progressing concurrent PR PR schedules to examine effects of reinforcer probability on PR breakpoint (highest completed ratio prior to a session terminating 300s pause) and response allocation. The probability of reinforcement on one lever remained at 100% across all conditions while the probability of reinforcement on the other lever was systematically manipulated (i.e., 100%, 50%, 25%, 12.5%, and a replication of 25%). Breakpoints systematically decreased with decreasing reinforcer probabilities while breakpoints on the control lever remained unchanged. Patterns of switching between the two levers were well described by a choice-by-choice unit price model that accounted for the hyperbolic discounting of the value of probabilistic reinforcers. Copyright © 2016 Elsevier B.V. All rights reserved.

  18. Molecular characterization of deletion breakpoints in adults with 22q11 deletion syndrome

    PubMed Central

    Stachon, Andrea C.; Squire, Jeremy A.; Moldovan, Laura; Bayani, Jane; Meyn, Stephen; Chow, Eva; Bassett, Anne S.

    2011-01-01

    22q11 Deletion syndrome (22q11DS) is a common microdeletion syndrome with variable expression, including congenital and later onset conditions such as schizophrenia. Most studies indicate that expression does not appear to be related to length of the deletion but there is limited information on the endpoints of even the common deletion breakpoint regions in adults. We used a real-time quantitative PCR (qPCR) approach to fine map 22q11.2 deletions in 44 adults with 22q11DS, 22 with schizophrenia (SZ; 12 M, 10 F; mean age 35.7 SD 8.0 years) and 22 with no history of psychosis (NP; 8 M, 14 F; mean age 27.1 SD 8.6 years). QPCR data were consistent with clinical FISH results using the TUPLE1 or N25 probes. Two subjects (one SZ, one NP) negative for clinical FISH had atypical 22q11.2 deletions confirmed by FISH using the RP11-138C22 probe. Most (n = 34; 18 SZ, 16 NP) subjects shared a common 3 Mb hemizygous 22q11.2 deletion. However, eight subjects showed breakpoint variability: a more telomeric proximal breakpoint (n = 2), or more centromeric (n = 3) or more telomeric distal breakpoint (n = 3). One NP subject had a proximal nested 1.4 Mb deletion. COMT and TBX1 were deleted in all 44 subjects, and PRODH in 40 subjects (19 SZ, 21 NP). The results delineate proximal and distal breakpoint variants in 22q11DS. Neither deletion extent nor PRODH haploinsufficiency appeared to explain the clinical expression of schizophrenia in the present study. Further studies are needed to elucidate the molecular basis of schizophrenia and clinical heterogeneity in 22q11DS. PMID:17028864

  19. Clinical relevance of the breakpoint sites within the M-BCR in 50 patients from Argentina with chronic myeloid leukemia.

    PubMed

    Giere, I A; Larripa, I B

    1996-08-01

    Fifty patients from Argentina with chronic myeloid leukemia (CML) were studied in order to characterize the breakpoint site within the major breakpoint cluster region (M-BCR) and its relationship with the duration of the chronic phase (CP). The DNA digestion with the restriction enzymes: Bgl II, BAM HI and Hind III and hybridization with the 1.2Kb Hind III-Bgl II bcr probe showed that 56% of cases had the breakpoint in 5'M-bcr region and the remaining 44% in 3'M-bcr region. The duration of chronic phase from diagnosis to the onset of the blast crisis (BC) was correlated with the location of the breakpoint within the M-bcr and no statistical differences were observed between the 5' and the 3' groups. These data indicate that the breakpoint site within the bcr gene is not a prognostic indicator of the duration of CP of the disease.

  20. The Newick utilities: high-throughput phylogenetic tree processing in the Unix shell

    PubMed Central

    Junier, Thomas; Zdobnov, Evgeny M.

    2010-01-01

    Summary: We present a suite of Unix shell programs for processing any number of phylogenetic trees of any size. They perform frequently-used tree operations without requiring user interaction. They also allow tree drawing as scalable vector graphics (SVG), suitable for high-quality presentations and further editing, and as ASCII graphics for command-line inspection. As an example we include an implementation of bootscanning, a procedure for finding recombination breakpoints in viral genomes. Availability: C source code, Python bindings and executables for various platforms are available from http://cegg.unige.ch/newick_utils. The distribution includes a manual and example data. The package is distributed under the BSD License. Contact: thomas.junier@unige.ch PMID:20472542

  1. Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution.

    PubMed

    Guillén, Yolanda; Ruiz, Alfredo

    2012-02-01

    Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution.

  2. Gene alterations at Drosophila inversion breakpoints provide prima facie evidence for natural selection as an explanation for rapid chromosomal evolution

    PubMed Central

    2012-01-01

    Background Chromosomal inversions have been pervasive during the evolution of the genus Drosophila, but there is significant variation between lineages in the rate of rearrangement fixation. D. mojavensis, an ecological specialist adapted to a cactophilic niche under extreme desert conditions, is a chromosomally derived species with ten fixed inversions, five of them not present in any other species. Results In order to explore the causes of the rapid chromosomal evolution in D. mojavensis, we identified and characterized all breakpoints of seven inversions fixed in chromosome 2, the most dynamic one. One of the inversions presents unequivocal evidence for its generation by ectopic recombination between transposon copies and another two harbor inverted duplications of non-repetitive DNA at the two breakpoints and were likely generated by staggered single-strand breaks and repair by non-homologous end joining. Four out of 14 breakpoints lay in the intergenic region between preexisting duplicated genes, suggesting an adaptive advantage of separating previously tightly linked duplicates. Four out of 14 breakpoints are associated with transposed genes, suggesting these breakpoints are fragile regions. Finally two inversions contain novel genes at their breakpoints and another three show alterations of genes at breakpoints with potential adaptive significance. Conclusions D. mojavensis chromosomal inversions were generated by multiple mechanisms, an observation that does not provide support for increased mutation rate as explanation for rapid chromosomal evolution. On the other hand, we have found a number of gene alterations at the breakpoints with putative adaptive consequences that directly point to natural selection as the cause of D. mojavensis rapid chromosomal evolution. PMID:22296923

  3. Helicobacter pylori resistance to six antibiotics by two breakpoint systems and resistance evolution in Bulgaria.

    PubMed

    Boyanova, Lyudmila; Gergova, Galina; Evstatiev, Ivailo; Spassova, Zoya; Kandilarov, Naiden; Yaneva, Penka; Markovska, Rumyana; Mitov, Ivan

    2016-01-01

    Helicobacter pylori resistance to antibiotics is the main cause for eradication failures. Antibiotic resistance in 299 H. pylori strains from 233 untreated adults, 26 treated adults, and 40 untreated children was assessed by E tests and, for metronidazole, by breakpoint susceptibility testing and two breakpoint systems. Using EUCAST breakpoints (EBPs) and previous breakpoints (PBPs), overall resistance rates were: amoxicillin 4.0 and 0.6%, metronidazole 33.8 and 33.8%, clarithromycin 28.1 and 27.4%, levofloxacin 19.4 and 19.4%, tetracycline 3.7 and 1.5%, respectively, and rifampin 8.3% (EBP). Multidrug resistance was detected in treated and untreated adults and an untreated child and included 17 (EBPs) and 15 strains (PBPs). Differences between susceptibility categories were found for amoxicillin (3.5% of strains), clarithromycin (0.7%), and tetracycline (2.2%). Using PBPs, from 2005-2007 to 2010-2015, overall primary clarithromycin resistance continued to increase (17.9-25.6%) as noted in our previous study. However, in 2010-2015, overall primary metronidazole (24.0-31.5%) and fluoroquinolone (7.6-18.3%) resistance rates also increased. Primary resistance rates in children and adults were comparable. Briefly, differences in resistance rates by the two breakpoint systems affected the results for three antibiotics. National antibiotic consumption was linked to macrolide resistance in adults. Current primary H. pylori resistance to three antibiotics increased in all untreated patients and in the untreated adults, with the sharpest rise for the fluoroquinolones. The presence of fivefold H. pylori resistance to metronidazole, clarithromycin, tetracycline, levofloxacin, and amoxicillin according to EBPs is alarming.

  4. Germ line insertion of mtDNA at the breakpoint junction of a reciprocal constitutional translocation.

    PubMed

    Willett-Brozick, J E; Savul, S A; Richey, L E; Baysal, B E

    2001-08-01

    Constitutional chromosomal translocations are relatively common causes of human morbidity, yet the DNA double-strand break (DSB) repair mechanisms that generate them are incompletely understood. We cloned, sequenced and analyzed the breakpoint junctions of a familial constitutional reciprocal translocation t(9;11)(p24;q23). Within the 10-kb region flanking the breakpoints, chromosome 11 had 25% repeat elements, whereas chromosome 9 had 98% repeats, 95% of which were L1-type LINE elements. The breakpoints occurred within an L1-type repeat element at 9p24 and at the 3'-end of an Alu sequence at 11q23. At the breakpoint junction of derivative chromosome 9, we discovered an unusually large 41-bp insertion, which showed 100% identity to 12S mitochondrial DNA (mtDNA) between nucleotides 896 and 936 of the mtDNA sequence. Analysis of the human genome failed to show the preexistence of the inserted sequence at normal chromosomes 9 and 11 breakpoint junctions or elsewhere in the genome, strongly suggesting that the insertion was derived from human mtDNA and captured into the junction during the DSB repair process. To our knowledge, these findings represent the first observation of spontaneous germ line insertion of modern human mtDNA sequences and suggest that DSB repair may play a role in inter-organellar gene transfer in vivo. Our findings also provide evidence for a previously unrecognized insertional mechanism in human, by which non-mobile extra-chromosomal fragments can be inserted into the genome at DSB repair junctions.

  5. Distribution of Chromosome Breakpoints in Human Epithelial Cells Exposed to Low- and High-LET Radiations

    NASA Technical Reports Server (NTRS)

    Hada, Megumi; Cucinotta, Francis; Wu, Honglu

    2009-01-01

    The advantage of the multicolor banding in situ hybridization (mBAND) technique is not only its ability to identify simultaneously both inter- and intrachromosome exchanges, but also the ability to measure the breakpoint location along the length of the chromosome in a precision that is unmatched with other traditional banding techniques. Breakpoints on specific regions of a chromosome have been known to associate with specific cancers. The breakpoint distribution in cells after low- and high-LET radiation exposures will also provide the data for biophysical modeling of the chromatin structure, as well as the data for the modeling the formation of radiation-induced chromosome aberrations. In a series of experiments, we studied low- and high-LET radiation-induced chromosome aberrations using the mBAND technique with chromosome 3 painted in 23 different colored bands. Human epithelial cells (CH1 84B5F5/M10) were exposed in vitro to Cs- 137 rays at both low and high dose rates, secondary neutrons with a broad energy spectrum at a low dose rate and 600 MeV/u Fe ions at a high dose rate. The data of both inter- and intrachromosome aberrations involving the painted chromosome have been reported previously. Here we present data of the location of the chromosome breaks along the length of chromosome 3 in the cells after exposures to each of the four radiation scenarios. In comparison to the expected breakpoint distribution based on the length of the bands, the observed distribution appeared to be non-random for both the low- and high-LET radiations. In particular, hot spots towards both ends of the chromosome were found after low-LET irradiations of either low or high dose rates. For both high-LET radiation types (Fe ions and neutrons), the breakpoint distributions were similar, and were much smoother than that for low-LET radiation. The dependence of the breakpoint distribution on the radiation quality requires further investigations.

  6. GeneBreak: detection of recurrent DNA copy number aberration-associated chromosomal breakpoints within genes.

    PubMed

    van den Broek, Evert; van Lieshout, Stef; Rausch, Christian; Ylstra, Bauke; van de Wiel, Mark A; Meijer, Gerrit A; Fijneman, Remond J A; Abeln, Sanne

    2016-01-01

    Development of cancer is driven by somatic alterations, including numerical and structural chromosomal aberrations. Currently, several computational methods are available and are widely applied to detect numerical copy number aberrations (CNAs) of chromosomal segments in tumor genomes. However, there is lack of computational methods that systematically detect structural chromosomal aberrations by virtue of the genomic location of CNA-associated chromosomal breaks and identify genes that appear non-randomly affected by chromosomal breakpoints across (large) series of tumor samples. 'GeneBreak' is developed to systematically identify genes recurrently affected by the genomic location of chromosomal CNA-associated breaks by a genome-wide approach, which can be applied to DNA copy number data obtained by array-Comparative Genomic Hybridization (CGH) or by (low-pass) whole genome sequencing (WGS). First, 'GeneBreak' collects the genomic locations of chromosomal CNA-associated breaks that were previously pinpointed by the segmentation algorithm that was applied to obtain CNA profiles. Next, a tailored annotation approach for breakpoint-to-gene mapping is implemented. Finally, dedicated cohort-based statistics is incorporated with correction for covariates that influence the probability to be a breakpoint gene. In addition, multiple testing correction is integrated to reveal recurrent breakpoint events. This easy-to-use algorithm, 'GeneBreak', is implemented in R ( www.cran.r-project.org ) and is available from Bioconductor ( www.bioconductor.org/packages/release/bioc/html/GeneBreak.html ).

  7. Physical mapping of chromosome 12q breakpoints in lipoma, pleomorphic salivary gland adenoma, uterine leiomyoma, and myxoid liposarcoma

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schoenmakers, E.F.P.M.; Kools, P.F.J.; Mols, R.

    1994-03-15

    The authors report here the physical mapping of recurrent chromosome 12q13-q15 breakpoints in cell lines derived from primary myxoid liposarcoma, lipoma, uterine leiomyoma, and pleomorphic adenoma of the salivary glands. In fluorescence in situ hybridization (FISH) experiments, they first mapped the position of the chromosome 12 translocation breakpoint in uterine leiomyoma cell line LM-30.1/SV40 relative to loci COL2A1, D12S4, D12S17, D12S6, D12S19, D12S8, and D12S7. It mapped between linkage probes CRI-C86 (D12S19) and p7G11 (D12S8). They then isolated YAC clones using CRI-C86- and p7G11-derived sequence-tagged sites, constructed corresponding YAC contigs of 310 and 800 kb, respectively, and a mixture ofmore » them was used to routinely study the various tumor cell lines by FISH analysis. The chromosome 12 breakpoints of all tumor cell lines tested mapped between cosmids LLNL12NCO1-98C10 and LLNL12NCO1-113D12. None of the breakpoints appeared to map within any of the isolated YAC clones. Furthermore, FISH analysis using cosmid LLNL12-NCO1-144G3, which maps at the CHOP locus, revealed that the chromosome 12 breakpoints in all cell lines of the three benign solid tumors that were tested were located distal to the chromosome 12 translocation breakpoint with the CHOP gene in myxoid liposarcoma cells with t(12;16). In conclusion, the studies seem to indicate that the chromosome 12 breakpoints of myxoid liposarcoma, lipoma, uterine leiomyoma, and pleomorphic adenoma of the salivary glands are all clustered within the 7-cM interval between D12S19 and D12S8, with those of the benign solid tumors distal to CHOP. Finally, the MYF5 gene mapped telomeric to LLNL12NCO1-113D12, and the MIP gene mapped centromeric to the chromosome 12 translocation breakpoint in myxoid liposarcoma cells. 56 refs., 5 figs., 3 tabs.« less

  8. Identification of a YAC spanning the translocation breakpoints in uterine leiomyomata, pulmonary chondroid hamartoma, and lipoma: Physical mapping of the 12q14-q15 breakpoint region in uterine leiomyomata

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fejzo, M.S.; Yoon, S.J.; Kucherlapati, R.S.

    1995-03-20

    Uterine leiomyomata are the most common tumors in women and can cause abnormal uterine bleeding, pelvic pain, and infertility. Approximately 200,000 hysterectomies are performed annually in the U.S. to relieve patients of the medical sequelae of these benign neoplasms. Our efforts have focused on cloning the t(12;14)(q14-q15;q23-q24) breakpoint in uterine leiomyoma to further our understanding of the biology of these tumors. Thirty-nine YACs and six cosmids mapping to 12q14-q15 have been mapped by fluorescence in situ hybridization to tumor metaphase chromosomes containing a t(12;14). One YAC spanned the translocation breakpoint and was mapped to tumor metaphases from a pulmonary chondroidmore » hamartoma containing a t(12;14)(q14-q15;q23-q24) and a lipoma containing a t(12;15)(q15;q24); this YAC also spanned the breakpoint in these two tumors, suggesting that the same gene on chromosome 12 may be involved in the pathobiology of these distinct benign neoplasms. 41 refs., 2 figs., 1 tab.« less

  9. First report on an X-linked hypohidrotic ectodermal dysplasia family with X chromosome inversion: Breakpoint mapping reveals the pathogenic mechanism and preimplantation genetics diagnosis achieves an unaffected birth.

    PubMed

    Wu, Tonghua; Yin, Biao; Zhu, Yuanchang; Li, Guangui; Ye, Lijun; Liang, Desheng; Zeng, Yong

    2017-12-01

    To investigate the etiology of X-linked hypohidrotic ectodermal dysplasia (XLHED) in a family with an inversion of the X chromosome [inv(X)(p21q13)] and to achieve a healthy birth following preimplantation genetic diagnosis (PGD). Next generation sequencing (NGS) and Sanger sequencing analysis were carried out to define the inversion breakpoint. Multiple displacement amplification, amplification of breakpoint junction fragments, Sanger sequencing of exon 1 of ED1, haplotyping of informative short tandem repeat markers and gender determination were performed for PGD. NGS data of the proband sample revealed that the size of the possible inverted fragment was over 42Mb, spanning from position 26, 814, 206 to position 69, 231, 915 on the X chromosome. The breakpoints were confirmed by Sanger sequencing. A total of 5 blastocyst embryos underwent trophectoderm biopsy. Two embryos were diagnosed as carriers and three were unaffected. Two unaffected blastocysts were transferred and a singleton pregnancy was achieved. Following confirmation by prenatal diagnosis, a healthy baby was delivered. This is the first report of an XLHED family with inv(X). ED1 is disrupted by the X chromosome inversion in this XLHED family and embryos with the X chromosomal abnormality can be accurately identified by means of PGD. Copyright © 2017. Published by Elsevier B.V.

  10. Molecular analysis of DiGeorge Syndrome-related translocation breakpoints in 22q11.2

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chieffo, C.; Barnoski, B.L.; Emanuel, B.S.

    1994-09-01

    22q11 demonstrates a high frequency of disease-specific rearrangements. Several of the rearrangements are associated with developmental abnormalities such as DiGeorge Syndrome (DGS), Velocardiofacial Syndrome (VCFS), Cat Eye Syndrome (CES) and Supernumerary der(22)t(11;22) Syndrome. DGS and VCFS involve deletions of 22q11.2 resulting from unbalanced translocations or microdeletions. In contrast, CES and Supernumerary der(22)t(11;22) Syndrome result from duplications of this region via inter- or intra- chromosomal exchange. Although the molecular mechanism giving rise to these rearrangements has yet to be elucidated, the presence of known 22q11 repetitive elements are likely to be involved. GM5878 is a 46,XY,t(10;22) cell line from a balancedmore » translocation carrier father of an unbalanced DGS patient. GM0980 is a cell line from a patient with features of DGS/VCFS with an unbalanced karyotype. Using FISH cosmids, we have localized these translocation breakpoints near pH160b (D22S66) which maps to the center of the DiGeorge chromosomal region (DGCR). To further localize the breakpoint of GM5878, overlapping cosmids spanning this region were used as probes for FISH. Use of additional overlapping cosmids allowed the sublocalization of the breakpoint to a 10kb region. A 4.8 kb BglII fragment predicted to cross the breakpoint was isolated. When this fragment was used as a probe to normal and GM5878 DNA, novel bands were detected in GM5878 DNA digested with EcoRI and BglII. Similar analysis of the GM0980 breakpoint is being pursued. Full molecular characterization of these translocations is in progress using inverse PCR to clone the junctional fragments for sequencing. Detailed analysis of the region may reveal molecular features which make this a rearrangement prone area of the genome and help elucidate its relationship to human cytogenetic disease.« less

  11. Genomic Analysis Reveals a Common Breakpoint in Amplifications of the Plasmodium vivax Multidrug Resistance 1 Locus in Thailand

    PubMed Central

    Auburn, Sarah; Serre, David; Pearson, Richard D.; Amato, Roberto; Sriprawat, Kanlaya; To, Sheren; Handayuni, Irene; Suwanarusk, Rossarin; Russell, Bruce; Drury, Eleanor; Stalker, Jim; Miotto, Olivo; Kwiatkowski, Dominic P.; Nosten, Francois; Price, Ric N.

    2016-01-01

    In regions of coendemicity for Plasmodium falciparum and Plasmodium vivax where mefloquine is used to treat P. falciparum infection, drug pressure mediated by increased copy numbers of the multidrug resistance 1 gene (pvmdr1) may select for mefloquine-resistant P. vivax. Surveillance is not undertaken routinely owing in part to methodological challenges in detection of gene amplification. Using genomic data on 88 P. vivax samples from western Thailand, we identified pvmdr1 amplification in 17 isolates, all exhibiting tandem copies of a 37.6–kilobase pair region with identical breakpoints. A novel breakpoint-specific polymerase chain reaction assay was designed to detect the amplification. The assay demonstrated high sensitivity, identifying amplifications in 13 additional, polyclonal infections. Application to 132 further samples identified the common breakpoint in all years tested (2003–2015), with a decline in prevalence after 2012 corresponding to local discontinuation of mefloquine regimens. Assessment of the structure of pvmdr1 amplification in other geographic regions will yield information about the population-specificity of the breakpoints and underlying amplification mechanisms. PMID:27456706

  12. Delineation and physical separation of novel translocation breakpoints on chromosome 1p in two genetically closely associated childhood tumors.

    PubMed

    Steenman, M J; Zijlstra, N; Kruitbosch, D L; Wiesmeijer, C; Larizza, L; Voûte, P A; Westerveld, A; Mannens, M M

    2000-01-01

    Sporadic childhood tumors associated with Beckwith-Wiedemann syndrome (BWS) all show abnormalities of the same region on chromosome 11. In addition to chromosome 11, other chromosome regions are affected in some of these tumor types. In this study we analyzed the region on chromosome 1p involved in the etiology of BWS-associated tumors, Wilms tumor, rhabdomyosarcoma, and hepatoblastoma. For this purpose we determined the location of two novel translocation breakpoints in this chromosome region in cells from a Wilms tumor and cells from a rhabdomyosarcoma. We constructed a map of the region and found that both breakpoints are separated by at least 875 kb. We identified a PAC clone which crosses the rhabdomyosarcoma breakpoint and found several exons within this clone. We established that this breakpoint is located proximal to the PAX7 gene and, therefore, identified a new region involved in the etiology of rhabdomyosarcomas. Copyright 2000 S. Karger AG, Basel

  13. Antimicrobial breakpoint estimation accounting for variability in pharmacokinetics.

    PubMed

    Bi, Goue Denis Gohore; Li, Jun; Nekka, Fahima

    2009-06-26

    Pharmacokinetic and pharmacodynamic (PK/PD) indices are increasingly being used in the microbiological field to assess the efficacy of a dosing regimen. In contrast to methods using MIC, PK/PD-based methods reflect in vivo conditions and are more predictive of efficacy. Unfortunately, they entail the use of one PK-derived value such as AUC or Cmax and may thus lead to biased efficiency information when the variability is large. The aim of the present work was to evaluate the efficacy of a treatment by adjusting classical breakpoint estimation methods to the situation of variable PK profiles. We propose a logical generalisation of the usual AUC methods by introducing the concept of "efficiency" for a PK profile, which involves the efficacy function as a weight. We formulated these methods for both classes of concentration- and time-dependent antibiotics. Using drug models and in silico approaches, we provide a theoretical basis for characterizing the efficiency of a PK profile under in vivo conditions. We also used the particular case of variable drug intake to assess the effect of the variable PK profiles generated and to analyse the implications for breakpoint estimation. Compared to traditional methods, our weighted AUC approach gives a more powerful PK/PD link and reveals, through examples, interesting issues about the uniqueness of therapeutic outcome indices and antibiotic resistance problems.

  14. Antimicrobial breakpoint estimation accounting for variability in pharmacokinetics

    PubMed Central

    Bi, Goue Denis Gohore; Li, Jun; Nekka, Fahima

    2009-01-01

    Background Pharmacokinetic and pharmacodynamic (PK/PD) indices are increasingly being used in the microbiological field to assess the efficacy of a dosing regimen. In contrast to methods using MIC, PK/PD-based methods reflect in vivo conditions and are more predictive of efficacy. Unfortunately, they entail the use of one PK-derived value such as AUC or Cmax and may thus lead to biased efficiency information when the variability is large. The aim of the present work was to evaluate the efficacy of a treatment by adjusting classical breakpoint estimation methods to the situation of variable PK profiles. Methods and results We propose a logical generalisation of the usual AUC methods by introducing the concept of "efficiency" for a PK profile, which involves the efficacy function as a weight. We formulated these methods for both classes of concentration- and time-dependent antibiotics. Using drug models and in silico approaches, we provide a theoretical basis for characterizing the efficiency of a PK profile under in vivo conditions. We also used the particular case of variable drug intake to assess the effect of the variable PK profiles generated and to analyse the implications for breakpoint estimation. Conclusion Compared to traditional methods, our weighted AUC approach gives a more powerful PK/PD link and reveals, through examples, interesting issues about the uniqueness of therapeutic outcome indices and antibiotic resistance problems. PMID:19558679

  15. Effects of breakpoint changes on carbapenem susceptibility rates of Enterobacteriaceae: Results from the SENTRY Antimicrobial Surveillance Program, United States, 2008 to 2012

    PubMed Central

    Rennie, Robert P; Jones, Ronald N

    2014-01-01

    In the absence of clinical resistance, breakpoints for many antimicrobial agents are often set high. Clinical failures following use of the agents over time requires re-evaluation of breakpoints. This is based on patient response, pharmacokinetic/pharmacodynamic information and in vitro minimal inhibitory concentration data. Data from the SENTRY Antimicrobial Surveillance Program has shown that Clinical and Laboratory Standards Institute breakpoint changes for carbapenems that occurred between 2008 and 2012 in North America have resulted in decreased levels of susceptibility for some species. In particular, reduced susceptibility to imipenem was observed for Proteus mirabilis (35%) and Morganella morganii (80%). Minor decreases in susceptibility were also noted for Enterobacter species with ertapenem (5%) and imipenem (4.3%), and Serratia species with imipenem (6.4%). No significant decreases in susceptibility were observed for meropenem following the breakpoint changes. There were no earlier breakpoints established for doripenem. Very few of these Enterobacteriaceae produce carbapenamase enzymes; therefore, the clinical significance of these changes has not yet been clearly determined. In conclusion, ongoing surveillance studies with in vitro minimum inhibitory concentration data are essential in predicting the need for breakpoint changes and in identifying the impact of such changes on the percent susceptibility of different species. PMID:25371693

  16. Breakpoint chlorination and free-chlorine contact time: implications for drinking water N-nitrosodimethylamine concentrations.

    PubMed

    Charrois, Jeffrey W A; Hrudey, Steve E

    2007-02-01

    North American drinking water utilities are increasingly incorporating alternative disinfectants, such as chloramines, in order to comply with disinfection by-product (DBP) regulations. N-Nitrosodimethylamine (NDMA) is a non-halogenated DBP, associated with chloramination, having a drinking water unit risk two to three orders of magnitude greater than currently regulated halogenated DBPs. We quantified NDMA from two full-scale chloraminating water treatment plants in Alberta between 2003 and 2005 as well as conducted bench-scale chloramination/breakpoint experiments to assess NDMA formation. Distribution system NDMA concentrations varied and tended to increase with increasing distribution residence time. Bench-scale disinfection experiments resulted in peak NDMA production near the theoretical monochloramine maximum in the sub-breakpoint region of the disinfection curve. Breakpoints for the raw and partially treated waters tested ranged from 1.9:1 to 2.4:1 (Cl(2):total NH(3)-N, M:M). Bench-scale experiments with free-chlorine contact (2h) before chloramination resulted in significant reductions in NDMA formation (up to 93%) compared to no free-chlorine contact time. Risk-tradeoff issues involving alternative disinfection methods and unregulated DBPs, such as NDMA, are emerging as a major water quality and public health information gap.

  17. Phylogenetic relationships and the occurrence of interspecific recombination between beet chlorosis virus (BChV) and Beet mild yellowing virus (BMYV).

    PubMed

    Kozlowska-Makulska, Anna; Hasiow-Jaroszewska, Beata; Szyndel, Marek S; Herrbach, Etienne; Bouzoubaa, Salah; Lemaire, Olivier; Beuve, Monique

    2015-02-01

    Samples containing two viruses belonging to the genus Polerovirus, beet chlorosis virus (BChV) and beet mild yellowing virus (BMYV), were collected from French and Polish sugar beet fields. The molecular properties of 24 isolates of BChV and BMYV were investigated, and their genetic diversity was examined in the coat protein (CP)- and P0-encoding genes. For the first time, we have demonstrated that beet polerovirus populations include recombinants between BChV and BMYV containing breakpoints within the CP gene. Moreover, a partial correlation between geographic origin and phylogenetic clustering was observed for BMYV isolates.

  18. Genomic Analysis Reveals a Common Breakpoint in Amplifications of the Plasmodium vivax Multidrug Resistance 1 Locus in Thailand.

    PubMed

    Auburn, Sarah; Serre, David; Pearson, Richard D; Amato, Roberto; Sriprawat, Kanlaya; To, Sheren; Handayuni, Irene; Suwanarusk, Rossarin; Russell, Bruce; Drury, Eleanor; Stalker, Jim; Miotto, Olivo; Kwiatkowski, Dominic P; Nosten, Francois; Price, Ric N

    2016-10-15

    In regions of coendemicity for Plasmodium falciparum and Plasmodium vivax where mefloquine is used to treat P. falciparum infection, drug pressure mediated by increased copy numbers of the multidrug resistance 1 gene (pvmdr1) may select for mefloquine-resistant P. vivax Surveillance is not undertaken routinely owing in part to methodological challenges in detection of gene amplification. Using genomic data on 88 P. vivax samples from western Thailand, we identified pvmdr1 amplification in 17 isolates, all exhibiting tandem copies of a 37.6-kilobase pair region with identical breakpoints. A novel breakpoint-specific polymerase chain reaction assay was designed to detect the amplification. The assay demonstrated high sensitivity, identifying amplifications in 13 additional, polyclonal infections. Application to 132 further samples identified the common breakpoint in all years tested (2003-2015), with a decline in prevalence after 2012 corresponding to local discontinuation of mefloquine regimens. Assessment of the structure of pvmdr1 amplification in other geographic regions will yield information about the population-specificity of the breakpoints and underlying amplification mechanisms. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America.

  19. How Accurate and Robust Are the Phylogenetic Estimates of Austronesian Language Relationships?

    PubMed Central

    Greenhill, Simon J.; Drummond, Alexei J.; Gray, Russell D.

    2010-01-01

    We recently used computational phylogenetic methods on lexical data to test between two scenarios for the peopling of the Pacific. Our analyses of lexical data supported a pulse-pause scenario of Pacific settlement in which the Austronesian speakers originated in Taiwan around 5,200 years ago and rapidly spread through the Pacific in a series of expansion pulses and settlement pauses. We claimed that there was high congruence between traditional language subgroups and those observed in the language phylogenies, and that the estimated age of the Austronesian expansion at 5,200 years ago was consistent with the archaeological evidence. However, the congruence between the language phylogenies and the evidence from historical linguistics was not quantitatively assessed using tree comparison metrics. The robustness of the divergence time estimates to different calibration points was also not investigated exhaustively. Here we address these limitations by using a systematic tree comparison metric to calculate the similarity between the Bayesian phylogenetic trees and the subgroups proposed by historical linguistics, and by re-estimating the age of the Austronesian expansion using only the most robust calibrations. The results show that the Austronesian language phylogenies are highly congruent with the traditional subgroupings, and the date estimates are robust even when calculated using a restricted set of historical calibrations. PMID:20224774

  20. Further Effects of Phylogenetic Tree Style on Student Comprehension in an Introductory Biology Course.

    PubMed

    Dees, Jonathan; Bussard, Caitlin; Momsen, Jennifer L

    2018-06-01

    Phylogenetic trees have become increasingly important across the life sciences, and as a result, learning to interpret and reason from these diagrams is now an essential component of biology education. Unfortunately, students often struggle to understand phylogenetic trees. Style (i.e., diagonal or bracket) is one factor that has been observed to impact how students interpret phylogenetic trees, and one goal of this research was to investigate these style effects across an introductory biology course. In addition, we investigated the impact of instruction that integrated diagonal and bracket phylogenetic trees equally. Before instruction, students were significantly more accurate with the bracket style for a variety of interpretation and construction tasks. After instruction, however, students were significantly more accurate only for construction tasks and interpretations involving taxa relatedness when using the bracket style. Thus, instruction that used both styles equally mitigated some, but not all, style effects. These results inform the development of research-based instruction that best supports student understanding of phylogenetic trees.

  1. Breakpoint analysis of the pericentric inversion between chimpanzee chromosome 10 and the homologous chromosome 12 in humans.

    PubMed

    Kehrer-Sawatzki, H; Sandig, C A; Goidts, V; Hameister, H

    2005-01-01

    During this study, we analysed the pericentric inversion that distinguishes human chromosome 12 (HSA12) from the homologous chimpanzee chromosome (PTR10). Two large chimpanzee-specific duplications of 86 and 23 kb were observed in the breakpoint regions, which most probably occurred associated with the inversion. The inversion break in PTR10p caused the disruption of the SLCO1B3 gene in exon 11. However, the 86-kb duplication includes the functional SLCO1B3 locus, which is thus retained in the chimpanzee, although inverted to PTR10q. The second duplication spans 23 kb and does not contain expressed sequences. Eleven genes map to a region of about 1 Mb around the breakpoints. Six of these eleven genes are not among the differentially expressed genes as determined previously by comparing the human and chimpanzee transcriptome of fibroblast cell lines, blood leukocytes, liver and brain samples. These findings imply that the inversion did not cause major expression differences of these genes. Comparative FISH analysis with BACs spanning the inversion breakpoints in PTR on metaphase chromosomes of gorilla (GGO) confirmed that the pericentric inversion of the chromosome 12 homologs in GGO and PTR have distinct breakpoints and that humans retain the ancestral arrangement. These findings coincide with the trend observed in hominoid karyotype evolution that humans have a karyotype close to an ancestral one, while African great apes present with more derived chromosome arrangements. Copyright (c) 2005 S. Karger AG, Basel.

  2. In vitro antibacterial activity of doripenem against clinical isolates from French teaching hospitals: proposition of zone diameter breakpoints.

    PubMed

    Lascols, C; Legrand, P; Mérens, A; Leclercq, R; Armand-Lefevre, L; Drugeon, H B; Kitzis, M D; Muller-Serieys, C; Reverdy, M E; Roussel-Delvallez, M; Moubareck, C; Lemire, A; Miara, A; Gjoklaj, M; Soussy, C-J

    2011-04-01

    The aims of the study were to determine the in vitro activity of doripenem, a new carbapenem, against a large number of bacterial pathogens and to propose zone diameter breakpoints for clinical categorization in France according to the European Committee on Antimicrobial Susceptibility Testing (EUCAST) minimum inhibitory concentration (MIC) breakpoints. The MICs of doripenem were determined by the broth microdilution method against 1,547 clinical isolates from eight French hospitals. The disk diffusion test was performed (10-μg discs) according to the Comité de l'Antibiogramme de la Société Française de Microbiologie (CASFM) method. The MIC(50/90) (mg/L) values were as follows: methicillin-susceptible Staphylococcus aureus (MSSA) (0.03/0.25), methicillin-resistant Staphylococcus aureus (MRSA) (1/2), methicillin-susceptible coagulase-negative staphylococci (MSCoNS) (0.03/0.12), methicillin-resistant coagulase-negative staphylococci (MRCoNS) (2/8), Streptococcus pneumoniae (0.016/0.25), viridans group streptococci (0.016/2), β-hemolytic streptococci (≤0.008/≤0.008), Enterococcus faecalis (2/4), Enterococcus faecium (128/>128), Enterobacteriaceae (0.06/0.25), Pseudomonas aeruginosa (0.5/8), Acinetobacter baumannii (0.25/2), Haemophilus influenzae (0.12/0.25), and Moraxella catarrhalis (0.03/0.06). According to the regression curve, the zone diameter breakpoints were 24 and 19 mm for MICs of 1 and 4 mg/L, respectively. This study confirms the potent in vitro activity of doripenem against Pseudomonas aeruginosa, Acinetobacter, Enterobacteriaceae, MSSA, MSCoNS, and respiratory pathogens. According to the EUCAST MIC breakpoints (mg/L) ≤1/>4 for Enterobacteriaceae, Pseudomonas aeruginosa, and Acinetobacter, and ≤1/>1 for streptococci, pneumococci, and Haemophilus, the zone diameter breakpoints could be (mm) ≥24/<19 and ≥24/<24, respectively.

  3. Using in situ hybridization and PFGE Southern hybridization to detect translocation breakpoints in a BOR/TRPS patient cell line

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gu, J.Z.; Sapru, M.; Smith, D.

    Branchio-oto-renal syndrome (BOR) is an autosomal dominant disorder characterized by ear malformations, cervical fistulae, hearing loss and renal abnormalities. We have integrated the Genethon YAC contig maps with additional markers in the chromosome 8q region genetically linked by a unique patient cell line. This cell line is from a patient who has both the branchio-oto-renal syndrome and tricho-rhino-phalangeal syndrome (TRPS). High resolution cytogenetics demonstrated a direct insertion of materials from 8q13.3q21.13 to 8q24.11. TRPS has been previously linked to deletions involving 8q24.11-q24.13. The rearrangement in this patient suggests that TRPS results from loss of gene function due to insertion atmore » the 8q24.11 breakpoint and the possible location for the BOR gene is at either of the two breakpoints of 8q13.3 and 8q21.13. We have constructed cosmid contigs in 8q24.11. In situ hybridization with cosmids mapped to these locations as probes has helped to narrow down the breakpoints. Combinations of cosmids on either side or overlapping the 8q24.11 breakpoint show split signals on one chromosome 8q arm due to insertion of the materials from the proximal region. Cosmids mapped to the TRPS deletion region have been used to hybridize to pulsed field gel genomic blots of DNA from the patient cell line and detected rearranged genomic fragments. Both in situ hybridization and genomic PFGE Southern blot will be used to precisely locate the breakpoints.« less

  4. DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

    PubMed

    Kelly, Steven; Maini, Philip K

    2013-01-01

    The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.

  5. Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

    PubMed

    Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

    2003-09-01

    Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.

  6. Phylogenetic diversity and biodiversity indices on phylogenetic networks.

    PubMed

    Wicke, Kristina; Fischer, Mareike

    2018-04-01

    In biodiversity conservation it is often necessary to prioritize the species to conserve. Existing approaches to prioritization, e.g. the Fair Proportion Index and the Shapley Value, are based on phylogenetic trees and rank species according to their contribution to overall phylogenetic diversity. However, in many cases evolution is not treelike and thus, phylogenetic networks have been developed as a generalization of phylogenetic trees, allowing for the representation of non-treelike evolutionary events, such as hybridization. Here, we extend the concepts of phylogenetic diversity and phylogenetic diversity indices from phylogenetic trees to phylogenetic networks. On the one hand, we consider the treelike content of a phylogenetic network, e.g. the (multi)set of phylogenetic trees displayed by a network and the so-called lowest stable ancestor tree associated with it. On the other hand, we derive the phylogenetic diversity of subsets of taxa and biodiversity indices directly from the internal structure of the network. We consider both approaches that are independent of so-called inheritance probabilities as well as approaches that explicitly incorporate these probabilities. Furthermore, we introduce our software package NetDiversity, which is implemented in Perl and allows for the calculation of all generalized measures of phylogenetic diversity and generalized phylogenetic diversity indices established in this note that are independent of inheritance probabilities. We apply our methods to a phylogenetic network representing the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), a group of species characterized by widespread hybridization. Copyright © 2018 Elsevier Inc. All rights reserved.

  7. Performance of Vitek 2 for Antimicrobial Susceptibility Testing of Acinetobacter baumannii, Pseudomonas aeruginosa, and Stenotrophomonas maltophilia with Vitek 2 (2009 FDA) and CLSI M100S 26th Edition Breakpoints

    PubMed Central

    Bobenchik, April M.; Deak, Eszter; Hindler, Janet A.; Charlton, Carmen L.

    2016-01-01

    ABSTRACT The performances of Vitek 2 AST-GN69 and AST-XN06 cards were compared to Clinical and Laboratory Standards Institute (CLSI) reference broth microdilution (BMD) for 99 isolates of Pseudomonas aeruginosa, 26 Acinetobacter baumannii isolates, and 11 Stenotrophomonas maltophilia isolates. In total, 15 antimicrobials were evaluated, with 11 for P. aeruginosa, 14 for A. baumannii, and 2 for S. maltophilia. Categorical agreement (CA) was assessed using both Vitek 2 breakpoints and 2016 CLSI M100S 26th edition breakpoints. The essential agreement values for P. aeruginosa, A. baumannii, and S. maltophilia were 99.5%, 99.2%, and 100%, respectively. The CA values for P. aeruginosa, A. baumannii, and S. maltophilia were 94.1%, 92.7%, and 95.5%, respectively, by the Vitek 2 breakpoints, and 93.4%, 92.3%, and 95.5%, respectively, by the CLSI breakpoints. Overall, the Vitek 2 performance was comparable to that of BMD using both Vitek 2 breakpoints and 2016 CLSI M100S 26th edition breakpoints. Improved performance was noted for the reformulated piperacillin-tazobactam and imipenem found on the AST-GN69 card, with no very major or major errors noted when using the CLSI breakpoints. PMID:27881616

  8. Evolutionary lineages of marine snails identified using molecular phylogenetics and geometric morphometric analysis of shells.

    PubMed

    Vaux, Felix; Trewick, Steven A; Crampton, James S; Marshall, Bruce A; Beu, Alan G; Hills, Simon F K; Morgan-Richards, Mary

    2018-06-15

    The relationship between morphology and inheritance is of perennial interest in evolutionary biology and palaeontology. Using three marine snail genera Penion, Antarctoneptunea and Kelletia, we investigate whether systematics based on shell morphology accurately reflect evolutionary lineages indicated by molecular phylogenetics. Members of these gastropod genera have been a taxonomic challenge due to substantial variation in shell morphology, conservative radular and soft tissue morphology, few known ecological differences, and geographical overlap between numerous species. Sampling all sixteen putative taxa identified across the three genera, we infer mitochondrial and nuclear ribosomal DNA phylogenetic relationships within the group, and compare this to variation in adult shell shape and size. Results of phylogenetic analysis indicate that each genus is monophyletic, although the status of some phylogenetically derived and likely more recently evolved taxa within Penion is uncertain. The recently described species P. lineatus is supported by genetic evidence. Morphology, captured using geometric morphometric analysis, distinguishes the genera and matches the molecular phylogeny, although using the same dataset, species and phylogenetic subclades are not identified with high accuracy. Overall, despite abundant variation, we find that shell morphology accurately reflects genus-level classification and the corresponding deep phylogenetic splits identified in this group of marine snails. Copyright © 2018 Elsevier Inc. All rights reserved.

  9. Absolute Pitch in Boreal Chickadees and Humans: Exceptions that Test a Phylogenetic Rule

    ERIC Educational Resources Information Center

    Weisman, Ronald G.; Balkwill, Laura-Lee; Hoeschele, Marisa; Moscicki, Michele K.; Bloomfield, Laurie L.; Sturdy, Christopher B.

    2010-01-01

    This research examined generality of the phylogenetic rule that birds discriminate frequency ranges more accurately than mammals. Human absolute pitch chroma possessors accurately tracked transitions between frequency ranges. Independent tests showed that they used note naming (pitch chroma) to remap the tones into ranges; neither possessors nor…

  10. On the quirks of maximum parsimony and likelihood on phylogenetic networks.

    PubMed

    Bryant, Christopher; Fischer, Mareike; Linz, Simone; Semple, Charles

    2017-03-21

    Maximum parsimony is one of the most frequently-discussed tree reconstruction methods in phylogenetic estimation. However, in recent years it has become more and more apparent that phylogenetic trees are often not sufficient to describe evolution accurately. For instance, processes like hybridization or lateral gene transfer that are commonplace in many groups of organisms and result in mosaic patterns of relationships cannot be represented by a single phylogenetic tree. This is why phylogenetic networks, which can display such events, are becoming of more and more interest in phylogenetic research. It is therefore necessary to extend concepts like maximum parsimony from phylogenetic trees to networks. Several suggestions for possible extensions can be found in recent literature, for instance the softwired and the hardwired parsimony concepts. In this paper, we analyze the so-called big parsimony problem under these two concepts, i.e. we investigate maximum parsimonious networks and analyze their properties. In particular, we show that finding a softwired maximum parsimony network is possible in polynomial time. We also show that the set of maximum parsimony networks for the hardwired definition always contains at least one phylogenetic tree. Lastly, we investigate some parallels of parsimony to different likelihood concepts on phylogenetic networks. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Unrealistic phylogenetic trees may improve phylogenetic footprinting.

    PubMed

    Nettling, Martin; Treutler, Hendrik; Cerquides, Jesus; Grosse, Ivo

    2017-06-01

    The computational investigation of DNA binding motifs from binding sites is one of the classic tasks in bioinformatics and a prerequisite for understanding gene regulation as a whole. Due to the development of sequencing technologies and the increasing number of available genomes, approaches based on phylogenetic footprinting become increasingly attractive. Phylogenetic footprinting requires phylogenetic trees with attached substitution probabilities for quantifying the evolution of binding sites, but these trees and substitution probabilities are typically not known and cannot be estimated easily. Here, we investigate the influence of phylogenetic trees with different substitution probabilities on the classification performance of phylogenetic footprinting using synthetic and real data. For synthetic data we find that the classification performance is highest when the substitution probability used for phylogenetic footprinting is similar to that used for data generation. For real data, however, we typically find that the classification performance of phylogenetic footprinting surprisingly increases with increasing substitution probabilities and is often highest for unrealistically high substitution probabilities close to one. This finding suggests that choosing realistic model assumptions might not always yield optimal predictions in general and that choosing unrealistically high substitution probabilities close to one might actually improve the classification performance of phylogenetic footprinting. The proposed PF is implemented in JAVA and can be downloaded from https://github.com/mgledi/PhyFoo. : martin.nettling@informatik.uni-halle.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  12. Phylogenetic Quantification of Intra-tumour Heterogeneity

    PubMed Central

    Schwarz, Roland F.; Trinh, Anne; Sipos, Botond; Brenton, James D.; Goldman, Nick; Markowetz, Florian

    2014-01-01

    Intra-tumour genetic heterogeneity is the result of ongoing evolutionary change within each cancer. The expansion of genetically distinct sub-clonal populations may explain the emergence of drug resistance, and if so, would have prognostic and predictive utility. However, methods for objectively quantifying tumour heterogeneity have been missing and are particularly difficult to establish in cancers where predominant copy number variation prevents accurate phylogenetic reconstruction owing to horizontal dependencies caused by long and cascading genomic rearrangements. To address these challenges, we present MEDICC, a method for phylogenetic reconstruction and heterogeneity quantification based on a Minimum Event Distance for Intra-tumour Copy-number Comparisons. Using a transducer-based pairwise comparison function, we determine optimal phasing of major and minor alleles, as well as evolutionary distances between samples, and are able to reconstruct ancestral genomes. Rigorous simulations and an extensive clinical study show the power of our method, which outperforms state-of-the-art competitors in reconstruction accuracy, and additionally allows unbiased numerical quantification of tumour heterogeneity. Accurate quantification and evolutionary inference are essential to understand the functional consequences of tumour heterogeneity. The MEDICC algorithms are independent of the experimental techniques used and are applicable to both next-generation sequencing and array CGH data. PMID:24743184

  13. [Comparison of microdilution and disk diffusion methods for the detection of fluconazole and voriconazole susceptibility against clinical Candida glabrata isolates and determination of changing susceptibility with new CLSI breakpoints].

    PubMed

    Hazırolan, Gülşen; Sarıbaş, Zeynep; Arıkan Akdağlı, Sevtap

    2016-07-01

    Candida albicans is the most frequently isolated species as the causative agent of Candida infections. However, in recent years, the isolation rate of non-albicans Candida species have increased. In many centers, Candida glabrata is one of the commonly isolated non-albicans species of C.glabrata infections which are difficult-to-treat due to decreased susceptibility to fluconazole and cross-resistance to other azoles. The aims of this study were to determine the in vitro susceptibility profiles of clinical C.glabrata isolates against fluconazole and voriconazole by microdilution and disk diffusion methods and to evaluate the results with both the previous (CLSI) and current species-specific CLSI (Clinical and Laboratory Standards Institute) clinical breakpoints. A total of 70 C.glabrata strains isolated from clinical samples were included in the study. The identification of the isolates was performed by morphologic examination on cornmeal Tween 80 agar and assimilation profiles obtained by using ID32C (BioMérieux, France). Broth microdilution and disk diffusion methods were performed according to CLSI M27-A3 and CLSI M44-A2 documents, respectively. The results were evaluated according to CLSI M27-A3 and M44-A2 documents and new vs. species-specific CLSI breakpoints. By using both previous and new CLSI breakpoints, broth microdilution test results showed that voriconazole has greater in vitro activity than fluconazole against C.glabrata isolates. For the two drugs tested, very major error was not observed with disk diffusion method when microdilution method was considered as the reference method. Since "susceptible" category no more exists for fluconazole vs. C.glabrata, the isolates that were interpreted as susceptible by previous breakpoints were evaluated as susceptible-dose dependent by current CLSI breakpoints. Since species-specific breakpoints remain yet undetermined for voriconazole, comparative analysis was not possible for this agent. The results obtained

  14. Cloning of the anhidrotic ectodermal dysplasia gene: Identification of cDNAs associated with CpG islands mapped near translocation breakpoint in two female patients

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Srivastava, A.K.; Schlessinger, D.; Kere, J.

    1994-09-01

    The gene for the X chromosomal developmental disorder anhidrotic ectodermal dysplasia (EDA) has been mapped to Xq12-q13 by linkage analysis and is expressed in a few females with chromosomal translocations involving band Xq12-q13. A yeast artificial chromosome (YAC) contig (2.0 Mb) spanning two translocation breakpoints has been assembled by sequence-tagged site (STS)-based chromosomal walking. The two translocation breakpoints (X:autosome translocations from the affected female patients) have been mapped less than 60 kb apart within a YAC contig. Unique probes and intragenic STSs (mapped between the two translocations) have been developed and a somatic cell hybrid carrying the translocated X chromosomemore » from the AK patient has been analyzed by isolating unique probes that span the breakpoint. Several STSs made from intragenic sequences have been found to be conserved in mouse, hamster and monkey, but we have detected no mRNAs in a number of tissues tested. However, a probe and STS developed from the DNA spanning the AK breakpoint is conserved in mouse, hamster and monkey, and we have detected expressed sequences in skin cells and cDNA libraries. In addition, unique sequences have been obtained from two CpG islands in the region that maps proximal to the breakpoints. cDNAs containing these sequences are being studied as candidates for the gene affected in the etiology of EDA.« less

  15. On Computing Breakpoint Distances for Genomes with Duplicate Genes.

    PubMed

    Shao, Mingfu; Moret, Bernard M E

    2017-06-01

    A fundamental problem in comparative genomics is to compute the distance between two genomes in terms of its higher level organization (given by genes or syntenic blocks). For two genomes without duplicate genes, we can easily define (and almost always efficiently compute) a variety of distance measures, but the problem is NP-hard under most models when genomes contain duplicate genes. To tackle duplicate genes, three formulations (exemplar, maximum matching, and any matching) have been proposed, all of which aim to build a matching between homologous genes so as to minimize some distance measure. Of the many distance measures, the breakpoint distance (the number of nonconserved adjacencies) was the first one to be studied and remains of significant interest because of its simplicity and model-free property. The three breakpoint distance problems corresponding to the three formulations have been widely studied. Although we provided last year a solution for the exemplar problem that runs very fast on full genomes, computing optimal solutions for the other two problems has remained challenging. In this article, we describe very fast, exact algorithms for these two problems. Our algorithms rely on a compact integer-linear program that we further simplify by developing an algorithm to remove variables, based on new results on the structure of adjacencies and matchings. Through extensive experiments using both simulations and biological data sets, we show that our algorithms run very fast (in seconds) on mammalian genomes and scale well beyond. We also apply these algorithms (as well as the classic orthology tool MSOAR) to create orthology assignment, then compare their quality in terms of both accuracy and coverage. We find that our algorithm for the "any matching" formulation significantly outperforms other methods in terms of accuracy while achieving nearly maximum coverage.

  16. Gene deregulation and spatial genome reorganization near breakpoints prior to formation of translocations in anaplastic large cell lymphoma.

    PubMed

    Mathas, Stephan; Kreher, Stephan; Meaburn, Karen J; Jöhrens, Korinna; Lamprecht, Björn; Assaf, Chalid; Sterry, Wolfram; Kadin, Marshall E; Daibata, Masanori; Joos, Stefan; Hummel, Michael; Stein, Harald; Janz, Martin; Anagnostopoulos, Ioannis; Schrock, Evelin; Misteli, Tom; Dörken, Bernd

    2009-04-07

    Although the identification and characterization of translocations have rapidly increased, little is known about the mechanisms of how translocations occur in vivo. We used anaplastic large cell lymphoma (ALCL) with and without the characteristic t(2;5)(p23;q35) translocation to study the mechanisms of formation of translocations and of ALCL transformation. We report deregulation of several genes located near the ALCL translocation breakpoint, regardless of whether the tumor contains the t(2;5). The affected genes include the oncogenic transcription factor Fra2 (located on 2p23), the HLH protein Id2 (2p25), and the oncogenic tyrosine kinase CSF1-receptor (5q33.1). Their up-regulation promotes cell survival and repression of T cell-specific gene expression programs that are characteristic for ALCL. The deregulated genes are in spatial proximity within the nuclear space of t(2;5)-negative ALCL cells, facilitating their translocation on induction of double-strand breaks. These data suggest that deregulation of breakpoint-proximal genes occurs before the formation of translocations, and that aberrant transcriptional activity of genomic regions is linked to their propensity to undergo chromosomal translocations. Also, our data demonstrate that deregulation of breakpoint-proximal genes has a key role in ALCL.

  17. Gene deregulation and spatial genome reorganization near breakpoints prior to formation of translocations in anaplastic large cell lymphoma

    PubMed Central

    Mathas, Stephan; Kreher, Stephan; Meaburn, Karen J.; Jöhrens, Korinna; Lamprecht, Björn; Assaf, Chalid; Sterry, Wolfram; Kadin, Marshall E.; Daibata, Masanori; Joos, Stefan; Hummel, Michael; Stein, Harald; Janz, Martin; Anagnostopoulos, Ioannis; Schrock, Evelin; Misteli, Tom; Dörken, Bernd

    2009-01-01

    Although the identification and characterization of translocations have rapidly increased, little is known about the mechanisms of how translocations occur in vivo. We used anaplastic large cell lymphoma (ALCL) with and without the characteristic t(2;5)(p23;q35) translocation to study the mechanisms of formation of translocations and of ALCL transformation. We report deregulation of several genes located near the ALCL translocation breakpoint, regardless of whether the tumor contains the t(2;5). The affected genes include the oncogenic transcription factor Fra2 (located on 2p23), the HLH protein Id2 (2p25), and the oncogenic tyrosine kinase CSF1-receptor (5q33.1). Their up-regulation promotes cell survival and repression of T cell-specific gene expression programs that are characteristic for ALCL. The deregulated genes are in spatial proximity within the nuclear space of t(2;5)-negative ALCL cells, facilitating their translocation on induction of double-strand breaks. These data suggest that deregulation of breakpoint-proximal genes occurs before the formation of translocations, and that aberrant transcriptional activity of genomic regions is linked to their propensity to undergo chromosomal translocations. Also, our data demonstrate that deregulation of breakpoint-proximal genes has a key role in ALCL. PMID:19321746

  18. Cryptic breakpoint identified by whole-genome mate-pair sequencing in a rare paternally inherited complex chromosomal rearrangement.

    PubMed

    Aristidou, Constantia; Theodosiou, Athina; Ketoni, Andria; Bak, Mads; Mehrjouy, Mana M; Tommerup, Niels; Sismani, Carolina

    2018-01-01

    Precise characterization of apparently balanced complex chromosomal rearrangements in non-affected individuals is crucial as they may result in reproductive failure, recurrent miscarriages or affected offspring. We present a family, where the non-affected father and daughter were found, using FISH and karyotyping, to be carriers of a three-way complex chromosomal rearrangement [t(6;7;10)(q16.2;q34;q26.1), de novo in the father]. The family suffered from two stillbirths, one miscarriage, and has a son with severe intellectual disability. In the present study, the family was revisited using whole-genome mate-pair sequencing. Interestingly, whole-genome mate-pair sequencing revealed a cryptic breakpoint on derivative (der) chromosome 6 rendering the rearrangement even more complex. FISH using a chromosome (chr) 6 custom-designed probe and a chr10 control probe confirmed that the interstitial chr6 segment, created by the two chr6 breakpoints, was translocated onto der(10). Breakpoints were successfully validated with Sanger sequencing, and small imbalances as well as microhomology were identified. Finally, the complex chromosomal rearrangement breakpoints disrupted the SIM1 , GRIK2 , CNTNAP2 , and PTPRE genes without causing any phenotype development. In contrast to the majority of maternally transmitted complex chromosomal rearrangement cases, our study investigated a rare case where a complex chromosomal rearrangement, which most probably resulted from a Type IV hexavalent during the pachytene stage of meiosis I, was stably transmitted from a fertile father to his non-affected daughter. Whole-genome mate-pair sequencing proved highly successful in identifying cryptic complexity, which consequently provided further insight into the meiotic segregation of chromosomes and the increased reproductive risk in individuals carrying the specific complex chromosomal rearrangement. We propose that such complex rearrangements should be characterized in detail using a combination

  19. Impact of cleaning and other interventions on the reduction of hospital-acquired Clostridium difficile infections in two hospitals in England assessed using a breakpoint model.

    PubMed

    Hughes, G J; Nickerson, E; Enoch, D A; Ahluwalia, J; Wilkinson, C; Ayers, R; Brown, N M

    2013-07-01

    Clostridium difficile infection remains a major challenge for hospitals. Although targeted infection control initiatives have been shown to be effective in reducing the incidence of hospital-acquired C. difficile infection, there is little evidence available to assess the effectiveness of specific interventions. To use statistical modelling to detect substantial reductions in the incidence of C. difficile from time series data from two hospitals in England, and relate these time points to infection control interventions. A statistical breakpoints model was fitted to likely hospital-acquired C. difficile infection incidence data from a teaching hospital (2002-2009) and a district general hospital (2005-2009) in England. Models with increasing complexity (i.e. increasing the number of breakpoints) were tested for an improved fit to the data. Partitions estimated from breakpoint models were tested for individual stability using statistical process control charts. Major infection control interventions from both hospitals during this time were grouped according to their primary target (antibiotics, cleaning, isolation, other) and mapped to the model-suggested breakpoints. For both hospitals, breakpoints coincided with enhancements to cleaning protocols. Statistical models enabled formal assessment of the impact of different interventions, and showed that enhancements to deep cleaning programmes are the interventions that have most likely led to substantial reductions in hospital-acquired C. difficile infections at the two hospitals studied. Copyright © 2013 The Healthcare Infection Society. Published by Elsevier Ltd. All rights reserved.

  20. Visualizing phylogenetic tree landscapes.

    PubMed

    Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A

    2017-02-02

    Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D

  1. Spatial phylogenetics of the vascular flora of Chile.

    PubMed

    Scherson, Rosa A; Thornhill, Andrew H; Urbina-Casanova, Rafael; Freyman, William A; Pliscoff, Patricio A; Mishler, Brent D

    2017-07-01

    Current geographic patterns of biodiversity are a consequence of the evolutionary history of the lineages that comprise them. This study was aimed at exploring how evolutionary features of the vascular flora of Chile are distributed across the landscape. Using a phylogeny at the genus level for 87% of the Chilean vascular flora, and a geographic database of sample localities, we calculated phylogenetic diversity (PD), phylogenetic endemism (PE), relative PD (RPD), and relative PE (RPE). Categorical Analyses of Neo- and Paleo-Endemism (CANAPE) were also performed, using a spatial randomization to assess statistical significance. A cluster analysis using range-weighted phylogenetic turnover was used to compare among grid cells, and with known Chilean bioclimates. PD patterns were concordant with known centers of high taxon richness and the Chilean biodiversity hotspot. In addition, several other interesting areas of concentration of evolutionary history were revealed as potential conservation targets. The south of the country shows areas of significantly high RPD and a concentration of paleo-endemism, and the north shows areas of significantly low PD and RPD, and a concentration of neo-endemism. Range-weighted phylogenetic turnover shows high congruence with the main macrobioclimates of Chile. Even though the study was done at the genus level, the outcome provides an accurate outline of phylogenetic patterns that can be filled in as more fine-scaled information becomes available. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Incompletely resolved phylogenetic trees inflate estimates of phylogenetic conservatism.

    PubMed

    Davies, T Jonathan; Kraft, Nathan J B; Salamin, Nicolas; Wolkovich, Elizabeth M

    2012-02-01

    The tendency for more closely related species to share similar traits and ecological strategies can be explained by their longer shared evolutionary histories and represents phylogenetic conservatism. How strongly species traits co-vary with phylogeny can significantly impact how we analyze cross-species data and can influence our interpretation of assembly rules in the rapidly expanding field of community phylogenetics. Phylogenetic conservatism is typically quantified by analyzing the distribution of species values on the phylogenetic tree that connects them. Many phylogenetic approaches, however, assume a completely sampled phylogeny: while we have good estimates of deeper phylogenetic relationships for many species-rich groups, such as birds and flowering plants, we often lack information on more recent interspecific relationships (i.e., within a genus). A common solution has been to represent these relationships as polytomies on trees using taxonomy as a guide. Here we show that such trees can dramatically inflate estimates of phylogenetic conservatism quantified using S. P. Blomberg et al.'s K statistic. Using simulations, we show that even randomly generated traits can appear to be phylogenetically conserved on poorly resolved trees. We provide a simple rarefaction-based solution that can reliably retrieve unbiased estimates of K, and we illustrate our method using data on first flowering times from Thoreau's woods (Concord, Massachusetts, USA).

  3. Accurate and exact CNV identification from targeted high-throughput sequence data.

    PubMed

    Nord, Alex S; Lee, Ming; King, Mary-Claire; Walsh, Tom

    2011-04-12

    Massively parallel sequencing of barcoded DNA samples significantly increases screening efficiency for clinically important genes. Short read aligners are well suited to single nucleotide and indel detection. However, methods for CNV detection from targeted enrichment are lacking. We present a method combining coverage with map information for the identification of deletions and duplications in targeted sequence data. Sequencing data is first scanned for gains and losses using a comparison of normalized coverage data between samples. CNV calls are confirmed by testing for a signature of sequences that span the CNV breakpoint. With our method, CNVs can be identified regardless of whether breakpoints are within regions targeted for sequencing. For CNVs where at least one breakpoint is within targeted sequence, exact CNV breakpoints can be identified. In a test data set of 96 subjects sequenced across ~1 Mb genomic sequence using multiplexing technology, our method detected mutations as small as 31 bp, predicted quantitative copy count, and had a low false-positive rate. Application of this method allows for identification of gains and losses in targeted sequence data, providing comprehensive mutation screening when combined with a short read aligner.

  4. Heterogeneous breakpoints in patients with acute lymphoblastic leukemia and the dic(9;20)(p11~13;q11) show recurrent involvement of genes at 20q11.21

    PubMed Central

    An, Qian; Wright, Sarah L.; Moorman, Anthony V.; Parker, Helen; Griffiths, Mike; Ross, Fiona M.; Davies, Teresa; Harrison, Christine J.; Strefford, Jon C.

    2009-01-01

    The dic(9;20)(p11~13;q11) is a recurrent chromosomal abnormality in patients with acute lymphoblastic leukemia. Although it results in loss of material from 9p and 20q, the molecular targets on both chromosomes have not been fully elucidated. From an initial cohort of 58 with acute lymphoblastic leukemia patients with this translocation, breakpoint mapping with fluorescence in situ hybridization on 26 of them revealed breakpoint heterogeneity of both chromosomes. PAX5 has been proposed to be the target gene on 9p, while for 20q, FISH analysis implicated the involvement of the ASXL1 gene, either by a breakpoint within (n=4) or centromeric (deletion, n=12) of the gene. Molecular copy-number counting, long-distance inverse PCR and direct sequence analysis identified six dic(9;20) breakpoint sequences. In addition to the three previously reported: PAX5-ASXL1, PAX5-C20ORF112 and PAX5-KIF3B; we identified three new ones in this study: sequences 3’ of PAX5 disrupting ASXL1, and ZCCHC7 disrupted by sequences 3’ of FRG1B and LOC1499503. This study provides insight into the breakpoint complexity underlying dicentric chromosomal formation in acute lymphoblastic leukemia and highlights putative target gene loci. PMID:19586940

  5. Heterogeneous breakpoints in patients with acute lymphoblastic leukemia and the dic(9;20)(p11-13;q11) show recurrent involvement of genes at 20q11.21.

    PubMed

    An, Qian; Wright, Sarah L; Moorman, Anthony V; Parker, Helen; Griffiths, Mike; Ross, Fiona M; Davies, Teresa; Harrison, Christine J; Strefford, Jon C

    2009-08-01

    The dic(9;20)(p11-13;q11) is a recurrent chromosomal abnormality in patients with acute lymphoblastic leukemia. Although it results in loss of material from 9p and 20q, the molecular targets on both chromosomes have not been fully elucidated. From an initial cohort of 58 with acute lymphoblastic leukemia patients with this translocation, breakpoint mapping with fluorescence in situ hybridization on 26 of them revealed breakpoint heterogeneity of both chromosomes. PAX5 has been proposed to be the target gene on 9p, while for 20q, FISH analysis implicated the involvement of the ASXL1 gene, either by a breakpoint within (n=4) or centromeric (deletion, n=12) of the gene. Molecular copy-number counting, long-distance inverse PCR and direct sequence analysis identified six dic(9;20) breakpoint sequences. In addition to the three previously reported: PAX5-ASXL1, PAX5-C20ORF112 and PAX5-KIF3B; we identified three new ones in this study: sequences 3' of PAX5 disrupting ASXL1, and ZCCHC7 disrupted by sequences 3' of FRG1B and LOC1499503. This study provides insight into the breakpoint complexity underlying dicentric chromosomal formation in acute lymphoblastic leukemia and highlights putative target gene loci.

  6. Modelling the association of dengue fever cases with temperature and relative humidity in Jeddah, Saudi Arabia-A generalised linear model with break-point analysis.

    PubMed

    Alkhaldy, Ibrahim

    2017-04-01

    The aim of this study was to examine the role of environmental factors in the temporal distribution of dengue fever in Jeddah, Saudi Arabia. The relationship between dengue fever cases and climatic factors such as relative humidity and temperature was investigated during 2006-2009 to determine whether there is any relationship between dengue fever cases and climatic parameters in Jeddah City, Saudi Arabia. A generalised linear model (GLM) with a break-point was used to determine how different levels of temperature and relative humidity affected the distribution of the number of cases of dengue fever. Break-point analysis was performed to modelled the effect before and after a break-point (change point) in the explanatory parameters under various scenarios. Akaike information criterion (AIC) and cross validation (CV) were used to assess the performance of the models. The results showed that maximum temperature and mean relative humidity are most probably the better predictors of the number of dengue fever cases in Jeddah. In this study three scenarios were modelled: no time lag, 1-week lag and 2-weeks lag. Among these scenarios, the 1-week lag model using mean relative humidity as an explanatory variable showed better performance. This study showed a clear relationship between the meteorological variables and the number of dengue fever cases in Jeddah. The results also demonstrated that meteorological variables can be successfully used to estimate the number of dengue fever cases for a given period of time. Break-point analysis provides further insight into the association between meteorological parameters and dengue fever cases by dividing the meteorological parameters into certain break-points. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. MDS/AML del(11)(q14) Share Common Morphological Features Despite Different Chromosomal Breakpoints.

    PubMed

    Dambruoso, Irene; Invernizzi, Rosangela; Boni, Marina; Zappatore, Rita; Giardini, Ilaria; Cavigliano, Maria Paola; Rocca, Barbara; Calvello, Celeste; Bastia, Raffaella; Caresana, Marilena; Pasi, Francesca; Nano, Rosanna; Bernasconi, Paolo

    2017-02-01

    In myelodysplatic syndromes and acute myeloid leukemia (MDS/AML) deletion of the 11q14 region is a rare chromosomal defect (incidence: 0.6-1.0%), included within the intermediate risk criteria by the International Prognostic Scoring System. No fluorescence in situ hybridization (FISH) study has yet been performed to identify a common breakpoint region (CBR). In our study through FISH with bacterial artificial chromosomes and commercial probes, we analyzed seven patients with MDS/AML harboring 11q14 deletion on conventional cytogenetic analysis. FISH revealed deletions in five patients and amplifications in two. Three patients with deletion carried a CBR, two had a deletion involving a more centromeric breakpoint. These five patients exhibited multilineage dysplasia, blast cells with large round nuclei, loose chromatin, small and abundant nucleoli, and vacuolated cytoplasm with very thin Auer bodies. In conclusion, the morphological features which occur independently of the extent of the deletion are of multilineage dysplasia in MDS and leukemic blasts strongly reactive to peroxidase in AML; despite the variable size of the deleted area, some patients harbor a CBR. Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved.

  8. Classification of HCV and HIV-1 Sequences with the Branching Index

    PubMed Central

    Hraber, Peter; Kuiken, Carla; Waugh, Mark; Geer, Shaun; Bruno, William J.; Leitner, Thomas

    2009-01-01

    SUMMARY Classification of viral sequences should be fast, objective, accurate, and reproducible. Most methods that classify sequences use either pairwise distances or phylogenetic relations, but cannot discern when a sequence is unclassifiable. The branching index (BI) combines distance and phylogeny methods to compute a ratio that quantifies how closely a query sequence clusters with a subtype clade. In the hypothesis-testing framework of statistical inference, the BI is compared with a threshold to test whether sufficient evidence exists for the query sequence to be classified among known sequences. If above the threshold, the null hypothesis of no support for the subtype relation is rejected and the sequence is taken as belonging to the subtype clade with which it clusters on the tree. This study evaluates statistical properties of the branching index for subtype classification in HCV and HIV-1. Pairs of BI values with known positive and negative test results were computed from 10,000 random fragments of reference alignments. Sampled fragments were of sufficient length to contain phylogenetic signal that groups reference sequences together properly into subtype clades. For HCV, a threshold BI of 0.71 yields 95.1% agreement with reference subtypes, with equal false positive and false negative rates. For HIV-1, a threshold of 0.66 yields 93.5% agreement. Higher thresholds can be used where lower false positive rates are required. In synthetic recombinants, regions without breakpoints are recognized accurately; regions with breakpoints do not uniquely represent any known subtype. Web-based services for viral subtype classification with the branching index are available online. PMID:18753218

  9. Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.

    PubMed

    Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T

    2016-02-24

    Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic

  10. Conformation of phylogenetic relationship of Penaeidae shrimp based on morphometric and molecular investigations.

    PubMed

    Rajakumaran, P; Vaseeharan, B; Jayakumar, R; Chidambara, R

    2014-01-01

    Understanding of accurate phylogenetic relationship among Penaeidae shrimp is important for academic and fisheries industry. The Morphometric and Randomly amplified polymorphic DNA (RAPD) analysis was used to make the phylogenetic relationsip among 13 Penaeidae shrimp. For morphometric analysis forty variables and total lengths of shrimp were measured for each species, and removed the effect of size variation. The size normalized values obtained was subjected to UPGMA (Unweighted Pair-Group Method with Arithmetic Mean) cluster analysis. For RAPD analysis, the four primers showed reliable differentiation between species, and used correlation coefficient between the DNA banding patterns of 13 Penaeidae species to construct UPGMA dendrogram. Phylogenetic relationship from morphometric and molecular analysis for Penaeidae species found to be congruent. We concluded that as the results from morphometry investigations concur with molecular one, phylogenetic relationship obtained for the studied Penaeidae are considered to be reliable.

  11. Breakpoint Features of Genomic Rearrangements in Neuroblastoma with Unbalanced Translocations and Chromothripsis

    PubMed Central

    Daveau, Romain; Combaret, Valérie; Pierre-Eugène, Cécile; Cazes, Alex; Louis-Brennetot, Caroline; Schleiermacher, Gudrun; Ferrand, Sandrine; Pierron, Gaëlle; Lermine, Alban; Frio, Thomas Rio; Raynal, Virginie; Vassal, Gilles; Barillot, Emmanuel; Delattre, Olivier; Janoueix-Lerosey, Isabelle

    2013-01-01

    Neuroblastoma is a pediatric cancer of the peripheral nervous system in which structural chromosome aberrations are emblematic of aggressive tumors. In this study, we performed an in-depth analysis of somatic rearrangements in two neuroblastoma cell lines and two primary tumors using paired-end sequencing of mate-pair libraries and RNA-seq. The cell lines presented with typical genetic alterations of neuroblastoma and the two tumors belong to the group of neuroblastoma exhibiting a profile of chromothripsis. Inter and intra-chromosomal rearrangements were identified in the four samples, allowing in particular characterization of unbalanced translocations at high resolution. Using complementary experiments, we further characterized 51 rearrangements at the base pair resolution that revealed 59 DNA junctions. In a subset of cases, complex rearrangements were observed with templated insertion of fragments of nearby sequences. Although we did not identify known particular motifs in the local environment of the breakpoints, we documented frequent microhomologies at the junctions in both chromothripsis and non-chromothripsis associated breakpoints. RNA-seq experiments confirmed expression of several predicted chimeric genes and genes with disrupted exon structure including ALK, NBAS, FHIT, PTPRD and ODZ4. Our study therefore indicates that both non-homologous end joining-mediated repair and replicative processes may account for genomic rearrangements in neuroblastoma. RNA-seq analysis allows the identification of the subset of abnormal transcripts expressed from genomic rearrangements that may be involved in neuroblastoma oncogenesis. PMID:23991058

  12. Testing chromosomal phylogenies and inversion breakpoint reuse in Drosophila. The martensis cluster revisited.

    PubMed

    Prada, Carlos F; Delprat, Alejandra; Ruiz, Alfredo

    2011-02-01

    The chromosomal relationships of the four martensis cluster species are among the most complex and intricate within the entire Drosophila repleta group, due to the so-called sharing of inversions. Here, we have revised these relationships using comparative mapping of bacterial artificial chromosome (BAC) clones on the salivary gland chromosomes. A physical map of chromosome 2 of Drosophila uniseta (one of the cluster members) was generated by in situ hybridization of 82 BAC clones from the physical map of the Drosophila buzzatii genome (an outgroup that represents the ancestral arrangement). By comparing the marker positions, we determined the number, order, and orientation of conserved chromosomal segments between chromosome 2 of D. buzzatii and D. uniseta. GRIMM software was used to infer that a minimum of five chromosomal inversions are necessary to transform the chromosome 2 of D. buzzatii into that of D. uniseta. Two of these inversions have been overlooked in previous cytological analyses. The five fixed inversions entail two breakpoint reuses because only nine syntenic segments and eight interruptions were observed. We tested for the presence of the five inversions fixed in D. uniseta in the other three species of the martensis cluster by in situ hybridization of eight breakpoint-bearing BAC clones. The results shed light on the chromosomal phylogeny of the martensis cluster, yet leave a number of questions open.

  13. En Route towards European Clinical Breakpoints for Veterinary Antimicrobial Susceptibility Testing: A Position Paper Explaining the VetCAST Approach.

    PubMed

    Toutain, Pierre-Louis; Bousquet-Mélou, Alain; Damborg, Peter; Ferran, Aude A; Mevius, Dik; Pelligand, Ludovic; Veldman, Kees T; Lees, Peter

    2017-01-01

    VetCAST is the EUCAST sub-committee for Veterinary Antimicrobial Susceptibility Testing. Its remit is to define clinical breakpoints (CBPs) for antimicrobial drugs (AMDs) used in veterinary medicine in Europe. This position paper outlines the procedures and reviews scientific options to solve challenges for the determination of specific CBPs for animal species, drug substances and disease conditions. VetCAST will adopt EUCAST approaches: the initial step will be data assessment; then procedures for decisions on the CBP; and finally the release of recommendations for CBP implementation. The principal challenges anticipated by VetCAST are those associated with the differing modalities of AMD administration, including mass medication, specific long-acting product formulations or local administration. Specific challenges comprise mastitis treatment in dairy cattle, the range of species and within species breed considerations and several other variable factors not relevant to human medicine. Each CBP will be based on consideration of: (i) an epidemiological cut-off value (ECOFF) - the highest MIC that defines the upper end of the wild-type MIC distribution; (ii) a PK/PD breakpoint obtained from pre-clinical pharmacokinetic data [this PK/PD break-point is the highest possible MIC for which a given percentage of animals in the target population achieves a critical value for the selected PK/PD index ( f AUC/MIC or f T > MIC)] and (iii) when possible, a clinical cut-off, that is the relationship between MIC and clinical cure. For the latter, VetCAST acknowledges the paucity of such data in veterinary medicine. When a CBP cannot be established, VetCAST will recommend use of ECOFF as surrogate. For decision steps, VetCAST will follow EUCAST procedures involving transparency, consensus and independence. VetCAST will ensure freely available dissemination of information, concerning standards, guidelines, ECOFF, PK/PD breakpoints, CBPs and other relevant information for AST

  14. En Route towards European Clinical Breakpoints for Veterinary Antimicrobial Susceptibility Testing: A Position Paper Explaining the VetCAST Approach

    PubMed Central

    Toutain, Pierre-Louis; Bousquet-Mélou, Alain; Damborg, Peter; Ferran, Aude A.; Mevius, Dik; Pelligand, Ludovic; Veldman, Kees T.; Lees, Peter

    2017-01-01

    VetCAST is the EUCAST sub-committee for Veterinary Antimicrobial Susceptibility Testing. Its remit is to define clinical breakpoints (CBPs) for antimicrobial drugs (AMDs) used in veterinary medicine in Europe. This position paper outlines the procedures and reviews scientific options to solve challenges for the determination of specific CBPs for animal species, drug substances and disease conditions. VetCAST will adopt EUCAST approaches: the initial step will be data assessment; then procedures for decisions on the CBP; and finally the release of recommendations for CBP implementation. The principal challenges anticipated by VetCAST are those associated with the differing modalities of AMD administration, including mass medication, specific long-acting product formulations or local administration. Specific challenges comprise mastitis treatment in dairy cattle, the range of species and within species breed considerations and several other variable factors not relevant to human medicine. Each CBP will be based on consideration of: (i) an epidemiological cut-off value (ECOFF) – the highest MIC that defines the upper end of the wild-type MIC distribution; (ii) a PK/PD breakpoint obtained from pre-clinical pharmacokinetic data [this PK/PD break-point is the highest possible MIC for which a given percentage of animals in the target population achieves a critical value for the selected PK/PD index (fAUC/MIC or fT > MIC)] and (iii) when possible, a clinical cut-off, that is the relationship between MIC and clinical cure. For the latter, VetCAST acknowledges the paucity of such data in veterinary medicine. When a CBP cannot be established, VetCAST will recommend use of ECOFF as surrogate. For decision steps, VetCAST will follow EUCAST procedures involving transparency, consensus and independence. VetCAST will ensure freely available dissemination of information, concerning standards, guidelines, ECOFF, PK/PD breakpoints, CBPs and other relevant information for AST

  15. Phylogenetic comparative methods on phylogenetic networks with reticulations.

    PubMed

    Bastide, Paul; Solís-Lemus, Claudia; Kriebel, Ricardo; Sparks, K William; Ané, Cécile

    2018-04-25

    The goal of Phylogenetic Comparative Methods (PCMs) is to study the distribution of quantitative traits among related species. The observed traits are often seen as the result of a Brownian Motion (BM) along the branches of a phylogenetic tree. Reticulation events such as hybridization, gene flow or horizontal gene transfer, can substantially affect a species' traits, but are not modeled by a tree. Phylogenetic networks have been designed to represent reticulate evolution. As they become available for downstream analyses, new models of trait evolution are needed, applicable to networks. One natural extension of the BM is to use a weighted average model for the trait of a hybrid, at a reticulation point. We develop here an efficient recursive algorithm to compute the phylogenetic variance matrix of a trait on a network, in only one preorder traversal of the network. We then extend the standard PCM tools to this new framework, including phylogenetic regression with covariates (or phylogenetic ANOVA), ancestral trait reconstruction, and Pagel's λ test of phylogenetic signal. The trait of a hybrid is sometimes outside of the range of its two parents, for instance because of hybrid vigor or hybrid depression. These two phenomena are rather commonly observed in present-day hybrids. Transgressive evolution can be modeled as a shift in the trait value following a reticulation point. We develop a general framework to handle such shifts, and take advantage of the phylogenetic regression view of the problem to design statistical tests for ancestral transgressive evolution in the evolutionary history of a group of species. We study the power of these tests in several scenarios, and show that recent events have indeed the strongest impact on the trait distribution of present-day taxa. We apply those methods to a dataset of Xiphophorus fishes, to confirm and complete previous analysis in this group. All the methods developed here are available in the Julia package PhyloNetworks.

  16. Using Sorting by Reversal: Breakpoint Graph for Gene Assembly in Ciliates

    NASA Astrophysics Data System (ADS)

    Brijder, Robert; Jan Hoogeboom, Hendrik

    2007-09-01

    The theory of gene assembly in ciliates has much in common with the theory of sorting by reversal. Both model processes that are based on splicing, and have a fixed begin and end product. The main difference is the type of splicing operations used to obtain the end product from the begin product. In this overview paper we show that the concept of breakpoint graph, known from the theory of sorting by reversal, has many uses in the theory of gene assembly. Our aim is to present the material in an intuitive and informal manner to allow for an efficient introduction into the subject.

  17. Phylogenetic turnover during subtropical forest succession across environmental and phylogenetic scales.

    PubMed

    Purschke, Oliver; Michalski, Stefan G; Bruelheide, Helge; Durka, Walter

    2017-12-01

    Although spatial and temporal patterns of phylogenetic community structure during succession are inherently interlinked and assembly processes vary with environmental and phylogenetic scales, successional studies of community assembly have yet to integrate spatial and temporal components of community structure, while accounting for scaling issues. To gain insight into the processes that generate biodiversity after disturbance, we combine analyses of spatial and temporal phylogenetic turnover across phylogenetic scales, accounting for covariation with environmental differences. We compared phylogenetic turnover, at the species- and individual-level, within and between five successional stages, representing woody plant communities in a subtropical forest chronosequence. We decomposed turnover at different phylogenetic depths and assessed its covariation with between-plot abiotic differences. Phylogenetic turnover between stages was low relative to species turnover and was not explained by abiotic differences. However, within the late-successional stages, there was high presence-/absence-based turnover (clustering) that occurred deep in the phylogeny and covaried with environmental differentiation. Our results support a deterministic model of community assembly where (i) phylogenetic composition is constrained through successional time, but (ii) toward late succession, species sorting into preferred habitats according to niche traits that are conserved deep in phylogeny, becomes increasingly important.

  18. Cloning of the breakpoints of a de novo inversion of chromosome 8, inv (8)(p11.2q23.1) in a patient with Ambras syndrome.

    PubMed

    Tadin-Strapps, M; Warburton, D; Baumeister, F A M; Fischer, S G; Yonan, J; Gilliam, T C; Christiano, A M

    2004-01-01

    Ambras syndrome (AMS) is a unique form of universal congenital hypertrichosis. In patients with this syndrome, the whole body is covered with fine long hair, except for areas where normally no hair grows. There is accompanying facial dysmorphism and teeth abnormalities, including retarded first and second dentition and absence of teeth. In 1993, Baumeister et al. reported an isolated case of Ambras syndrome in association with a pericentric inversion of chromosome 8. Subsequently, another patient with congenital hypertrichosis and rearrangement of chromosome 8 was reported by Balducci et al. (1998). Both of these patients have a breakpoint in 8q22 in common suggesting that this region of chromosome 8 contains a gene involved in regulation of hair growth. In order to precisely determine the nature of the rearrangement in the case of Ambras syndrome, we have used fluorescent in situ hybridization (FISH) analysis. We have cloned the inversion breakpoints in this patient and generated a detailed physical map of the inversion breakpoint interval. Analysis of the transcripts that map in the vicinity of the breakpoints revealed that the inversion does not disrupt a gene, and suggests that the phenotype is caused by a position effect. Copyright 2004 S. Karger AG, Basel

  19. Phylogenetics.

    PubMed

    Sleator, Roy D

    2011-04-01

    The recent rapid expansion in the DNA and protein databases, arising from large-scale genomic and metagenomic sequence projects, has forced significant development in the field of phylogenetics: the study of the evolutionary relatedness of the planet's inhabitants. Advances in phylogenetic analysis have greatly transformed our view of the landscape of evolutionary biology, transcending the view of the tree of life that has shaped evolutionary theory since Darwinian times. Indeed, modern phylogenetic analysis no longer focuses on the restricted Darwinian-Mendelian model of vertical gene transfer, but must also consider the significant degree of lateral gene transfer, which connects and shapes almost all living things. Herein, I review the major tree-building methods, their strengths, weaknesses and future prospects.

  20. Differences in antimicrobial susceptibility breakpoints for Pseudomonas aeruginosa, isolated from blood cultures, set by the Clinical and Laboratory Standards Institute (CLSI) and the Japanese Society of Chemotherapy.

    PubMed

    Nakamura, Tatsuya; Shimizu, Chihiro; Kasahara, Mayumi; Nakata, Chiyo; Munakata, Machiko; Takahashi, Hakuo

    2007-02-01

    A study was made of the antimicrobial susceptibility to and efficacy of various kinds of antimicrobial agents against 179 strains of Pseudomonas aeruginosa that were isolated from blood cultures at Kansai Medical University Hospital from 1990 through 2004. The annual detection rate was highest in 1994, at 22 strains (6.5%). There were 9 multidrug resistant strains of Pseudomonas aeruginosa (5.0%). Among 14 antimicrobial agents tested for measurements, ciprofloxacin (CPFX) showed the best minimum inhibitory concentration (MIC) 50 value, of 0.25 microg/ml, followed by pazufloxacin (PZFX) and biapenem (BIPM), each at 0.5 microg/ml. When the period of 15 years was divided into three stages, the MIC50 value for each antimicrobial agent was highest in the middle stage (1995 to 1999). Assuming that the percentage of sensitive strains according to the breakpoints set by the Clinical and Laboratory Standards Institute (CLSI) represents the antimicrobial susceptibility rate, amikacin (AMK) showed the best value, of 85.5%. According to the sepsis breakpoint set by the Japanese Society of Chemotherapy (JSC), the efficacy of CPFX showed the highest rate (77.1%) of all the antimicrobial agents tested. Among beta-lactams, BIPM showed the highest efficacy rate, of 67.0%. When the efficacy rates were compared with each other, the difference in efficacy rate between the breakpoint set by the CLSI and the sepsis breakpoint set by the JSC was large for beta-lactams. Comparisons made based on the CLSI criteria showed no difference in cross-resistance rates between CPFX, meropenem (MEPM), and BIPM. However, when comparisons were made using the JSC sepsis breakpoint, MEPM showed a cross-resistance rate of 87.8%, while the rate for BIPM was lower, at 56.1%, with the chi2 test showing a significant difference, at P = 0.0014. In accordance with the pharmacokinetics/pharmacodynamics theory that has been advocated, breakpoints which are more suitable for the clinical setting in Japan should

  1. An Improved Binary Differential Evolution Algorithm to Infer Tumor Phylogenetic Trees.

    PubMed

    Liang, Ying; Liao, Bo; Zhu, Wen

    2017-01-01

    Tumourigenesis is a mutation accumulation process, which is likely to start with a mutated founder cell. The evolutionary nature of tumor development makes phylogenetic models suitable for inferring tumor evolution through genetic variation data. Copy number variation (CNV) is the major genetic marker of the genome with more genes, disease loci, and functional elements involved. Fluorescence in situ hybridization (FISH) accurately measures multiple gene copy number of hundreds of single cells. We propose an improved binary differential evolution algorithm, BDEP, to infer tumor phylogenetic tree based on FISH platform. The topology analysis of tumor progression tree shows that the pathway of tumor subcell expansion varies greatly during different stages of tumor formation. And the classification experiment shows that tree-based features are better than data-based features in distinguishing tumor. The constructed phylogenetic trees have great performance in characterizing tumor development process, which outperforms other similar algorithms.

  2. TreeShrink: fast and accurate detection of outlier long branches in collections of phylogenetic trees.

    PubMed

    Mai, Uyen; Mirarab, Siavash

    2018-05-08

    Sequence data used in reconstructing phylogenetic trees may include various sources of error. Typically errors are detected at the sequence level, but when missed, the erroneous sequences often appear as unexpectedly long branches in the inferred phylogeny. We propose an automatic method to detect such errors. We build a phylogeny including all the data then detect sequences that artificially inflate the tree diameter. We formulate an optimization problem, called the k-shrink problem, that seeks to find k leaves that could be removed to maximally reduce the tree diameter. We present an algorithm to find the exact solution for this problem in polynomial time. We then use several statistical tests to find outlier species that have an unexpectedly high impact on the tree diameter. These tests can use a single tree or a set of related gene trees and can also adjust to species-specific patterns of branch length. The resulting method is called TreeShrink. We test our method on six phylogenomic biological datasets and an HIV dataset and show that the method successfully detects and removes long branches. TreeShrink removes sequences more conservatively than rogue taxon removal and often reduces gene tree discordance more than rogue taxon removal once the amount of filtering is controlled. TreeShrink is an effective method for detecting sequences that lead to unrealistically long branch lengths in phylogenetic trees. The tool is publicly available at https://github.com/uym2/TreeShrink .

  3. Genomic structure and paralogous regions of the inversion breakpoint occurring between human chromosome 3p12.3 and orangutan chromosome 2.

    PubMed

    Yue, Y; Grossmann, B; Tsend-Ayush, E; Grützner, F; Ferguson-Smith, M A; Yang, F; Haaf, T

    2005-01-01

    Intrachromosomal duplications play a significant role in human genome pathology and evolution. To better understand the molecular basis of evolutionary chromosome rearrangements, we performed molecular cytogenetic and sequence analyses of the breakpoint region that distinguishes human chromosome 3p12.3 and orangutan chromosome 2. FISH with region-specific BAC clones demonstrated that the breakpoint-flanking sequences are duplicated intrachromosomally on orangutan 2 and human 3q21 as well as at many pericentromeric and subtelomeric sites throughout the genomes. Breakage and rearrangement of the human 3p12.3-homologous region in the orangutan lineage were associated with a partial loss of duplicated sequences in the breakpoint region. Consistent with our FISH mapping results, computational analysis of the human chromosome 3 genomic sequence revealed three 3p12.3-paralogous sequence blocks on human chromosome 3q21 and smaller blocks on the short arm end 3p26-->p25. This is consistent with the view that sequences from an ancestral site at 3q21 were duplicated at 3p12.3 in a common ancestor of orangutan and humans. Our results show that evolutionary chromosome rearrangements are associated with microduplications and microdeletions, contributing to the DNA differences between closely related species. Copyright (c) 2005 S. Karger AG, Basel.

  4. Bayesian phylogenetic estimation of fossil ages.

    PubMed

    Drummond, Alexei J; Stadler, Tanja

    2016-07-19

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth-death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the 'morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses.This article is part of the themed issue 'Dating species divergences using

  5. Bayesian phylogenetic estimation of fossil ages

    PubMed Central

    Drummond, Alexei J.; Stadler, Tanja

    2016-01-01

    Recent advances have allowed for both morphological fossil evidence and molecular sequences to be integrated into a single combined inference of divergence dates under the rule of Bayesian probability. In particular, the fossilized birth–death tree prior and the Lewis-Mk model of discrete morphological evolution allow for the estimation of both divergence times and phylogenetic relationships between fossil and extant taxa. We exploit this statistical framework to investigate the internal consistency of these models by producing phylogenetic estimates of the age of each fossil in turn, within two rich and well-characterized datasets of fossil and extant species (penguins and canids). We find that the estimation accuracy of fossil ages is generally high with credible intervals seldom excluding the true age and median relative error in the two datasets of 5.7% and 13.2%, respectively. The median relative standard error (RSD) was 9.2% and 7.2%, respectively, suggesting good precision, although with some outliers. In fact, in the two datasets we analyse, the phylogenetic estimate of fossil age is on average less than 2 Myr from the mid-point age of the geological strata from which it was excavated. The high level of internal consistency found in our analyses suggests that the Bayesian statistical model employed is an adequate fit for both the geological and morphological data, and provides evidence from real data that the framework used can accurately model the evolution of discrete morphological traits coded from fossil and extant taxa. We anticipate that this approach will have diverse applications beyond divergence time dating, including dating fossils that are temporally unconstrained, testing of the ‘morphological clock', and for uncovering potential model misspecification and/or data errors when controversial phylogenetic hypotheses are obtained based on combined divergence dating analyses. This article is part of the themed issue ‘Dating species divergences

  6. Molecular breakpoint cloning and gene expression studies of a novel translocation t(4;15)(q27;q11.2) associated with Prader-Willi syndrome

    PubMed Central

    Schüle, Birgitt; Albalwi, Mohammed; Northrop, Emma; Francis, David I; Rowell, Margaret; Slater, Howard R; Gardner, RJ McKinlay; Francke, Uta

    2005-01-01

    Background Prader-Willi syndrome (MIM #176270; PWS) is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15)(q27;q11.2). Methods We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Results Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. Conclusion As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype. PMID:15877813

  7. Molecular breakpoint cloning and gene expression studies of a novel translocation t(4;15)(q27;q11.2) associated with Prader-Willi syndrome.

    PubMed

    Schüle, Birgitt; Albalwi, Mohammed; Northrop, Emma; Francis, David I; Rowell, Margaret; Slater, Howard R; Gardner, R J McKinlay; Francke, Uta

    2005-05-06

    Prader-Willi syndrome (MIM #176270; PWS) is caused by lack of the paternally-derived copies, or their expression, of multiple genes in a 4 Mb region on chromosome 15q11.2. Known mechanisms include large deletions, maternal uniparental disomy or mutations involving the imprinting center. De novo balanced reciprocal translocations in 5 reported individuals had breakpoints clustering in SNRPN intron 2 or exon 20/intron 20. To further dissect the PWS phenotype and define the minimal critical region for PWS features, we have studied a 22 year old male with a milder PWS phenotype and a de novo translocation t(4;15)(q27;q11.2). We used metaphase FISH to narrow the breakpoint region and molecular analyses to map the breakpoints on both chromosomes at the nucleotide level. The expression of genes on chromosome 15 on both sides of the breakpoint was determined by RT-PCR analyses. Pertinent clinical features include neonatal hypotonia with feeding difficulties, hypogonadism, short stature, late-onset obesity, learning difficulties, abnormal social behavior and marked tolerance to pain, as well as sticky saliva and narcolepsy. Relative macrocephaly and facial features are not typical for PWS. The translocation breakpoints were identified within SNRPN intron 17 and intron 10 of a spliced non-coding transcript in band 4q27. LINE and SINE sequences at the exchange points may have contributed to the translocation event. By RT-PCR of lymphoblasts and fibroblasts, we find that upstream SNURF/SNRPN exons and snoRNAs HBII-437 and HBII-13 are expressed, but the downstream snoRNAs PWCR1/HBII-85 and HBII-438A/B snoRNAs are not. As part of the PWCR1/HBII-85 snoRNA cluster is highly conserved between human and mice, while no copy of HBII-438 has been found in mouse, we conclude that PWCR1/HBII-85 snoRNAs is likely to play a major role in the PWS- phenotype.

  8. Phylogenetic inference under varying proportions of indel-induced alignment gaps

    PubMed Central

    Dwivedi, Bhakti; Gadagkar, Sudhindra R

    2009-01-01

    Background The effect of alignment gaps on phylogenetic accuracy has been the subject of numerous studies. In this study, we investigated the relationship between the total number of gapped sites and phylogenetic accuracy, when the gaps were introduced (by means of computer simulation) to reflect indel (insertion/deletion) events during the evolution of DNA sequences. The resulting (true) alignments were subjected to commonly used gap treatment and phylogenetic inference methods. Results (1) In general, there was a strong – almost deterministic – relationship between the amount of gap in the data and the level of phylogenetic accuracy when the alignments were very "gappy", (2) gaps resulting from deletions (as opposed to insertions) contributed more to the inaccuracy of phylogenetic inference, (3) the probabilistic methods (Bayesian, PhyML & "MLε, " a method implemented in DNAML in PHYLIP) performed better at most levels of gap percentage when compared to parsimony (MP) and distance (NJ) methods, with Bayesian analysis being clearly the best, (4) methods that treat gapped sites as missing data yielded less accurate trees when compared to those that attribute phylogenetic signal to the gapped sites (by coding them as binary character data – presence/absence, or as in the MLε method), and (5) in general, the accuracy of phylogenetic inference depended upon the amount of available data when the gaps resulted from mainly deletion events, and the amount of missing data when insertion events were equally likely to have caused the alignment gaps. Conclusion When gaps in an alignment are a consequence of indel events in the evolution of the sequences, the accuracy of phylogenetic analysis is likely to improve if: (1) alignment gaps are categorized as arising from insertion events or deletion events and then treated separately in the analysis, (2) the evolutionary signal provided by indels is harnessed in the phylogenetic analysis, and (3) methods that utilize the

  9. Deletion 2q37 syndrome: Cognitive-behavioral trajectories and autistic features related to breakpoint and deletion size.

    PubMed

    Fisch, Gene S; Falk, Rena E; Carey, John C; Imitola, Jaime; Sederberg, Maria; Caravalho, Karen S; South, Sarah

    2016-09-01

    Subtelomeric deletions have been reported in ∼2.5% of individuals with developmental disabilities. Subtelomeric deletion 2q37 has been detected in many individuals diagnosed with intellectual disabilities (ID) and autism spectrum disorders (ASD). Previously, genotype-phenotype correspondences were examined for their relationship to breakpoints 37.1, 37.2, or 37.3. Our purpose was to ascertain whether there were phenotypic differences at these breakpoints, elucidate the cognitive-behavioral phenotype in del2q37, and examine the genotype-phenotype association in the deletion with respect to cognitive-behavioral profiles and ASD. We administered a comprehensive cognitive-behavioral battery to nine children diagnosed with del 2q37, ages 3.9-17.75 years. ID for five tested with the Stanford-Binet (4th Edition) (SBFE) ranged from severe to mild [IQ Range: 36-59]. Adaptive behavior scores from the Vineland Adaptive Behavior Scale (VABS) were much below adequate levels (DQ Range: floor value ["19"] to 55). Autism scores from the Child Autism Rating Scale (CARS) ranged from 22 [non-autistic] to 56 [extremely autistic]; 5/8 [63%] children received scores on the autism spectrum. Participants with the largest deletions, 10.1 and 9.5 Mb, attained the highest IQ and DQ scores while those with the smallest deletions, 7.9 and 6.6 Mb, made the lowest IQ and DQ scores. No association between deletion breakpoint and phenotype were found. Assessment of the various deleted regions suggested histone deacetylase 4 gene (HDAC4) was a likely candidate gene for ASD in our sample. However, two earlier reports found no association between HDAC4 haploinsufficiency and ASD. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  10. Stratification of co-evolving genomic groups using ranked phylogenetic profiles

    PubMed Central

    Freilich, Shiri; Goldovsky, Leon; Gottlieb, Assaf; Blanc, Eric; Tsoka, Sophia; Ouzounis, Christos A

    2009-01-01

    Background Previous methods of detecting the taxonomic origins of arbitrary sequence collections, with a significant impact to genome analysis and in particular metagenomics, have primarily focused on compositional features of genomes. The evolutionary patterns of phylogenetic distribution of genes or proteins, represented by phylogenetic profiles, provide an alternative approach for the detection of taxonomic origins, but typically suffer from low accuracy. Herein, we present rank-BLAST, a novel approach for the assignment of protein sequences into genomic groups of the same taxonomic origin, based on the ranking order of phylogenetic profiles of target genes or proteins across the reference database. Results The rank-BLAST approach is validated by computing the phylogenetic profiles of all sequences for five distinct microbial species of varying degrees of phylogenetic proximity, against a reference database of 243 fully sequenced genomes. The approach - a combination of sequence searches, statistical estimation and clustering - analyses the degree of sequence divergence between sets of protein sequences and allows the classification of protein sequences according to the species of origin with high accuracy, allowing taxonomic classification of 64% of the proteins studied. In most cases, a main cluster is detected, representing the corresponding species. Secondary, functionally distinct and species-specific clusters exhibit different patterns of phylogenetic distribution, thus flagging gene groups of interest. Detailed analyses of such cases are provided as examples. Conclusion Our results indicate that the rank-BLAST approach can capture the taxonomic origins of sequence collections in an accurate and efficient manner. The approach can be useful both for the analysis of genome evolution and the detection of species groups in metagenomics samples. PMID:19860884

  11. Revised Ciprofloxacin Breakpoints for Salmonella: Is it Time to Write an Obituary?

    PubMed

    Girish, Revathy; Kumar, Anil; Khan, Sadia; Dinesh, Kavitha R; Karim, Shamsul

    2013-11-01

    To determine the minimum inhibitory concentration of ciprofloxacin among 50 blood stream isolates of Salmonella enterica. A total of 50 consecutive isolates of Salmonella enterica were tested for susceptibility to antimicrobials using the Kirby Bauer disk diffusion method. Minimum inhibitory concentrations were determined using Hi-Comb strips. All results were interpreted according to the CLSI guidelines. Of the 50 isolates 70%were Salmonella Typhi, 4% Salmonella paratyphi A, 2% Salmonella paratyphi B and the remaining 10% were identified only as Salmonella species. Using the CLSI 2011 breakpoints for disc diffusion, 86% (43/50) were resistant to nalidixic acid(NA), 22% (11/50) to ciprofloxacin, 12% to azithromycin, 6% to cotrimoxazole, 4% to ampicillin and 1% to chloramphenicol. The MIC50 and MIC90 of ciprofloxacin for S.Typhi were 0.181 μg/mL and 5.06 μg/mL respectively. While the same for S. paratyphi A was 0.212μg/mL and 0.228μg/mL respectively. None of the isolates were multi drug resistant and all were susceptible to ceftriaxone. Using the CLSI 2012 revised ciprofloxacin breakpoints for disc diffusion (>31mm) & MIC (<0.06 μg/mL), 90% (45/50) of these isolates were found to be resistant. MIC's of ciprofloxacin should be reported for all salmonella isolates and should be used to guide treatment. Blindly following western guidelines for a disease which is highly endemic in the subcontinent will spell the death knell of a cheap and effective drug in our armamentarium. Therefore it will be too premature to declare that "the concept of using ciprofloxacin in typhoid fever is dead!"

  12. BIMLR: a method for constructing rooted phylogenetic networks from rooted phylogenetic trees.

    PubMed

    Wang, Juan; Guo, Maozu; Xing, Linlin; Che, Kai; Liu, Xiaoyan; Wang, Chunyu

    2013-09-15

    Rooted phylogenetic trees constructed from different datasets (e.g. from different genes) are often conflicting with one another, i.e. they cannot be integrated into a single phylogenetic tree. Phylogenetic networks have become an important tool in molecular evolution, and rooted phylogenetic networks are able to represent conflicting rooted phylogenetic trees. Hence, the development of appropriate methods to compute rooted phylogenetic networks from rooted phylogenetic trees has attracted considerable research interest of late. The CASS algorithm proposed by van Iersel et al. is able to construct much simpler networks than other available methods, but it is extremely slow, and the networks it constructs are dependent on the order of the input data. Here, we introduce an improved CASS algorithm, BIMLR. We show that BIMLR is faster than CASS and less dependent on the input data order. Moreover, BIMLR is able to construct much simpler networks than almost all other methods. BIMLR is available at http://nclab.hit.edu.cn/wangjuan/BIMLR/. © 2013 Elsevier B.V. All rights reserved.

  13. Characterization of IGH locus breakpoints in multiple myeloma indicates a subset of translocations appear to occur in pregerminal center B cells.

    PubMed

    Walker, Brian A; Wardell, Christopher P; Johnson, David C; Kaiser, Martin F; Begum, Dil B; Dahir, Nasrin B; Ross, Fiona M; Davies, Faith E; Gonzalez, David; Morgan, Gareth J

    2013-04-25

    Translocations in myeloma are thought to occur solely in mature B cells in the germinal center through class switch recombination (CSR). We used a targeted captured technique followed by massively parallel sequencing to determine the exact breakpoints in both the immunoglobulin heavy chain (IGH) locus and the partner chromosome in 61 presentation multiple myeloma samples. The majority of samples (62%) have a breakpoint within the switch regions upstream of the IGH constant genes and are generated through CSR in a mature B cell. However, the proportion of CSR translocations is not consistent between cytogenetic subgroups. We find that 100% of t(4;14) are CSR-mediated; however, 21% of t(11;14) and 25% of t(14;20) are generated through DH-JH recombination activation gene-mediated mechanisms, indicating they occur earlier in B-cell development at the pro-B-cell stage in the bone marrow. These 2 groups also generate translocations through receptor revision, as determined by the breakpoints and mutation status of the segments used in 10% and 50% of t(11;14) and t(14;20) samples, respectively. The study indicates that in a significant number of cases the translocation-based etiological events underlying myeloma may arise at the pro-B-cell hematological progenitor cell level, much earlier in B-cell development than was previously thought.

  14. Use of phylogenetical analysis to predict susceptibility of pathogenic Candida spp. to antifungal drugs.

    PubMed

    Maheux, Andrée F; Sellam, Adnane; Piché, Yves; Boissinot, Maurice; Pelletier, René; Boudreau, Dominique K; Picard, François J; Trépanier, Hélène; Boily, Marie-Josée; Ouellette, Marc; Roy, Paul H; Bergeron, Michel G

    2016-12-01

    Successful treatment of a Candida infection relies on 1) an accurate identification of the pathogenic fungus and 2) on its susceptibility to antifungal drugs. In the present study we investigated the level of correlation between phylogenetical evolution and susceptibility of pathogenic Candida spp. to antifungal drugs. For this, we compared a phylogenetic tree, assembled with the concatenated sequences (2475-bp) of the ATP2, TEF1, and TUF1 genes from 20 representative Candida species, with published minimal inhibitory concentrations (MIC) of the four principal antifungal drug classes commonly used in the treatment of candidiasis: polyenes, triazoles, nucleoside analogues, and echinocandins. The phylogenetic tree revealed three distinct phylogenetic clusters among Candida species. Species within a given phylogenetic cluster have generally similar susceptibility profiles to antifungal drugs and species within Clusters II and III were less sensitive to antifungal drugs than Cluster I species. These results showed that phylogenetical relationship between clusters and susceptibility to several antifungal drugs could be used to guide therapy when only species identification is available prior to information pertaining to its resistance profile. An extended study comprising a large panel of clinical samples should be conducted to confirm the efficiency of this approach in the treatment of candidiasis. Copyright © 2016. Published by Elsevier B.V.

  15. Microhomology-mediated end joining induces hypermutagenesis at breakpoint junctions

    PubMed Central

    Li, Fuyang; Villarreal, Diana; Shim, Jae Hoon; Myung, Kyungjae; Shim, Eun Yong; Lee, Sang Eun

    2017-01-01

    Microhomology (MH) flanking a DNA double-strand break (DSB) drives chromosomal rearrangements but its role in mutagenesis has not yet been analyzed. Here we determined the mutation frequency of a URA3 reporter gene placed at multiple locations distal to a DSB, which is flanked by different sizes (15-, 18-, or 203-bp) of direct repeat sequences for efficient repair in budding yeast. Induction of a DSB accumulates mutations in the reporter gene situated up to 14-kb distal to the 15-bp MH, but more modestly to those carrying 18- and 203-bp or no homology. Increased mutagenesis in MH-mediated end joining (MMEJ) appears coupled to its slower repair kinetics and the extensive resection occurring at flanking DNA. Chromosomal translocations via MMEJ also elevate mutagenesis of the flanking DNA sequences 7.1 kb distal to the breakpoint junction as compared to those without MH. The results suggest that MMEJ could destabilize genomes by triggering structural alterations and increasing mutation burden. PMID:28419093

  16. A method of alignment masking for refining the phylogenetic signal of multiple sequence alignments.

    PubMed

    Rajan, Vaibhav

    2013-03-01

    Inaccurate inference of positional homologies in multiple sequence alignments and systematic errors introduced by alignment heuristics obfuscate phylogenetic inference. Alignment masking, the elimination of phylogenetically uninformative or misleading sites from an alignment before phylogenetic analysis, is a common practice in phylogenetic analysis. Although masking is often done manually, automated methods are necessary to handle the much larger data sets being prepared today. In this study, we introduce the concept of subsplits and demonstrate their use in extracting phylogenetic signal from alignments. We design a clustering approach for alignment masking where each cluster contains similar columns-similarity being defined on the basis of compatible subsplits; our approach then identifies noisy clusters and eliminates them. Trees inferred from the columns in the retained clusters are found to be topologically closer to the reference trees. We test our method on numerous standard benchmarks (both synthetic and biological data sets) and compare its performance with other methods of alignment masking. We find that our method can eliminate sites more accurately than other methods, particularly on divergent data, and can improve the topologies of the inferred trees in likelihood-based analyses. Software available upon request from the author.

  17. Somatic cell hybrid mapping on mouse chromosome 11 (MMU11): Assignment of markers relative to two breakpoints in band D

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Morris, D.J.; Robinson, T.J.; Adler, I.D.

    1993-02-01

    Mouse [times] rat somatic cell hybrids were generated by fusing mouse cell lines that are heterozygous for reciprocal translocations involving the T42H and T9Ad breakpoints on mouse chromosome 11 (MMU11) to a thymidine kinase-negative (Tk[sup [minus

  18. Optimal rates for phylogenetic inference and experimental design in the era of genome-scale datasets.

    PubMed

    Dornburg, Alex; Su, Zhuo; Townsend, Jeffrey P

    2018-06-25

    With the rise of genome- scale datasets there has been a call for increased data scrutiny and careful selection of loci appropriate for attempting the resolution of a phylogenetic problem. Such loci are desired to maximize phylogenetic information content while minimizing the risk of homoplasy. Theory posits the existence of characters that evolve under such an optimum rate, and efforts to determine optimal rates of inference have been a cornerstone of phylogenetic experimental design for over two decades. However, both theoretical and empirical investigations of optimal rates have varied dramatically in their conclusions: spanning no relationship to a tight relationship between the rate of change and phylogenetic utility. Here we synthesize these apparently contradictory views, demonstrating both empirical and theoretical conditions under which each is correct. We find that optimal rates of characters-not genes-are generally robust to most experimental design decisions. Moreover, consideration of site rate heterogeneity within a given locus is critical to accurate predictions of utility. Factors such as taxon sampling or the targeted number of characters providing support for a topology are additionally critical to the predictions of phylogenetic utility based on the rate of character change. Further, optimality of rates and predictions of phylogenetic utility are not equivalent, demonstrating the need for further development of comprehensive theory of phylogenetic experimental design.

  19. Comparative Maps of Human 19p13.3 and Mouse Chromosome 10 Allow Identification of Sequences at Evolutionary Breakpoints

    PubMed Central

    Puttagunta, Radhika; Gordon, Laurie A.; Meyer, Gary E.; Kapfhamer, David; Lamerdin, Jane E.; Kantheti, Prameela; Portman, Kathleen M.; Chung, Wendy K.; Jenne, Dieter E.; Olsen, Anne S.; Burmeister, Margit

    2000-01-01

    A cosmid/bacterial artificial chromosome (BAC) contiguous (contig) map of human chromosome (HSA) 19p13.3 has been constructed, and over 50 genes have been localized to the contig. Genes and anonymous ESTs from ≈4000 kb of human 19p13.3 were placed on the central mouse chromosome 10 map by genetic mapping and pulsed-field gel electrophoresis (PFGE) analysis. A region of ∼2500 kb of HSA 19p13.3 is collinear to mouse chromosome (MMU) 10. In contrast, the adjacent ≈1200 kb are inverted. Two genes are located in a 50-kb region after the inversion on MMU 10, followed by a region of homology to mouse chromosome 17. The synteny breakpoint and one of the inversion breakpoints has been localized to sequenced regions in human <5 kb in size. Both breakpoints are rich in simple tandem repeats, including (TCTG)n, (CT)n, and (GTCTCT)n, suggesting that simple repeat sequences may be involved in chromosome breaks during evolution. The overall size of the region in mouse is smaller, although no large regions are missing. Comparing the physical maps to the genetic maps showed that in contrast to the higher-than-average rate of genetic recombination in gene-rich telomeric region on HSA 19p13.3, the average rate of recombination is lower than expected in the homologous mouse region. This might indicate that a hot spot of recombination may have been lost in mouse or gained in human during evolution, or that the position of sequences along the chromosome (telomeric compared to the middle of a chromosome) is important for recombination rates. PMID:10984455

  20. Cross-validation to select Bayesian hierarchical models in phylogenetics.

    PubMed

    Duchêne, Sebastián; Duchêne, David A; Di Giallonardo, Francesca; Eden, John-Sebastian; Geoghegan, Jemma L; Holt, Kathryn E; Ho, Simon Y W; Holmes, Edward C

    2016-05-26

    Recent developments in Bayesian phylogenetic models have increased the range of inferences that can be drawn from molecular sequence data. Accordingly, model selection has become an important component of phylogenetic analysis. Methods of model selection generally consider the likelihood of the data under the model in question. In the context of Bayesian phylogenetics, the most common approach involves estimating the marginal likelihood, which is typically done by integrating the likelihood across model parameters, weighted by the prior. Although this method is accurate, it is sensitive to the presence of improper priors. We explored an alternative approach based on cross-validation that is widely used in evolutionary analysis. This involves comparing models according to their predictive performance. We analysed simulated data and a range of viral and bacterial data sets using a cross-validation approach to compare a variety of molecular clock and demographic models. Our results show that cross-validation can be effective in distinguishing between strict- and relaxed-clock models and in identifying demographic models that allow growth in population size over time. In most of our empirical data analyses, the model selected using cross-validation was able to match that selected using marginal-likelihood estimation. The accuracy of cross-validation appears to improve with longer sequence data, particularly when distinguishing between relaxed-clock models. Cross-validation is a useful method for Bayesian phylogenetic model selection. This method can be readily implemented even when considering complex models where selecting an appropriate prior for all parameters may be difficult.

  1. The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism.

    PubMed

    Salm, Maximilian P A; Horswell, Stuart D; Hutchison, Claire E; Speedy, Helen E; Yang, Xia; Liang, Liming; Schadt, Eric E; Cookson, William O; Wierzbicki, Anthony S; Naoumova, Rossi P; Shoulders, Carol C

    2012-06-01

    Genomic inversions are an increasingly recognized source of genetic variation. However, a lack of reliable high-throughput genotyping assays for these structures has precluded a full understanding of an inversion's phylogenetic, phenotypic, and population genetic properties. We characterize these properties for one of the largest polymorphic inversions in man (the ∼4.5-Mb 8p23.1 inversion), a structure that encompasses numerous signals of natural selection and disease association. We developed and validated a flexible bioinformatics tool that utilizes SNP data to enable accurate, high-throughput genotyping of the 8p23.1 inversion. This tool was applied retrospectively to diverse genome-wide data sets, revealing significant population stratification that largely follows a clinal "serial founder effect" distribution model. Phylogenetic analyses establish the inversion's ancestral origin within the Homo lineage, indicating that 8p23.1 inversion has occurred independently in the Pan lineage. The human inversion breakpoint was localized to an inverted pair of human endogenous retrovirus elements within the large, flanking low-copy repeats; experimental validation of this breakpoint confirmed these elements as the likely intermediary substrates that sponsored inversion formation. In five data sets, mRNA levels of disease-associated genes were robustly associated with inversion genotype. Moreover, a haplotype associated with systemic lupus erythematosus was restricted to the derived inversion state. We conclude that the 8p23.1 inversion is an evolutionarily dynamic structure that can now be accommodated into the understanding of human genetic and phenotypic diversity.

  2. Assessment of available anatomical characters for linking living mammals to fossil taxa in phylogenetic analyses.

    PubMed

    Guillerme, Thomas; Cooper, Natalie

    2016-05-01

    Analyses of living and fossil taxa are crucial for understanding biodiversity through time. The total evidence method allows living and fossil taxa to be combined in phylogenies, using molecular data for living taxa and morphological data for living and fossil taxa. With this method, substantial overlap of coded anatomical characters among living and fossil taxa is vital for accurately inferring topology. However, although molecular data for living species are widely available, scientists generating morphological data mainly focus on fossils. Therefore, there are fewer coded anatomical characters in living taxa, even in well-studied groups such as mammals. We investigated the number of coded anatomical characters available in phylogenetic matrices for living mammals and how these were phylogenetically distributed across orders. Eleven of 28 mammalian orders have less than 25% species with available characters; this has implications for the accurate placement of fossils, although the issue is less pronounced at higher taxonomic levels. In most orders, species with available characters are randomly distributed across the phylogeny, which may reduce the impact of the problem. We suggest that increased morphological data collection efforts for living taxa are needed to produce accurate total evidence phylogenies. © 2016 The Authors.

  3. Evolutionary history of the third chromosome gene arrangements of Drosophila pseudoobscura inferred from inversion breakpoints.

    PubMed

    Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W

    2011-08-01

    The third chromosome of Drosophila pseudoobscura is polymorphic for numerous gene arrangements that form classical clines in North America. The polytene salivary chromosomes isolated from natural populations revealed changes in gene order that allowed the different gene arrangements to be linked together by paracentric inversions representing one of the first cases where genetic data were used to construct a phylogeny. Although the inversion phylogeny can be used to determine the relationships among the gene arrangements, the cytogenetic data are unable to infer the ancestral arrangement or the age of the different chromosome types. These are both important properties if one is to infer the evolutionary forces responsible for the spread and maintenance of the chromosomes. Here, we employ the nucleotide sequences of 18 regions distributed across the third chromosome in 80-100 D. pseudoobscura strains to test whether five gene arrangements are of unique or multiple origin, what the ancestral arrangement was, and what are the ages of the different arrangements. Each strain carried one of six commonly found gene arrangements and the sequences were used to infer their evolutionary relationships. Breakpoint regions in the center of the chromosome supported monophyly of the gene arrangements, whereas regions at the ends of the chromosome gave phylogenies that provided less support for monophyly of the chromosomes either because the individual markers did not have enough phylogenetically informative sites or genetic exchange scrambled information among the gene arrangements. A data set where the genetic markers were concatenated strongly supported a unique origin of the different gene arrangements. The inversion polymorphism of D. pseudoobscura is estimated to be about a million years old. We have also shown that the generated phylogeny is consistent with the cytological phylogeny of this species. In addition, the data presented here support hypothetical as the ancestral

  4. Teaching Molecular Phylogenetics through Investigating a Real-World Phylogenetic Problem

    ERIC Educational Resources Information Center

    Zhang, Xiaorong

    2012-01-01

    A phylogenetics exercise is incorporated into the "Introduction to biocomputing" course, a junior-level course at Savannah State University. This exercise is designed to help students learn important concepts and practical skills in molecular phylogenetics through solving a real-world problem. In this application, students are required to identify…

  5. Phylogenomics of species from four genera of New World monkeys by flow sorting and reciprocal chromosome painting

    PubMed Central

    Dumas, Francesca; Stanyon, Roscoe; Sineo, Luca; Stone, Gary; Bigoni, Francesca

    2007-01-01

    Background The taxonomic and phylogenetic relationships of New World monkeys (Platyrrhini) are difficult to distinguish on the basis of morphology and because diagnostic fossils are rare. Recently, molecular data have led to a radical revision of the traditional taxonomy and phylogeny of these primates. Here we examine new hypotheses of platyrrhine evolutionary relationships by reciprocal chromosome painting after chromosome flow sorting of species belonging to four genera of platyrrhines included in the Cebidae family: Callithrix argentata (silvered-marmoset), Cebuella pygmaea (pygmy marmoset), Callimico goeldii (Goeldi's marmoset) and Saimiri sciureus (squirrel monkey). This is the first report of reciprocal painting in marmosets. Results The paints made from chromosome flow sorting of the four platyrrhine monkeys provided from 42 to 45 hybridization signals on human metaphases. The reciprocal painting of monkey probes on human chromosomes revealed that 21 breakpoints are common to all four studied species. There are only three additional breakpoints. A breakpoint on human chromosome 13 was found in Callithrix argentata, Cebuella pygmaea and Callimico goeldii, but not in Saimiri sciureus. There are two additional breakpoints on human chromosome 5: one is specific to squirrel monkeys, and the other to Goeldi's marmoset. Conclusion The reciprocal painting results support the molecular genomic assemblage of Cebidae. We demonstrated that the five chromosome associations previously hypothesized to phylogenetically link tamarins and marmosets are homologous and represent derived chromosome rearrangements. Four of these derived homologous associations tightly nest Callimico goeldii with marmosets. One derived association 2/15 may place squirrel monkeys within the Cebidae assemblage. An apparently common breakpoint on chromosome 5q33 found in both Saimiri and Aotus nancymae could be evidence of a phylogenetic link between these species. Comparison with previous reports

  6. Nonbinary Tree-Based Phylogenetic Networks.

    PubMed

    Jetten, Laura; van Iersel, Leo

    2018-01-01

    Rooted phylogenetic networks are used to describe evolutionary histories that contain non-treelike evolutionary events such as hybridization and horizontal gene transfer. In some cases, such histories can be described by a phylogenetic base-tree with additional linking arcs, which can, for example, represent gene transfer events. Such phylogenetic networks are called tree-based. Here, we consider two possible generalizations of this concept to nonbinary networks, which we call tree-based and strictly-tree-based nonbinary phylogenetic networks. We give simple graph-theoretic characterizations of tree-based and strictly-tree-based nonbinary phylogenetic networks. Moreover, we show for each of these two classes that it can be decided in polynomial time whether a given network is contained in the class. Our approach also provides a new view on tree-based binary phylogenetic networks. Finally, we discuss two examples of nonbinary phylogenetic networks in biology and show how our results can be applied to them.

  7. In vitro antibacterial activity of ceftobiprole against clinical isolates from French teaching hospitals: proposition of zone diameter breakpoints.

    PubMed

    Lascols, C; Legrand, P; Mérens, A; Leclercq, R; Muller-Serieys, C; Drugeon, H B; Kitzis, M D; Reverdy, M E; Roussel-Delvallez, M; Moubareck, C; Brémont, S; Miara, A; Gjoklaj, M; Soussy, C-J

    2011-03-01

    The aims of this study were to determine the in vitro activity profile of ceftobiprole, a pyrrolidinone cephalosporin, against a large number of bacterial pathogens and to propose zone diameter breakpoints for clinical categorisation according to the European Committee on Antimicrobial Susceptibility Testing (EUCAST) minimum inhibitory concentration (MIC) breakpoints. MICs of ceftobiprole were determined by broth microdilution against 1548 clinical isolates collected in eight French hospitals. Disk diffusion testing was performed using 30 μg disks according to the method of the Comité de l'Antibiogramme de la Société Française de Microbiologie (CA-SFM). The in vitro activity of ceftobiprole, expressed by MIC(50/90) (MICs for 50% and 90% of the organisms, respectively) (mg/L), was as follows: meticillin-susceptible Staphylococcus aureus, 0.25/0.5; meticillin-resistant S. aureus (MRSA), 1/2; meticillin-susceptible coagulase-negative staphylococci (CoNS), 0.12/0.5; meticillin-resistant CoNS, 1/2; penicillin-susceptible Streptococcus pneumoniae, ≤ 0.008/0.03; penicillin-resistant S. pneumoniae, 0.12/0.5; viridans group streptococci, 0.03/0.12; β-haemolytic streptococci, ≤ 0.008/0.016; Enterococcus faecalis, 0.25/1; Enterococcus faecium, 64/128; Enterobacteriaceae, 0.06/32; Pseudomonas aeruginosa, 4/16; Acinetobacter baumannii, 0.5/64; Haemophilus influenzae, 0.03/0.12; and Moraxella catarrhalis, 0.25/0.5. According to the regression curve, zone diameter breakpoints could be 28, 26, 24 and 22 mm for MICs of 0.5, 1, 2 and 4 mg/L respectively. In conclusion, this study confirms the potent in vitro activity of ceftobiprole against many Gram-positive bacteria, including MRSA but not E. faecium, whilst maintaining a Gram-negative spectrum similar to the advanced-generation cephalosporins such as cefepime. Thus ceftobiprole appears to be well suited for the empirical treatment of a variety of healthcare-associated infections. Copyright © 2011 Elsevier B.V. and the

  8. Progress, pitfalls and parallel universes: a history of insect phylogenetics

    PubMed Central

    Simon, Chris; Yavorskaya, Margarita; Beutel, Rolf G.

    2016-01-01

    The phylogeny of insects has been both extensively studied and vigorously debated for over a century. A relatively accurate deep phylogeny had been produced by 1904. It was not substantially improved in topology until recently when phylogenomics settled many long-standing controversies. Intervening advances came instead through methodological improvement. Early molecular phylogenetic studies (1985–2005), dominated by a few genes, provided datasets that were too small to resolve controversial phylogenetic problems. Adding to the lack of consensus, this period was characterized by a polarization of philosophies, with individuals belonging to either parsimony or maximum-likelihood camps; each largely ignoring the insights of the other. The result was an unfortunate detour in which the few perceived phylogenetic revolutions published by both sides of the philosophical divide were probably erroneous. The size of datasets has been growing exponentially since the mid-1980s accompanied by a wave of confidence that all relationships will soon be known. However, large datasets create new challenges, and a large number of genes does not guarantee reliable results. If history is a guide, then the quality of conclusions will be determined by an improved understanding of both molecular and morphological evolution, and not simply the number of genes analysed. PMID:27558853

  9. A High-Resolution Comparative Chromosome Map of Cricetus cricetus and Peromyscus eremicus Reveals the Involvement of Constitutive Heterochromatin in Breakpoint Regions.

    PubMed

    Vieira-da-Silva, Ana; Louzada, Sandra; Adega, Filomena; Chaves, Raquel

    2015-01-01

    Compared to humans and other mammals, rodent genomes, specifically Muroidea species, underwent intense chromosome reshuffling in which many complex structural rearrangements occurred. This fact makes them preferential animal models for studying the process of karyotype evolution. Here, we present the first combined chromosome comparative maps between 2 Cricetidae species, Cricetus cricetus and Peromyscus eremicus, and the index species Mus musculus and Rattus norvegicus. Comparative chromosome painting was done using mouse and rat paint probes together with in silico analysis from the Ensembl genome browser database. Hereby, evolutionary events (inter- and intrachromosomal rearrangements) that occurred in C. cricetus and P. eremicus since the putative ancestral Muroidea genome could be inferred, and evolutionary breakpoint regions could be detected. A colocalization of constitutive heterochromatin and evolutionary breakpoint regions in each genome was observed. Our results suggest the involvement of constitutive heterochromatin in karyotype restructuring of these species, despite the different levels of conservation of the C. cricetus (derivative) and P. eremicus (conserved) genomes. © 2015 S. Karger AG, Basel.

  10. Analysis of HIV-1 intersubtype recombination breakpoints suggests region with high pairing probability may be a more fundamental factor than sequence similarity affecting HIV-1 recombination.

    PubMed

    Jia, Lei; Li, Lin; Gui, Tao; Liu, Siyang; Li, Hanping; Han, Jingwan; Guo, Wei; Liu, Yongjian; Li, Jingyun

    2016-09-21

    With increasing data on HIV-1, a more relevant molecular model describing mechanism details of HIV-1 genetic recombination usually requires upgrades. Currently an incomplete structural understanding of the copy choice mechanism along with several other issues in the field that lack elucidation led us to perform an analysis of the correlation between breakpoint distributions and (1) the probability of base pairing, and (2) intersubtype genetic similarity to further explore structural mechanisms. Near full length sequences of URFs from Asia, Europe, and Africa (one sequence/patient), and representative sequences of worldwide CRFs were retrieved from the Los Alamos HIV database. Their recombination patterns were analyzed by jpHMM in detail. Then the relationships between breakpoint distributions and (1) the probability of base pairing, and (2) intersubtype genetic similarities were investigated. Pearson correlation test showed that all URF groups and the CRF group exhibit the same breakpoint distribution pattern. Additionally, the Wilcoxon two-sample test indicated a significant and inexplicable limitation of recombination in regions with high pairing probability. These regions have been found to be strongly conserved across distinct biological states (i.e., strong intersubtype similarity), and genetic similarity has been determined to be a very important factor promoting recombination. Thus, the results revealed an unexpected disagreement between intersubtype similarity and breakpoint distribution, which were further confirmed by genetic similarity analysis. Our analysis reveals a critical conflict between results from natural HIV-1 isolates and those from HIV-1-based assay vectors in which genetic similarity has been shown to be a very critical factor promoting recombination. These results indicate the region with high-pairing probabilities may be a more fundamental factor affecting HIV-1 recombination than sequence similarity in natural HIV-1 infections. Our

  11. The Independent Evolution Method Is Not a Viable Phylogenetic Comparative Method

    PubMed Central

    2015-01-01

    Phylogenetic comparative methods (PCMs) use data on species traits and phylogenetic relationships to shed light on evolutionary questions. Recently, Smaers and Vinicius suggested a new PCM, Independent Evolution (IE), which purportedly employs a novel model of evolution based on Felsenstein’s Adaptive Peak Model. The authors found that IE improves upon previous PCMs by producing more accurate estimates of ancestral states, as well as separate estimates of evolutionary rates for each branch of a phylogenetic tree. Here, we document substantial theoretical and computational issues with IE. When data are simulated under a simple Brownian motion model of evolution, IE produces severely biased estimates of ancestral states and changes along individual branches. We show that these branch-specific changes are essentially ancestor-descendant or “directional” contrasts, and draw parallels between IE and previous PCMs such as “minimum evolution”. Additionally, while comparisons of branch-specific changes between variables have been interpreted as reflecting the relative strength of selection on those traits, we demonstrate through simulations that regressing IE estimated branch-specific changes against one another gives a biased estimate of the scaling relationship between these variables, and provides no advantages or insights beyond established PCMs such as phylogenetically independent contrasts. In light of our findings, we discuss the results of previous papers that employed IE. We conclude that Independent Evolution is not a viable PCM, and should not be used in comparative analyses. PMID:26683838

  12. Cyber-infrastructure for Fusarium (CiF): Three integrated platforms supporting strain identification, phylogenetics, comparative genomics, and knowledge sharing

    USDA-ARS?s Scientific Manuscript database

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on ...

  13. Sequencing and Analyzing the "t" (1;7) Reciprocal Translocation Breakpoints Associated with a Case of Childhood-Onset Schizophrenia/Autistic Disorder

    ERIC Educational Resources Information Center

    Idol, Jacquelyn R.; Addington, Anjene M.; Long, Robert T.; Rapoport, Judith L.; Green, Eric D.

    2008-01-01

    We characterized a "t"(1;7)(p22;q21) reciprocal translocation in a patient with childhood-onset schizophrenia (COS) and autism using genome mapping and sequencing methods. Based on genomic maps of human chromosome 7 and fluorescence in situ hybridization (FISH) studies, we delimited the region of 7q21 harboring the translocation breakpoint to a…

  14. Targeted next-generation sequencing at copy-number breakpoints for personalized analysis of rearranged ends in solid tumors.

    PubMed

    Kim, Hyun-Kyoung; Park, Won Cheol; Lee, Kwang Man; Hwang, Hai-Li; Park, Seong-Yeol; Sorn, Sungbin; Chandra, Vishal; Kim, Kwang Gi; Yoon, Woong-Bae; Bae, Joon Seol; Shin, Hyoung Doo; Shin, Jong-Yeon; Seoh, Ju-Young; Kim, Jong-Il; Hong, Kyeong-Man

    2014-01-01

    The concept of the utilization of rearranged ends for development of personalized biomarkers has attracted much attention owing to its clinical applicability. Although targeted next-generation sequencing (NGS) for recurrent rearrangements has been successful in hematologic malignancies, its application to solid tumors is problematic due to the paucity of recurrent translocations. However, copy-number breakpoints (CNBs), which are abundant in solid tumors, can be utilized for identification of rearranged ends. As a proof of concept, we performed targeted next-generation sequencing at copy-number breakpoints (TNGS-CNB) in nine colon cancer cases including seven primary cancers and two cell lines, COLO205 and SW620. For deduction of CNBs, we developed a novel competitive single-nucleotide polymorphism (cSNP) microarray method entailing CNB-region refinement by competitor DNA. Using TNGS-CNB, 19 specific rearrangements out of 91 CNBs (20.9%) were identified, and two polymerase chain reaction (PCR)-amplifiable rearrangements were obtained in six cases (66.7%). And significantly, TNGS-CNB, with its high positive identification rate (82.6%) of PCR-amplifiable rearrangements at candidate sites (19/23), just from filtering of aligned sequences, requires little effort for validation. Our results indicate that TNGS-CNB, with its utility for identification of rearrangements in solid tumors, can be successfully applied in the clinical laboratory for cancer-relapse and therapy-response monitoring.

  15. Transforming phylogenetic networks: Moving beyond tree space.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Wu, Taoyang

    2016-09-07

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent reticulate evolution. Unrooted phylogenetic networks form a special class of such networks, which naturally generalize unrooted phylogenetic trees. In this paper we define two operations on unrooted phylogenetic networks, one of which is a generalization of the well-known nearest-neighbor interchange (NNI) operation on phylogenetic trees. We show that any unrooted phylogenetic network can be transformed into any other such network using only these operations. This generalizes the well-known fact that any phylogenetic tree can be transformed into any other such tree using only NNI operations. It also allows us to define a generalization of tree space and to define some new metrics on unrooted phylogenetic networks. To prove our main results, we employ some fascinating new connections between phylogenetic networks and cubic graphs that we have recently discovered. Our results should be useful in developing new strategies to search for optimal phylogenetic networks, a topic that has recently generated some interest in the literature, as well as for providing new ways to compare networks. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. Missing Data and Influential Sites: Choice of Sites for Phylogenetic Analysis Can Be As Important As Taxon Sampling and Model Choice

    PubMed Central

    Shavit Grievink, Liat; Penny, David; Holland, Barbara R.

    2013-01-01

    Phylogenetic studies based on molecular sequence alignments are expected to become more accurate as the number of sites in the alignments increases. With the advent of genomic-scale data, where alignments have very large numbers of sites, bootstrap values close to 100% and posterior probabilities close to 1 are the norm, suggesting that the number of sites is now seldom a limiting factor on phylogenetic accuracy. This provokes the question, should we be fussy about the sites we choose to include in a genomic-scale phylogenetic analysis? If some sites contain missing data, ambiguous character states, or gaps, then why not just throw them away before conducting the phylogenetic analysis? Indeed, this is exactly the approach taken in many phylogenetic studies. Here, we present an example where the decision on how to treat sites with missing data is of equal importance to decisions on taxon sampling and model choice, and we introduce a graphical method for illustrating this. PMID:23471508

  17. Phylogenetic effective sample size.

    PubMed

    Bartoszek, Krzysztof

    2016-10-21

    In this paper I address the question-how large is a phylogenetic sample? I propose a definition of a phylogenetic effective sample size for Brownian motion and Ornstein-Uhlenbeck processes-the regression effective sample size. I discuss how mutual information can be used to define an effective sample size in the non-normal process case and compare these two definitions to an already present concept of effective sample size (the mean effective sample size). Through a simulation study I find that the AICc is robust if one corrects for the number of species or effective number of species. Lastly I discuss how the concept of the phylogenetic effective sample size can be useful for biodiversity quantification, identification of interesting clades and deciding on the importance of phylogenetic correlations. Copyright © 2016 Elsevier Ltd. All rights reserved.

  18. The origin, global distribution, and functional impact of the human 8p23 inversion polymorphism

    PubMed Central

    Salm, Maximilian P.A.; Horswell, Stuart D.; Hutchison, Claire E.; Speedy, Helen E.; Yang, Xia; Liang, Liming; Schadt, Eric E.; Cookson, William O.; Wierzbicki, Anthony S.; Naoumova, Rossi P.; Shoulders, Carol C.

    2012-01-01

    Genomic inversions are an increasingly recognized source of genetic variation. However, a lack of reliable high-throughput genotyping assays for these structures has precluded a full understanding of an inversion's phylogenetic, phenotypic, and population genetic properties. We characterize these properties for one of the largest polymorphic inversions in man (the ∼4.5-Mb 8p23.1 inversion), a structure that encompasses numerous signals of natural selection and disease association. We developed and validated a flexible bioinformatics tool that utilizes SNP data to enable accurate, high-throughput genotyping of the 8p23.1 inversion. This tool was applied retrospectively to diverse genome-wide data sets, revealing significant population stratification that largely follows a clinal “serial founder effect” distribution model. Phylogenetic analyses establish the inversion's ancestral origin within the Homo lineage, indicating that 8p23.1 inversion has occurred independently in the Pan lineage. The human inversion breakpoint was localized to an inverted pair of human endogenous retrovirus elements within the large, flanking low-copy repeats; experimental validation of this breakpoint confirmed these elements as the likely intermediary substrates that sponsored inversion formation. In five data sets, mRNA levels of disease-associated genes were robustly associated with inversion genotype. Moreover, a haplotype associated with systemic lupus erythematosus was restricted to the derived inversion state. We conclude that the 8p23.1 inversion is an evolutionarily dynamic structure that can now be accommodated into the understanding of human genetic and phenotypic diversity. PMID:22399572

  19. Molecular characterization of the breakpoints of a 12-kb deletion in the NF1 gene in a family showing germ-line mosaicism.

    PubMed Central

    Lázaro, C; Gaona, A; Lynch, M; Kruyer, H; Ravella, A; Estivill, X

    1995-01-01

    Neurofibromatosis type 1 (NF1) is caused by deletions, insertions, translocations, and point mutations in the NF1 gene, which spans 350 kb on the long arm of human chromosome 17. Although several point mutations have been described, large molecular abnormalities have rarely been characterized in detail. We describe here the molecular breakpoints of a 12-kb deletion of the NF1 gene, which is responsible for the NF1 phenotype in a kindred with two children affected because of germline mosaicism in the unaffected father, who has the mutation in 10% of his spermatozoa. The mutation spans introns 31-39, removing 12,021 nt and inserting 30 bp, of which 19 bp are a direct repetition of a sequence located in intron 31, just 4 bp before the 5' breakpoint. The 5' and 3' breakpoints contain the sequence TATTTTA, which could be involved in the generation of the deletion. The most plausible explanation for the mechanism involved in the generation of this 12-kb deletion is homologous/nonhomologous recombination. Since sperm of the father does not contain the corresponding insertion of the 12-kb deleted sequence, this deletion could have occurred within the NF1 chromosome through loop formation. RNA from lymphocytes of one of the NF1 patients showed similar levels of the mutated and normal transcripts, suggesting that the NF1-mRNA from mutations causing frame shifts of the reading frame or stop codons in this gene is not degraded during its processing. The mutation was not detected in fresh lymphocytes from the unaffected father by PCR analysis, supporting the case for true germ-line mosaicism. Images Figure 1 Figure 3 PMID:7485153

  20. A Format for Phylogenetic Placements

    PubMed Central

    Matsen, Frederick A.; Hoffman, Noah G.; Gallagher, Aaron; Stamatakis, Alexandros

    2012-01-01

    We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement. PMID:22383988

  1. A format for phylogenetic placements.

    PubMed

    Matsen, Frederick A; Hoffman, Noah G; Gallagher, Aaron; Stamatakis, Alexandros

    2012-01-01

    We have developed a unified format for phylogenetic placements, that is, mappings of environmental sequence data (e.g., short reads) into a phylogenetic tree. We are motivated to do so by the growing number of tools for computing and post-processing phylogenetic placements, and the lack of an established standard for storing them. The format is lightweight, versatile, extensible, and is based on the JSON format, which can be parsed by most modern programming languages. Our format is already implemented in several tools for computing and post-processing parsimony- and likelihood-based phylogenetic placements and has worked well in practice. We believe that establishing a standard format for analyzing read placements at this early stage will lead to a more efficient development of powerful and portable post-analysis tools for the growing applications of phylogenetic placement.

  2. Phylogenetic inertia and Darwin's higher law.

    PubMed

    Shanahan, Timothy

    2011-03-01

    The concept of 'phylogenetic inertia' is routinely deployed in evolutionary biology as an alternative to natural selection for explaining the persistence of characteristics that appear sub-optimal from an adaptationist perspective. However, in many of these contexts the precise meaning of 'phylogenetic inertia' and its relationship to selection are far from clear. After tracing the history of the concept of 'inertia' in evolutionary biology, I argue that treating phylogenetic inertia and natural selection as alternative explanations is mistaken because phylogenetic inertia is, from a Darwinian point of view, simply an expected effect of selection. Although Darwin did not discuss 'phylogenetic inertia,' he did assert the explanatory priority of selection over descent. An analysis of 'phylogenetic inertia' provides a perspective from which to assess Darwin's view. Copyright © 2010 Elsevier Ltd. All rights reserved.

  3. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

  4. High-resolution phylogenetic microbial community profiling

    DOE PAGES

    Singer, Esther; Bushnell, Brian; Coleman-Derr, Devin; ...

    2016-02-09

    Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structuresmore » at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake's water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.« less

  5. Molecular Phylogenetics: Concepts for a Newcomer.

    PubMed

    Ajawatanawong, Pravech

    Molecular phylogenetics is the study of evolutionary relationships among organisms using molecular sequence data. The aim of this review is to introduce the important terminology and general concepts of tree reconstruction to biologists who lack a strong background in the field of molecular evolution. Some modern phylogenetic programs are easy to use because of their user-friendly interfaces, but understanding the phylogenetic algorithms and substitution models, which are based on advanced statistics, is still important for the analysis and interpretation without a guide. Briefly, there are five general steps in carrying out a phylogenetic analysis: (1) sequence data preparation, (2) sequence alignment, (3) choosing a phylogenetic reconstruction method, (4) identification of the best tree, and (5) evaluating the tree. Concepts in this review enable biologists to grasp the basic ideas behind phylogenetic analysis and also help provide a sound basis for discussions with expert phylogeneticists.

  6. Encoding phylogenetic trees in terms of weighted quartets.

    PubMed

    Grünewald, Stefan; Huber, Katharina T; Moulton, Vincent; Semple, Charles

    2008-04-01

    One of the main problems in phylogenetics is to develop systematic methods for constructing evolutionary or phylogenetic trees. For a set of species X, an edge-weighted phylogenetic X-tree or phylogenetic tree is a (graph theoretical) tree with leaf set X and no degree 2 vertices, together with a map assigning a non-negative length to each edge of the tree. Within phylogenetics, several methods have been proposed for constructing such trees that work by trying to piece together quartet trees on X, i.e. phylogenetic trees each having four leaves in X. Hence, it is of interest to characterise when a collection of quartet trees corresponds to a (unique) phylogenetic tree. Recently, Dress and Erdös provided such a characterisation for binary phylogenetic trees, that is, phylogenetic trees all of whose internal vertices have degree 3. Here we provide a new characterisation for arbitrary phylogenetic trees.

  7. Translocation breakpoint at 7q31 associated with tics: further evidence for IMMP2L as a candidate gene for Tourette syndrome.

    PubMed

    Patel, Chirag; Cooper-Charles, Lisa; McMullan, Dominic J; Walker, Judith M; Davison, Val; Morton, Jenny

    2011-06-01

    Gilles de la Tourette syndrome is a complex neuropsychiatric disorder with a strong genetic basis. We identified a male patient with Tourette syndrome-like tics and an apparently balanced de novo translocation [46,XY,t(2;7)(p24.2;q31)]. Further analysis using array comparative genomic hybridisation (CGH) revealed a cryptic deletion at 7q31.1-7q31.2. Breakpoints disrupting this region have been reported in one isolated and one familial case of Tourette syndrome. In our case, IMMP2L, a gene coding for a human homologue of the yeast inner mitochondrial membrane peptidase subunit 2, was disrupted by the breakpoint on 7q31.1, with deletion of exons 1-3 of the gene. The IMMP2L gene has previously been proposed as a candidate gene for Tourette syndrome, and our case provides further evidence of its possible role in the pathogenesis. The deleted region (7q31.1-7q31.2) of 7.2 Mb of genomic DNA also encompasses numerous genes, including FOXP2, associated with verbal dyspraxia, and the CFTR gene.

  8. Breakpoint analysis and relations of nutrient and turbidity stressor variables to macroinvertebrate integrity in streams in the Crawford-Mammoth Cave Uplands Ecoregion, Kentucky, for the development of nutrient criteria

    USGS Publications Warehouse

    Crain, Angela S.; Caskey, Brian J.

    2010-01-01

    To assist Kentucky in refining numeric nutrient criteria in the Pennyroyal Bioregion, the U.S. Geological Survey and the Kentucky Division of Water collected and analyzed water chemistry, turbidity, and biological-community data from 22 streams throughout the Crawford-Mammoth Cave Upland ecoregion (U.S. Environmental Protection Agency Level IV Ecoregion, 71a) within the Pennyroyal Bioregion from September 2007 to May 2008. Statistically significant and ecologically relevant relations among the stressor (total phosphorus, total nitrogen, and turbidity) variables and response (macroinvertebrate-community attributes) variables and the breakpoint values of biological-community attributes and metrics in response to changes in stressor variables were determined. Thirteen of 18 macroinvertebrate attributes were significantly and ecologically correlated (p-value < 0.10) with at least one nutrient measure. Total number of individuals, Ephemeroptera-Plecoptera-Trichoptera richness, and average tolerance value were macroinvertebrate measures that most strongly correlated with the concentrations of nutrients. Comparison of the average macroinvertebrate-breakpoint value for the median concentration of total phosphorus (TP, 0.033 mg/L) and for median concentration of total nitrogen (TN, 1.1 mg/L) to Dodds' trophic classification for TP and TN indicates streams in the Crawford-Mammoth Cave Uplands ecoregion within the Pennyroyal Bioregion would be classified as mesotrophic-eutrophic. The biological breakpoint relations with median concentrations of TP in this study were similar to the U.S. Environmental Protection Agency proposed numeric TP criteria (0.037 mg/L), but were 1.5 times higher than the proposed numeric criteria for concentrations of TN (0.69 mg/L). No sites were impacted adversely using median turbidity values based on a 25 Formazin nephelometric turbidity unit biological threshold. The breakpoints determined in this study, in addition to Dodds' trophic

  9. Tree-Based Unrooted Phylogenetic Networks.

    PubMed

    Francis, A; Huber, K T; Moulton, V

    2018-02-01

    Phylogenetic networks are a generalization of phylogenetic trees that are used to represent non-tree-like evolutionary histories that arise in organisms such as plants and bacteria, or uncertainty in evolutionary histories. An unrooted phylogenetic network on a non-empty, finite set X of taxa, or network, is a connected, simple graph in which every vertex has degree 1 or 3 and whose leaf set is X. It is called a phylogenetic tree if the underlying graph is a tree. In this paper we consider properties of tree-based networks, that is, networks that can be constructed by adding edges into a phylogenetic tree. We show that although they have some properties in common with their rooted analogues which have recently drawn much attention in the literature, they have some striking differences in terms of both their structural and computational properties. We expect that our results could eventually have applications to, for example, detecting horizontal gene transfer or hybridization which are important factors in the evolution of many organisms.

  10. Distribution of chromosome breakpoints in benzene-exposed and unexposed AML patients.

    PubMed

    Kerzic, Patrick J; Irons, Richard D

    2017-10-01

    Results of laboratory studies and investigations of occupationally exposed healthy individuals have been used to develop a mode of action for benzene-induced leukemia that mirrors disease following treatment with chemotherapeutic agents. Recently we have described series of AML and MDS cases with benzene exposure history, and have provided cytogenetic, molecular, and pathologic evidence that these cases differ significantly in many features from therapy-related disease. Here we have extended this work, and describe chromosome breakpoints across 441 identifiable regions, in terms of gains or losses, in 710 AML cases collected during the Shanghai Health Study, which include 75 with a history of benzene exposure. Using FISH and cytogenetic analysis, we developed prevalence information and risk ratios for benzene exposure across all regions with a lesion in at least one exposed and unexposed case. These results indicate that AML following benzene exposure mirrors de novo disease, and supports a mechanism for development of hematopoietic disease that bears no resemblance to therapy-related disease. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.

  11. Nodal distances for rooted phylogenetic trees.

    PubMed

    Cardona, Gabriel; Llabrés, Mercè; Rosselló, Francesc; Valiente, Gabriel

    2010-08-01

    Dissimilarity measures for (possibly weighted) phylogenetic trees based on the comparison of their vectors of path lengths between pairs of taxa, have been present in the systematics literature since the early seventies. For rooted phylogenetic trees, however, these vectors can only separate non-weighted binary trees, and therefore these dissimilarity measures are metrics only on this class of rooted phylogenetic trees. In this paper we overcome this problem, by splitting in a suitable way each path length between two taxa into two lengths. We prove that the resulting splitted path lengths matrices single out arbitrary rooted phylogenetic trees with nested taxa and arcs weighted in the set of positive real numbers. This allows the definition of metrics on this general class of rooted phylogenetic trees by comparing these matrices through metrics in spaces M(n)(R) of real-valued n x n matrices. We conclude this paper by establishing some basic facts about the metrics for non-weighted phylogenetic trees defined in this way using L(p) metrics on M(n)(R), with p [epsilon] R(>0).

  12. Phylogenetically resolving epidemiologic linkage

    PubMed Central

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-01-01

    Although the use of phylogenetic trees in epidemiological investigations has become commonplace, their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the true transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. We confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results. PMID:26903617

  13. Species trees for the tree swallows (Genus Tachycineta): an alternative phylogenetic hypothesis to the mitochondrial gene tree.

    PubMed

    Dor, Roi; Carling, Matthew D; Lovette, Irby J; Sheldon, Frederick H; Winkler, David W

    2012-10-01

    The New World swallow genus Tachycineta comprises nine species that collectively have a wide geographic distribution and remarkable variation both within- and among-species in ecologically important traits. Existing phylogenetic hypotheses for Tachycineta are based on mitochondrial DNA sequences, thus they provide estimates of a single gene tree. In this study we sequenced multiple individuals from each species at 16 nuclear intron loci. We used gene concatenated approaches (Bayesian and maximum likelihood) as well as coalescent-based species tree inference to reconstruct phylogenetic relationships of the genus. We examined the concordance and conflict between the nuclear and mitochondrial trees and between concatenated and coalescent-based inferences. Our results provide an alternative phylogenetic hypothesis to the existing mitochondrial DNA estimate of phylogeny. This new hypothesis provides a more accurate framework in which to explore trait evolution and examine the evolution of the mitochondrial genome in this group. Copyright © 2012 Elsevier Inc. All rights reserved.

  14. SUNPLIN: simulation with uncertainty for phylogenetic investigations.

    PubMed

    Martins, Wellington S; Carmo, Welton C; Longo, Humberto J; Rosa, Thierson C; Rangel, Thiago F

    2013-11-15

    Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets.

  15. Language impairment in a case of a complex chromosomal rearrangement with a breakpoint downstream of FOXP2.

    PubMed

    Moralli, Daniela; Nudel, Ron; Chan, May T M; Green, Catherine M; Volpi, Emanuela V; Benítez-Burraco, Antonio; Newbury, Dianne F; García-Bellido, Paloma

    2015-01-01

    We report on a young female, who presents with a severe speech and language disorder and a balanced de novo complex chromosomal rearrangement, likely to have resulted from a chromosome 7 pericentromeric inversion, followed by a chromosome 7 and 11 translocation. Using molecular cytogenetics, we mapped the four breakpoints to 7p21.1-15.3 (chromosome position: 20,954,043-21,001,537, hg19), 7q31 (chromosome position: 114,528,369-114,556,605, hg19), 7q21.3 (chromosome position: 93,884,065-93,933,453, hg19) and 11p12 (chromosome position: 38,601,145-38,621,572, hg19). These regions contain only non-coding transcripts (ENSG00000232790 on 7p21.1 and TCONS_00013886, TCONS_00013887, TCONS_00014353, TCONS_00013888 on 7q21) indicating that no coding sequences are directly disrupted. The breakpoint on 7q31 mapped 200 kb downstream of FOXP2, a well-known language gene. No splice site or non-synonymous coding variants were found in the FOXP2 coding sequence. We were unable to detect any changes in the expression level of FOXP2 in fibroblast cells derived from the proband, although this may be the result of the low expression level of FOXP2 in these cells. We conclude that the phenotype observed in this patient either arises from a subtle change in FOXP2 regulation due to the disruption of a downstream element controlling its expression, or from the direct disruption of non-coding RNAs.

  16. Cloning a balanced t(9;11)(p24;q23.1) chromosomal translocation breakpoint segregating with bipolar affective disorder in a small pedigree

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Duggan, D.J.; Baysal, B.E.; Gollin, S.M.

    A small multigenerational pedigree was previously identified in which a balanced 9;11 chromosomal translocation was cosegregating with bipolar affective disorder. We hypothesize that genes or gene regulatory sequences disrupted by the translocation are contributing to bipolar affective disorder in a dominant fashion. The general strategy involves (1) using somatic cell hybrids containing the derivative 9 or 11 chromosomes to identify the closest chromosome 9 and 11 flanking markers, (2) using the nearest markers as PCR and hybridization probes to isolate both normal DNA (YAC) and patient DNA (cosmid) adjacent to and incorporating the translocation breakpoint, and (3) identifying expressed sequencesmore » in the genomic DNA that may be disrupted by the translocation. From a fusion of the translocation patient cell line and a recipient hamster cell line, somatic cell hybrids were isolated which contain either the human derivative 9 or derivative 11 chromosome. Using PCR-based STS assays with these hybrids, the location of the translocation breakpoint was localized to an estimated 500 kb region at chromosome 11 band q23.1 and a 1 cM region in 9 band p24 (more telomeric than originally reported). From a large set of CEPH and Roswell Park yeast artificial chromosomes (YACs), six chromosome 11 YACs spanning the 11q23.1 breakpoint have now been identified. A combination of pulsed field gel eletrophoresis and YAC mapping has narrowed the chromosome 11 region to less than 430 kb. Current efforts are focused on generating new chromosome 11 probes within the flanking markers, mapping these probes back to the der(9) and der(11) containing hybrids and the chromosome 11 YAC mapping panel. As the region is physically narrowed, we will identify candidate genes whose expression may be altered by this t(9:11) translocation.« less

  17. On Tree-Based Phylogenetic Networks.

    PubMed

    Zhang, Louxin

    2016-07-01

    A large class of phylogenetic networks can be obtained from trees by the addition of horizontal edges between the tree edges. These networks are called tree-based networks. We present a simple necessary and sufficient condition for tree-based networks and prove that a universal tree-based network exists for any number of taxa that contains as its base every phylogenetic tree on the same set of taxa. This answers two problems posted by Francis and Steel recently. A byproduct is a computer program for generating random binary phylogenetic networks under the uniform distribution model.

  18. Long-range restriction map of human chromosome 22q11-22q12 between the lambda immunoglobulin locus and the Ewing sarcoma breakpoint

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McDermid, H.E.; Budarf, M.L.; Emanuel, B.S.

    1993-11-01

    A long-range restriction map of the region between the immunoglobulin lambda locus and the Ewing sarcoma breakpoint has been constructed using the rare-cutting enzymes NotI, NruI, AscI, and BsiWI. The map spans approximately 11,000 kb and represents about one-fifth of the long arm of chromosome 22. Thirty-nine markers, including seven NotI junction clones as well as numerous genes and anonymous sequences, were mapped to the region with a somatic cell hybrid panel. These probes were then used to produce the map. The seven NotI junction clones each identified a possible CpG island. The breakpoints of the RAJ5 hybrid and themore » Ewing sarcoma t(11;22) were also localized in the resulting map. This physical map will be useful in studying chromosomal rearrangements in the region, as well as providing the details to examine the fidelity of the YAC and cosmid contigs currently under construction. Comparisons of this physical map to genetic and radiation hybrid maps are discussed. 52 refs., 7 figs., 3 tabs.« less

  19. Functional & phylogenetic diversity of copepod communities

    NASA Astrophysics Data System (ADS)

    Benedetti, F.; Ayata, S. D.; Blanco-Bercial, L.; Cornils, A.; Guilhaumon, F.

    2016-02-01

    The diversity of natural communities is classically estimated through species identification (taxonomic diversity) but can also be estimated from the ecological functions performed by the species (functional diversity), or from the phylogenetic relationships among them (phylogenetic diversity). Estimating functional diversity requires the definition of specific functional traits, i.e., phenotypic characteristics that impact fitness and are relevant to ecosystem functioning. Estimating phylogenetic diversity requires the description of phylogenetic relationships, for instance by using molecular tools. In the present study, we focused on the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. First, we implemented a specific trait database for the most commonly-sampled and abundant copepod species of the Mediterranean Sea. Our database includes 191 species, described by seven traits encompassing diverse ecological functions: minimal and maximal body length, trophic group, feeding type, spawning strategy, diel vertical migration and vertical habitat. Clustering analysis in the functional trait space revealed that Mediterranean copepods can be gathered into groups that have different ecological roles. Second, we reconstructed a phylogenetic tree using the available sequences of 18S rRNA. Our tree included 154 of the analyzed Mediterranean copepod species. We used these two datasets to describe the functional and phylogenetic diversity of copepod surface communities in the Mediterranean Sea. The replacement component (turn-over) and the species richness difference component (nestedness) of the beta diversity indices were identified. Finally, by comparing various and complementary aspects of plankton diversity (taxonomic, functional, and phylogenetic diversity) we were able to gain a better understanding of the relationships among the zooplankton community, biodiversity, ecosystem function, and environmental forcing.

  20. treespace: Statistical exploration of landscapes of phylogenetic trees.

    PubMed

    Jombart, Thibaut; Kendall, Michelle; Almagro-Garcia, Jacob; Colijn, Caroline

    2017-11-01

    The increasing availability of large genomic data sets as well as the advent of Bayesian phylogenetics facilitates the investigation of phylogenetic incongruence, which can result in the impossibility of representing phylogenetic relationships using a single tree. While sometimes considered as a nuisance, phylogenetic incongruence can also reflect meaningful biological processes as well as relevant statistical uncertainty, both of which can yield valuable insights in evolutionary studies. We introduce a new tool for investigating phylogenetic incongruence through the exploration of phylogenetic tree landscapes. Our approach, implemented in the R package treespace, combines tree metrics and multivariate analysis to provide low-dimensional representations of the topological variability in a set of trees, which can be used for identifying clusters of similar trees and group-specific consensus phylogenies. treespace also provides a user-friendly web interface for interactive data analysis and is integrated alongside existing standards for phylogenetics. It fills a gap in the current phylogenetics toolbox in R and will facilitate the investigation of phylogenetic results. © 2017 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

  1. SUNPLIN: Simulation with Uncertainty for Phylogenetic Investigations

    PubMed Central

    2013-01-01

    Background Phylogenetic comparative analyses usually rely on a single consensus phylogenetic tree in order to study evolutionary processes. However, most phylogenetic trees are incomplete with regard to species sampling, which may critically compromise analyses. Some approaches have been proposed to integrate non-molecular phylogenetic information into incomplete molecular phylogenies. An expanded tree approach consists of adding missing species to random locations within their clade. The information contained in the topology of the resulting expanded trees can be captured by the pairwise phylogenetic distance between species and stored in a matrix for further statistical analysis. Thus, the random expansion and processing of multiple phylogenetic trees can be used to estimate the phylogenetic uncertainty through a simulation procedure. Because of the computational burden required, unless this procedure is efficiently implemented, the analyses are of limited applicability. Results In this paper, we present efficient algorithms and implementations for randomly expanding and processing phylogenetic trees so that simulations involved in comparative phylogenetic analysis with uncertainty can be conducted in a reasonable time. We propose algorithms for both randomly expanding trees and calculating distance matrices. We made available the source code, which was written in the C++ language. The code may be used as a standalone program or as a shared object in the R system. The software can also be used as a web service through the link: http://purl.oclc.org/NET/sunplin/. Conclusion We compare our implementations to similar solutions and show that significant performance gains can be obtained. Our results open up the possibility of accounting for phylogenetic uncertainty in evolutionary and ecological analyses of large datasets. PMID:24229408

  2. Quadriceps Muscles O2 Extraction and EMG Breakpoints during a Ramp Incremental Test

    PubMed Central

    Iannetta, Danilo; Qahtani, Ahmad; Millet, Guillaume Y.; Murias, Juan M.

    2017-01-01

    Muscle deoxygenated breakpoint ([HHb]BP) has been found to be associated with other indices of exercise tolerance in the vastus lateralis (VL) muscle but not in the vastus medialis (VM) and rectus femoris (RF). Purpose: To investigate whether the [HHb]BP occurs also in the VM and RF muscles and whether or not it is associated with other physiological indices of exercise tolerance, such as the EMG threshold (EMGt) and the respiratory compensation point (RCP). Methods: Twelve young endurance trained participants performed maximal ramp incremental (RI) cycling tests (25–30 W·min−1 increments). Muscle oxygen extraction and activity as well as ventilatory and gas exchange parameters were measured. After accounting for the mean response time, the oxygen uptake (V·O2) corresponding to the RCP, [HHb]BP, and the EMGt was determined. Results: Peak power output (POpeak) was 359 ± 48 W. Maximal oxygen consumption (V·O2max) was 3.87 ± 0.46 L·min−1. The V·O2 at the RCP was 3.39 ± 0.41 L·min−1. The V·O2 (L·min−1) corresponding to the [HHb]BP and EMGt were: 3.49 ± 0.46 and 3.40 ± 0.44; 3.44 ± 0.61 and 3.43 ± 0.49; 3.59 ± 0.52, and 3.48 ± 0.46 for VL, VM, and RF, respectively. Pearson's correlation between these thresholds ranged from 0.90 to 0.97 (P < 0.05). No difference was found for the absolute V·O2 and the normalized PO (%) at which the thresholds occurred in all three muscles investigated (P > 0.05). Although in eight out of 12 participants, the [HHb]BP in the RF led to a steeper increase instead of leading to a plateau-like response as observed in the VL and VM, the V·O2 at the breakpoints still coincided with that at the RCP. Conclusions: This study demonstrated that local indices of exercise tolerance derived from different portions of the quadriceps are not different to the systemic index of the RCP. PMID:28970805

  3. Phylogenetically resolving epidemiologic linkage

    DOE PAGES

    Romero-Severson, Ethan O.; Bulla, Ingo; Leitner, Thomas

    2016-02-22

    The use of phylogenetic trees in epidemiological investigations has become commonplace, but their epidemiological interpretation has not been systematically evaluated. Here, we use an HIV-1 within-host coalescent model to probabilistically evaluate transmission histories of two epidemiologically linked hosts. Previous critique of phylogenetic reconstruction has claimed that direction of transmission is difficult to infer, and that the existence of unsampled intermediary links or common sources can never be excluded. The phylogenetic relationship between the HIV populations of epidemiologically linked hosts can be classified into six types of trees, based on cladistic relationships and whether the reconstruction is consistent with the truemore » transmission history or not. We show that the direction of transmission and whether unsampled intermediary links or common sources existed make very different predictions about expected phylogenetic relationships: (i) Direction of transmission can often be established when paraphyly exists, (ii) intermediary links can be excluded when multiple lineages were transmitted, and (iii) when the sampled individuals’ HIV populations both are monophyletic a common source was likely the origin. Inconsistent results, suggesting the wrong transmission direction, were generally rare. In addition, the expected tree topology also depends on the number of transmitted lineages, the sample size, the time of the sample relative to transmission, and how fast the diversity increases after infection. Typically, 20 or more sequences per subject give robust results. Moreover, we confirm our theoretical evaluations with analyses of real transmission histories and discuss how our findings should aid in interpreting phylogenetic results.« less

  4. Spatial patterns of phylogenetic diversity.

    PubMed

    Morlon, Hélène; Schwilk, Dylan W; Bryant, Jessica A; Marquet, Pablo A; Rebelo, Anthony G; Tauss, Catherine; Bohannan, Brendan J M; Green, Jessica L

    2011-02-01

    Ecologists and conservation biologists have historically used species-area and distance-decay relationships as tools to predict the spatial distribution of biodiversity and the impact of habitat loss on biodiversity. These tools treat each species as evolutionarily equivalent, yet the importance of species' evolutionary history in their ecology and conservation is becoming increasingly evident. Here, we provide theoretical predictions for phylogenetic analogues of the species-area and distance-decay relationships. We use a random model of community assembly and a spatially explicit flora dataset collected in four Mediterranean-type regions to provide theoretical predictions for the increase in phylogenetic diversity - the total phylogenetic branch-length separating a set of species - with increasing area and the decay in phylogenetic similarity with geographic separation. These developments may ultimately provide insights into the evolution and assembly of biological communities, and guide the selection of protected areas. © 2010 Blackwell Publishing Ltd/CNRS.

  5. Phylogenetic overdispersion of plant species in southern Brazilian savannas.

    PubMed

    Silva, I A; Batalha, M A

    2009-08-01

    Ecological communities are the result of not only present ecological processes, such as competition among species and environmental filtering, but also past and continuing evolutionary processes. Based on these assumptions, we may infer mechanisms of contemporary coexistence from the phylogenetic relationships of the species in a community. We studied the phylogenetic structure of plant communities in four cerrado sites, in southeastern Brazil. We calculated two raw phylogenetic distances among the species sampled. We estimated the phylogenetic structure by comparing the observed phylogenetic distances to the distribution of phylogenetic distances in null communities. We obtained null communities by randomizing the phylogenetic relationships of the regional pool of species. We found a phylogenetic overdispersion of the cerrado species. Phylogenetic overdispersion has several explanations, depending on the phylogenetic history of traits and contemporary ecological interactions. However, based on coexistence models between grasses and trees, density-dependent ecological forces, and the evolutionary history of the cerrado flora, we argue that the phylogenetic overdispersion of cerrado species is predominantly due to competitive interactions, herbivores and pathogen attacks, and ecological speciation. Future studies will need to include information on the phylogenetic history of plant traits.

  6. Translocation breakpoint at 7q31 associated with tics: further evidence for IMMP2L as a candidate gene for Tourette syndrome

    PubMed Central

    Patel, Chirag; Cooper-Charles, Lisa; McMullan, Dominic J; Walker, Judith M; Davison, Val; Morton, Jenny

    2011-01-01

    Gilles de la Tourette syndrome is a complex neuropsychiatric disorder with a strong genetic basis. We identified a male patient with Tourette syndrome-like tics and an apparently balanced de novo translocation [46,XY,t(2;7)(p24.2;q31)]. Further analysis using array comparative genomic hybridisation (CGH) revealed a cryptic deletion at 7q31.1–7q31.2. Breakpoints disrupting this region have been reported in one isolated and one familial case of Tourette syndrome. In our case, IMMP2L, a gene coding for a human homologue of the yeast inner mitochondrial membrane peptidase subunit 2, was disrupted by the breakpoint on 7q31.1, with deletion of exons 1–3 of the gene. The IMMP2L gene has previously been proposed as a candidate gene for Tourette syndrome, and our case provides further evidence of its possible role in the pathogenesis. The deleted region (7q31.1–7q31.2) of 7.2 Mb of genomic DNA also encompasses numerous genes, including FOXP2, associated with verbal dyspraxia, and the CFTR gene. PMID:21386874

  7. Phylogenetic Framework and Molecular Signatures for the Main Clades of the Phylum Actinobacteria

    PubMed Central

    Gao, Beile

    2012-01-01

    Summary: The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria. PMID:22390973

  8. Different relationships between temporal phylogenetic turnover and phylogenetic similarity and in two forests were detected by a new null model.

    PubMed

    Huang, Jian-Xiong; Zhang, Jian; Shen, Yong; Lian, Ju-yu; Cao, Hong-lin; Ye, Wan-hui; Wu, Lin-fang; Bin, Yue

    2014-01-01

    Ecologists have been monitoring community dynamics with the purpose of understanding the rates and causes of community change. However, there is a lack of monitoring of community dynamics from the perspective of phylogeny. We attempted to understand temporal phylogenetic turnover in a 50 ha tropical forest (Barro Colorado Island, BCI) and a 20 ha subtropical forest (Dinghushan in southern China, DHS). To obtain temporal phylogenetic turnover under random conditions, two null models were used. The first shuffled names of species that are widely used in community phylogenetic analyses. The second simulated demographic processes with careful consideration on the variation in dispersal ability among species and the variations in mortality both among species and among size classes. With the two models, we tested the relationships between temporal phylogenetic turnover and phylogenetic similarity at different spatial scales in the two forests. Results were more consistent with previous findings using the second null model suggesting that the second null model is more appropriate for our purposes. With the second null model, a significantly positive relationship was detected between phylogenetic turnover and phylogenetic similarity in BCI at a 10 m×10 m scale, potentially indicating phylogenetic density dependence. This relationship in DHS was significantly negative at three of five spatial scales. This could indicate abiotic filtering processes for community assembly. Using variation partitioning, we found phylogenetic similarity contributed to variation in temporal phylogenetic turnover in the DHS plot but not in BCI plot. The mechanisms for community assembly in BCI and DHS vary from phylogenetic perspective. Only the second null model detected this difference indicating the importance of choosing a proper null model.

  9. Rooting phylogenetic trees under the coalescent model using site pattern probabilities.

    PubMed

    Tian, Yuan; Kubatko, Laura

    2017-12-19

    Phylogenetic tree inference is a fundamental tool to estimate ancestor-descendant relationships among different species. In phylogenetic studies, identification of the root - the most recent common ancestor of all sampled organisms - is essential for complete understanding of the evolutionary relationships. Rooted trees benefit most downstream application of phylogenies such as species classification or study of adaptation. Often, trees can be rooted by using outgroups, which are species that are known to be more distantly related to the sampled organisms than any other species in the phylogeny. However, outgroups are not always available in evolutionary research. In this study, we develop a new method for rooting species tree under the coalescent model, by developing a series of hypothesis tests for rooting quartet phylogenies using site pattern probabilities. The power of this method is examined by simulation studies and by application to an empirical North American rattlesnake data set. The method shows high accuracy across the simulation conditions considered, and performs well for the rattlesnake data. Thus, it provides a computationally efficient way to accurately root species-level phylogenies that incorporates the coalescent process. The method is robust to variation in substitution model, but is sensitive to the assumption of a molecular clock. Our study establishes a computationally practical method for rooting species trees that is more efficient than traditional methods. The method will benefit numerous evolutionary studies that require rooting a phylogenetic tree without having to specify outgroups.

  10. Undergraduate Students’ Difficulties in Reading and Constructing Phylogenetic Tree

    NASA Astrophysics Data System (ADS)

    Sa'adah, S.; Tapilouw, F. S.; Hidayat, T.

    2017-02-01

    Representation is a very important communication tool to communicate scientific concepts. Biologists produce phylogenetic representation to express their understanding of evolutionary relationships. The phylogenetic tree is visual representation depict a hypothesis about the evolutionary relationship and widely used in the biological sciences. Phylogenetic tree currently growing for many disciplines in biology. Consequently, learning about phylogenetic tree become an important part of biological education and an interesting area for biology education research. However, research showed many students often struggle with interpreting the information that phylogenetic trees depict. The purpose of this study was to investigate undergraduate students’ difficulties in reading and constructing a phylogenetic tree. The method of this study is a descriptive method. In this study, we used questionnaires, interviews, multiple choice and open-ended questions, reflective journals and observations. The findings showed students experiencing difficulties, especially in constructing a phylogenetic tree. The students’ responds indicated that main reasons for difficulties in constructing a phylogenetic tree are difficult to placing taxa in a phylogenetic tree based on the data provided so that the phylogenetic tree constructed does not describe the actual evolutionary relationship (incorrect relatedness). Students also have difficulties in determining the sister group, character synapomorphy, autapomorphy from data provided (character table) and comparing among phylogenetic tree. According to them building the phylogenetic tree is more difficult than reading the phylogenetic tree. Finding this studies provide information to undergraduate instructor and students to overcome learning difficulties of reading and constructing phylogenetic tree.

  11. Constructing phylogenetic trees using interacting pathways.

    PubMed

    Wan, Peng; Che, Dongsheng

    2013-01-01

    Phylogenetic trees are used to represent evolutionary relationships among biological species or organisms. The construction of phylogenetic trees is based on the similarities or differences of their physical or genetic features. Traditional approaches of constructing phylogenetic trees mainly focus on physical features. The recent advancement of high-throughput technologies has led to accumulation of huge amounts of biological data, which in turn changed the way of biological studies in various aspects. In this paper, we report our approach of building phylogenetic trees using the information of interacting pathways. We have applied hierarchical clustering on two domains of organisms-eukaryotes and prokaryotes. Our preliminary results have shown the effectiveness of using the interacting pathways in revealing evolutionary relationships.

  12. The Evolutionary Ecology of Plant Disease: A Phylogenetic Perspective.

    PubMed

    Gilbert, Gregory S; Parker, Ingrid M

    2016-08-04

    An explicit phylogenetic perspective provides useful tools for phytopathology and plant disease ecology because the traits of both plants and microbes are shaped by their evolutionary histories. We present brief primers on phylogenetic signal and the analytical tools of phylogenetic ecology. We review the literature and find abundant evidence of phylogenetic signal in pathogens and plants for most traits involved in disease interactions. Plant nonhost resistance mechanisms and pathogen housekeeping functions are conserved at deeper phylogenetic levels, whereas molecular traits associated with rapid coevolutionary dynamics are more labile at branch tips. Horizontal gene transfer disrupts the phylogenetic signal for some microbial traits. Emergent traits, such as host range and disease severity, show clear phylogenetic signals. Therefore pathogen spread and disease impact are influenced by the phylogenetic structure of host assemblages. Phylogenetically rare species escape disease pressure. Phylogenetic tools could be used to develop predictive tools for phytosanitary risk analysis and reduce disease pressure in multispecies cropping systems.

  13. Inferring Phylogenetic Networks Using PhyloNet.

    PubMed

    Wen, Dingqiao; Yu, Yun; Zhu, Jiafan; Nakhleh, Luay

    2018-07-01

    PhyloNet was released in 2008 as a software package for representing and analyzing phylogenetic networks. At the time of its release, the main functionalities in PhyloNet consisted of measures for comparing network topologies and a single heuristic for reconciling gene trees with a species tree. Since then, PhyloNet has grown significantly. The software package now includes a wide array of methods for inferring phylogenetic networks from data sets of unlinked loci while accounting for both reticulation (e.g., hybridization) and incomplete lineage sorting. In particular, PhyloNet now allows for maximum parsimony, maximum likelihood, and Bayesian inference of phylogenetic networks from gene tree estimates. Furthermore, Bayesian inference directly from sequence data (sequence alignments or biallelic markers) is implemented. Maximum parsimony is based on an extension of the "minimizing deep coalescences" criterion to phylogenetic networks, whereas maximum likelihood and Bayesian inference are based on the multispecies network coalescent. All methods allow for multiple individuals per species. As computing the likelihood of a phylogenetic network is computationally hard, PhyloNet allows for evaluation and inference of networks using a pseudolikelihood measure. PhyloNet summarizes the results of the various analyzes and generates phylogenetic networks in the extended Newick format that is readily viewable by existing visualization software.

  14. Phylogenetic classification of bony fishes.

    PubMed

    Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo

    2017-07-06

    Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution

  15. The space of ultrametric phylogenetic trees.

    PubMed

    Gavryushkin, Alex; Drummond, Alexei J

    2016-08-21

    The reliability of a phylogenetic inference method from genomic sequence data is ensured by its statistical consistency. Bayesian inference methods produce a sample of phylogenetic trees from the posterior distribution given sequence data. Hence the question of statistical consistency of such methods is equivalent to the consistency of the summary of the sample. More generally, statistical consistency is ensured by the tree space used to analyse the sample. In this paper, we consider two standard parameterisations of phylogenetic time-trees used in evolutionary models: inter-coalescent interval lengths and absolute times of divergence events. For each of these parameterisations we introduce a natural metric space on ultrametric phylogenetic trees. We compare the introduced spaces with existing models of tree space and formulate several formal requirements that a metric space on phylogenetic trees must possess in order to be a satisfactory space for statistical analysis, and justify them. We show that only a few known constructions of the space of phylogenetic trees satisfy these requirements. However, our results suggest that these basic requirements are not enough to distinguish between the two metric spaces we introduce and that the choice between metric spaces requires additional properties to be considered. Particularly, that the summary tree minimising the square distance to the trees from the sample might be different for different parameterisations. This suggests that further fundamental insight is needed into the problem of statistical consistency of phylogenetic inference methods. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  16. Detecting taxonomic and phylogenetic signals in equid cheek teeth: towards new palaeontological and archaeological proxies

    NASA Astrophysics Data System (ADS)

    Cucchi, T.; Mohaseb, A.; Peigné, S.; Debue, K.; Orlando, L.; Mashkour, M.

    2017-04-01

    The Plio-Pleistocene evolution of Equus and the subsequent domestication of horses and donkeys remains poorly understood, due to the lack of phenotypic markers capable of tracing this evolutionary process in the palaeontological/archaeological record. Using images from 345 specimens, encompassing 15 extant taxa of equids, we quantified the occlusal enamel folding pattern in four mandibular cheek teeth with a single geometric morphometric protocol. We initially investigated the protocol accuracy by assigning each tooth to its correct anatomical position and taxonomic group. We then contrasted the phylogenetic signal present in each tooth shape with an exome-wide phylogeny from 10 extant equine species. We estimated the strength of the phylogenetic signal using a Brownian motion model of evolution with multivariate K statistic, and mapped the dental shape along the molecular phylogeny using an approach based on squared-change parsimony. We found clear evidence for the relevance of dental phenotypes to accurately discriminate all modern members of the genus Equus and capture their phylogenetic relationships. These results are valuable for both palaeontologists and zooarchaeologists exploring the spatial and temporal dynamics of the evolutionary history of the horse family, up to the latest domestication trajectories of horses and donkeys.

  17. Detecting taxonomic and phylogenetic signals in equid cheek teeth: towards new palaeontological and archaeological proxies

    PubMed Central

    Mohaseb, A.; Peigné, S.; Debue, K.; Orlando, L.; Mashkour, M.

    2017-01-01

    The Plio–Pleistocene evolution of Equus and the subsequent domestication of horses and donkeys remains poorly understood, due to the lack of phenotypic markers capable of tracing this evolutionary process in the palaeontological/archaeological record. Using images from 345 specimens, encompassing 15 extant taxa of equids, we quantified the occlusal enamel folding pattern in four mandibular cheek teeth with a single geometric morphometric protocol. We initially investigated the protocol accuracy by assigning each tooth to its correct anatomical position and taxonomic group. We then contrasted the phylogenetic signal present in each tooth shape with an exome-wide phylogeny from 10 extant equine species. We estimated the strength of the phylogenetic signal using a Brownian motion model of evolution with multivariate K statistic, and mapped the dental shape along the molecular phylogeny using an approach based on squared-change parsimony. We found clear evidence for the relevance of dental phenotypes to accurately discriminate all modern members of the genus Equus and capture their phylogenetic relationships. These results are valuable for both palaeontologists and zooarchaeologists exploring the spatial and temporal dynamics of the evolutionary history of the horse family, up to the latest domestication trajectories of horses and donkeys. PMID:28484618

  18. PhyloTreePruner: A Phylogenetic Tree-Based Approach for Selection of Orthologous Sequences for Phylogenomics.

    PubMed

    Kocot, Kevin M; Citarella, Mathew R; Moroz, Leonid L; Halanych, Kenneth M

    2013-01-01

    Molecular phylogenetics relies on accurate identification of orthologous sequences among the taxa of interest. Most orthology inference programs available for use in phylogenomics rely on small sets of pre-defined orthologs from model organisms or phenetic approaches such as all-versus-all sequence comparisons followed by Markov graph-based clustering. Such approaches have high sensitivity but may erroneously include paralogous sequences. We developed PhyloTreePruner, a software utility that uses a phylogenetic approach to refine orthology inferences made using phenetic methods. PhyloTreePruner checks single-gene trees for evidence of paralogy and generates a new alignment for each group containing only sequences inferred to be orthologs. Importantly, PhyloTreePruner takes into account support values on the tree and avoids unnecessarily deleting sequences in cases where a weakly supported tree topology incorrectly indicates paralogy. A test of PhyloTreePruner on a dataset generated from 11 completely sequenced arthropod genomes identified 2,027 orthologous groups sampled for all taxa. Phylogenetic analysis of the concatenated supermatrix yielded a generally well-supported topology that was consistent with the current understanding of arthropod phylogeny. PhyloTreePruner is freely available from http://sourceforge.net/projects/phylotreepruner/.

  19. The origin and diversification of eukaryotes: problems with molecular phylogenetics and molecular clock estimation

    PubMed Central

    Roger, Andrew J; Hug, Laura A

    2006-01-01

    Determining the relationships among and divergence times for the major eukaryotic lineages remains one of the most important and controversial outstanding problems in evolutionary biology. The sequencing and phylogenetic analyses of ribosomal RNA (rRNA) genes led to the first nearly comprehensive phylogenies of eukaryotes in the late 1980s, and supported a view where cellular complexity was acquired during the divergence of extant unicellular eukaryote lineages. More recently, however, refinements in analytical methods coupled with the availability of many additional genes for phylogenetic analysis showed that much of the deep structure of early rRNA trees was artefactual. Recent phylogenetic analyses of a multiple genes and the discovery of important molecular and ultrastructural phylogenetic characters have resolved eukaryotic diversity into six major hypothetical groups. Yet relationships among these groups remain poorly understood because of saturation of sequence changes on the billion-year time-scale, possible rapid radiations of major lineages, phylogenetic artefacts and endosymbiotic or lateral gene transfer among eukaryotes. Estimating the divergence dates between the major eukaryote lineages using molecular analyses is even more difficult than phylogenetic estimation. Error in such analyses comes from a myriad of sources including: (i) calibration fossil dates, (ii) the assumed phylogenetic tree, (iii) the nucleotide or amino acid substitution model, (iv) substitution number (branch length) estimates, (v) the model of how rates of evolution change over the tree, (vi) error inherent in the time estimates for a given model and (vii) how multiple gene data are treated. By reanalysing datasets from recently published molecular clock studies, we show that when errors from these various sources are properly accounted for, the confidence intervals on inferred dates can be very large. Furthermore, estimated dates of divergence vary hugely depending on the methods

  20. Undergraduate Students’ Initial Ability in Understanding Phylogenetic Tree

    NASA Astrophysics Data System (ADS)

    Sa'adah, S.; Hidayat, T.; Sudargo, Fransisca

    2017-04-01

    The Phylogenetic tree is a visual representation depicts a hypothesis about the evolutionary relationship among taxa. Evolutionary experts use this representation to evaluate the evidence for evolution. The phylogenetic tree is currently growing for many disciplines in biology. Consequently, learning about the phylogenetic tree has become an important part of biological education and an interesting area of biology education research. Skill to understanding and reasoning of the phylogenetic tree, (called tree thinking) is an important skill for biology students. However, research showed many students have difficulty in interpreting, constructing, and comparing among the phylogenetic tree, as well as experiencing a misconception in the understanding of the phylogenetic tree. Students are often not taught how to reason about evolutionary relationship depicted in the diagram. Students are also not provided with information about the underlying theory and process of phylogenetic. This study aims to investigate the initial ability of undergraduate students in understanding and reasoning of the phylogenetic tree. The research method is the descriptive method. Students are given multiple choice questions and an essay that representative by tree thinking elements. Each correct answer made percentages. Each student is also given questionnaires. The results showed that the undergraduate students’ initial ability in understanding and reasoning phylogenetic tree is low. Many students are not able to answer questions about the phylogenetic tree. Only 19 % undergraduate student who answered correctly on indicator evaluate the evolutionary relationship among taxa, 25% undergraduate student who answered correctly on indicator applying concepts of the clade, 17% undergraduate student who answered correctly on indicator determines the character evolution, and only a few undergraduate student who can construct the phylogenetic tree.

  1. Improved Maximum Parsimony Models for Phylogenetic Networks.

    PubMed

    Van Iersel, Leo; Jones, Mark; Scornavacca, Celine

    2018-05-01

    Phylogenetic networks are well suited to represent evolutionary histories comprising reticulate evolution. Several methods aiming at reconstructing explicit phylogenetic networks have been developed in the last two decades. In this article, we propose a new definition of maximum parsimony for phylogenetic networks that permits to model biological scenarios that cannot be modeled by the definitions currently present in the literature (namely, the "hardwired" and "softwired" parsimony). Building on this new definition, we provide several algorithmic results that lay the foundations for new parsimony-based methods for phylogenetic network reconstruction.

  2. Genomic Repeat Abundances Contain Phylogenetic Signal

    PubMed Central

    Dodsworth, Steven; Chase, Mark W.; Kelly, Laura J.; Leitch, Ilia J.; Macas, Jiří; Novák, Petr; Piednoël, Mathieu; Weiss-Schneeweiss, Hanna; Leitch, Andrew R.

    2015-01-01

    A large proportion of genomic information, particularly repetitive elements, is usually ignored when researchers are using next-generation sequencing. Here we demonstrate the usefulness of this repetitive fraction in phylogenetic analyses, utilizing comparative graph-based clustering of next-generation sequence reads, which results in abundance estimates of different classes of genomic repeats. Phylogenetic trees are then inferred based on the genome-wide abundance of different repeat types treated as continuously varying characters; such repeats are scattered across chromosomes and in angiosperms can constitute a majority of nuclear genomic DNA. In six diverse examples, five angiosperms and one insect, this method provides generally well-supported relationships at interspecific and intergeneric levels that agree with results from more standard phylogenetic analyses of commonly used markers. We propose that this methodology may prove especially useful in groups where there is little genetic differentiation in standard phylogenetic markers. At the same time as providing data for phylogenetic inference, this method additionally yields a wealth of data for comparative studies of genome evolution. PMID:25261464

  3. Phylogenetic Factor Analysis.

    PubMed

    Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A

    2018-05-01

    Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.

  4. An improved model for whole genome phylogenetic analysis by Fourier transform.

    PubMed

    Yin, Changchuan; Yau, Stephen S-T

    2015-10-07

    DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees

  5. Probabilistic Graphical Model Representation in Phylogenetics

    PubMed Central

    Höhna, Sebastian; Heath, Tracy A.; Boussau, Bastien; Landis, Michael J.; Ronquist, Fredrik; Huelsenbeck, John P.

    2014-01-01

    Recent years have seen a rapid expansion of the model space explored in statistical phylogenetics, emphasizing the need for new approaches to statistical model representation and software development. Clear communication and representation of the chosen model is crucial for: (i) reproducibility of an analysis, (ii) model development, and (iii) software design. Moreover, a unified, clear and understandable framework for model representation lowers the barrier for beginners and nonspecialists to grasp complex phylogenetic models, including their assumptions and parameter/variable dependencies. Graphical modeling is a unifying framework that has gained in popularity in the statistical literature in recent years. The core idea is to break complex models into conditionally independent distributions. The strength lies in the comprehensibility, flexibility, and adaptability of this formalism, and the large body of computational work based on it. Graphical models are well-suited to teach statistical models, to facilitate communication among phylogeneticists and in the development of generic software for simulation and statistical inference. Here, we provide an introduction to graphical models for phylogeneticists and extend the standard graphical model representation to the realm of phylogenetics. We introduce a new graphical model component, tree plates, to capture the changing structure of the subgraph corresponding to a phylogenetic tree. We describe a range of phylogenetic models using the graphical model framework and introduce modules to simplify the representation of standard components in large and complex models. Phylogenetic model graphs can be readily used in simulation, maximum likelihood inference, and Bayesian inference using, for example, Metropolis–Hastings or Gibbs sampling of the posterior distribution. [Computation; graphical models; inference; modularization; statistical phylogenetics; tree plate.] PMID:24951559

  6. Fast and accurate phylogeny reconstruction using filtered spaced-word matches

    PubMed Central

    Sohrabi-Jahromi, Salma; Morgenstern, Burkhard

    2017-01-01

    Abstract Motivation: Word-based or ‘alignment-free’ algorithms are increasingly used for phylogeny reconstruction and genome comparison, since they are much faster than traditional approaches that are based on full sequence alignments. Existing alignment-free programs, however, are less accurate than alignment-based methods. Results: We propose Filtered Spaced Word Matches (FSWM), a fast alignment-free approach to estimate phylogenetic distances between large genomic sequences. For a pre-defined binary pattern of match and don’t-care positions, FSWM rapidly identifies spaced word-matches between input sequences, i.e. gap-free local alignments with matching nucleotides at the match positions and with mismatches allowed at the don’t-care positions. We then estimate the number of nucleotide substitutions per site by considering the nucleotides aligned at the don’t-care positions of the identified spaced-word matches. To reduce the noise from spurious random matches, we use a filtering procedure where we discard all spaced-word matches for which the overall similarity between the aligned segments is below a threshold. We show that our approach can accurately estimate substitution frequencies even for distantly related sequences that cannot be analyzed with existing alignment-free methods; phylogenetic trees constructed with FSWM distances are of high quality. A program run on a pair of eukaryotic genomes of a few hundred Mb each takes a few minutes. Availability and Implementation: The program source code for FSWM including a documentation, as well as the software that we used to generate artificial genome sequences are freely available at http://fswm.gobics.de/ Contact: chris.leimeister@stud.uni-goettingen.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28073754

  7. Fast and accurate phylogeny reconstruction using filtered spaced-word matches.

    PubMed

    Leimeister, Chris-André; Sohrabi-Jahromi, Salma; Morgenstern, Burkhard

    2017-04-01

    Word-based or 'alignment-free' algorithms are increasingly used for phylogeny reconstruction and genome comparison, since they are much faster than traditional approaches that are based on full sequence alignments. Existing alignment-free programs, however, are less accurate than alignment-based methods. We propose Filtered Spaced Word Matches (FSWM) , a fast alignment-free approach to estimate phylogenetic distances between large genomic sequences. For a pre-defined binary pattern of match and don't-care positions, FSWM rapidly identifies spaced word-matches between input sequences, i.e. gap-free local alignments with matching nucleotides at the match positions and with mismatches allowed at the don't-care positions. We then estimate the number of nucleotide substitutions per site by considering the nucleotides aligned at the don't-care positions of the identified spaced-word matches. To reduce the noise from spurious random matches, we use a filtering procedure where we discard all spaced-word matches for which the overall similarity between the aligned segments is below a threshold. We show that our approach can accurately estimate substitution frequencies even for distantly related sequences that cannot be analyzed with existing alignment-free methods; phylogenetic trees constructed with FSWM distances are of high quality. A program run on a pair of eukaryotic genomes of a few hundred Mb each takes a few minutes. The program source code for FSWM including a documentation, as well as the software that we used to generate artificial genome sequences are freely available at http://fswm.gobics.de/. chris.leimeister@stud.uni-goettingen.de. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  8. Evaluation of CLSI M44-A2 Disk Diffusion and Associated Breakpoint Testing of Caspofungin and Micafungin Using a Well-Characterized Panel of Wild-Type and fks Hot Spot Mutant Candida Isolates▿

    PubMed Central

    Arendrup, Maiken Cavling; Park, Steven; Brown, Steven; Pfaller, Michael; Perlin, David S.

    2011-01-01

    Disk diffusion testing has recently been standardized by the CLSI, and susceptibility breakpoints have been established for several antifungal compounds. For caspofungin, 5-μg disks are approved, and for micafungin, 10-μg disks are under evaluation. We evaluated the performances of caspofungin and micafungin disk testing using a panel of Candida isolates with and without known FKS echinocandin resistance mechanisms. Disk diffusion and microdilution assays were performed strictly according to CLSI documents M44-A2 and M27-A3. Eighty-nine clinical Candida isolates were included: Candida albicans (20 isolates/10 mutants), C. glabrata (19 isolates/10 mutants), C. dubliniensis (2 isolates/1 mutant), C. krusei (16 isolates/3 mutants), C. parapsilosis (14 isolates/0 mutants), and C. tropicalis (18 isolates/4 mutants). Quality control strains were C. parapsilosis ATCC 22019 and C. krusei ATCC 6258. The correlations between zone diameters and MIC results were good for both compounds, with identical susceptibility classifications for 93.3% of the isolates by applying the current CLSI breakpoints. However, the numbers of fks hot spot mutant isolates misclassified as being susceptible (S) (very major errors [VMEs]) were high (61% for caspofungin [S, ≥11 mm] and 93% for micafungin [S, ≥14 mm]). Changing the disk diffusion breakpoint to S at ≥22 mm significantly improved the discrimination. For caspofungin, 1 VME was detected (a C. tropicalis isolate with an F76S substitution) (3.5%), and for micafungin, 10 VMEs were detected, the majority of which were for C. glabrata (8/10). The broadest separation between zone diameter ranges for wild-type (WT) and mutant isolates was seen for caspofungin (6 to 12 mm versus −4 to 7 mm). In conclusion, caspofungin disk diffusion testing with a modified breakpoint led to excellent separation between WT and mutant isolates for all Candida species. PMID:21357293

  9. Fast Construction of Near Parsimonious Hybridization Networks for Multiple Phylogenetic Trees.

    PubMed

    Mirzaei, Sajad; Wu, Yufeng

    2016-01-01

    Hybridization networks represent plausible evolutionary histories of species that are affected by reticulate evolutionary processes. An established computational problem on hybridization networks is constructing the most parsimonious hybridization network such that each of the given phylogenetic trees (called gene trees) is "displayed" in the network. There have been several previous approaches, including an exact method and several heuristics, for this NP-hard problem. However, the exact method is only applicable to a limited range of data, and heuristic methods can be less accurate and also slow sometimes. In this paper, we develop a new algorithm for constructing near parsimonious networks for multiple binary gene trees. This method is more efficient for large numbers of gene trees than previous heuristics. This new method also produces more parsimonious results on many simulated datasets as well as a real biological dataset than a previous method. We also show that our method produces topologically more accurate networks for many datasets.

  10. Multi-locus phylogenetic analysis reveals the pattern and tempo of bony fish evolution

    PubMed Central

    Broughton, Richard E.; Betancur-R., Ricardo; Li, Chenhong; Arratia, Gloria; Ortí, Guillermo

    2013-01-01

    Over half of all vertebrates are “fishes”, which exhibit enormous diversity in morphology, physiology, behavior, reproductive biology, and ecology. Investigation of fundamental areas of vertebrate biology depend critically on a robust phylogeny of fishes, yet evolutionary relationships among the major actinopterygian and sarcopterygian lineages have not been conclusively resolved. Although a consensus phylogeny of teleosts has been emerging recently, it has been based on analyses of various subsets of actinopterygian taxa, but not on a full sample of all bony fishes. Here we conducted a comprehensive phylogenetic study on a broad taxonomic sample of 61 actinopterygian and sarcopterygian lineages (with a chondrichthyan outgroup) using a molecular data set of 21 independent loci. These data yielded a resolved phylogenetic hypothesis for extant Osteichthyes, including 1) reciprocally monophyletic Sarcopterygii and Actinopterygii, as currently understood, with polypteriforms as the first diverging lineage within Actinopterygii; 2) a monophyletic group containing gars and bowfin (= Holostei) as sister group to teleosts; and 3) the earliest diverging lineage among teleosts being Elopomorpha, rather than Osteoglossomorpha. Relaxed-clock dating analysis employing a set of 24 newly applied fossil calibrations reveals divergence times that are more consistent with paleontological estimates than previous studies. Establishing a new phylogenetic pattern with accurate divergence dates for bony fishes illustrates several areas where the fossil record is incomplete and provides critical new insights on diversification of this important vertebrate group. PMID:23788273

  11. Rearrangement moves on rooted phylogenetic networks

    PubMed Central

    Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine

    2017-01-01

    Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a

  12. Fourier transform inequalities for phylogenetic trees.

    PubMed

    Matsen, Frederick A

    2009-01-01

    Phylogenetic invariants are not the only constraints on site-pattern frequency vectors for phylogenetic trees. A mutation matrix, by its definition, is the exponential of a matrix with non-negative off-diagonal entries; this positivity requirement implies non-trivial constraints on the site-pattern frequency vectors. We call these additional constraints "edge-parameter inequalities". In this paper, we first motivate the edge-parameter inequalities by considering a pathological site-pattern frequency vector corresponding to a quartet tree with a negative internal edge. This site-pattern frequency vector nevertheless satisfies all of the constraints described up to now in the literature. We next describe two complete sets of edge-parameter inequalities for the group-based models; these constraints are square-free monomial inequalities in the Fourier transformed coordinates. These inequalities, along with the phylogenetic invariants, form a complete description of the set of site-pattern frequency vectors corresponding to bona fide trees. Said in mathematical language, this paper explicitly presents two finite lists of inequalities in Fourier coordinates of the form "monomial < or = 1", each list characterizing the phylogenetically relevant semialgebraic subsets of the phylogenetic varieties.

  13. Phylogenetic search through partial tree mixing

    PubMed Central

    2012-01-01

    Background Recent advances in sequencing technology have created large data sets upon which phylogenetic inference can be performed. Current research is limited by the prohibitive time necessary to perform tree search on a reasonable number of individuals. This research develops new phylogenetic algorithms that can operate on tens of thousands of species in a reasonable amount of time through several innovative search techniques. Results When compared to popular phylogenetic search algorithms, better trees are found much more quickly for large data sets. These algorithms are incorporated in the PSODA application available at http://dna.cs.byu.edu/psoda Conclusions The use of Partial Tree Mixing in a partition based tree space allows the algorithm to quickly converge on near optimal tree regions. These regions can then be searched in a methodical way to determine the overall optimal phylogenetic solution. PMID:23320449

  14. How does cognition evolve? Phylogenetic comparative psychology

    PubMed Central

    Matthews, Luke J.; Hare, Brian A.; Nunn, Charles L.; Anderson, Rindy C.; Aureli, Filippo; Brannon, Elizabeth M.; Call, Josep; Drea, Christine M.; Emery, Nathan J.; Haun, Daniel B. M.; Herrmann, Esther; Jacobs, Lucia F.; Platt, Michael L.; Rosati, Alexandra G.; Sandel, Aaron A.; Schroepfer, Kara K.; Seed, Amanda M.; Tan, Jingzhi; van Schaik, Carel P.; Wobber, Victoria

    2014-01-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution. PMID:21927850

  15. Phylogenetic diversity measures based on Hill numbers.

    PubMed

    Chao, Anne; Chiu, Chun-Huo; Jost, Lou

    2010-11-27

    We propose a parametric class of phylogenetic diversity (PD) measures that are sensitive to both species abundance and species taxonomic or phylogenetic distances. This work extends the conventional parametric species-neutral approach (based on 'effective number of species' or Hill numbers) to take into account species relatedness, and also generalizes the traditional phylogenetic approach (based on 'total phylogenetic length') to incorporate species abundances. The proposed measure quantifies 'the mean effective number of species' over any time interval of interest, or the 'effective number of maximally distinct lineages' over that time interval. The product of the measure and the interval length quantifies the 'branch diversity' of the phylogenetic tree during that interval. The new measures generalize and unify many existing measures and lead to a natural definition of taxonomic diversity as a special case. The replication principle (or doubling property), an important requirement for species-neutral diversity, is generalized to PD. The widely used Rao's quadratic entropy and the phylogenetic entropy do not satisfy this essential property, but a simple transformation converts each to our measures, which do satisfy the property. The proposed approach is applied to forest data for interpreting the effects of thinning.

  16. How does cognition evolve? Phylogenetic comparative psychology.

    PubMed

    MacLean, Evan L; Matthews, Luke J; Hare, Brian A; Nunn, Charles L; Anderson, Rindy C; Aureli, Filippo; Brannon, Elizabeth M; Call, Josep; Drea, Christine M; Emery, Nathan J; Haun, Daniel B M; Herrmann, Esther; Jacobs, Lucia F; Platt, Michael L; Rosati, Alexandra G; Sandel, Aaron A; Schroepfer, Kara K; Seed, Amanda M; Tan, Jingzhi; van Schaik, Carel P; Wobber, Victoria

    2012-03-01

    Now more than ever animal studies have the potential to test hypotheses regarding how cognition evolves. Comparative psychologists have developed new techniques to probe the cognitive mechanisms underlying animal behavior, and they have become increasingly skillful at adapting methodologies to test multiple species. Meanwhile, evolutionary biologists have generated quantitative approaches to investigate the phylogenetic distribution and function of phenotypic traits, including cognition. In particular, phylogenetic methods can quantitatively (1) test whether specific cognitive abilities are correlated with life history (e.g., lifespan), morphology (e.g., brain size), or socio-ecological variables (e.g., social system), (2) measure how strongly phylogenetic relatedness predicts the distribution of cognitive skills across species, and (3) estimate the ancestral state of a given cognitive trait using measures of cognitive performance from extant species. Phylogenetic methods can also be used to guide the selection of species comparisons that offer the strongest tests of a priori predictions of cognitive evolutionary hypotheses (i.e., phylogenetic targeting). Here, we explain how an integration of comparative psychology and evolutionary biology will answer a host of questions regarding the phylogenetic distribution and history of cognitive traits, as well as the evolutionary processes that drove their evolution.

  17. Pharmacokinetics and Pharmacodynamics of Fluconazole for Cryptococcal Meningoencephalitis: Implications for Antifungal Therapy and In Vitro Susceptibility Breakpoints

    PubMed Central

    Sudan, Ajay; Livermore, Joanne; Howard, Susan J.; Al-Nakeeb, Zaid; Sharp, Andrew; Goodwin, Joanne; Gregson, Lea; Warn, Peter A.; Felton, Tim W.; Perfect, John R.; Harrison, Thomas S.

    2013-01-01

    Fluconazole is frequently the only antifungal agent that is available for induction therapy for cryptococcal meningitis. There is relatively little understanding of the pharmacokinetics and pharmacodynamics (PK-PD) of fluconazole in this setting. PK-PD relationships were estimated with 4 clinical isolates of Cryptococcus neoformans. MICs were determined using Clinical and Laboratory Standards Institute (CLSI) methodology. A nonimmunosuppressed murine model of cryptococcal meningitis was used. Mice received two different doses of fluconazole (125 mg/kg of body weight/day and 250 mg/kg of body weight/day) orally for 9 days; a control group of mice was not given fluconazole. Fluconazole concentrations in plasma and in the cerebrum were determined using high-performance liquid chromatography (HPLC). The cryptococcal density in the brain was estimated using quantitative cultures. A mathematical model was fitted to the PK-PD data. The experimental results were extrapolated to humans (bridging study). The PK were linear. A dose-dependent decline in fungal burden was observed, with near-maximal activity evident with dosages of 250 mg/kg/day. The MIC was important for understanding the exposure-response relationships. The mean AUC/MIC ratio associated with stasis was 389. The results of the bridging study suggested that only 66.7% of patients receiving 1,200 mg/kg would achieve or exceed an AUC/MIC ratio of 389. The potential breakpoints for fluconazole against Cryptococcus neoformans follow: susceptible, ≤2 mg/liter; resistant, >2 mg/liter. Fluconazole may be an inferior agent for induction therapy because many patients cannot achieve the pharmacodynamic target. Clinical breakpoints are likely to be significantly lower than epidemiological cutoff values. The MIC may guide the appropriate use of fluconazole. If fluconazole is the only option for induction therapy, then the highest possible dose should be used. PMID:23571544

  18. Comparative evolutionary diversity and phylogenetic structure across multiple forest dynamics plots: a mega-phylogeny approach

    PubMed Central

    Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John

    2014-01-01

    Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of

  19. Detecting exact breakpoints of deletions with diversity in hepatitis B viral genomic DNA from next-generation sequencing data.

    PubMed

    Cheng, Ji-Hong; Liu, Wen-Chun; Chang, Ting-Tsung; Hsieh, Sun-Yuan; Tseng, Vincent S

    2017-10-01

    Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/. Copyright © 2017. Published by Elsevier Inc.

  20. A Comprehensive Strategy for Accurate Mutation Detection of the Highly Homologous PMS2.

    PubMed

    Li, Jianli; Dai, Hongzheng; Feng, Yanming; Tang, Jia; Chen, Stella; Tian, Xia; Gorman, Elizabeth; Schmitt, Eric S; Hansen, Terah A A; Wang, Jing; Plon, Sharon E; Zhang, Victor Wei; Wong, Lee-Jun C

    2015-09-01

    Germline mutations in the DNA mismatch repair gene PMS2 underlie the cancer susceptibility syndrome, Lynch syndrome. However, accurate molecular testing of PMS2 is complicated by a large number of highly homologous sequences. To establish a comprehensive approach for mutation detection of PMS2, we have designed a strategy combining targeted capture next-generation sequencing (NGS), multiplex ligation-dependent probe amplification, and long-range PCR followed by NGS to simultaneously detect point mutations and copy number changes of PMS2. Exonic deletions (E2 to E9, E5 to E9, E8, E10, E14, and E1 to E15), duplications (E11 to E12), and a nonsense mutation, p.S22*, were identified. Traditional multiplex ligation-dependent probe amplification and Sanger sequencing approaches cannot differentiate the origin of the exonic deletions in the 3' region when PMS2 and PMS2CL share identical sequences as a result of gene conversion. Our approach allows unambiguous identification of mutations in the active gene with a straightforward long-range-PCR/NGS method. Breakpoint analysis of multiple samples revealed that recurrent exon 14 deletions are mediated by homologous Alu sequences. Our comprehensive approach provides a reliable tool for accurate molecular analysis of genes containing multiple copies of highly homologous sequences and should improve PMS2 molecular analysis for patients with Lynch syndrome. Copyright © 2015 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  1. A hardware-oriented algorithm for floating-point function generation

    NASA Technical Reports Server (NTRS)

    O'Grady, E. Pearse; Young, Baek-Kyu

    1991-01-01

    An algorithm is presented for performing accurate, high-speed, floating-point function generation for univariate functions defined at arbitrary breakpoints. Rapid identification of the breakpoint interval, which includes the input argument, is shown to be the key operation in the algorithm. A hardware implementation which makes extensive use of read/write memories is used to illustrate the algorithm.

  2. Identification of a B lymphoid disorder defined by specific morphologic features, a del(11)(q13) and a same breakpoint far from CCND1

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Morel, D.; Callet, E.; Reynaud, S.

    Chromosome rearrangements in 11q13 have been shown to occur in a variety of diffuse small B-cell lymphomas/leukemias, including, beside mantle cell lymphomas (MCL), some cases of CLL/SLL, PLL, and SLVL. If t(11;14)(q13;q32) may be considered as a hallmark of MCL, less is known about deletions involving 11q13. A series of 13 patients with a diffuse small B-lymphoma/leukemia was examined for morphology (cytology and histology), immunology, cytogenetics and FISH for some of them. According to karyotype findings, 2 groups were identified: Group 1: 9 patients (6M,3F), median age 60 yrs. without 11q anomalies. Apart from trisomy 12 (3 cases), diverse anomaliesmore » were identified including chromosomes 1, 2, 7, 8, 12, 17. Cases were classified as CLL (2), SLL/SLP (5), blast-rich immunocytomas (2). Group 2: 4 patients, all males, median age 52 yrs. with breakpoints in 11q13; there were 3 deletions, and a t(11;14) was present in another case. 2 patients presented with refractory disease followed for 23 and 9 months, respectively, without any consistent morphologic change, the chromosomal anomaly being present at diagnosis in 1.3 cases. Cytologically, a nucleolated cell component was the constant and striking feature and FISH study by cos 14, pHS 11, cos 17, cos 105 revealed the same breakpoint located far from CCND1. The fourth case, bearing the t(11;14), was diagnosed as CLL/PLL in cytology, but was histologically consistent with MCL; the breakpoint was located by FISH into the BCL1 locus. Even if this study needs further confirmation, it points at the 11q13 deletion as a genetic event leading to a more aggressive disease, associated with distinct cytologic features differing from MCL and a molecular event probably not involving BCL1/CCND1.« less

  3. Bayesian models for comparative analysis integrating phylogenetic uncertainty.

    PubMed

    de Villemereuil, Pierre; Wells, Jessie A; Edwards, Robert D; Blomberg, Simon P

    2012-06-28

    Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for phylogenetic comparative analyses

  4. Bayesian models for comparative analysis integrating phylogenetic uncertainty

    PubMed Central

    2012-01-01

    Background Uncertainty in comparative analyses can come from at least two sources: a) phylogenetic uncertainty in the tree topology or branch lengths, and b) uncertainty due to intraspecific variation in trait values, either due to measurement error or natural individual variation. Most phylogenetic comparative methods do not account for such uncertainties. Not accounting for these sources of uncertainty leads to false perceptions of precision (confidence intervals will be too narrow) and inflated significance in hypothesis testing (e.g. p-values will be too small). Although there is some application-specific software for fitting Bayesian models accounting for phylogenetic error, more general and flexible software is desirable. Methods We developed models to directly incorporate phylogenetic uncertainty into a range of analyses that biologists commonly perform, using a Bayesian framework and Markov Chain Monte Carlo analyses. Results We demonstrate applications in linear regression, quantification of phylogenetic signal, and measurement error models. Phylogenetic uncertainty was incorporated by applying a prior distribution for the phylogeny, where this distribution consisted of the posterior tree sets from Bayesian phylogenetic tree estimation programs. The models were analysed using simulated data sets, and applied to a real data set on plant traits, from rainforest plant species in Northern Australia. Analyses were performed using the free and open source software OpenBUGS and JAGS. Conclusions Incorporating phylogenetic uncertainty through an empirical prior distribution of trees leads to more precise estimation of regression model parameters than using a single consensus tree and enables a more realistic estimation of confidence intervals. In addition, models incorporating measurement errors and/or individual variation, in one or both variables, are easily formulated in the Bayesian framework. We show that BUGS is a useful, flexible general purpose tool for

  5. Open Reading Frame Phylogenetic Analysis on the Cloud

    PubMed Central

    2013-01-01

    Phylogenetic analysis has become essential in researching the evolutionary relationships between viruses. These relationships are depicted on phylogenetic trees, in which viruses are grouped based on sequence similarity. Viral evolutionary relationships are identified from open reading frames rather than from complete sequences. Recently, cloud computing has become popular for developing internet-based bioinformatics tools. Biocloud is an efficient, scalable, and robust bioinformatics computing service. In this paper, we propose a cloud-based open reading frame phylogenetic analysis service. The proposed service integrates the Hadoop framework, virtualization technology, and phylogenetic analysis methods to provide a high-availability, large-scale bioservice. In a case study, we analyze the phylogenetic relationships among Norovirus. Evolutionary relationships are elucidated by aligning different open reading frame sequences. The proposed platform correctly identifies the evolutionary relationships between members of Norovirus. PMID:23671843

  6. DNA Sequences Proximal to Human Mitochondrial DNA Deletion Breakpoints Prevalent in Human Disease Form G-quadruplexes, a Class of DNA Structures Inefficiently Unwound by the Mitochondrial Replicative Twinkle Helicase*

    PubMed Central

    Bharti, Sanjay Kumar; Sommers, Joshua A.; Zhou, Jun; Kaplan, Daniel L.; Spelbrink, Johannes N.; Mergny, Jean-Louis; Brosh, Robert M.

    2014-01-01

    Mitochondrial DNA deletions are prominent in human genetic disorders, cancer, and aging. It is thought that stalling of the mitochondrial replication machinery during DNA synthesis is a prominent source of mitochondrial genome instability; however, the precise molecular determinants of defective mitochondrial replication are not well understood. In this work, we performed a computational analysis of the human mitochondrial genome using the “Pattern Finder” G-quadruplex (G4) predictor algorithm to assess whether G4-forming sequences reside in close proximity (within 20 base pairs) to known mitochondrial DNA deletion breakpoints. We then used this information to map G4P sequences with deletions characteristic of representative mitochondrial genetic disorders and also those identified in various cancers and aging. Circular dichroism and UV spectral analysis demonstrated that mitochondrial G-rich sequences near deletion breakpoints prevalent in human disease form G-quadruplex DNA structures. A biochemical analysis of purified recombinant human Twinkle protein (gene product of c10orf2) showed that the mitochondrial replicative helicase inefficiently unwinds well characterized intermolecular and intramolecular G-quadruplex DNA substrates, as well as a unimolecular G4 substrate derived from a mitochondrial sequence that nests a deletion breakpoint described in human renal cell carcinoma. Although G4 has been implicated in the initiation of mitochondrial DNA replication, our current findings suggest that mitochondrial G-quadruplexes are also likely to be a source of instability for the mitochondrial genome by perturbing the normal progression of the mitochondrial replication machinery, including DNA unwinding by Twinkle helicase. PMID:25193669

  7. Monte Carlo estimation of total variation distance of Markov chains on large spaces, with application to phylogenetics.

    PubMed

    Herbei, Radu; Kubatko, Laura

    2013-03-26

    Markov chains are widely used for modeling in many areas of molecular biology and genetics. As the complexity of such models advances, it becomes increasingly important to assess the rate at which a Markov chain converges to its stationary distribution in order to carry out accurate inference. A common measure of convergence to the stationary distribution is the total variation distance, but this measure can be difficult to compute when the state space of the chain is large. We propose a Monte Carlo method to estimate the total variation distance that can be applied in this situation, and we demonstrate how the method can be efficiently implemented by taking advantage of GPU computing techniques. We apply the method to two Markov chains on the space of phylogenetic trees, and discuss the implications of our findings for the development of algorithms for phylogenetic inference.

  8. In Vivo Activities of Amoxicillin and Amoxicillin-Clavulanate against Streptococcus pneumoniae: Application to Breakpoint Determinations

    PubMed Central

    Andes, D.; Craig, W. A.

    1998-01-01

    The in vivo activities of amoxicillin and amoxicillin-clavulanate against 17 strains of Streptococcus pneumoniae with penicillin MICs of 0.12–8.0 mg/liter were assessed in a cyclophosphamide-induced neutropenic murine thigh infection model. Renal impairment was produced by administration of uranyl nitrate to prolong the amoxicillin half-life in the mice from 21 to 65 min, simulating human pharmacokinetics. Two hours after thigh infection with 105 to 106 CFU, groups of mice were treated with 7 mg of amoxicillin per kg of body weight alone or combined with clavulanate (ratio, 4:1) every 8 h for 1 and 4 days. There was an excellent correlation between the MIC of amoxicillin (0.03 to 5.6 mg/liter) and (i) the change in log10 CFU/thigh at 24 h and (ii) survival after 4 days of therapy. Organisms for which MICs were 2 mg/liter or less were killed at 1.4 to 4.2 and 1.6 to 4.1 log10 CFU/thigh at 24 h by amoxicillin and amoxicillin-clavulanate, respectively. The four strains for which MICs were >4 mg/liter grew 0.2 to 2.6 and 0.6 to 2.3 logs at 24 h despite therapy with amoxicillin and amoxicillin-clavulanate, respectively. Infection was uniformly fatal by 72 h in untreated mice. Amoxicillin therapy resulted in no mortality with organisms for which MICs were 1 mg/liter or less, 20 to 40% mortality with organisms for which MICs were 2 mg/liter, and 80 to 100% mortality with organisms for which MICs were 4.0–5.6 mg/liter. Lower and higher doses (0.5, 2, and 20 mg/kg) of amoxicillin were studied against organisms for which MICs were near the breakpoint. These studies demonstrate that a reduction of 1 log10 or greater in CFU/thigh at 24 h is consistently observed when amoxicillin levels exceed the MIC for 25 to 30% of the dosing interval. These studies would support amoxicillin (and amoxicillin-clavulanate) MIC breakpoints of 1 mg/liter for susceptible, 2 mg/liter for intermediate, and 4 mg/liter for resistant strains of S. pneumoniae. PMID:9736566

  9. N-nitrosodimethylamine (NDMA) formation potential of amine-based water treatment polymers: Effects of in situ chloramination, breakpoint chlorination, and pre-oxidation.

    PubMed

    Park, Sang Hyuck; Padhye, Lokesh P; Wang, Pei; Cho, Min; Kim, Jae-Hong; Huang, Ching-Hua

    2015-01-23

    Recent studies show that cationic amine-based water treatment polymers may be important precursors that contribute to formation of the probable human carcinogen N-nitrosodimethylamine (NDMA) during water treatment and disinfection. To better understand how water treatment parameters affect NDMA formation from the polymers, the effects of in situ chloramination, breakpoint chlorination, and pre-oxidation on the NDMA formation from the polymers were investigated. NDMA formation potential (NDMA-FP) as well as dimethylamine (DMA) residual concentration were measured from poly(epichlorohydrin dimethylamine) (polyamine) and poly(diallyldimethylammonium chloride) (polyDADMAC) solutions upon reactions with oxidants including free chlorine, chlorine dioxide, ozone, and monochloramine under different treatment conditions. The results supported that dichloramine (NHCl2) formation was the critical factor affecting NDMA formation from the polymers during in situ chloramination. The highest NDMA formation from the polymers occurred near the breakpoint of chlorination. Polymer chain breakdown and transformation of the released DMA and other intermediates were important factors affecting NDMA formation from the polymers in pre-oxidation followed by post-chloramination. Pre-oxidation generally reduced NDMA-FP of the polymers; however, the treatments involving pre-ozonation increased polyDADMAC's NDMA-FP and DMA release. The strategies for reducing NDMA formation from the polymers may include the avoidance of the conditions favorable to NHCl2 formation and the avoidance of polymer exposure to strong oxidants such as ozone. Copyright © 2014 Elsevier B.V. All rights reserved.

  10. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing.

    PubMed

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J; O'Donnell, Kerry; Geiser, David M; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education.

  11. Cyber infrastructure for Fusarium: three integrated platforms supporting strain identification, phylogenetics, comparative genomics and knowledge sharing

    PubMed Central

    Park, Bongsoo; Park, Jongsun; Cheong, Kyeong-Chae; Choi, Jaeyoung; Jung, Kyongyong; Kim, Donghan; Lee, Yong-Hwan; Ward, Todd J.; O'Donnell, Kerry; Geiser, David M.; Kang, Seogchan

    2011-01-01

    The fungal genus Fusarium includes many plant and/or animal pathogenic species and produces diverse toxins. Although accurate species identification is critical for managing such threats, it is difficult to identify Fusarium morphologically. Fortunately, extensive molecular phylogenetic studies, founded on well-preserved culture collections, have established a robust foundation for Fusarium classification. Genomes of four Fusarium species have been published with more being currently sequenced. The Cyber infrastructure for Fusarium (CiF; http://www.fusariumdb.org/) was built to support archiving and utilization of rapidly increasing data and knowledge and consists of Fusarium-ID, Fusarium Comparative Genomics Platform (FCGP) and Fusarium Community Platform (FCP). The Fusarium-ID archives phylogenetic marker sequences from most known species along with information associated with characterized isolates and supports strain identification and phylogenetic analyses. The FCGP currently archives five genomes from four species. Besides supporting genome browsing and analysis, the FCGP presents computed characteristics of multiple gene families and functional groups. The Cart/Favorite function allows users to collect sequences from Fusarium-ID and the FCGP and analyze them later using multiple tools without requiring repeated copying-and-pasting of sequences. The FCP is designed to serve as an online community forum for sharing and preserving accumulated experience and knowledge to support future research and education. PMID:21087991

  12. Phylogenetic Properties of RNA Viruses

    PubMed Central

    Pompei, Simone; Loreto, Vittorio; Tria, Francesca

    2012-01-01

    A new word, phylodynamics, was coined to emphasize the interconnection between phylogenetic properties, as observed for instance in a phylogenetic tree, and the epidemic dynamics of viruses, where selection, mediated by the host immune response, and transmission play a crucial role. The challenges faced when investigating the evolution of RNA viruses call for a virtuous loop of data collection, data analysis and modeling. This already resulted both in the collection of massive sequences databases and in the formulation of hypotheses on the main mechanisms driving qualitative differences observed in the (reconstructed) evolutionary patterns of different RNA viruses. Qualitatively, it has been observed that selection driven by the host immune response induces an uneven survival ability among co-existing strains. As a consequence, the imbalance level of the phylogenetic tree is manifestly more pronounced if compared to the case when the interaction with the host immune system does not play a central role in the evolutive dynamics. While many imbalance metrics have been introduced, reliable methods to discriminate in a quantitative way different level of imbalance are still lacking. In our work, we reconstruct and analyze the phylogenetic trees of six RNA viruses, with a special emphasis on the human Influenza A virus, due to its relevance for vaccine preparation as well as for the theoretical challenges it poses due to its peculiar evolutionary dynamics. We focus in particular on topological properties. We point out the limitation featured by standard imbalance metrics, and we introduce a new methodology with which we assign the correct imbalance level of the phylogenetic trees, in agreement with the phylodynamics of the viruses. Our thorough quantitative analysis allows for a deeper understanding of the evolutionary dynamics of the considered RNA viruses, which is crucial in order to provide a valuable framework for a quantitative assessment of theoretical

  13. Visualizing Phylogenetic Treespace Using Cartographic Projections

    NASA Astrophysics Data System (ADS)

    Sundberg, Kenneth; Clement, Mark; Snell, Quinn

    Phylogenetic analysis is becoming an increasingly important tool for biological research. Applications include epidemiological studies, drug development, and evolutionary analysis. Phylogenetic search is a known NP-Hard problem. The size of the data sets which can be analyzed is limited by the exponential growth in the number of trees that must be considered as the problem size increases. A better understanding of the problem space could lead to better methods, which in turn could lead to the feasible analysis of more data sets. We present a definition of phylogenetic tree space and a visualization of this space that shows significant exploitable structure. This structure can be used to develop search methods capable of handling much larger datasets.

  14. Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison

    PubMed Central

    2013-01-01

    Background Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. Results We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. Conclusion CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets. PMID:23617892

  15. Compression-based distance (CBD): a simple, rapid, and accurate method for microbiota composition comparison.

    PubMed

    Yang, Fang; Chia, Nicholas; White, Bryan A; Schook, Lawrence B

    2013-04-23

    Perturbations in intestinal microbiota composition have been associated with a variety of gastrointestinal tract-related diseases. The alleviation of symptoms has been achieved using treatments that alter the gastrointestinal tract microbiota toward that of healthy individuals. Identifying differences in microbiota composition through the use of 16S rRNA gene hypervariable tag sequencing has profound health implications. Current computational methods for comparing microbial communities are usually based on multiple alignments and phylogenetic inference, making them time consuming and requiring exceptional expertise and computational resources. As sequencing data rapidly grows in size, simpler analysis methods are needed to meet the growing computational burdens of microbiota comparisons. Thus, we have developed a simple, rapid, and accurate method, independent of multiple alignments and phylogenetic inference, to support microbiota comparisons. We create a metric, called compression-based distance (CBD) for quantifying the degree of similarity between microbial communities. CBD uses the repetitive nature of hypervariable tag datasets and well-established compression algorithms to approximate the total information shared between two datasets. Three published microbiota datasets were used as test cases for CBD as an applicable tool. Our study revealed that CBD recaptured 100% of the statistically significant conclusions reported in the previous studies, while achieving a decrease in computational time required when compared to similar tools without expert user intervention. CBD provides a simple, rapid, and accurate method for assessing distances between gastrointestinal tract microbiota 16S hypervariable tag datasets.

  16. Folding and unfolding phylogenetic trees and networks.

    PubMed

    Huber, Katharina T; Moulton, Vincent; Steel, Mike; Wu, Taoyang

    2016-12-01

    Phylogenetic networks are rooted, labelled directed acyclic graphswhich are commonly used to represent reticulate evolution. There is a close relationship between phylogenetic networks and multi-labelled trees (MUL-trees). Indeed, any phylogenetic network N can be "unfolded" to obtain a MUL-tree U(N) and, conversely, a MUL-tree T can in certain circumstances be "folded" to obtain aphylogenetic network F(T) that exhibits T. In this paper, we study properties of the operations U and F in more detail. In particular, we introduce the class of stable networks, phylogenetic networks N for which F(U(N)) is isomorphic to N, characterise such networks, and show that they are related to the well-known class of tree-sibling networks. We also explore how the concept of displaying a tree in a network N can be related to displaying the tree in the MUL-tree U(N). To do this, we develop aphylogenetic analogue of graph fibrations. This allows us to view U(N) as the analogue of the universal cover of a digraph, and to establish a close connection between displaying trees in U(N) and reconciling phylogenetic trees with networks.

  17. Worldwide Phylogenetic Relationship of Avian Poxviruses

    PubMed Central

    Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of the Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups, and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g., starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy. PMID:23408635

  18. Worldwide phylogenetic relationship of avian poxviruses

    USGS Publications Warehouse

    Gyuranecz, Miklós; Foster, Jeffrey T.; Dán, Ádám; Ip, Hon S.; Egstad, Kristina F.; Parker, Patricia G.; Higashiguchi, Jenni M.; Skinner, Michael A.; Höfle, Ursula; Kreizinger, Zsuzsa; Dorrestein, Gerry M.; Solt, Szabolcs; Sós, Endre; Kim, Young Jun; Uhart, Marcela; Pereda, Ariel; González-Hein, Gisela; Hidalgo, Hector; Blanco, Juan-Manuel; Erdélyi, Károly

    2013-01-01

    Poxvirus infections have been found in 230 species of wild and domestic birds worldwide in both terrestrial and marine environments. This ubiquity raises the question of how infection has been transmitted and globally dispersed. We present a comprehensive global phylogeny of 111 novel poxvirus isolates in addition to all available sequences from GenBank. Phylogenetic analysis of Avipoxvirus genus has traditionally relied on one gene region (4b core protein). In this study we have expanded the analyses to include a second locus (DNA polymerase gene), allowing for a more robust phylogenetic framework, finer genetic resolution within specific groups and the detection of potential recombination. Our phylogenetic results reveal several major features of avipoxvirus evolution and ecology and propose an updated avipoxvirus taxonomy, including three novel subclades. The characterization of poxviruses from 57 species of birds in this study extends the current knowledge of their host range and provides the first evidence of the phylogenetic effect of genetic recombination of avipoxviruses. The repeated occurrence of avian family or order-specific grouping within certain clades (e.g. starling poxvirus, falcon poxvirus, raptor poxvirus, etc.) indicates a marked role of host adaptation, while the sharing of poxvirus species within prey-predator systems emphasizes the capacity for cross-species infection and limited host adaptation. Our study provides a broad and comprehensive phylogenetic analysis of the Avipoxvirus genus, an ecologically and environmentally important viral group, to formulate a genome sequencing strategy that will clarify avipoxvirus taxonomy.

  19. Localization of a translocation breakpoint involved in Smith-Lemli-Opitz syndrome

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Alley, T.L.; Gray, B.A.; Lee, S.

    1994-09-01

    Smith-Lemli-Opitz syndrome (SLOS) is a multiple congenital anomaly/mental retardation syndrome, with features including toe syndactyly, genital anomalies, unusual facies, and occasional organ malformations. The gene(s) for this autosomal recessive disorder has not been mapped. Recent biochemical studies suggest that the defect may involve the penultimate step in cholesterol synthesis, as patients have low serum cholesterol and increased 7-dehydrocholesterol (7-DHC) levels. However, the enzyme putatively involved (7-DHC reductase) has not been isolated. We identified an SLOS patient with a de novo balanced chromosome translocation [t(7;20)(q32.1;q13.2)], and we propose that the translocation interrupts one of the patient`s SLOS alleles. We are pursuingmore » positional cloning to identify the SLOS gene. Using fluorescence in situ hybridization (FISH), we recently identified a chromosome 7 yeast artificial chromosome (YAC) that spans the breakpoint and places it onto physical and genetic maps. We are in the process of narrowing this region via overlapping YACs and YAC subclones, from which we will isolate candidate cDNAs. Any candidate gene disrupted by the translocation and mutated on the other allele will be proven to be the SLOS gene. Functional analysis of an SLOS cDNA may also determine its relationship to cholesterol metabolism and the observed biochemical abnormalities.« less

  20. Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution

    PubMed Central

    Kendall, Michelle; Colijn, Caroline

    2016-01-01

    Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. Key words: phylogenetics, evolution, tree metrics, genetics, sequencing. PMID:27343287

  1. The Diaporthe sojae species complex: Phylogenetic re-assessment of pathogens associated with soybean, cucurbits and other field crops.

    PubMed

    Udayanga, Dhanushka; Castlebury, Lisa A; Rossman, Amy Y; Chukeatirote, Ekachai; Hyde, Kevin D

    2015-05-01

    Phytopathogenic species of Diaporthe are associated with a number of soybean diseases including seed decay, pod and stem blight and stem canker and lead to considerable crop production losses worldwide. Accurate morphological identification of the species that cause these diseases has been difficult. In this study, we determined the phylogenetic relationships and species boundaries of Diaporthe longicolla, Diaporthe phaseolorum, Diaporthe sojae and closely related taxa. Species boundaries for this complex were determined based on combined phylogenetic analysis of five gene regions: partial sequences of calmodulin (CAL), beta-tubulin (TUB), histone-3 (HIS), translation elongation factor 1-α (EF1-α), and the nuclear ribosomal internal transcribed spacers (ITS). Phylogenetic analyses revealed that this large complex of taxa is comprised of soybean pathogens as well as species associated with herbaceous field crops and weeds. Diaporthe arctii, Diaporthe batatas, D. phaseolorum and D. sojae are epitypified. The seed decay pathogen D. longicolla was determined to be distinct from D. sojae. D. phaseolorum, originally associated with stem and leaf blight of Lima bean, was not found to be associated with soybean. A new species, Diaporthe ueckerae on Cucumis melo, is introduced with description and illustrations. Published by Elsevier Ltd.

  2. Phylogenetic tree construction using trinucleotide usage profile (TUP).

    PubMed

    Chen, Si; Deng, Lih-Yuan; Bowman, Dale; Shiau, Jyh-Jen Horng; Wong, Tit-Yee; Madahian, Behrouz; Lu, Henry Horng-Shing

    2016-10-06

    It has been a challenging task to build a genome-wide phylogenetic tree for a large group of species containing a large number of genes with long nucleotides sequences. The most popular method, called feature frequency profile (FFP-k), finds the frequency distribution for all words of certain length k over the whole genome sequence using (overlapping) windows of the same length. For a satisfactory result, the recommended word length (k) ranges from 6 to 15 and it may not be a multiple of 3 (codon length). The total number of possible words needed for FFP-k can range from 4 6 =4096 to 4 15 . We propose a simple improvement over the popular FFP method using only a typical word length of 3. A new method, called Trinucleotide Usage Profile (TUP), is proposed based only on the (relative) frequency distribution using non-overlapping windows of length 3. The total number of possible words needed for TUP is 4 3 =64, which is much less than the total count for the recommended optimal "resolution" for FFP. To build a phylogenetic tree, we propose first representing each of the species by a TUP vector and then using an appropriate distance measure between pairs of the TUP vectors for the tree construction. In particular, we propose summarizing a DNA sequence by a matrix of three rows corresponding to three reading frames, recording the frequency distribution of the non-overlapping words of length 3 in each of the reading frame. We also provide a numerical measure for comparing trees constructed with various methods. Compared to the FFP method, our empirical study showed that the proposed TUP method is more capable of building phylogenetic trees with a stronger biological support. We further provide some justifications on this from the information theory viewpoint. Unlike the FFP method, the TUP method takes the advantage that the starting of the first reading frame is (usually) known. Without this information, the FFP method could only rely on the frequency distribution of

  3. Predicting rates of interspecific interaction from phylogenetic trees.

    PubMed

    Nuismer, Scott L; Harmon, Luke J

    2015-01-01

    Integrating phylogenetic information can potentially improve our ability to explain species' traits, patterns of community assembly, the network structure of communities, and ecosystem function. In this study, we use mathematical models to explore the ecological and evolutionary factors that modulate the explanatory power of phylogenetic information for communities of species that interact within a single trophic level. We find that phylogenetic relationships among species can influence trait evolution and rates of interaction among species, but only under particular models of species interaction. For example, when interactions within communities are mediated by a mechanism of phenotype matching, phylogenetic trees make specific predictions about trait evolution and rates of interaction. In contrast, if interactions within a community depend on a mechanism of phenotype differences, phylogenetic information has little, if any, predictive power for trait evolution and interaction rate. Together, these results make clear and testable predictions for when and how evolutionary history is expected to influence contemporary rates of species interaction. © 2014 John Wiley & Sons Ltd/CNRS.

  4. Molecular identification and phylogenetic analysis of Wuchereria bancrofti from human blood samples in Egypt.

    PubMed

    Abdel-Shafi, Iman R; Shoieb, Eman Y; Attia, Samar S; Rubio, José M; Ta-Tang, Thuy-Huong; El-Badry, Ayman A

    2017-03-01

    Lymphatic filariasis (LF) is a serious vector-borne health problem, and Wuchereria bancrofti (W.b) is the major cause of LF worldwide and is focally endemic in Egypt. Identification of filarial infection using traditional morphologic and immunological criteria can be difficult and lead to misdiagnosis. The aim of the present study was molecular detection of W.b in residents in endemic areas in Egypt, sequence variance analysis, and phylogenetic analysis of W.b DNA. Collected blood samples from residents in filariasis endemic areas in five governorates were subjected to semi-nested PCR targeting repeated DNA sequence, for detection of W.b DNA. PCR products were sequenced; subsequently, a phylogenetic analysis of the obtained sequences was performed. Out of 300 blood samples, W.b DNA was identified in 48 (16%). Sequencing analysis confirmed PCR results identifying only W.b species. Sequence alignment and phylogenetic analysis indicated genetically distinct clusters of W.b among the study population. Study results demonstrated that the semi-nested PCR proved to be an effective diagnostic tool for accurate and rapid detection of W.b infections in nano-epidemics and is applicable for samples collected in the daytime as well as the night time. PCR products sequencing and phylogenitic analysis revealed three different nucleotide sequences variants. Further genetic studies of W.b in Egypt and other endemic areas are needed to distinguish related strains and the various ecological as well as drug effects exerted on them to support W.b elimination.

  5. Maximizing the phylogenetic diversity of seed banks.

    PubMed

    Griffiths, Kate E; Balding, Sharon T; Dickie, John B; Lewis, Gwilym P; Pearce, Tim R; Grenyer, Richard

    2015-04-01

    Ex situ conservation efforts such as those of zoos, botanical gardens, and seed banks will form a vital complement to in situ conservation actions over the coming decades. It is therefore necessary to pay the same attention to the biological diversity represented in ex situ conservation facilities as is often paid to protected-area networks. Building the phylogenetic diversity of ex situ collections will strengthen our capacity to respond to biodiversity loss. Since 2000, the Millennium Seed Bank Partnership has banked seed from 14% of the world's plant species. We assessed the taxonomic, geographic, and phylogenetic diversity of the Millennium Seed Bank collection of legumes (Leguminosae). We compared the collection with all known legume genera, their known geographic range (at country and regional levels), and a genus-level phylogeny of the legume family constructed for this study. Over half the phylogenetic diversity of legumes at the genus level was represented in the Millennium Seed Bank. However, pragmatic prioritization of species of economic importance and endangerment has led to the banking of a less-than-optimal phylogenetic diversity and prioritization of range-restricted species risks an underdispersed collection. The current state of the phylogenetic diversity of legumes in the Millennium Seed Bank could be substantially improved through the strategic banking of relatively few additional taxa. Our method draws on tools that are widely applied to in situ conservation planning, and it can be used to evaluate and improve the phylogenetic diversity of ex situ collections. © 2014 Society for Conservation Biology.

  6. Rapid and accurate pyrosequencing of angiosperm plastid genomes

    PubMed Central

    Moore, Michael J; Dhingra, Amit; Soltis, Pamela S; Shaw, Regina; Farmerie, William G; Folta, Kevin M; Soltis, Douglas E

    2006-01-01

    Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid

  7. Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.

    PubMed

    Kendall, Michelle; Colijn, Caroline

    2016-10-01

    Evolutionary relationships are frequently described by phylogenetic trees, but a central barrier in many fields is the difficulty of interpreting data containing conflicting phylogenetic signals. We present a metric-based method for comparing trees which extracts distinct alternative evolutionary relationships embedded in data. We demonstrate detection and resolution of phylogenetic uncertainty in a recent study of anole lizards, leading to alternate hypotheses about their evolutionary relationships. We use our approach to compare trees derived from different genes of Ebolavirus and find that the VP30 gene has a distinct phylogenetic signature composed of three alternatives that differ in the deep branching structure. phylogenetics, evolution, tree metrics, genetics, sequencing. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. Phylogenetic Copy-Number Factorization of Multiple Tumor Samples.

    PubMed

    Zaccaria, Simone; El-Kebir, Mohammed; Klau, Gunnar W; Raphael, Benjamin J

    2018-04-16

    Cancer is an evolutionary process driven by somatic mutations. This process can be represented as a phylogenetic tree. Constructing such a phylogenetic tree from genome sequencing data is a challenging task due to the many types of mutations in cancer and the fact that nearly all cancer sequencing is of a bulk tumor, measuring a superposition of somatic mutations present in different cells. We study the problem of reconstructing tumor phylogenies from copy-number aberrations (CNAs) measured in bulk-sequencing data. We introduce the Copy-Number Tree Mixture Deconvolution (CNTMD) problem, which aims to find the phylogenetic tree with the fewest number of CNAs that explain the copy-number data from multiple samples of a tumor. We design an algorithm for solving the CNTMD problem and apply the algorithm to both simulated and real data. On simulated data, we find that our algorithm outperforms existing approaches that either perform deconvolution/factorization of mixed tumor samples or build phylogenetic trees assuming homogeneous tumor samples. On real data, we analyze multiple samples from a prostate cancer patient, identifying clones within these samples and a phylogenetic tree that relates these clones and their differing proportions across samples. This phylogenetic tree provides a higher resolution view of copy-number evolution of this cancer than published analyses.

  9. Molecular phylogenetics and species delimitation of leaf-toed geckos (Phyllodactylidae: Phyllodactylus) throughout the Mexican tropical dry forest.

    PubMed

    Blair, Christopher; Méndez de la Cruz, Fausto R; Law, Christopher; Murphy, Robert W

    2015-03-01

    Methods and approaches for accurate species delimitation continue to be a highly controversial subject in the systematics community. Inaccurate assessment of species' limits precludes accurate inference of historical evolutionary processes. Recent evidence suggests that multilocus coalescent methods show promise in delimiting species in cryptic clades. We combine multilocus sequence data with coalescence-based phylogenetics in a hypothesis-testing framework to assess species limits and elucidate the timing of diversification in leaf-toed geckos (Phyllodactylus) of Mexico's dry forests. Tropical deciduous forests (TDF) of the Neotropics are among the planet's most diverse ecosystems. However, in comparison to moist tropical forests, little is known about the mode and tempo of biotic evolution throughout this threatened biome. We find increased speciation and substantial, cryptic molecular diversity originating following the formation of Mexican TDF 30-20million years ago due to orogenesis of the Sierra Madre Occidental and Mexican Volcanic Belt. Phylogenetic results suggest that the Mexican Volcanic Belt, the Rio Fuerte, and Isthmus of Tehuantepec may be important biogeographic barriers. Single- and multilocus coalescent analyses suggest that nearly every sampling locality may be a distinct species. These results suggest unprecedented levels of diversity, a complex evolutionary history, and that the formation and expansion of TDF vegetation in the Miocene may have influenced subsequent cladogenesis of leaf-toed geckos throughout western Mexico. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. The phylogenetic roots of human lethal violence.

    PubMed

    Gómez, José María; Verdú, Miguel; González-Megías, Adela; Méndez, Marcos

    2016-10-13

    The psychological, sociological and evolutionary roots of conspecific violence in humans are still debated, despite attracting the attention of intellectuals for over two millennia. Here we propose a conceptual approach towards understanding these roots based on the assumption that aggression in mammals, including humans, has a significant phylogenetic component. By compiling sources of mortality from a comprehensive sample of mammals, we assessed the percentage of deaths due to conspecifics and, using phylogenetic comparative tools, predicted this value for humans. The proportion of human deaths phylogenetically predicted to be caused by interpersonal violence stood at 2%. This value was similar to the one phylogenetically inferred for the evolutionary ancestor of primates and apes, indicating that a certain level of lethal violence arises owing to our position within the phylogeny of mammals. It was also similar to the percentage seen in prehistoric bands and tribes, indicating that we were as lethally violent then as common mammalian evolutionary history would predict. However, the level of lethal violence has changed through human history and can be associated with changes in the socio-political organization of human populations. Our study provides a detailed phylogenetic and historical context against which to compare levels of lethal violence observed throughout our history.

  11. Breakpoint Analysis and Assessment of Selected Stressor Variables on Benthic Macroinvertebrate and Fish Communities in Indiana Streams: Implications for Developing Nutrient Criteria

    USGS Publications Warehouse

    Caskey, Brian J.; Frey, Jeffrey W.; Selvaratnam, Shivi

    2010-01-01

    Water chemistry, periphyton and seston chlorophyll a (CHLa), and biological community data were collected from 321 sites from 2001 through 2005 to (1) determine statistically and ecologically significant relations among the stressor (total nitrogen, total phosphorus, periphyton and seston CHLa, and turbidity) variables and response (biological community) variables; and, (2) determine the breakpoint of biological community attributes and metrics in response to changes in stressor variables. Because of the typically weak relations among the stressor and response variables, methods were developed to reduce the effects of non-nutrient biological stressors that could mask the effect of nutrients. Stressor variable concentrations ranged from 0.30 to 11.0 milligrams per liter (mg/L) for total nitrogen, 0.025 to 1.33 mg/L for total phosphorus, 2.9 to 768 milligrams per square meter (mg/m2) for periphyton CHLa, and 0.37 to 42 micrograms per liter (ug/L) for seston CHLa. Turbidity, another stressor variable, ranged from 0.8 to 65.4 Nephelometric turbidity units (NTUs). When the nutrient and CHLa data were compared to Dodds' trophic classifications, 75.0 percent of the values for total nitrogen, 46.6 percent of the values for total phosphorus, 35.8 percent of the values for periphyton CHLa, and 3.5 percent of the values for seston CHLa, were eutrophic. The invertebrate communities were dominated by families considered highly nutrient tolerant, Chironimidae, (41.7 percent relative abundance), Hydropsychidae, (17.3 percent relative abundance), and Baetidae, (10.2 percent relative abundance). Fish communities were dominated by algivores and nutrient-tolerant species, specifically central stonerollers (13.3 percent relative abundance), creek chubs (9.9 percent relative abundance), and bluntnose minnows (9.3 percent relative abundance). Although not the dominant taxa, white sucker, spotted sucker, green sunfish, and bluegill species were correlated (p ?0.05) with the stressor

  12. Anchoring quartet-based phylogenetic distances and applications to species tree reconstruction.

    PubMed

    Sayyari, Erfan; Mirarab, Siavash

    2016-11-11

    Inferring species trees from gene trees using the coalescent-based summary methods has been the subject of much attention, yet new scalable and accurate methods are needed. We introduce DISTIQUE, a new statistically consistent summary method for inferring species trees from gene trees under the coalescent model. We generalize our results to arbitrary phylogenetic inference problems; we show that two arbitrarily chosen leaves, called anchors, can be used to estimate relative distances between all other pairs of leaves by inferring relevant quartet trees. This results in a family of distance-based tree inference methods, with running times ranging between quadratic to quartic in the number of leaves. We show in simulated studies that DISTIQUE has comparable accuracy to leading coalescent-based summary methods and reduced running times.

  13. Refuting phylogenetic relationships

    PubMed Central

    Bucknam, James; Boucher, Yan; Bapteste, Eric

    2006-01-01

    Background Phylogenetic methods are philosophically grounded, and so can be philosophically biased in ways that limit explanatory power. This constitutes an important methodologic dimension not often taken into account. Here we address this dimension in the context of concatenation approaches to phylogeny. Results We discuss some of the limits of a methodology restricted to verificationism, the philosophy on which gene concatenation practices generally rely. As an alternative, we describe a software which identifies and focuses on impossible or refuted relationships, through a simple analysis of bootstrap bipartitions, followed by multivariate statistical analyses. We show how refuting phylogenetic relationships could in principle facilitate systematics. We also apply our method to the study of two complex phylogenies: the phylogeny of the archaea and the phylogeny of the core of genes shared by all life forms. While many groups are rejected, our results left open a possible proximity of N. equitans and the Methanopyrales, of the Archaea and the Cyanobacteria, and as well the possible grouping of the Methanobacteriales/Methanoccocales and Thermosplasmatales, of the Spirochaetes and the Actinobacteria and of the Proteobacteria and firmicutes. Conclusion It is sometimes easier (and preferable) to decide which species do not group together than which ones do. When possible topologies are limited, identifying local relationships that are rejected may be a useful alternative to classical concatenation approaches aiming to find a globally resolved tree on the basis of weak phylogenetic markers. Reviewers This article was reviewed by Mark Ragan, Eugene V Koonin and J Peter Gogarten. PMID:16956399

  14. Associations of Leaf Spectra with Genetic and Phylogenetic Variation in Oaks: Prospects for Remote Detection of Biodiversity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Cavender-Bares, Jeannine; Meireles, Jose; Couture, John

    Species and phylogenetic lineages have evolved to differ in the way that they acquire and deploy resources, with consequences for their physiological, chemical and structural attributes, many of which can be detected using spectral reflectance form leaves. Recent technological advances for assessing optical properties of plants offer opportunities to detect functional traits of organisms and differentiate levels of biological organization across the tree of life. We connect leaf-level full range spectral data (400–2400 nm) of leaves to the hierarchical organization of plant diversity within the oak genus (Quercus) using field and greenhouse experiments in which environmental factors and plant agemore » are controlled. We show that spectral data significantly differentiate populations within a species and that spectral similarity is significantly associated with phylogenetic similarity among species. Furthermore, we show that hyperspectral information allows more accurate classification of taxa than spectrally-derived traits, which by definition are of lower dimensionality. Finally, model accuracy increases at higher levels in the hierarchical organization of plant diversity, such that we are able to better distinguish clades than species or populations. This pattern supports an evolutionary explanation for the degree of optical differentiation among plants and demonstrates potential for remote detection of genetic and phylogenetic diversity.« less

  15. Associations of Leaf Spectra with Genetic and Phylogenetic Variation in Oaks: Prospects for Remote Detection of Biodiversity

    DOE PAGES

    Cavender-Bares, Jeannine; Meireles, Jose; Couture, John; ...

    2016-03-09

    Species and phylogenetic lineages have evolved to differ in the way that they acquire and deploy resources, with consequences for their physiological, chemical and structural attributes, many of which can be detected using spectral reflectance form leaves. Recent technological advances for assessing optical properties of plants offer opportunities to detect functional traits of organisms and differentiate levels of biological organization across the tree of life. We connect leaf-level full range spectral data (400–2400 nm) of leaves to the hierarchical organization of plant diversity within the oak genus (Quercus) using field and greenhouse experiments in which environmental factors and plant agemore » are controlled. We show that spectral data significantly differentiate populations within a species and that spectral similarity is significantly associated with phylogenetic similarity among species. Furthermore, we show that hyperspectral information allows more accurate classification of taxa than spectrally-derived traits, which by definition are of lower dimensionality. Finally, model accuracy increases at higher levels in the hierarchical organization of plant diversity, such that we are able to better distinguish clades than species or populations. This pattern supports an evolutionary explanation for the degree of optical differentiation among plants and demonstrates potential for remote detection of genetic and phylogenetic diversity.« less

  16. Phylogenetic turnover along local environmental gradients in tropical forest communities.

    PubMed

    Baldeck, C A; Kembel, S W; Harms, K E; Yavitt, J B; John, R; Turner, B L; Madawala, S; Gunatilleke, N; Gunatilleke, S; Bunyavejchewin, S; Kiratiprayoon, S; Yaacob, A; Supardi, M N N; Valencia, R; Navarrete, H; Davies, S J; Chuyong, G B; Kenfack, D; Thomas, D W; Dalling, J W

    2016-10-01

    While the importance of local-scale habitat niches in shaping tree species turnover along environmental gradients in tropical forests is well appreciated, relatively little is known about the influence of phylogenetic signal in species' habitat niches in shaping local community structure. We used detailed maps of the soil resource and topographic variation within eight 24-50 ha tropical forest plots combined with species phylogenies created from the APG III phylogeny to examine how phylogenetic beta diversity (indicating the degree of phylogenetic similarity of two communities) was related to environmental gradients within tropical tree communities. Using distance-based redundancy analysis we found that phylogenetic beta diversity, expressed as either nearest neighbor distance or mean pairwise distance, was significantly related to both soil and topographic variation in all study sites. In general, more phylogenetic beta diversity within a forest plot was explained by environmental variables this was expressed as nearest neighbor distance versus mean pairwise distance (3.0-10.3 % and 0.4-8.8 % of variation explained among plots, respectively), and more variation was explained by soil resource variables than topographic variables using either phylogenetic beta diversity metric. We also found that patterns of phylogenetic beta diversity expressed as nearest neighbor distance were consistent with previously observed patterns of niche similarity among congeneric species pairs in these plots. These results indicate the importance of phylogenetic signal in local habitat niches in shaping the phylogenetic structure of tropical tree communities, especially at the level of close phylogenetic neighbors, where similarity in habitat niches is most strongly preserved.

  17. Spatial phylogenetics of the native California flora.

    PubMed

    Thornhill, Andrew H; Baldwin, Bruce G; Freyman, William A; Nosratinia, Sonia; Kling, Matthew M; Morueta-Holme, Naia; Madsen, Thomas P; Ackerly, David D; Mishler, Brent D

    2017-10-26

    California is a world floristic biodiversity hotspot where the terms neo- and paleo-endemism were first applied. Using spatial phylogenetics, it is now possible to evaluate biodiversity from an evolutionary standpoint, including discovering significant areas of neo- and paleo-endemism, by combining spatial information from museum collections and DNA-based phylogenies. Here we used a distributional dataset of 1.39 million herbarium specimens, a phylogeny of 1083 operational taxonomic units (OTUs) and 9 genes, and a spatial randomization test to identify regions of significant phylogenetic diversity, relative phylogenetic diversity, and phylogenetic endemism (PE), as well as to conduct a categorical analysis of neo- and paleo-endemism (CANAPE). We found (1) extensive phylogenetic clustering in the South Coast Ranges, southern Great Valley, and deserts of California; (2) significant concentrations of short branches in the Mojave and Great Basin Deserts and the South Coast Ranges and long branches in the northern Great Valley, Sierra Nevada foothills, and the northwestern and southwestern parts of the state; (3) significant concentrations of paleo-endemism in Northwestern California, the northern Great Valley, and western Sonoran Desert, and neo-endemism in the White-Inyo Range, northern Mojave Desert, and southern Channel Islands. Multiple analyses were run to observe the effects on significance patterns of using different phylogenetic tree topologies (uncalibrated trees versus time-calibrated ultrametric trees) and using different representations of OTU ranges (herbarium specimen locations versus species distribution models). These analyses showed that examining the geographic distributions of branch lengths in a statistical framework adds a new dimension to California floristics that, in comparison with climatic data, helps to illuminate causes of endemism. In particular, the concentration of significant PE in more arid regions of California extends previous ideas

  18. Towards an eco-phylogenetic framework for infectious disease ecology.

    PubMed

    Fountain-Jones, Nicholas M; Pearse, William D; Escobar, Luis E; Alba-Casals, Ana; Carver, Scott; Davies, T Jonathan; Kraberger, Simona; Papeş, Monica; Vandegrift, Kurt; Worsley-Tonks, Katherine; Craft, Meggan E

    2018-05-01

    Identifying patterns and drivers of infectious disease dynamics across multiple scales is a fundamental challenge for modern science. There is growing awareness that it is necessary to incorporate multi-host and/or multi-parasite interactions to understand and predict current and future disease threats better, and new tools are needed to help address this task. Eco-phylogenetics (phylogenetic community ecology) provides one avenue for exploring multi-host multi-parasite systems, yet the incorporation of eco-phylogenetic concepts and methods into studies of host pathogen dynamics has lagged behind. Eco-phylogenetics is a transformative approach that uses evolutionary history to infer present-day dynamics. Here, we present an eco-phylogenetic framework to reveal insights into parasite communities and infectious disease dynamics across spatial and temporal scales. We illustrate how eco-phylogenetic methods can help untangle the mechanisms of host-parasite dynamics from individual (e.g. co-infection) to landscape scales (e.g. parasite/host community structure). An improved ecological understanding of multi-host and multi-pathogen dynamics across scales will increase our ability to predict disease threats. © 2017 Cambridge Philosophical Society.

  19. Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

    PubMed

    Barr, W Andrew; Scott, Robert S

    2014-04-01

    In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, <5.6% of DFAs and <8.26% of PGLS analyses were significant. When characters contained phylogenetic signal and actual habitats were used, 33.27 to 45.07% of DFAs and <13.09% of PGLS analyses were significant. False Discovery Rate (FDR) corrections for multiple PGLS analyses reduced the rate of significance to <4.64%. In all cases using actual habitats and characters with phylogenetic signal, correct classification rates of DFAs exceeded random chance. In simulations involving phylogenetic signal in both predictor variables and predicted categories, PGLS with FDR was rarely significant, while DFA often was. In short, DFA offered no indication that differences between categories might be explained by phylogenetic signal, while PGLS did. As such, PGLS provides a valuable tool for testing the functional hypotheses at the heart of ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.

  20. ["Long-branch Attraction" artifact in phylogenetic reconstruction].

    PubMed

    Li, Yi-Wei; Yu, Li; Zhang, Ya-Ping

    2007-06-01

    Phylogenetic reconstruction among various organisms not only helps understand their evolutionary history but also reveal several fundamental evolutionary questions. Understanding of the evolutionary relationships among organisms establishes the foundation for the investigations of other biological disciplines. However, almost all the widely used phylogenetic methods have limitations which fail to eliminate systematic errors effectively, preventing the reconstruction of true organismal relationships. "Long-branch Attraction" (LBA) artifact is one of the most disturbing factors in phylogenetic reconstruction. In this review, the conception and analytic method as well as the avoidance strategy of LBA were summarized. In addition, several typical examples were provided. The approach to avoid and resolve LBA artifact has been discussed.

  1. Understanding phylogenetic incongruence: lessons from phyllostomid bats

    PubMed Central

    Dávalos, Liliana M; Cirranello, Andrea L; Geisler, Jonathan H; Simmons, Nancy B

    2012-01-01

    All characters and trait systems in an organism share a common evolutionary history that can be estimated using phylogenetic methods. However, differential rates of change and the evolutionary mechanisms driving those rates result in pervasive phylogenetic conflict. These drivers need to be uncovered because mismatches between evolutionary processes and phylogenetic models can lead to high confidence in incorrect hypotheses. Incongruence between phylogenies derived from morphological versus molecular analyses, and between trees based on different subsets of molecular sequences has become pervasive as datasets have expanded rapidly in both characters and species. For more than a decade, evolutionary relationships among members of the New World bat family Phyllostomidae inferred from morphological and molecular data have been in conflict. Here, we develop and apply methods to minimize systematic biases, uncover the biological mechanisms underlying phylogenetic conflict, and outline data requirements for future phylogenomic and morphological data collection. We introduce new morphological data for phyllostomids and outgroups and expand previous molecular analyses to eliminate methodological sources of phylogenetic conflict such as taxonomic sampling, sparse character sampling, or use of different algorithms to estimate the phylogeny. We also evaluate the impact of biological sources of conflict: saturation in morphological changes and molecular substitutions, and other processes that result in incongruent trees, including convergent morphological and molecular evolution. Methodological sources of incongruence play some role in generating phylogenetic conflict, and are relatively easy to eliminate by matching taxa, collecting more characters, and applying the same algorithms to optimize phylogeny. The evolutionary patterns uncovered are consistent with multiple biological sources of conflict, including saturation in morphological and molecular changes, adaptive

  2. Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics

    PubMed Central

    Kolaczkowski, Bryan; Thornton, Joseph W.

    2009-01-01

    Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias—which is apparent under both controlled simulation conditions and in analyses of empirical sequence data—also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages—that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis. PMID:20011052

  3. Long-branch attraction bias and inconsistency in Bayesian phylogenetics.

    PubMed

    Kolaczkowski, Bryan; Thornton, Joseph W

    2009-12-09

    Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias--which is apparent under both controlled simulation conditions and in analyses of empirical sequence data--also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages--that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis.

  4. Prioritizing Populations for Conservation Using Phylogenetic Networks

    PubMed Central

    Volkmann, Logan; Martyn, Iain; Moulton, Vincent; Spillner, Andreas; Mooers, Arne O.

    2014-01-01

    In the face of inevitable future losses to biodiversity, ranking species by conservation priority seems more than prudent. Setting conservation priorities within species (i.e., at the population level) may be critical as species ranges become fragmented and connectivity declines. However, existing approaches to prioritization (e.g., scoring organisms by their expected genetic contribution) are based on phylogenetic trees, which may be poor representations of differentiation below the species level. In this paper we extend evolutionary isolation indices used in conservation planning from phylogenetic trees to phylogenetic networks. Such networks better represent population differentiation, and our extension allows populations to be ranked in order of their expected contribution to the set. We illustrate the approach using data from two imperiled species: the spotted owl Strix occidentalis in North America and the mountain pygmy-possum Burramys parvus in Australia. Using previously published mitochondrial and microsatellite data, we construct phylogenetic networks and score each population by its relative genetic distinctiveness. In both cases, our phylogenetic networks capture the geographic structure of each species: geographically peripheral populations harbor less-redundant genetic information, increasing their conservation rankings. We note that our approach can be used with all conservation-relevant distances (e.g., those based on whole-genome, ecological, or adaptive variation) and suggest it be added to the assortment of tools available to wildlife managers for allocating effort among threatened populations. PMID:24586451

  5. Estimating hybridization in the presence of coalescence using phylogenetic intraspecific sampling.

    PubMed

    Gerard, David; Gibbs, H Lisle; Kubatko, Laura

    2011-10-06

    A well-known characteristic of multi-locus data is that each locus has its own phylogenetic history which may differ substantially from the overall phylogenetic history of the species. Although the possibility that this arises through incomplete lineage sorting is often incorporated in models for the species-level phylogeny, it is much less common for hybridization to also be formally included in such models. We have modified the evolutionary model of Meng and Kubatko (2009) to incorporate intraspecific sampling of multiple individuals for estimation of speciation times and times of hybridization events for testing for hybridization in the presence of incomplete lineage sorting. We have also utilized a more efficient algorithm for obtaining our estimates. Using simulations, we demonstrate that our approach performs well under conditions motivated by an empirical data set for Sistrurus rattlesnakes where putative hybridization has occurred. We further demonstrate that the method is able to accurately detect the signature of hybridization in the data, while this signal may be obscured when other species-tree inference methods that ignore hybridization are used. Our approach is shown to be powerful in detecting hybridization when it is present. When applied to the Sistrurus data, we find no evidence of hybridization; instead, it appears that putative hybrid snakes in Missouri are most likely pure S. catenatus tergeminus in origin, which has significant conservation implications.

  6. Phylogenetic structure of soil bacterial communities predicts ecosystem functioning.

    PubMed

    Pérez-Valera, Eduardo; Goberna, Marta; Verdú, Miguel

    2015-05-01

    Quantifying diversity with phylogeny-informed metrics helps understand the effects of diversity on ecosystem functioning (EF). The sign of these effects remains controversial because phylogenetic diversity and taxonomic identity may interactively influence EF. Positive relationships, traditionally attributed to complementarity effects, seem unimportant in natural soil bacterial communities. Negative relationships could be attributed to fitness differences leading to the overrepresentation of few productive clades, a mechanism recently invoked to assemble soil bacteria communities. We tested in two ecosystems contrasting in terms of environmental heterogeneity whether two metrics of phylogenetic community structure, a simpler measure of phylogenetic diversity (NRI) and a more complex metric incorporating taxonomic identity (PCPS), correctly predict microbially mediated EF. We show that the relationship between phylogenetic diversity and EF depends on the taxonomic identity of the main coexisting lineages. Phylogenetic diversity was negatively related to EF in soils where a marked fertility gradient exists and a single and productive clade (Proteobacteria) outcompete other clades in the most fertile plots. However, phylogenetic diversity was unrelated to EF in soils where the fertility gradient is less marked and Proteobacteria coexist with other abundant lineages. Including the taxonomic identity of bacterial lineages in metrics of phylogenetic community structure allows the prediction of EF in both ecosystems. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Molecular characterization and phylogenetic analysis of Sugarcane yellow leaf virus isolates from China.

    PubMed

    Gao, San-Ji; Lin, Yi-Hua; Pan, Yong-Bao; Damaj, Mona B; Wang, Qin-Nan; Mirkov, T Erik; Chen, Ru-Kai

    2012-10-01

    Sugarcane yellow leaf virus (SCYLV) (genus Polerovirus, family Luteoviridae), the causal agent of sugarcane yellow leaf disease (YLD), was first detected in China in 2006. To assess the distribution of SCYLV in the major sugarcane-growing Chinese provinces, leaf samples from 22 sugarcane clones (Saccharum spp. hybrid) showing YLD symptoms were collected and analyzed for infection by the virus using reverse transcription PCR (RT-PCR), quantitative RT-PCR, and immunological assays. A complete genomic sequence (5,879 nt) of the Chinese SCYLV isolate CHN-FJ1 and partial genomic sequences (2,915 nt) of 13 other Chinese SCYLV isolates from this study were amplified, cloned, and sequenced. The genomic sequence of the CHN-FJ1 isolate was found to share a high identity (98.4-99.1 %) with those of the Brazilian (BRA) genotype isolates and a low identity (86.5-86.9 %) with those of the CHN1 and Cuban (CUB) genotype isolates. The genetic diversity of these 14 Chinese SCYLV isolates was assessed along with that of 29 SCYLV isolates of worldwide origin reported in the GenBank database, based on the full or partial genomic sequence. Phylogenetic analysis demonstrated that all the 14 Chinese SCYLV isolates clustered into one large group with the BRA genotype and 12 other reported SCYLV isolates. In addition, five reported Chinese SCYLV isolates were grouped with the Peruvian (PER), CHN1 and CUB genotypes. We therefore speculated that at least four SCYLV genotypes, BRA, PER, CHN1, and CUB, are associated with YLD in China. Interestingly, a 39-nt deletion was detected in the sequence of the CHN-GD3 isolate, in the middle of the ORF1 region adjacent to the overlap between ORF1 and ORF2. This location is known to be one of the recombination breakpoints in the Luteoviridae family.

  8. A program to compute the soft Robinson-Foulds distance between phylogenetic networks.

    PubMed

    Lu, Bingxin; Zhang, Louxin; Leong, Hon Wai

    2017-03-14

    Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson-Foulds distance between phylogenetic networks. Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson-Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data.

  9. The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

    PubMed Central

    Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

    2016-01-01

    Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326

  10. YBYRÁ facilitates comparison of large phylogenetic trees.

    PubMed

    Machado, Denis Jacob

    2015-07-01

    The number and size of tree topologies that are being compared by phylogenetic systematists is increasing due to technological advancements in high-throughput DNA sequencing. However, we still lack tools to facilitate comparison among phylogenetic trees with a large number of terminals. The "YBYRÁ" project integrates software solutions for data analysis in phylogenetics. It comprises tools for (1) topological distance calculation based on the number of shared splits or clades, (2) sensitivity analysis and automatic generation of sensitivity plots and (3) clade diagnoses based on different categories of synapomorphies. YBYRÁ also provides (4) an original framework to facilitate the search for potential rogue taxa based on how much they affect average matching split distances (using MSdist). YBYRÁ facilitates comparison of large phylogenetic trees and outperforms competing software in terms of usability and time efficiency, specially for large data sets. The programs that comprises this toolkit are written in Python, hence they do not require installation and have minimum dependencies. The entire project is available under an open-source licence at http://www.ib.usp.br/grant/anfibios/researchSoftware.html .

  11. Disentangling the phylogenetic and ecological components of spider phenotypic variation.

    PubMed

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure.

  12. Disentangling the Phylogenetic and Ecological Components of Spider Phenotypic Variation

    PubMed Central

    Gonçalves-Souza, Thiago; Diniz-Filho, José Alexandre Felizola; Romero, Gustavo Quevedo

    2014-01-01

    An understanding of how the degree of phylogenetic relatedness influences the ecological similarity among species is crucial to inferring the mechanisms governing the assembly of communities. We evaluated the relative importance of spider phylogenetic relationships and ecological niche (plant morphological variables) to the variation in spider body size and shape by comparing spiders at different scales: (i) between bromeliads and dicot plants (i.e., habitat scale) and (ii) among bromeliads with distinct architectural features (i.e., microhabitat scale). We partitioned the interspecific variation in body size and shape into phylogenetic (that express trait values as expected by phylogenetic relationships among species) and ecological components (that express trait values independent of phylogenetic relationships). At the habitat scale, bromeliad spiders were larger and flatter than spiders associated with the surrounding dicots. At this scale, plant morphology sorted out close related spiders. Our results showed that spider flatness is phylogenetically clustered at the habitat scale, whereas it is phylogenetically overdispersed at the microhabitat scale, although phylogenic signal is present in both scales. Taken together, these results suggest that whereas at the habitat scale selective colonization affect spider body size and shape, at fine scales both selective colonization and adaptive evolution determine spider body shape. By partitioning the phylogenetic and ecological components of phenotypic variation, we were able to disentangle the evolutionary history of distinct spider traits and show that plant architecture plays a role in the evolution of spider body size and shape. We also discussed the relevance in considering multiple scales when studying phylogenetic community structure. PMID:24651264

  13. Phylogenetic system and zoogeography of the Plecoptera.

    PubMed

    Zwick, P

    2000-01-01

    Information about the phylogenetic relationships of Plecoptera is summarized. The few characters supporting monophyly of the order are outlined. Several characters of possible significance for the search for the closest relatives of the stoneflies are discussed, but the sister-group of the order remains unknown. Numerous characters supporting the presently recognized phylogenetic system of Plecoptera are presented, alternative classifications are discussed, and suggestions for future studies are made. Notes on zoogeography are appended. The order as such is old (Permian fossils), but phylogenetic relationships and global distribution patterns suggest that evolution of the extant suborders started with the breakup of Pangaea. There is evidence of extensive recent speciation in all parts of the world.

  14. A response to Yu et al. "A forward-backward fragment assembling algorithm for the identification of genomic amplification and deletion breakpoints using high-density single nucleotide polymorphism (SNP) array", BMC Bioinformatics 2007, 8: 145.

    PubMed

    Rueda, Oscar M; Diaz-Uriarte, Ramon

    2007-10-16

    Yu et al. (BMC Bioinformatics 2007,8: 145+) have recently compared the performance of several methods for the detection of genomic amplification and deletion breakpoints using data from high-density single nucleotide polymorphism arrays. One of the methods compared is our non-homogenous Hidden Markov Model approach. Our approach uses Markov Chain Monte Carlo for inference, but Yu et al. ran the sampler for a severely insufficient number of iterations for a Markov Chain Monte Carlo-based method. Moreover, they did not use the appropriate reference level for the non-altered state. We rerun the analysis in Yu et al. using appropriate settings for both the Markov Chain Monte Carlo iterations and the reference level. Additionally, to show how easy it is to obtain answers to additional specific questions, we have added a new analysis targeted specifically to the detection of breakpoints. The reanalysis shows that the performance of our method is comparable to that of the other methods analyzed. In addition, we can provide probabilities of a given spot being a breakpoint, something unique among the methods examined. Markov Chain Monte Carlo methods require using a sufficient number of iterations before they can be assumed to yield samples from the distribution of interest. Running our method with too small a number of iterations cannot be representative of its performance. Moreover, our analysis shows how our original approach can be easily adapted to answer specific additional questions (e.g., identify edges).

  15. Cophenetic metrics for phylogenetic trees, after Sokal and Rohlf.

    PubMed

    Cardona, Gabriel; Mir, Arnau; Rosselló, Francesc; Rotger, Lucía; Sánchez, David

    2013-01-16

    Phylogenetic tree comparison metrics are an important tool in the study of evolution, and hence the definition of such metrics is an interesting problem in phylogenetics. In a paper in Taxon fifty years ago, Sokal and Rohlf proposed to measure quantitatively the difference between a pair of phylogenetic trees by first encoding them by means of their half-matrices of cophenetic values, and then comparing these matrices. This idea has been used several times since then to define dissimilarity measures between phylogenetic trees but, to our knowledge, no proper metric on weighted phylogenetic trees with nested taxa based on this idea has been formally defined and studied yet. Actually, the cophenetic values of pairs of different taxa alone are not enough to single out phylogenetic trees with weighted arcs or nested taxa. For every (rooted) phylogenetic tree T, let its cophenetic vectorφ(T) consist of all pairs of cophenetic values between pairs of taxa in T and all depths of taxa in T. It turns out that these cophenetic vectors single out weighted phylogenetic trees with nested taxa. We then define a family of cophenetic metrics dφ,p by comparing these cophenetic vectors by means of Lp norms, and we study, either analytically or numerically, some of their basic properties: neighbors, diameter, distribution, and their rank correlation with each other and with other metrics. The cophenetic metrics can be safely used on weighted phylogenetic trees with nested taxa and no restriction on degrees, and they can be computed in O(n2) time, where n stands for the number of taxa. The metrics dφ,1 and dφ,2 have positive skewed distributions, and they show a low rank correlation with the Robinson-Foulds metric and the nodal metrics, and a very high correlation with each other and with the splitted nodal metrics. The diameter of dφ,p, for p⩾1 , is in O(n(p+2)/p), and thus for low p they are more discriminative, having a wider range of values.

  16. The problem and promise of scale dependency in community phylogenetics.

    PubMed

    Swenson, Nathan G; Enquist, Brian J; Pither, Jason; Thompson, Jill; Zimmerman, Jess K

    2006-10-01

    The problem of scale dependency is widespread in investigations of ecological communities. Null model investigations of community assembly exemplify the challenges involved because they typically include subjectively defined "regional species pools." The burgeoning field of community phylogenetics appears poised to face similar challenges. Our objective is to quantify the scope of the problem of scale dependency by comparing the phylogenetic structure of assemblages across contrasting geographic and taxonomic scales. We conduct phylogenetic analyses on communities within three tropical forests, and perform a sensitivity analysis with respect to two scaleable inputs: taxonomy and species pool size. We show that (1) estimates of phylogenetic overdispersion within local assemblages depend strongly on the taxonomic makeup of the local assemblage and (2) comparing the phylogenetic structure of a local assemblage to a species pool drawn from increasingly larger geographic scales results in an increased signal of phylogenetic clustering. We argue that, rather than posing a problem, "scale sensitivities" are likely to reveal general patterns of diversity that could help identify critical scales at which local or regional influences gain primacy for the structuring of communities. In this way, community phylogenetics promises to fill an important gap in community ecology and biogeography research.

  17. Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

    PubMed Central

    Dees, Jonathan; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa relatedness on phylogenetic trees, to measure the prevalence of correct taxa-relatedness interpretations, and to determine how student reasoning and correctness change in response to instruction and over time. Counting synapomorphies and nodes between taxa were the most common forms of incorrect reasoning, which presents a pedagogical dilemma concerning labeled synapomorphies on phylogenetic trees. Students also independently generated an alternative form of correct reasoning using monophyletic groups, the use of which decreased in popularity over time. Approximately half of all students were able to correctly interpret taxa relatedness on phylogenetic trees, and many memorized correct reasoning without understanding its application. Broad initial instruction that allowed students to generate inferences on their own contributed very little to phylogenetic tree understanding, while targeted instruction on evolutionary relationships improved understanding to some extent. Phylogenetic trees, which can directly affect student understanding of evolution, appear to offer introductory biology instructors a formidable pedagogical challenge. PMID:25452489

  18. A methodological investigation of hominoid craniodental morphology and phylogenetics.

    PubMed

    Bjarnason, Alexander; Chamberlain, Andrew T; Lockwood, Charles A

    2011-01-01

    The evolutionary relationships of extant great apes and humans have been largely resolved by molecular studies, yet morphology-based phylogenetic analyses continue to provide conflicting results. In order to further investigate this discrepancy we present bootstrap clade support of morphological data based on two quantitative datasets, one dataset consisting of linear measurements of the whole skull from 5 hominoid genera and the second dataset consisting of 3D landmark data from the temporal bone of 5 hominoid genera, including 11 sub-species. Using similar protocols for both datasets, we were able to 1) compare distance-based phylogenetic methods to cladistic parsimony of quantitative data converted into discrete character states, 2) vary outgroup choice to observe its effect on phylogenetic inference, and 3) analyse male and female data separately to observe the effect of sexual dimorphism on phylogenies. Phylogenetic analysis was sensitive to methodological decisions, particularly outgroup selection, where designation of Pongo as an outgroup and removal of Hylobates resulted in greater congruence with the proposed molecular phylogeny. The performance of distance-based methods also justifies their use in phylogenetic analysis of morphological data. It is clear from our analyses that hominoid phylogenetics ought not to be used as an example of conflict between the morphological and molecular, but as an example of how outgroup and methodological choices can affect the outcome of phylogenetic analysis. Copyright © 2010 Elsevier Ltd. All rights reserved.

  19. Phylogenetic Analyses of Armillaria Reveal at Least 15 Phylogenetic Lineages in China, Seven of Which Are Associated with Cultivated Gastrodia elata

    PubMed Central

    Guo, Ting; Wang, Han Chen; Xue, Wan Qiu; Zhao, Jun; Yang, Zhu L.

    2016-01-01

    Fungal species of Armillaria, which can act as plant pathogens and/or symbionts of the Chinese traditional medicinal herb Gastrodia elata (“Tianma”), are ecologically and economically important and have consequently attracted the attention of mycologists. However, their taxonomy has been highly dependent on morphological characterization and mating tests. In this study, we phylogenetically analyzed Chinese Armillaria samples using the sequences of the internal transcribed spacer region, translation elongation factor-1 alpha gene and beta-tubulin gene. Our data revealed at least 15 phylogenetic lineages of Armillaria from China, of which seven were newly discovered and two were recorded from China for the first time. Fourteen Chinese biological species of Armillaria, which were previously defined based on mating tests, could be assigned to the 15 phylogenetic lineages identified herein. Seven of the 15 phylogenetic lineages were found to be disjunctively distributed in different continents of the Northern Hemisphere, while eight were revealed to be endemic to certain continents. In addition, we found that seven phylogenetic lineages of Armillaria were used for the cultivation of Tianma, only two of which had been recorded to be associated with Tianma previously. We also illustrated that G. elata f. glauca (“Brown Tianma”) and G. elata f. elata (“Red Tianma”), two cultivars of Tianma grown in different regions of China, form symbiotic relationships with different phylogenetic lineages of Armillaria. These findings should aid the development of Tianma cultivation in China. PMID:27138686

  20. Coincidence of synteny breakpoints with malignancy-related deletions on human chromosome 3

    PubMed Central

    Kost-Alimova, Maria; Kiss, Hajnalka; Fedorova, Ludmila; Yang, Ying; Dumanski, Jan P.; Klein, George; Imreh, Stefan

    2003-01-01

    We have found previously that during tumor growth intact human chromosome 3 transferred into tumor cells regularly looses certain 3p regions, among them the ≈1.4-Mb common eliminated region 1 (CER1) at 3p21.3. Fluorescence in situ hybridization analysis of 12 mouse orthologous loci revealed that CER1 splits into two segments in mouse and therefore contains a murine/human conservation breakpoint region (CBR). Several breaks occurred in tumors within the region surrounding the CBR, and this sequence has features that characterize unstable chromosomal regions: deletions in yeast artificial chromosome clones, late replication, gene and segment duplications, and pseudogene insertions. Sequence analysis of the entire 3p12-22 revealed that other cancer-associated deletions (regions eliminated from monochromosomal hybrids carrying an intact chromosome 3 during tumor growth and homozygous deletions found in human tumors) colocalized nonrandomly with murine/human CBRs and were characterized by an increased number of local gene duplications and murine/human conservation mismatches (single genes that do not match into the conserved chromosomal segment). The CBR within CER1 contains a simple tandem TATAGA repeat capable of forming a 40-bp-long secondary hairpin-like structure. This repeat is nonrandomly localized within the other tumor-associated deletions and in the vicinity of 3p12-22 CBRs. PMID:12738884

  1. Applying species-tree analyses to deep phylogenetic histories: challenges and potential suggested from a survey of empirical phylogenetic studies.

    PubMed

    Lanier, Hayley C; Knowles, L Lacey

    2015-02-01

    Coalescent-based methods for species-tree estimation are becoming a dominant approach for reconstructing species histories from multi-locus data, with most of the studies examining these methodologies focused on recently diverged species. However, deeper phylogenies, such as the datasets that comprise many Tree of Life (ToL) studies, also exhibit gene-tree discordance. This discord may also arise from the stochastic sorting of gene lineages during the speciation process (i.e., reflecting the random coalescence of gene lineages in ancestral populations). It remains unknown whether guidelines regarding methodologies and numbers of loci established by simulation studies at shallow tree depths translate into accurate species relationships for deeper phylogenetic histories. We address this knowledge gap and specifically identify the challenges and limitations of species-tree methods that account for coalescent variance for deeper phylogenies. Using simulated data with characteristics informed by empirical studies, we evaluate both the accuracy of estimated species trees and the characteristics associated with recalcitrant nodes, with a specific focus on whether coalescent variance is generally responsible for the lack of resolution. By determining the proportion of coalescent genealogies that support a particular node, we demonstrate that (1) species-tree methods account for coalescent variance at deep nodes and (2) mutational variance - not gene-tree discord arising from the coalescent - posed the primary challenge for accurate reconstruction across the tree. For example, many nodes were accurately resolved despite predicted discord from the random coalescence of gene lineages and nodes with poor support were distributed across a range of depths (i.e., they were not restricted to a particular recent divergences). Given their broad taxonomic scope and large sampling of taxa, deep level phylogenies pose several potential methodological complications including

  2. Climate-driven extinctions shape the phylogenetic structure of temperate tree floras.

    PubMed

    Eiserhardt, Wolf L; Borchsenius, Finn; Plum, Christoffer M; Ordonez, Alejandro; Svenning, Jens-Christian

    2015-03-01

    When taxa go extinct, unique evolutionary history is lost. If extinction is selective, and the intrinsic vulnerabilities of taxa show phylogenetic signal, more evolutionary history may be lost than expected under random extinction. Under what conditions this occurs is insufficiently known. We show that late Cenozoic climate change induced phylogenetically selective regional extinction of northern temperate trees because of phylogenetic signal in cold tolerance, leading to significantly and substantially larger than random losses of phylogenetic diversity (PD). The surviving floras in regions that experienced stronger extinction are phylogenetically more clustered, indicating that non-random losses of PD are of increasing concern with increasing extinction severity. Using simulations, we show that a simple threshold model of survival given a physiological trait with phylogenetic signal reproduces our findings. Our results send a strong warning that we may expect future assemblages to be phylogenetically and possibly functionally depauperate if anthropogenic climate change affects taxa similarly. © 2015 John Wiley & Sons Ltd/CNRS.

  3. Increased phylogenetic resolution using target enrichment in Rubus

    USDA-ARS?s Scientific Manuscript database

    Phylogenetic analyses in Rubus L. have been challenging due to polyploidy, hybridization, and apomixis within the genus. Wide morphological diversity occurs within and between species, contributing to challenges at lower and higher systematic levels. Phylogenetic inferences to date have been based o...

  4. Interpreting the universal phylogenetic tree

    NASA Technical Reports Server (NTRS)

    Woese, C. R.

    2000-01-01

    The universal phylogenetic tree not only spans all extant life, but its root and earliest branchings represent stages in the evolutionary process before modern cell types had come into being. The evolution of the cell is an interplay between vertically derived and horizontally acquired variation. Primitive cellular entities were necessarily simpler and more modular in design than are modern cells. Consequently, horizontal gene transfer early on was pervasive, dominating the evolutionary dynamic. The root of the universal phylogenetic tree represents the first stage in cellular evolution when the evolving cell became sufficiently integrated and stable to the erosive effects of horizontal gene transfer that true organismal lineages could exist.

  5. Phylogenetics beyond biology.

    PubMed

    Retzlaff, Nancy; Stadler, Peter F

    2018-06-21

    Evolutionary processes have been described not only in biology but also for a wide range of human cultural activities including languages and law. In contrast to the evolution of DNA or protein sequences, the detailed mechanisms giving rise to the observed evolution-like processes are not or only partially known. The absence of a mechanistic model of evolution implies that it remains unknown how the distances between different taxa have to be quantified. Considering distortions of metric distances, we first show that poor choices of the distance measure can lead to incorrect phylogenetic trees. Based on the well-known fact that phylogenetic inference requires additive metrics, we then show that the correct phylogeny can be computed from a distance matrix [Formula: see text] if there is a monotonic, subadditive function [Formula: see text] such that [Formula: see text] is additive. The required metric-preserving transformation [Formula: see text] can be computed as the solution of an optimization problem. This result shows that the problem of phylogeny reconstruction is well defined even if a detailed mechanistic model of the evolutionary process remains elusive.

  6. Power law tails in phylogenetic systems.

    PubMed

    Qin, Chongli; Colwell, Lucy J

    2018-01-23

    Covariance analysis of protein sequence alignments uses coevolving pairs of sequence positions to predict features of protein structure and function. However, current methods ignore the phylogenetic relationships between sequences, potentially corrupting the identification of covarying positions. Here, we use random matrix theory to demonstrate the existence of a power law tail that distinguishes the spectrum of covariance caused by phylogeny from that caused by structural interactions. The power law is essentially independent of the phylogenetic tree topology, depending on just two parameters-the sequence length and the average branch length. We demonstrate that these power law tails are ubiquitous in the large protein sequence alignments used to predict contacts in 3D structure, as predicted by our theory. This suggests that to decouple phylogenetic effects from the interactions between sequence distal sites that control biological function, it is necessary to remove or down-weight the eigenvectors of the covariance matrix with largest eigenvalues. We confirm that truncating these eigenvectors improves contact prediction.

  7. Phylogenetic tree construction based on 2D graphical representation

    NASA Astrophysics Data System (ADS)

    Liao, Bo; Shan, Xinzhou; Zhu, Wen; Li, Renfa

    2006-04-01

    A new approach based on the two-dimensional (2D) graphical representation of the whole genome sequence [Bo Liao, Chem. Phys. Lett., 401(2005) 196.] is proposed to analyze the phylogenetic relationships of genomes. The evolutionary distances are obtained through measuring the differences among the 2D curves. The fuzzy theory is used to construct phylogenetic tree. The phylogenetic relationships of H5N1 avian influenza virus illustrate the utility of our approach.

  8. Contrasting biodiversity-ecosystem functioning relationships in phylogenetic and functional diversity.

    PubMed

    Steudel, Bastian; Hallmann, Christine; Lorenz, Maike; Abrahamczyk, Stefan; Prinz, Kathleen; Herrfurth, Cornelia; Feussner, Ivo; Martini, Johannes W R; Kessler, Michael

    2016-10-01

    It is well known that ecosystem functioning is positively influenced by biodiversity. Most biodiversity-ecosystem functioning experiments have measured biodiversity based on species richness or phylogenetic relationships. However, theoretical and empirical evidence suggests that ecosystem functioning should be more closely related to functional diversity than to species richness. We applied different metrics of biodiversity in an artificial biodiversity-ecosystem functioning experiment using 64 species of green microalgae in combinations of two to 16 species. We found that phylogenetic and functional diversity were positively correlated with biomass overyield, driven by their strong correlation with species richness. At low species richness, no significant correlation between overyield and functional and phylogenetic diversity was found. However, at high species richness (16 species), we found a positive relationship of overyield with functional diversity and a negative relationship with phylogenetic diversity. We show that negative phylogenetic diversity-ecosystem functioning relationships can result from interspecific growth inhibition. The opposing performances of facilitation (functional diversity) and inhibition (phylogenetic diversity) we observed at the 16 species level suggest that phylogenetic diversity is not always a good proxy for functional diversity and that results from experiments with low species numbers may underestimate negative species interactions. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  9. The performance of the Congruence Among Distance Matrices (CADM) test in phylogenetic analysis

    PubMed Central

    2011-01-01

    Background CADM is a statistical test used to estimate the level of Congruence Among Distance Matrices. It has been shown in previous studies to have a correct rate of type I error and good power when applied to dissimilarity matrices and to ultrametric distance matrices. Contrary to most other tests of incongruence used in phylogenetic analysis, the null hypothesis of the CADM test assumes complete incongruence of the phylogenetic trees instead of congruence. In this study, we performed computer simulations to assess the type I error rate and power of the test. It was applied to additive distance matrices representing phylogenies and to genetic distance matrices obtained from nucleotide sequences of different lengths that were simulated on randomly generated trees of varying sizes, and under different evolutionary conditions. Results Our results showed that the test has an accurate type I error rate and good power. As expected, power increased with the number of objects (i.e., taxa), the number of partially or completely congruent matrices and the level of congruence among distance matrices. Conclusions Based on our results, we suggest that CADM is an excellent candidate to test for congruence and, when present, to estimate its level in phylogenomic studies where numerous genes are analysed simultaneously. PMID:21388552

  10. Estimating phylogenetic trees from genome-scale data.

    PubMed

    Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V

    2015-12-01

    The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data. © 2015 New York Academy of Sciences.

  11. Quantifying MCMC exploration of phylogenetic tree space.

    PubMed

    Whidden, Chris; Matsen, Frederick A

    2015-05-01

    In order to gain an understanding of the effectiveness of phylogenetic Markov chain Monte Carlo (MCMC), it is important to understand how quickly the empirical distribution of the MCMC converges to the posterior distribution. In this article, we investigate this problem on phylogenetic tree topologies with a metric that is especially well suited to the task: the subtree prune-and-regraft (SPR) metric. This metric directly corresponds to the minimum number of MCMC rearrangements required to move between trees in common phylogenetic MCMC implementations. We develop a novel graph-based approach to analyze tree posteriors and find that the SPR metric is much more informative than simpler metrics that are unrelated to MCMC moves. In doing so, we show conclusively that topological peaks do occur in Bayesian phylogenetic posteriors from real data sets as sampled with standard MCMC approaches, investigate the efficiency of Metropolis-coupled MCMC (MCMCMC) in traversing the valleys between peaks, and show that conditional clade distribution (CCD) can have systematic problems when there are multiple peaks. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

  12. The phylogenetic structure of plant-pollinator networks increases with habitat size and isolation.

    PubMed

    Aizen, Marcelo A; Gleiser, Gabriela; Sabatino, Malena; Gilarranz, Luis J; Bascompte, Jordi; Verdú, Miguel

    2016-01-01

    Similarity among species in traits related to ecological interactions is frequently associated with common ancestry. Thus, closely related species usually interact with ecologically similar partners, which can be reinforced by diverse co-evolutionary processes. The effect of habitat fragmentation on the phylogenetic signal in interspecific interactions and correspondence between plant and animal phylogenies is, however, unknown. Here, we address to what extent phylogenetic signal and co-phylogenetic congruence of plant-animal interactions depend on habitat size and isolation by analysing the phylogenetic structure of 12 pollination webs from isolated Pampean hills. Phylogenetic signal in interspecific interactions differed among webs, being stronger for flower-visiting insects than plants. Phylogenetic signal and overall co-phylogenetic congruence increased independently with hill size and isolation. We propose that habitat fragmentation would erode the phylogenetic structure of interaction webs. A decrease in phylogenetic signal and co-phylogenetic correspondence in plant-pollinator interactions could be associated with less reliable mutualism and erratic co-evolutionary change. © 2015 John Wiley & Sons Ltd/CNRS.

  13. Phylogenetic lineages in Entomophthoromycota

    USDA-ARS?s Scientific Manuscript database

    Entomophthoromycota Humber is one of five major phylogenetic lineages among the former phylum Zygomycota. These early terrestrial fungi share evolutionarily ancestral characters such as coenocytic mycelium and gametangiogamy as a sexual process resulting in zygospore formation. Previous molecular st...

  14. Phylogenetic context determines the role of competition in adaptive radiation

    PubMed Central

    Tan, Jiaqi; Slattery, Matthew R.; Yang, Xian; Jiang, Lin

    2016-01-01

    Understanding ecological mechanisms regulating the evolution of biodiversity is of much interest to ecologists and evolutionary biologists. Adaptive radiation constitutes an important evolutionary process that generates biodiversity. Competition has long been thought to influence adaptive radiation, but the directionality of its effect and associated mechanisms remain ambiguous. Here, we report a rigorous experimental test of the role of competition on adaptive radiation using the rapidly evolving bacterium Pseudomonas fluorescens SBW25 interacting with multiple bacterial species that differed in their phylogenetic distance to the diversifying bacterium. We showed that the inhibitive effect of competitors on the adaptive radiation of P. fluorescens decreased as their phylogenetic distance increased. To explain this phylogenetic dependency of adaptive radiation, we linked the phylogenetic distance between P. fluorescens and its competitors to their niche and competitive fitness differences. Competitive fitness differences, which showed weak phylogenetic signal, reduced P. fluorescens abundance and thus diversification, whereas phylogenetically conserved niche differences promoted diversification. These results demonstrate the context dependency of competitive effects on adaptive radiation, and highlight the importance of past evolutionary history for ongoing evolutionary processes. PMID:27335414

  15. DendroPy: a Python library for phylogenetic computing.

    PubMed

    Sukumaran, Jeet; Holder, Mark T

    2010-06-15

    DendroPy is a cross-platform library for the Python programming language that provides for object-oriented reading, writing, simulation and manipulation of phylogenetic data, with an emphasis on phylogenetic tree operations. DendroPy uses a splits-hash mapping to perform rapid calculations of tree distances, similarities and shape under various metrics. It contains rich simulation routines to generate trees under a number of different phylogenetic and coalescent models. DendroPy's data simulation and manipulation facilities, in conjunction with its support of a broad range of phylogenetic data formats (NEXUS, Newick, PHYLIP, FASTA, NeXML, etc.), allow it to serve a useful role in various phyloinformatics and phylogeographic pipelines. The stable release of the library is available for download and automated installation through the Python Package Index site (http://pypi.python.org/pypi/DendroPy), while the active development source code repository is available to the public from GitHub (http://github.com/jeetsukumaran/DendroPy).

  16. Phylogenetic and functional diversity in large carnivore assemblages

    PubMed Central

    Dalerum, F.

    2013-01-01

    Large terrestrial carnivores are important ecological components and prominent flagship species, but are often extinction prone owing to a combination of biological traits and high levels of human persecution. This study combines phylogenetic and functional diversity evaluations of global and continental large carnivore assemblages to provide a framework for conservation prioritization both between and within assemblages. Species-rich assemblages of large carnivores simultaneously had high phylogenetic and functional diversity, but species contributions to phylogenetic and functional diversity components were not positively correlated. The results further provide ecological justification for the largest carnivore species as a focus for conservation action, and suggests that range contraction is a likely cause of diminishing carnivore ecosystem function. This study highlights that preserving species-rich carnivore assemblages will capture both high phylogenetic and functional diversity, but that prioritizing species within assemblages will involve trade-offs between optimizing contemporary ecosystem function versus the evolutionary potential for future ecosystem performance. PMID:23576787

  17. On the information content of discrete phylogenetic characters.

    PubMed

    Bordewich, Magnus; Deutschmann, Ina Maria; Fischer, Mareike; Kasbohm, Elisa; Semple, Charles; Steel, Mike

    2017-12-16

    Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to each species groups all of them together and so provides no insight into the relationships of the species considered. At the other extreme, a character that assigns a different state to each species also conveys no phylogenetic signal. In this manuscript, we study a natural combinatorial measure of the information content of an individual character and analyse properties of characters that provide the maximum phylogenetic information, particularly, the number of states such a character uses and how the different states have to be distributed among the species or taxa of the phylogenetic tree.

  18. Estimation of rates-across-sites distributions in phylogenetic substitution models.

    PubMed

    Susko, Edward; Field, Chris; Blouin, Christian; Roger, Andrew J

    2003-10-01

    Previous work has shown that it is often essential to account for the variation in rates at different sites in phylogenetic models in order to avoid phylogenetic artifacts such as long branch attraction. In most current models, the gamma distribution is used for the rates-across-sites distributions and is implemented as an equal-probability discrete gamma. In this article, we introduce discrete distribution estimates with large numbers of equally spaced rate categories allowing us to investigate the appropriateness of the gamma model. With large numbers of rate categories, these discrete estimates are flexible enough to approximate the shape of almost any distribution. Likelihood ratio statistical tests and a nonparametric bootstrap confidence-bound estimation procedure based on the discrete estimates are presented that can be used to test the fit of a parametric family. We applied the methodology to several different protein data sets, and found that although the gamma model often provides a good parametric model for this type of data, rate estimates from an equal-probability discrete gamma model with a small number of categories will tend to underestimate the largest rates. In cases when the gamma model assumption is in doubt, rate estimates coming from the discrete rate distribution estimate with a large number of rate categories provide a robust alternative to gamma estimates. An alternative implementation of the gamma distribution is proposed that, for equal numbers of rate categories, is computationally more efficient during optimization than the standard gamma implementation and can provide more accurate estimates of site rates.

  19. Phylogenetic community structure: temporal variation in fish assemblage

    PubMed Central

    Santorelli, Sergio; Magnusson, William; Ferreira, Efrem; Caramaschi, Erica; Zuanon, Jansen; Amadio, Sidnéia

    2014-01-01

    Hypotheses about phylogenetic relationships among species allow inferences about the mechanisms that affect species coexistence. Nevertheless, most studies assume that phylogenetic patterns identified are stable over time. We used data on monthly samples of fish from a single lake over 10 years to show that the structure in phylogenetic assemblages varies over time and conclusions depend heavily on the time scale investigated. The data set was organized in guild structures and temporal scales (grouped at three temporal scales). Phylogenetic distance was measured as the mean pairwise distances (MPD) and as mean nearest-neighbor distance (MNTD). Both distances were based on counts of nodes. We compared the observed values of MPD and MNTD with values that were generated randomly using null model independent swap. A serial runs test was used to assess the temporal independence of indices over time. The phylogenetic pattern in the whole assemblage and the functional groups varied widely over time. Conclusions about phylogenetic clustering or dispersion depended on the temporal scales. Conclusions about the frequency with which biotic processes and environmental filters affect the local assembly do not depend only on taxonomic grouping and spatial scales. While these analyzes allow the assertion that all proposed patterns apply to the fish assemblages in the floodplain, the assessment of the relative importance of these processes, and how they vary depending on the temporal scale and functional group studied, cannot be determined with the effort commonly used. It appears that, at least in the system that we studied, the assemblages are forming and breaking continuously, resulting in various phylogeny-related structures that makes summarizing difficult. PMID:25360256

  20. A phylogenetic Kalman filter for ancestral trait reconstruction using molecular data.

    PubMed

    Lartillot, Nicolas

    2014-02-15

    Correlation between life history or ecological traits and genomic features such as nucleotide or amino acid composition can be used for reconstructing the evolutionary history of the traits of interest along phylogenies. Thus far, however, such ancestral reconstructions have been done using simple linear regression approaches that do not account for phylogenetic inertia. These reconstructions could instead be seen as a genuine comparative regression problem, such as formalized by classical generalized least-square comparative methods, in which the trait of interest and the molecular predictor are represented as correlated Brownian characters coevolving along the phylogeny. Here, a Bayesian sampler is introduced, representing an alternative and more efficient algorithmic solution to this comparative regression problem, compared with currently existing generalized least-square approaches. Technically, ancestral trait reconstruction based on a molecular predictor is shown to be formally equivalent to a phylogenetic Kalman filter problem, for which backward and forward recursions are developed and implemented in the context of a Markov chain Monte Carlo sampler. The comparative regression method results in more accurate reconstructions and a more faithful representation of uncertainty, compared with simple linear regression. Application to the reconstruction of the evolution of optimal growth temperature in Archaea, using GC composition in ribosomal RNA stems and amino acid composition of a sample of protein-coding genes, confirms previous findings, in particular, pointing to a hyperthermophilic ancestor for the kingdom. The program is freely available at www.phylobayes.org.

  1. Use of Whole-Genus Genome Sequence Data To Develop a Multilocus Sequence Typing Tool That Accurately Identifies Yersinia Isolates to the Species and Subspecies Levels

    PubMed Central

    Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.

    2014-01-01

    The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391

  2. Enumerating all maximal frequent subtrees in collections of phylogenetic trees.

    PubMed

    Deepak, Akshay; Fernández-Baca, David

    2014-01-01

    A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees.

  3. Identification of a t(3;4)(p1.3;q1.5) translocation breakpoint in pigs using somatic cell hybrid mapping and high-resolution mate-pair sequencing

    PubMed Central

    Fève, Katia; Foissac, Sylvain; Pinton, Alain; Mompart, Florence; Esquerré, Diane; Faraut, Thomas; Yerle, Martine

    2017-01-01

    Reciprocal translocations are the most frequently occurring constitutional structural rearrangements in mammalian genomes. In phenotypically normal pigs, an incidence of 1/200 is estimated for such rearrangements. Even if constitutional translocations do not necessarily induce defects and diseases, they are responsible for significant economic losses in domestic animals due to reproduction failures. Over the last 30 years, advances in molecular and cytogenetic technologies have led to major improvements in the resolution of the characterization of translocation events. Characterization of translocation breakpoints helps to decipher the mechanisms that lead to such rearrangements and the functions of the genes that are involved in the translocation. Here, we describe the fine characterization of a reciprocal translocation t(3;4) (p1.3;q1.5) detected in a pig line. The breakpoint was identified at the base-pair level using a positional cloning and chromosome walking strategy in somatic cell hybrids that were generated from an animal that carries this translocation. We show that this translocation occurs within the ADAMTSL4 gene and results in a loss of expression in homozygous carriers. In addition, by taking this translocation as a model, we used a whole-genome next-generation mate-pair sequencing approach on pooled individuals to evaluate this strategy for high-throughput screening of structural rearrangements. PMID:29121641

  4. gyrB as a phylogenetic discriminator for members of the Bacillus anthracis-cereus-thuringiensis group

    NASA Technical Reports Server (NTRS)

    La Duc, Myron T.; Satomi, Masataka; Agata, Norio; Venkateswaran, Kasthuri

    2004-01-01

    Bacillus anthracis, the causative agent of the human disease anthrax, Bacillus cereus, a food-borne pathogen capable of causing human illness, and Bacillus thuringiensis, a well-characterized insecticidal toxin producer, all cluster together within a very tight clade (B. cereus group) phylogenetically and are indistinguishable from one another via 16S rDNA sequence analysis. As new pathogens are continually emerging, it is imperative to devise a system capable of rapidly and accurately differentiating closely related, yet phenotypically distinct species. Although the gyrB gene has proven useful in discriminating closely related species, its sequence analysis has not yet been validated by DNA:DNA hybridization, the taxonomically accepted "gold standard". We phylogenetically characterized the gyrB sequences of various species and serotypes encompassed in the "B. cereus group," including lab strains and environmental isolates. Results were compared to those obtained from analyses of phenotypic characteristics, 16S rDNA sequence, DNA:DNA hybridization, and virulence factors. The gyrB gene proved more highly differential than 16S, while, at the same time, as analytical as costly and laborious DNA:DNA hybridization techniques in differentiating species within the B. cereus group.

  5. Maximum parsimony, substitution model, and probability phylogenetic trees.

    PubMed

    Weng, J F; Thomas, D A; Mareels, I

    2011-01-01

    The problem of inferring phylogenies (phylogenetic trees) is one of the main problems in computational biology. There are three main methods for inferring phylogenies-Maximum Parsimony (MP), Distance Matrix (DM) and Maximum Likelihood (ML), of which the MP method is the most well-studied and popular method. In the MP method the optimization criterion is the number of substitutions of the nucleotides computed by the differences in the investigated nucleotide sequences. However, the MP method is often criticized as it only counts the substitutions observable at the current time and all the unobservable substitutions that really occur in the evolutionary history are omitted. In order to take into account the unobservable substitutions, some substitution models have been established and they are now widely used in the DM and ML methods but these substitution models cannot be used within the classical MP method. Recently the authors proposed a probability representation model for phylogenetic trees and the reconstructed trees in this model are called probability phylogenetic trees. One of the advantages of the probability representation model is that it can include a substitution model to infer phylogenetic trees based on the MP principle. In this paper we explain how to use a substitution model in the reconstruction of probability phylogenetic trees and show the advantage of this approach with examples.

  6. PrePhyloPro: phylogenetic profile-based prediction of whole proteome linkages

    PubMed Central

    Niu, Yulong; Liu, Chengcheng; Moghimyfiroozabad, Shayan; Yang, Yi

    2017-01-01

    Direct and indirect functional links between proteins as well as their interactions as part of larger protein complexes or common signaling pathways may be predicted by analyzing the correlation of their evolutionary patterns. Based on phylogenetic profiling, here we present a highly scalable and time-efficient computational framework for predicting linkages within the whole human proteome. We have validated this method through analysis of 3,697 human pathways and molecular complexes and a comparison of our results with the prediction outcomes of previously published co-occurrency model-based and normalization methods. Here we also introduce PrePhyloPro, a web-based software that uses our method for accurately predicting proteome-wide linkages. We present data on interactions of human mitochondrial proteins, verifying the performance of this software. PrePhyloPro is freely available at http://prephylopro.org/phyloprofile/. PMID:28875072

  7. Beyond trend analysis: How a modified breakpoint analysis enhances knowledge of agricultural production after Zimbabwe's fast track land reform

    NASA Astrophysics Data System (ADS)

    Hentze, Konrad; Thonfeld, Frank; Menz, Gunter

    2017-10-01

    In the discourse on land reform assessments, a significant lack of spatial and time-series data has been identified, especially with respect to Zimbabwe's ;Fast-Track Land Reform Programme; (FTLRP). At the same time, interest persists among land use change scientists to evaluate causes of land use change and therefore to increase the explanatory power of remote sensing products. This study recognizes these demands and aims to provide input on both levels: Evaluating the potential of satellite remote sensing time-series to answer questions which evolved after intensive land redistribution efforts in Zimbabwe; and investigating how time-series analysis of Normalized Difference Vegetation Index (NDVI) can be enhanced to provide information on land reform induced land use change. To achieve this, two time-series methods are applied to MODIS NDVI data: Seasonal Trend Analysis (STA) and Breakpoint Analysis for Additive Season and Trend (BFAST). In our first analysis, a link of agricultural productivity trends to different land tenure regimes shows that regional clustering of trends is more dominant than a relationship between tenure and trend with a slightly negative slope for all regimes. We demonstrate that clusters of strong negative and positive productivity trends are results of changing irrigation patterns. To locate emerging and fallow irrigation schemes in semi-arid Zimbabwe, a new multi-method approach is developed which allows to map changes from bimodal seasonal phenological patterns to unimodal and vice versa. With an enhanced breakpoint analysis through the combination of STA and BFAST, we are able to provide a technique that can be applied on large scale to map status and development of highly productive cropping systems, which are key for food production, national export and local employment. We therefore conclude that the combination of existing and accessible time-series analysis methods: is able to achieve both: overcoming demonstrated limitations of

  8. Fire modifies the phylogenetic structure of soil bacterial co-occurrence networks.

    PubMed

    Pérez-Valera, Eduardo; Goberna, Marta; Faust, Karoline; Raes, Jeroen; García, Carlos; Verdú, Miguel

    2017-01-01

    Fire alters ecosystems by changing the composition and community structure of soil microbes. The phylogenetic structure of a community provides clues about its main assembling mechanisms. While environmental filtering tends to reduce the community phylogenetic diversity by selecting for functionally (and hence phylogenetically) similar species, processes like competitive exclusion by limiting similarity tend to increase it by preventing the coexistence of functionally (and phylogenetically) similar species. We used co-occurrence networks to detect co-presence (bacteria that co-occur) or exclusion (bacteria that do not co-occur) links indicative of the ecological interactions structuring the community. We propose that inspecting the phylogenetic structure of co-presence or exclusion links allows to detect the main processes simultaneously assembling the community. We monitored a soil bacterial community after an experimental fire and found that fire altered its composition, richness and phylogenetic diversity. Both co-presence and exclusion links were more phylogenetically related than expected by chance. We interpret such a phylogenetic clustering in co-presence links as a result of environmental filtering, while that in exclusion links reflects competitive exclusion by limiting similarity. This suggests that environmental filtering and limiting similarity operate simultaneously to assemble soil bacterial communities, widening the traditional view that only environmental filtering structures bacterial communities. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.

  9. Tanglegrams for rooted phylogenetic trees and networks

    PubMed Central

    Scornavacca, Celine; Zickmann, Franziska; Huson, Daniel H.

    2011-01-01

    Motivation: In systematic biology, one is often faced with the task of comparing different phylogenetic trees, in particular in multi-gene analysis or cospeciation studies. One approach is to use a tanglegram in which two rooted phylogenetic trees are drawn opposite each other, using auxiliary lines to connect matching taxa. There is an increasing interest in using rooted phylogenetic networks to represent evolutionary history, so as to explicitly represent reticulate events, such as horizontal gene transfer, hybridization or reassortment. Thus, the question arises how to define and compute a tanglegram for such networks. Results: In this article, we present the first formal definition of a tanglegram for rooted phylogenetic networks and present a heuristic approach for computing one, called the NN-tanglegram method. We compare the performance of our method with existing tree tanglegram algorithms and also show a typical application to real biological datasets. For maximum usability, the algorithm does not require that the trees or networks are bifurcating or bicombining, or that they are on identical taxon sets. Availability: The algorithm is implemented in our program Dendroscope 3, which is freely available from www.dendroscope.org. Contact: scornava@informatik.uni-tuebingen.de; huson@informatik.uni-tuebingen.de PMID:21685078

  10. Species Divergence and Phylogenetic Variation of Ecophysiological Traits in Lianas and Trees

    PubMed Central

    Rios, Rodrigo S.; Salgado-Luarte, Cristian; Gianoli, Ernesto

    2014-01-01

    The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [Amax], dark respiration rate [Rd], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that Rd evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for Rd, while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. Rd followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for Amax. Rd may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades. PMID:24914958

  11. Identifying nutrient reference sites in nutrient-enriched regions-Using algal, invertebrate, and fish-community measures to identify stressor-breakpoint thresholds in Indiana rivers and streams, 2005-9

    USGS Publications Warehouse

    Caskey, Brian J.; Bunch, Aubrey R.; Shoda, Megan E.; Frey, Jeffrey W.; Selvaratnam, Shivi; Miltner, Robert J.

    2013-01-01

    Excess nutrients in aquatic ecosystems can lead to shifts in species composition, reduced dissolved oxygen concentrations, fish kills, and toxic algal blooms. In this study, nutrients, periphyton chlorophyll a (CHLa), and invertebrate- and fishcommunity data collected during 2005-9 were analyzed from 318 sites on Indiana rivers and streams. The objective of this study was to determine which invertebrate and fish-taxa attributes best reflect the conditions of streams in Indiana along a gradient of nutrient concentrations by (1) determining statistically and ecologically significant relations among the stressor (total nitrogen, total phosphorus, and periphyton CHLa) and response (invertebrate and fish community) variables; and (2) determining the levels at which invertebrate- and fish-community measures change in response to nutrients or periphyton CHLa. For water samples at the headwater sites, total nitrogen (TN) concentrations ranged from 0.343 to 21.6 milligrams per liter (mg/L) (median 2.12 mg/L), total phosphorus (TP) concentrations ranged from 0.050 to 1.44 mg/L (median 0.093 mg/L), and periphyton CHLa ranged from 0.947 to 629 mg/L (median 69.7 mg/L). At the wadable sites, TN concentrations ranged from 0.340 to 10.0 mg/L (median 2.31 mg/L), TP concentrations ranged from 0.050 to 1.24 mg/L (median 0.110 mg/L), and periphyton CHLa ranged from 0.383 to 719 mg/L (median 44.7 mg/L). Recursive partitioning identified statistically significant low and high breakpoint thresholds on invertebrate and fish measures, which demonstrated the ecological response in enriched conditions. The combined community (invertebrate and fish) mean low and high TN breakpoint thresholds were 1.03 and 2.61 mg/L, respectively. The mean low and high breakpoint thresholds for TP were 0.083 and 0.144 mg/L, respectively. The mean low and high breakpoint thresholds for periphyton CHLa were 20.9 and 98.6 milligrams per square meter (mg/m2), respectively. Additive quantile regression analysis

  12. Impact of the revised penicillin susceptibility breakpoints for Streptococcus pneumoniae on antimicrobial resistance rates of meningeal and non-meningeal pneumococcal strains.

    PubMed

    Al-Waili, Badria R; Al-Thawadi, Sahar; Hajjar, Sami Al

    2013-01-01

    In January 2008, the Clinical Laboratory Standard Institute (CLSI) revised the Streptococcus pneumoniae breakpoints for penicillin to define the susceptibility of meningeal and non-meningeal isolates. We studied the impact of these changes. In addition, the pneumococcal resistance rate to other antimicrobial agents was reviewed. Laboratory data on peumococcal isolates collected retrospectively from hospitalized children in tertiary care hospital in Riyadh, Saudi Arabia from January 2006 to March 2012. Only sterile samples were included from cerebrospinal fluids, blood, sterile body fluids and surgical tissue. Other samples such as sputum and non sterile samples were excluded. We included samples from children 14 years old or younger. The minimum inhibitory concentration (MIC) for penicillin, cefuroxime, ceftriaxone and meropenem were determined by using the E-test, while susceptibility to erythromycin, cotrimoxazole and vancomycin were measured using the disc diffusion methods following the guideline of CLSI. Specimens were analyzed in two different periods: from January 2006 to December 2007 and from January 2008 to March 2012. During the two periods there were 208 samples of which 203 were blood samples. Full penicillin resistance was detected in 6.6% in the first period. There was decrease in penicillin nonmeningeal resistance to 1.5% and an increase in resistance in penicillin meningeal 68.2% in the second period (P=.0001). There was an increase in rate of resistance among S pneumoniae isolates over the two periods to parenteral cefuroxime, erythromycin and cotrimoxazole by 34.6%, 35.5% and 51.9%, respectively. Total meropenem resistance found 4.3% and no vancomycin resistance was detected. The current study supports the use of the revised CLSI susceptibility breakpoints that promote using penicillin to treat nonmeningeal pneumococcal disease, and might slow the development of resistance to broader-spectrum antibiotics.

  13. Enumerating all maximal frequent subtrees in collections of phylogenetic trees

    PubMed Central

    2014-01-01

    Background A common problem in phylogenetic analysis is to identify frequent patterns in a collection of phylogenetic trees. The goal is, roughly, to find a subset of the species (taxa) on which all or some significant subset of the trees agree. One popular method to do so is through maximum agreement subtrees (MASTs). MASTs are also used, among other things, as a metric for comparing phylogenetic trees, computing congruence indices and to identify horizontal gene transfer events. Results We give algorithms and experimental results for two approaches to identify common patterns in a collection of phylogenetic trees, one based on agreement subtrees, called maximal agreement subtrees, the other on frequent subtrees, called maximal frequent subtrees. These approaches can return subtrees on larger sets of taxa than MASTs, and can reveal new common phylogenetic relationships not present in either MASTs or the majority rule tree (a popular consensus method). Our current implementation is available on the web at https://code.google.com/p/mfst-miner/. Conclusions Our computational results confirm that maximal agreement subtrees and all maximal frequent subtrees can reveal a more complete phylogenetic picture of the common patterns in collections of phylogenetic trees than maximum agreement subtrees; they are also often more resolved than the majority rule tree. Further, our experiments show that enumerating maximal frequent subtrees is considerably more practical than enumerating ordinary (not necessarily maximal) frequent subtrees. PMID:25061474

  14. Phylogenetic fields through time: temporal dynamics of geographical co-occurrence and phylogenetic structure within species ranges

    PubMed Central

    Carotenuto, Francesco; Diniz-Filho, José Alexandre F.

    2016-01-01

    Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes—speciation, extinction and dispersal—in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. PMID:26977061

  15. Phylogenetic fields through time: temporal dynamics of geographical co-occurrence and phylogenetic structure within species ranges.

    PubMed

    Villalobos, Fabricio; Carotenuto, Francesco; Raia, Pasquale; Diniz-Filho, José Alexandre F

    2016-04-05

    Species co-occur with different sets of other species across their geographical distribution, which can be either closely or distantly related. Such co-occurrence patterns and their phylogenetic structure within individual species ranges represent what we call the species phylogenetic fields (PFs). These PFs allow investigation of the role of historical processes--speciation, extinction and dispersal--in shaping species co-occurrence patterns, in both extinct and extant species. Here, we investigate PFs of large mammalian species during the last 3 Myr, and how these correlate with trends in diversification rates. Using the fossil record, we evaluate species' distributional and co-occurrence patterns along with their phylogenetic structure. We apply a novel Bayesian framework on fossil occurrences to estimate diversification rates through time. Our findings highlight the effect of evolutionary processes and past climatic changes on species' distributions and co-occurrences. From the Late Pliocene to the Recent, mammal species seem to have responded in an individualistic manner to climate changes and diversification dynamics, co-occurring with different sets of species from different lineages across their geographical ranges. These findings stress the difficulty of forecasting potential effects of future climate changes on biodiversity. © 2016 The Author(s).

  16. Phylogenetic Analyses: A Toolbox Expanding towards Bayesian Methods

    PubMed Central

    Aris-Brosou, Stéphane; Xia, Xuhua

    2008-01-01

    The reconstruction of phylogenies is becoming an increasingly simple activity. This is mainly due to two reasons: the democratization of computing power and the increased availability of sophisticated yet user-friendly software. This review describes some of the latest additions to the phylogenetic toolbox, along with some of their theoretical and practical limitations. It is shown that Bayesian methods are under heavy development, as they offer the possibility to solve a number of long-standing issues and to integrate several steps of the phylogenetic analyses into a single framework. Specific topics include not only phylogenetic reconstruction, but also the comparison of phylogenies, the detection of adaptive evolution, and the estimation of divergence times between species. PMID:18483574

  17. Applying phylogenetic analysis to viral livestock diseases: moving beyond molecular typing.

    PubMed

    Olvera, Alex; Busquets, Núria; Cortey, Marti; de Deus, Nilsa; Ganges, Llilianne; Núñez, José Ignacio; Peralta, Bibiana; Toskano, Jennifer; Dolz, Roser

    2010-05-01

    Changes in livestock production systems in recent years have altered the presentation of many diseases resulting in the need for more sophisticated control measures. At the same time, new molecular assays have been developed to support the diagnosis of animal viral disease. Nucleotide sequences generated by these diagnostic techniques can be used in phylogenetic analysis to infer phenotypes by sequence homology and to perform molecular epidemiology studies. In this review, some key elements of phylogenetic analysis are highlighted, such as the selection of the appropriate neutral phylogenetic marker, the proper phylogenetic method and different techniques to test the reliability of the resulting tree. Examples are given of current and future applications of phylogenetic reconstructions in viral livestock diseases. Copyright 2009 Elsevier Ltd. All rights reserved.

  18. Phylogenetic analysis of Common Garter Snake (Thamnophis sirtalis) stomach contents detects cryptic range of a secretive salamander (Ensatina eschscholtzii oregonensis) Herpetological Conservation and Biology 5(3):395–402

    Treesearch

    Sean B. Reilly; Andrew D Gottsho; Justin M. Garwood; Bryan Jennings

    2010-01-01

    Given the current global amphibian decline, it is crucial to obtain accurate and current information regarding species distributions. Secretive amphibians such as plethodontid salamanders can be difficult to detect in many cases, especially in remote, high elevation areas. We used molecular phylogenetic analyses to identify three partially digested salamanders palped...

  19. One tree to link them all: a phylogenetic dataset for the European tetrapoda.

    PubMed

    Roquet, Cristina; Lavergne, Sébastien; Thuiller, Wilfried

    2014-08-08

    Since the ever-increasing availability of phylogenetic informative data, the last decade has seen an upsurge of ecological studies incorporating information on evolutionary relationships among species. However, detailed species-level phylogenies are still lacking for many large groups and regions, which are necessary for comprehensive large-scale eco-phylogenetic analyses. Here, we provide a dataset of 100 dated phylogenetic trees for all European tetrapods based on a mixture of supermatrix and supertree approaches. Phylogenetic inference was performed separately for each of the main Tetrapoda groups of Europe except mammals (i.e. amphibians, birds, squamates and turtles) by means of maximum likelihood (ML) analyses of supermatrix applying a tree constraint at the family (amphibians and squamates) or order (birds and turtles) levels based on consensus knowledge. For each group, we inferred 100 ML trees to be able to provide a phylogenetic dataset that accounts for phylogenetic uncertainty, and assessed node support with bootstrap analyses. Each tree was dated using penalized-likelihood and fossil calibration. The trees obtained were well-supported by existing knowledge and previous phylogenetic studies. For mammals, we modified the most complete supertree dataset available on the literature to include a recent update of the Carnivora clade. As a final step, we merged the phylogenetic trees of all groups to obtain a set of 100 phylogenetic trees for all European Tetrapoda species for which data was available (91%). We provide this phylogenetic dataset (100 chronograms) for the purpose of comparative analyses, macro-ecological or community ecology studies aiming to incorporate phylogenetic information while accounting for phylogenetic uncertainty.

  20. Assessing the relationships between phylogenetic and functional singularities in sharks (Chondrichthyes).

    PubMed

    Cachera, Marie; Le Loc'h, François

    2017-08-01

    The relationships between diversity and ecosystem functioning have become a major focus of science. A crucial issue is to estimate functional diversity, as it is intended to impact ecosystem dynamics and stability. However, depending on the ecosystem, it may be challenging or even impossible to directly measure ecological functions and thus functional diversity. Phylogenetic diversity was recently under consideration as a proxy for functional diversity. Phylogenetic diversity is indeed supposed to match functional diversity if functions are conservative traits along evolution. However, in case of adaptive radiation and/or evolutive convergence, a mismatch may appear between species phylogenetic and functional singularities. Using highly threatened taxa, sharks, this study aimed to explore the relationships between phylogenetic and functional diversities and singularities. Different statistical computations were used in order to test both methodological issue (phylogenetic reconstruction) and overall a theoretical questioning: the predictive power of phylogeny for function diversity. Despite these several methodological approaches, a mismatch between phylogeny and function was highlighted. This mismatch revealed that (i) functions are apparently nonconservative in shark species, and (ii) phylogenetic singularity is not a proxy for functional singularity. Functions appeared to be not conservative along the evolution of sharks, raising the conservational challenge to identify and protect both phylogenetic and functional singular species. Facing the current rate of species loss, it is indeed of major importance to target phylogenetically singular species to protect genetic diversity and also functionally singular species in order to maintain particular functions within ecosystem.

  1. Species divergence and phylogenetic variation of ecophysiological traits in lianas and trees.

    PubMed

    Rios, Rodrigo S; Salgado-Luarte, Cristian; Gianoli, Ernesto

    2014-01-01

    The climbing habit is an evolutionary key innovation in plants because it is associated with enhanced clade diversification. We tested whether patterns of species divergence and variation of three ecophysiological traits that are fundamental for plant adaptation to light environments (maximum photosynthetic rate [A(max)], dark respiration rate [R(d)], and specific leaf area [SLA]) are consistent with this key innovation. Using data reported from four tropical forests and three temperate forests, we compared phylogenetic distance among species as well as the evolutionary rate, phylogenetic distance and phylogenetic signal of those traits in lianas and trees. Estimates of evolutionary rates showed that R(d) evolved faster in lianas, while SLA evolved faster in trees. The mean phylogenetic distance was 1.2 times greater among liana species than among tree species. Likewise, estimates of phylogenetic distance indicated that lianas were less related than by chance alone (phylogenetic evenness across 63 species), and trees were more related than expected by chance (phylogenetic clustering across 71 species). Lianas showed evenness for R(d), while trees showed phylogenetic clustering for this trait. In contrast, for SLA, lianas exhibited phylogenetic clustering and trees showed phylogenetic evenness. Lianas and trees showed patterns of ecophysiological trait variation among species that were independent of phylogenetic relatedness. We found support for the expected pattern of greater species divergence in lianas, but did not find consistent patterns regarding ecophysiological trait evolution and divergence. R(d) followed the species-level pattern, i.e., greater divergence/evolution in lianas compared to trees, while the opposite occurred for SLA and no pattern was detected for A(max). R(d) may have driven lianas' divergence across forest environments, and might contribute to diversification in climber clades.

  2. Cnidarian phylogenetic relationships as revealed by mitogenomics.

    PubMed

    Kayal, Ehsan; Roure, Béatrice; Philippe, Hervé; Collins, Allen G; Lavrov, Dennis V

    2013-01-09

    Cnidaria (corals, sea anemones, hydroids, jellyfish) is a phylum of relatively simple aquatic animals characterized by the presence of the cnidocyst: a cell containing a giant capsular organelle with an eversible tubule (cnida). Species within Cnidaria have life cycles that involve one or both of the two distinct body forms, a typically benthic polyp, which may or may not be colonial, and a typically pelagic mostly solitary medusa. The currently accepted taxonomic scheme subdivides Cnidaria into two main assemblages: Anthozoa (Hexacorallia + Octocorallia) - cnidarians with a reproductive polyp and the absence of a medusa stage - and Medusozoa (Cubozoa, Hydrozoa, Scyphozoa, Staurozoa) - cnidarians that usually possess a reproductive medusa stage. Hypothesized relationships among these taxa greatly impact interpretations of cnidarian character evolution. We expanded the sampling of cnidarian mitochondrial genomes, particularly from Medusozoa, to reevaluate phylogenetic relationships within Cnidaria. Our phylogenetic analyses based on a mitochogenomic dataset support many prior hypotheses, including monophyly of Hexacorallia, Octocorallia, Medusozoa, Cubozoa, Staurozoa, Hydrozoa, Carybdeida, Chirodropida, and Hydroidolina, but reject the monophyly of Anthozoa, indicating that the Octocorallia + Medusozoa relationship is not the result of sampling bias, as proposed earlier. Further, our analyses contradict Scyphozoa [Discomedusae + Coronatae], Acraspeda [Cubozoa + Scyphozoa], as well as the hypothesis that Staurozoa is the sister group to all the other medusozoans. Cnidarian mitochondrial genomic data contain phylogenetic signal informative for understanding the evolutionary history of this phylum. Mitogenome-based phylogenies, which reject the monophyly of Anthozoa, provide further evidence for the polyp-first hypothesis. By rejecting the traditional Acraspeda and Scyphozoa hypotheses, these analyses suggest that the shared morphological characters in

  3. A review of criticisms of phylogenetic nomenclature: is taxonomic freedom the fundamental issue?

    PubMed

    Bryant, Harold N; Cantino, Philip D

    2002-02-01

    The proposal to implement a phylogenetic nomenclatural system governed by the PhyloCode), in which taxon names are defined by explicit reference to common descent, has met with strong criticism from some proponents of phylogenetic taxonomy (taxonomy based on the principle of common descent in which only clades and species are recognized). We examine these criticisms and find that some of the perceived problems with phylogenetic nomenclature are based on misconceptions, some are equally true of the current rank-based nomenclatural system, and some will be eliminated by implementation of the PhyloCode. Most of the criticisms are related to an overriding concern that, because the meanings of names are associated with phylogenetic pattern which is subject to change, the adoption of phylogenetic nomenclature will lead to increased instability in the content of taxa. This concern is associated with the fact that, despite the widespread adoption of the view that taxa are historical entities that are conceptualized based on ancestry, many taxonomists also conceptualize taxa based on their content. As a result, critics of phylogenetic nomenclature have argued that taxonomists should be free to emend the content of taxa without constraints imposed by nomenclatural decisions. However, in phylogenetic nomenclature the contents of taxa are determined, not by the taxonomist, but by the combination of the phylogenetic definition of the name and a phylogenetic hypothesis. Because the contents of taxa, once their names are defined, can no longer be freely modified by taxonomists, phylogenetic nomenclature is perceived as limiting taxonomic freedom. We argue that the form of taxonomic freedom inherent to phylogenetic nomenclature is appropriate to phylogenetic taxonomy in which taxa are considered historical entities that are discovered through phylogenetic analysis and are not human constructs.

  4. MICCA: a complete and accurate software for taxonomic profiling of metagenomic data.

    PubMed

    Albanese, Davide; Fontana, Paolo; De Filippo, Carlotta; Cavalieri, Duccio; Donati, Claudio

    2015-05-19

    The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project.

  5. Student interpretations of phylogenetic trees in an introductory biology course.

    PubMed

    Dees, Jonathan; Momsen, Jennifer L; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa relatedness on phylogenetic trees, to measure the prevalence of correct taxa-relatedness interpretations, and to determine how student reasoning and correctness change in response to instruction and over time. Counting synapomorphies and nodes between taxa were the most common forms of incorrect reasoning, which presents a pedagogical dilemma concerning labeled synapomorphies on phylogenetic trees. Students also independently generated an alternative form of correct reasoning using monophyletic groups, the use of which decreased in popularity over time. Approximately half of all students were able to correctly interpret taxa relatedness on phylogenetic trees, and many memorized correct reasoning without understanding its application. Broad initial instruction that allowed students to generate inferences on their own contributed very little to phylogenetic tree understanding, while targeted instruction on evolutionary relationships improved understanding to some extent. Phylogenetic trees, which can directly affect student understanding of evolution, appear to offer introductory biology instructors a formidable pedagogical challenge. © 2014 J. Dees et al. CBE—Life Sciences Education © 2014 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).

  6. Using Genotype Abundance to Improve Phylogenetic Inference

    PubMed Central

    Mesin, Luka; Victora, Gabriel D; Minin, Vladimir N; Matsen, Frederick A

    2018-01-01

    Abstract Modern biological techniques enable very dense genetic sampling of unfolding evolutionary histories, and thus frequently sample some genotypes multiple times. This motivates strategies to incorporate genotype abundance information in phylogenetic inference. In this article, we synthesize a stochastic process model with standard sequence-based phylogenetic optimality, and show that tree estimation is substantially improved by doing so. Our method is validated with extensive simulations and an experimental single-cell lineage tracing study of germinal center B cell receptor affinity maturation. PMID:29474671

  7. Posterior Predictive Bayesian Phylogenetic Model Selection

    PubMed Central

    Lewis, Paul O.; Xie, Wangang; Chen, Ming-Hui; Fan, Yu; Kuo, Lynn

    2014-01-01

    We present two distinctly different posterior predictive approaches to Bayesian phylogenetic model selection and illustrate these methods using examples from green algal protein-coding cpDNA sequences and flowering plant rDNA sequences. The Gelfand–Ghosh (GG) approach allows dissection of an overall measure of model fit into components due to posterior predictive variance (GGp) and goodness-of-fit (GGg), which distinguishes this method from the posterior predictive P-value approach. The conditional predictive ordinate (CPO) method provides a site-specific measure of model fit useful for exploratory analyses and can be combined over sites yielding the log pseudomarginal likelihood (LPML) which is useful as an overall measure of model fit. CPO provides a useful cross-validation approach that is computationally efficient, requiring only a sample from the posterior distribution (no additional simulation is required). Both GG and CPO add new perspectives to Bayesian phylogenetic model selection based on the predictive abilities of models and complement the perspective provided by the marginal likelihood (including Bayes Factor comparisons) based solely on the fit of competing models to observed data. [Bayesian; conditional predictive ordinate; CPO; L-measure; LPML; model selection; phylogenetics; posterior predictive.] PMID:24193892

  8. Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction.

    PubMed

    Mai, Uyen; Sayyari, Erfan; Mirarab, Siavash

    2017-01-01

    Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods.

  9. Minimum variance rooting of phylogenetic trees and implications for species tree reconstruction

    PubMed Central

    Sayyari, Erfan; Mirarab, Siavash

    2017-01-01

    Phylogenetic trees inferred using commonly-used models of sequence evolution are unrooted, but the root position matters both for interpretation and downstream applications. This issue has been long recognized; however, whether the potential for discordance between the species tree and gene trees impacts methods of rooting a phylogenetic tree has not been extensively studied. In this paper, we introduce a new method of rooting a tree based on its branch length distribution; our method, which minimizes the variance of root to tip distances, is inspired by the traditional midpoint rerooting and is justified when deviations from the strict molecular clock are random. Like midpoint rerooting, the method can be implemented in a linear time algorithm. In extensive simulations that consider discordance between gene trees and the species tree, we show that the new method is more accurate than midpoint rerooting, but its relative accuracy compared to using outgroups to root gene trees depends on the size of the dataset and levels of deviations from the strict clock. We show high levels of error for all methods of rooting estimated gene trees due to factors that include effects of gene tree discordance, deviations from the clock, and gene tree estimation error. Our simulations, however, did not reveal significant differences between two equivalent methods for species tree estimation that use rooted and unrooted input, namely, STAR and NJst. Nevertheless, our results point to limitations of existing scalable rooting methods. PMID:28800608

  10. Phylogenetic approaches reveal biodiversity threats under climate change

    NASA Astrophysics Data System (ADS)

    González-Orozco, Carlos E.; Pollock, Laura J.; Thornhill, Andrew H.; Mishler, Brent D.; Knerr, Nunzio; Laffan, Shawn W.; Miller, Joseph T.; Rosauer, Dan F.; Faith, Daniel P.; Nipperess, David A.; Kujala, Heini; Linke, Simon; Butt, Nathalie; Külheim, Carsten; Crisp, Michael D.; Gruber, Bernd

    2016-12-01

    Predicting the consequences of climate change for biodiversity is critical to conservation efforts. Extensive range losses have been predicted for thousands of individual species, but less is known about how climate change might impact whole clades and landscape-scale patterns of biodiversity. Here, we show that climate change scenarios imply significant changes in phylogenetic diversity and phylogenetic endemism at a continental scale in Australia using the hyper-diverse clade of eucalypts. We predict that within the next 60 years the vast majority of species distributions (91%) across Australia will shrink in size (on average by 51%) and shift south on the basis of projected suitable climatic space. Geographic areas currently with high phylogenetic diversity and endemism are predicted to change substantially in future climate scenarios. Approximately 90% of the current areas with concentrations of palaeo-endemism (that is, places with old evolutionary diversity) are predicted to disappear or shift their location. These findings show that climate change threatens whole clades of the phylogenetic tree, and that the outlined approach can be used to forecast areas of biodiversity losses and continental-scale impacts of climate change.

  11. Distance-Based Phylogenetic Methods Around a Polytomy.

    PubMed

    Davidson, Ruth; Sullivant, Seth

    2014-01-01

    Distance-based phylogenetic algorithms attempt to solve the NP-hard least-squares phylogeny problem by mapping an arbitrary dissimilarity map representing biological data to a tree metric. The set of all dissimilarity maps is a Euclidean space properly containing the space of all tree metrics as a polyhedral fan. Outputs of distance-based tree reconstruction algorithms such as UPGMA and neighbor-joining are points in the maximal cones in the fan. Tree metrics with polytomies lie at the intersections of maximal cones. A phylogenetic algorithm divides the space of all dissimilarity maps into regions based upon which combinatorial tree is reconstructed by the algorithm. Comparison of phylogenetic methods can be done by comparing the geometry of these regions. We use polyhedral geometry to compare the local nature of the subdivisions induced by least-squares phylogeny, UPGMA, and neighbor-joining when the true tree has a single polytomy with exactly four neighbors. Our results suggest that in some circumstances, UPGMA and neighbor-joining poorly match least-squares phylogeny.

  12. The power and pitfalls of HIV phylogenetics in public health.

    PubMed

    Brooks, James I; Sandstrom, Paul A

    2013-07-25

    Phylogenetics is the application of comparative studies of genetic sequences in order to infer evolutionary relationships among organisms. This tool can be used as a form of molecular epidemiology to enhance traditional population-level communicable disease surveillance. Phylogenetic study has resulted in new paradigms being created in the field of communicable diseases and this commentary aims to provide the reader with an explanation of how phylogenetics can be used in tracking infectious diseases. Special emphasis will be placed upon the application of phylogenetics as a tool to help elucidate HIV transmission patterns and the limitations to these methods when applied to forensic analysis. Understanding infectious disease epidemiology in order to prevent new transmissions is the sine qua non of public health. However, with increasing epidemiological resolution, there may be an associated potential loss of privacy to the individual. It is within this context that we aim to promote the discussion on how to use phylogenetics to achieve important public health goals, while at the same time protecting the rights of the individual.

  13. Phylogenetic Pattern, Evolutionary Processes and Species Delimitation in the Genus Echinococcus.

    PubMed

    Lymbery, A J

    2017-01-01

    An accurate and stable alpha taxonomy requires a clear conception of what constitutes a species and agreed criteria for delimiting different species. An evolutionary or general lineage concept defines a species as a single lineage of organisms with a common evolutionary trajectory, distinguishable from other such lineages. Delimiting evolutionary species is a two-step process. In the first step, phylogenetic reconstruction identifies putative species as groups of organisms that are monophyletic (share a common ancestor) and exclusive (more closely related to each other than to organisms outside the group). The second step is to assess whether members of the group possess genetic exchangeability (where cohesion is maintained by gene flow among populations) or ecological exchangeability (where cohesion is maintained because populations occupy the same ecological niche). Recent taxonomic reviews have recognized nine species within the genus Echinococcus. Phylogenetic reconstructions of the relationships between these putative species using mtDNA and nuclear gene sequences show that for the most part these nine species are monophyletic, although there are important incongruences that need to be resolved. Applying the criteria of genetic and ecological exchangeability suggests that seven of the currently recognized species represent evolutionarily distinct lineages. The species status of Echinococcus canadensis and Echinococcus ortleppi could not be confirmed. Coalescent-based analyses represent a promising approach to species delimitation in these closely related taxa. It seems likely, from a comparison of sister species groups, that speciation in the genus has been driven by geographic isolation, but biogeographic scenarios are largely speculative and require further testing. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. [Determination of in vitro susceptibility of Candida species to amphotericin B by E-test and previously proposed MIC breakpoints on two different media].

    PubMed

    Alp, Sehnaz; Sancak, Banu; Arikan, Sevtap

    2008-04-01

    Although much work has concentrated on defining a reliable and reproducible method for determining in vitro susceptibility of Candida species to amphotericin B, there still has been limitations of the proposed techniques. In this study, amphotericin B minimal inhibitory concentrations (MIC) and susceptibility categories of 212 Candida strains (57 C. glabrata, 53 C. lusitaniae, 51 C. krusei and 51 C. tropicalis) were determined by E-test on RPMI agar (RPG) and antibiotic medium 3 agar (AM3) both supplemented with 2% glucose. The results were interpreted according to the proposed MIC breakpoints (> or = 0.38 microg/ml on RPG, >1 microg/ml on AM3) and discrepancies between susceptibility categories were investigated. While all Candida strains included in the study were determined to be susceptible on AM3 by amphotericin B E-test at 48h, 36.3% of the isolates were classified as resistant on RPG at 48 hours. On RPG, C. krusei strains showed the highest resistance rate (94.1% at 48 h), followed by C. tropicalis (35.3% at 48 h) and C. glabrata (17.5% at 48h). At 48h of incubation, 98.1% of C. lusitaniae isolates were found to be susceptible on RPG. The categorical agreement rates between the results obtained on two media and for C. lusitaniae and C. glabrata were 98.1% and 82.5% at 48 hours. For C. tropicalis and C. krusei, the rates of agreement were 64.7% and 5.9% at 48 hours. Conclusively, according to the previously proposed MIC breakpoints for amphotericin B E-test on RPG and AM3, discrepancies between susceptibility categories of Candida species were of remarkable significance.

  15. A Universal Phylogenetic Tree.

    ERIC Educational Resources Information Center

    Offner, Susan

    2001-01-01

    Presents a universal phylogenetic tree suitable for use in high school and college-level biology classrooms. Illustrates the antiquity of life and that all life is related, even if it dates back 3.5 billion years. Reflects important evolutionary relationships and provides an exciting way to learn about the history of life. (SAH)

  16. Phylogenetics of modern birds in the era of genomics

    PubMed Central

    Edwards, Scott V; Bryan Jennings, W; Shedlock, Andrew M

    2005-01-01

    In the 14 years since the first higher-level bird phylogenies based on DNA sequence data, avian phylogenetics has witnessed the advent and maturation of the genomics era, the completion of the chicken genome and a suite of technologies that promise to add considerably to the agenda of avian phylogenetics. In this review, we summarize current approaches and data characteristics of recent higher-level bird studies and suggest a number of as yet untested molecular and analytical approaches for the unfolding tree of life for birds. A variety of comparative genomics strategies, including adoption of objective quality scores for sequence data, analysis of contiguous DNA sequences provided by large-insert genomic libraries, and the systematic use of retroposon insertions and other rare genomic changes all promise an integrated phylogenetics that is solidly grounded in genome evolution. The avian genome is an excellent testing ground for such approaches because of the more balanced representation of single-copy and repetitive DNA regions than in mammals. Although comparative genomics has a number of obvious uses in avian phylogenetics, its application to large numbers of taxa poses a number of methodological and infrastructural challenges, and can be greatly facilitated by a ‘community genomics’ approach in which the modest sequencing throughputs of single PI laboratories are pooled to produce larger, complementary datasets. Although the polymerase chain reaction era of avian phylogenetics is far from complete, the comparative genomics era—with its ability to vastly increase the number and type of molecular characters and to provide a genomic context for these characters—will usher in a host of new perspectives and opportunities for integrating genome evolution and avian phylogenetics. PMID:16024355

  17. Pharmacokinetic-pharmacodynamic (PK-PD) modeling and the rational selection of dosage regimes for the prudent use of antimicrobial drugs.

    PubMed

    Papich, Mark G

    2014-07-16

    One of the strategies to decrease inappropriate antimicrobial use in veterinary medicine is to apply pharmacokinetic-pharmacodynamic (PK-PD) principles to dosing regimens. If antimicrobials are used appropriately by applying these principles to attain targets for area-under-the-curve to MIC ratio (AUC/MIC), peak concentration to MIC ratio (CMAX/MIC), and time above MIC (T>MIC), more effective antibiotic therapy is possible, thus avoiding ineffective administration. Another mechanism whereby inappropriate antibiotic administration can be avoided is to use accurate Interpretive Criteria established by the Clinical Laboratory Standards Institute (CLSI) for breakpoint selection. Inaccurate breakpoints will encourage antibiotic administration that is likely to be ineffective. For newly approved antimicrobials, three criteria are used for determining breakpoints: PK-PD criteria, MIC distributions, and clinical response. For older (often generic drugs) evaluated by the CLSI, recent clinical data may not be available and breakpoints are derived from PK-PD principles, wild-type distributions, and Monte Carlo simulations. It is the goal of the CLSI subcommittee that these revised breakpoints will encourage more effective antimicrobial use and avoid unnecessary antimicrobial administration. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. Antimicrobial susceptibility of gram-negative pathogens isolated from patients with complicated intra-abdominal infections in South African hospitals (SMART Study 2004-2009): impact of the new carbapenem breakpoints.

    PubMed

    Brink, Adrian J; Botha, Roelof F; Poswa, Xoliswa; Senekal, Marthinus; Badal, Robert E; Grolman, David C; Richards, Guy A; Feldman, Charles; Boffard, Kenneth D; Veller, Martin; Joubert, Ivan; Pretorius, Jan

    2012-02-01

    The Study for Monitoring Antimicrobial Resistance Trends (SMART) follows trends in resistance among aerobic and facultative anaerobic gram-negative bacilli (GNB) isolated from complicated intra-abdominal infections (cIAIs) in patients around the world. During 2004-2009, three centralized clinical microbiology laboratories serving 59 private hospitals in three large South African cities collected 1,218 GNB from complicated intra-abdominal infections (cIAIs) and tested them for susceptibility to 12 antibiotics according to the 2011 Clinical Laboratory Standards Institute (CLSI) guidelines. Enterobacteriaceae comprised 83.7% of the isolates. Escherichia coli was the species isolated most commonly (46.4%), and 7.6% of these were extended-spectrum β-lactamase (ESBL)-positive. The highest ESBL rate was documented for Klebsiella pneumoniae (41.2%). Overall, ertapenem was the antibiotic most active against susceptible species for which it has breakpoints (94.6%) followed by amikacin (91.9%), piperacillin-tazobactam (89.3%), and imipenem-cilastatin (87.1%), whereas rates of resistance to ceftriaxone, cefotaxime, ciprofloxacin, and levofloxacin were documented to be 29.7%, 28.7%, 22.5%, and 21.1%, respectively. Multi-drug resistance (MDR), defined as resistance to three or more antibiotic classes, was significantly more common in K. pneumoniae (27.9%) than in E. coli (4.9%; p<0.0001) or Proteus mirabilis (4.1%; p<0.05). Applying the new CLSI breakpoints for carbapenems, susceptibility to ertapenem was reduced significantly in ESBL-positive E. coli compared with ESBL-negative isolates (91% vs. 98%; p<0.05), but this did not apply to imipenem-cilastatin (95% vs. 99%; p=0.0928). A large disparity between imipenem-cilastatin and ertapenem susceptibility in P. mirabilis and Morganella morganii was documented (24% vs. 96% and 15% vs. 92%, respectively), as most isolates of these two species had imipenem-cilastatin minimum inhibitory concentrations in the 2-4 mcg/mL range, which

  19. Phylogenetic congruence between subtropical trees and their associated fungi.

    PubMed

    Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao

    2016-12-01

    Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.

  20. Turnover of plant lineages shapes herbivore phylogenetic beta diversity along ecological gradients.

    PubMed

    Pellissier, Loïc; Ndiribe, Charlotte; Dubuis, Anne; Pradervand, Jean-Nicolas; Salamin, Nicolas; Guisan, Antoine; Rasmann, Sergio

    2013-05-01

    Understanding drivers of biodiversity patterns is of prime importance in this era of severe environmental crisis. More diverse plant communities have been postulated to represent a larger functional trait-space, more likely to sustain a diverse assembly of herbivore species. Here, we expand this hypothesis to integrate environmental, functional and phylogenetic variation of plant communities as factors explaining the diversity of lepidopteran assemblages along elevation gradients in the Swiss Western Alps. According to expectations, we found that the association between butterflies and their host plants is highly phylogenetically structured. Multiple regression analyses showed the combined effect of climate, functional traits and phylogenetic diversity in structuring butterfly communities. Furthermore, we provide the first evidence that plant phylogenetic beta diversity is the major driver explaining butterfly phylogenetic beta diversity. Along ecological gradients, the bottom up control of herbivore diversity is thus driven by phylogenetically structured turnover of plant traits as well as environmental variables. © 2013 Blackwell Publishing Ltd/CNRS.

  1. Isolation and molecular analysis of inv dup(15) and construction of a physical map of a common breakpoint in order to elucidate their mechanism of formation.

    PubMed

    Wandstrat, A E; Schwartz, S

    2000-11-01

    An inverted duplication of chromosome 15 [inv dup(15)] is the most common supernumerary marker chromosome, comprising approximately 50% of all chromosomes in this class. Structurally, the inv dup(15) is a mirror image with the central axis defining a distal break within either the heterochromatic alpha-satellite array or along the euchromatin in the long (q) arm of the chromosome. There are several types of inv dup(15), classified by the amount of euchromatic material present. Generally, they are bisatellited, pseudodicentric and have a breakpoint in 15q11-q14. A suggested mechanism of formation of inv dup(15) involves illegitimate recombination between homologous chromosomes followed by nondisjunction and centromere inactivation. The proximal portion of chromosome 15 contains several low-copy repeat sequence families and it has been hypothesized that errors in pairing among these repeats may result in structural rearrangements of this chromosome including the inv dup(15). To test this hypothesis and to determine the mechanism of formation, the inv dup(15) from four cases was isolated in somatic cell hybrids and polymerase chain reaction microsatellite markers were used to determine the origin of exchange. Two appeared to result from interchromosomal and two from intrachromosomal exchange, one of which occurred post-recombination. In addition, a detailed physical map of the breakpoint region in the largest inv dup(15) was constructed placing eight new sequence-tagged sites and ten new bacterial artificial chromosome markers in the region.

  2. Cenozoic imprints on the phylogenetic structure of palm species assemblages worldwide.

    PubMed

    Kissling, W Daniel; Eiserhardt, Wolf L; Baker, William J; Borchsenius, Finn; Couvreur, Thomas L P; Balslev, Henrik; Svenning, Jens-Christian

    2012-05-08

    Despite long-standing interest in the origin and maintenance of species diversity, little is known about historical drivers of species assemblage structure at large spatiotemporal scales. Here, we use global species distribution data, a dated genus-level phylogeny, and paleo-reconstructions of biomes and climate to examine Cenozoic imprints on the phylogenetic structure of regional species assemblages of palms (Arecaceae), a species-rich plant family characteristic of tropical ecosystems. We find a strong imprint on phylogenetic clustering due to geographic isolation and in situ diversification, especially in the Neotropics and on islands with spectacular palm radiations (e.g., Madagascar, Hawaii, and Cuba). Phylogenetic overdispersion on mainlands and islands corresponds to biotic interchange areas. Differences in the degree of phylogenetic clustering among biogeographic realms are related to differential losses of tropical rainforests during the Cenozoic, but not to the cumulative area of tropical rainforest over geological time. A largely random phylogenetic assemblage structure in Africa coincides with severe losses of rainforest area, especially after the Miocene. More recent events also appear to be influential: phylogenetic clustering increases with increasing intensity of Quaternary glacial-interglacial climatic oscillations in South America and, to a lesser extent, Africa, indicating that specific clades perform better in climatically unstable regions. Our results suggest that continental isolation (in combination with limited long-distance dispersal) and changing climate and habitat loss throughout the Cenozoic have had strong impacts on the phylogenetic structure of regional species assemblages in the tropics.

  3. Threatened species and the potential loss of phylogenetic diversity: conservation scenarios based on estimated extinction probabilities and phylogenetic risk analysis.

    PubMed

    Faith, Daniel P

    2008-12-01

    New species conservation strategies, including the EDGE of Existence (EDGE) program, have expanded threatened species assessments by integrating information about species' phylogenetic distinctiveness. Distinctiveness has been measured through simple scores that assign shared credit among species for evolutionary heritage represented by the deeper phylogenetic branches. A species with a high score combined with a high extinction probability receives high priority for conservation efforts. Simple hypothetical scenarios for phylogenetic trees and extinction probabilities demonstrate how such scoring approaches can provide inefficient priorities for conservation. An existing probabilistic framework derived from the phylogenetic diversity measure (PD) properly captures the idea of shared responsibility for the persistence of evolutionary history. It avoids static scores, takes into account the status of close relatives through their extinction probabilities, and allows for the necessary updating of priorities in light of changes in species threat status. A hypothetical phylogenetic tree illustrates how changes in extinction probabilities of one or more species translate into changes in expected PD. The probabilistic PD framework provided a range of strategies that moved beyond expected PD to better consider worst-case PD losses. In another example, risk aversion gave higher priority to a conservation program that provided a smaller, but less risky, gain in expected PD. The EDGE program could continue to promote a list of top species conservation priorities through application of probabilistic PD and simple estimates of current extinction probability. The list might be a dynamic one, with all the priority scores updated as extinction probabilities change. Results of recent studies suggest that estimation of extinction probabilities derived from the red list criteria linked to changes in species range sizes may provide estimated probabilities for many different species

  4. Cnidarian phylogenetic relationships as revealed by mitogenomics

    PubMed Central

    2013-01-01

    Background Cnidaria (corals, sea anemones, hydroids, jellyfish) is a phylum of relatively simple aquatic animals characterized by the presence of the cnidocyst: a cell containing a giant capsular organelle with an eversible tubule (cnida). Species within Cnidaria have life cycles that involve one or both of the two distinct body forms, a typically benthic polyp, which may or may not be colonial, and a typically pelagic mostly solitary medusa. The currently accepted taxonomic scheme subdivides Cnidaria into two main assemblages: Anthozoa (Hexacorallia + Octocorallia) – cnidarians with a reproductive polyp and the absence of a medusa stage – and Medusozoa (Cubozoa, Hydrozoa, Scyphozoa, Staurozoa) – cnidarians that usually possess a reproductive medusa stage. Hypothesized relationships among these taxa greatly impact interpretations of cnidarian character evolution. Results We expanded the sampling of cnidarian mitochondrial genomes, particularly from Medusozoa, to reevaluate phylogenetic relationships within Cnidaria. Our phylogenetic analyses based on a mitochogenomic dataset support many prior hypotheses, including monophyly of Hexacorallia, Octocorallia, Medusozoa, Cubozoa, Staurozoa, Hydrozoa, Carybdeida, Chirodropida, and Hydroidolina, but reject the monophyly of Anthozoa, indicating that the Octocorallia + Medusozoa relationship is not the result of sampling bias, as proposed earlier. Further, our analyses contradict Scyphozoa [Discomedusae + Coronatae], Acraspeda [Cubozoa + Scyphozoa], as well as the hypothesis that Staurozoa is the sister group to all the other medusozoans. Conclusions Cnidarian mitochondrial genomic data contain phylogenetic signal informative for understanding the evolutionary history of this phylum. Mitogenome-based phylogenies, which reject the monophyly of Anthozoa, provide further evidence for the polyp-first hypothesis. By rejecting the traditional Acraspeda and Scyphozoa hypotheses, these analyses suggest that

  5. Efficient FPT Algorithms for (Strict) Compatibility of Unrooted Phylogenetic Trees.

    PubMed

    Baste, Julien; Paul, Christophe; Sau, Ignasi; Scornavacca, Celine

    2017-04-01

    In phylogenetics, a central problem is to infer the evolutionary relationships between a set of species X; these relationships are often depicted via a phylogenetic tree-a tree having its leaves labeled bijectively by elements of X and without degree-2 nodes-called the "species tree." One common approach for reconstructing a species tree consists in first constructing several phylogenetic trees from primary data (e.g., DNA sequences originating from some species in X), and then constructing a single phylogenetic tree maximizing the "concordance" with the input trees. The obtained tree is our estimation of the species tree and, when the input trees are defined on overlapping-but not identical-sets of labels, is called "supertree." In this paper, we focus on two problems that are central when combining phylogenetic trees into a supertree: the compatibility and the strict compatibility problems for unrooted phylogenetic trees. These problems are strongly related, respectively, to the notions of "containing as a minor" and "containing as a topological minor" in the graph community. Both problems are known to be fixed parameter tractable in the number of input trees k, by using their expressibility in monadic second-order logic and a reduction to graphs of bounded treewidth. Motivated by the fact that the dependency on k of these algorithms is prohibitively large, we give the first explicit dynamic programming algorithms for solving these problems, both running in time [Formula: see text], where n is the total size of the input.

  6. A Deliberate Practice Approach to Teaching Phylogenetic Analysis

    PubMed Central

    Hobbs, F. Collin; Johnson, Daniel J.; Kearns, Katherine D.

    2013-01-01

    One goal of postsecondary education is to assist students in developing expert-level understanding. Previous attempts to encourage expert-level understanding of phylogenetic analysis in college science classrooms have largely focused on isolated, or “one-shot,” in-class activities. Using a deliberate practice instructional approach, we designed a set of five assignments for a 300-level plant systematics course that incrementally introduces the concepts and skills used in phylogenetic analysis. In our assignments, students learned the process of constructing phylogenetic trees through a series of increasingly difficult tasks; thus, skill development served as a framework for building content knowledge. We present results from 5 yr of final exam scores, pre- and postconcept assessments, and student surveys to assess the impact of our new pedagogical materials on student performance related to constructing and interpreting phylogenetic trees. Students improved in their ability to interpret relationships within trees and improved in several aspects related to between-tree comparisons and tree construction skills. Student feedback indicated that most students believed our approach prepared them to engage in tree construction and gave them confidence in their abilities. Overall, our data confirm that instructional approaches implementing deliberate practice address student misconceptions, improve student experiences, and foster deeper understanding of difficult scientific concepts. PMID:24297294

  7. Detection, breakpoint identification and detailed characterisation of a CNV at the FRA16D site using SNP assays.

    PubMed

    Winchester, L; Newbury, D F; Monaco, A P; Ragoussis, J

    2008-01-01

    Copy Number Variants (CNV) and other submicroscopic structural changes are now recognised to be widespread across the human genome. We show that SNP data generated for association study can be utilised for the identification of deletion CNVs. During analysis of data for an SNP association study for Specific Language Impairment (SLI) a deletion was identified. SLI adversely affects the language development of children in the absence of any obvious cause. Previous studies have found linkage to a region on chromosome 16. The deletion was located in a known fragile site FRA16D in intron 5-6 of the WWOX gene (also known as FOR). Changes in the FRA16D site have been previously linked to cancer and are often characterised in cell lines. A long-range PCR assay was used to confirm the existence of the deletion. We also show the breakpoint identification and large-scale characterisation of this CNV in a normal human sample set. Copyright 2009 S. Karger AG, Basel.

  8. Mammalian phylogenetic diversity-area relationships at a continental scale

    PubMed Central

    Mazel, Florent; Renaud, Julien; Guilhaumon, François; Mouillot, David; Gravel, Dominique; Thuiller, Wilfried

    2015-01-01

    In analogy to the species-area relationship (SAR), one of the few laws in Ecology, the phylogenetic diversity-area relationship (PDAR) describes the tendency of phylogenetic diversity (PD) to increase with area. Although investigating PDAR has the potential to unravel the underlying processes shaping assemblages across spatial scales and to predict PD loss through habitat reduction, it has been little investigated so far. Focusing on PD has noticeable advantages compared to species richness (SR) since PD also gives insights on processes such as speciation/extinction, assembly rules and ecosystem functioning. Here we investigate the universality and pervasiveness of the PDAR at continental scale using terrestrial mammals as study case. We define the relative robustness of PD (compared to SR) to habitat loss as the area between the standardized PDAR and standardized SAR (i.e. standardized by the diversity of the largest spatial window) divided by the area under the standardized SAR only. This metric quantifies the relative increase of PD robustness compared to SR robustness. We show that PD robustness is higher than SR robustness but that it varies among continents. We further use a null model approach to disentangle the relative effect of phylogenetic tree shape and non random spatial distribution of evolutionary history on the PDAR. We find that for most spatial scales and for all continents except Eurasia, PDARs are not different from expected by a model using only the observed SAR and the shape of the phylogenetic tree at continental scale. Interestingly, we detect a strong phylogenetic structure of the Eurasian PDAR that can be predicted by a model that specifically account for a finer biogeographical delineation of this continent. In conclusion, the relative robustness of PD to habitat loss compared to species richness is determined by the phylogenetic tree shape but also depends on the spatial structure of PD. PMID:26649401

  9. Industrial applications of high-performance computing for phylogeny reconstruction

    NASA Astrophysics Data System (ADS)

    Bader, David A.; Moret, Bernard M.; Vawter, Lisa

    2001-07-01

    Phylogenies (that is, tree-of-life relationships) derived from gene order data may prove crucial in answering some fundamental open questions in biomolecular evolution. Real-world interest is strong in determining these relationships. For example, pharmaceutical companies may use phylogeny reconstruction in drug discovery for discovering synthetic pathways unique to organisms that they wish to target. Health organizations study the phylogenies of organisms such as HIV in order to understand their epidemiologies and to aid in predicting the behaviors of future outbreaks. And governments are interested in aiding the production of such foodstuffs as rice, wheat and potatoes via genetics through understanding of the phylogenetic distribution of genetic variation in wild populations. Yet few techniques are available for difficult phylogenetic reconstruction problems. Appropriate tools for analysis of such data may aid in resolving some of the phylogenetic problems that have been analyzed without much resolution for decades. With the rapid accumulation of whole genome sequences for a wide diversity of taxa, especially microbial taxa, phylogenetic reconstruction based on changes in gene order and gene content is showing promise, particularly for resolving deep (i.e., ancient) branch splits. However, reconstruction from gene-order data is even more computationally expensive than reconstruction from sequence data, particularly in groups with large numbers of genes and highly-rearranged genomes. We have developed a software suite, GRAPPA, that extends the breakpoint analysis (BPAnalysis) method of Sankoff and Blanchette while running much faster: in a recent analysis of chloroplast genome data for species of Campanulaceae on a 512-processor Linux supercluster with Myrinet, we achieved a one-million-fold speedup over BPAnalysis. GRAPPA can use either breakpoint or inversion distance (computed exactly) for its computation and runs on single-processor machines as well as

  10. Your place or mine? A phylogenetic comparative analysis of marital residence in Indo-European and Austronesian societies

    PubMed Central

    Fortunato, Laura; Jordan, Fiona

    2010-01-01

    Accurate reconstruction of prehistoric social organization is important if we are to put together satisfactory multidisciplinary scenarios about, for example, the dispersal of human groups. Such considerations apply in the case of Indo-European and Austronesian, two large-scale language families that are thought to represent Neolithic expansions. Ancestral kinship patterns have mostly been inferred through reconstruction of kin terminologies in ancestral proto-languages using the linguistic comparative method, and through geographical or distributional arguments based on the comparative patterns of kin terms and ethnographic kinship ‘facts’. While these approaches are detailed and valuable, the processes through which conclusions have been drawn from the data fail to provide explicit criteria for systematic testing of alternative hypotheses. Here, we use language trees derived using phylogenetic tree-building techniques on Indo-European and Austronesian vocabulary data. With these trees, ethnographic data and Bayesian phylogenetic comparative methods, we statistically reconstruct past marital residence and infer rates of cultural change between different residence forms, showing Proto-Indo-European to be virilocal and Proto-Malayo-Polynesian uxorilocal. The instability of uxorilocality and the rare loss of virilocality once gained emerge as common features of both families. PMID:21041215

  11. Fast algorithms for computing phylogenetic divergence time.

    PubMed

    Crosby, Ralph W; Williams, Tiffani L

    2017-12-06

    The inference of species divergence time is a key step in most phylogenetic studies. Methods have been available for the last ten years to perform the inference, but the performance of the methods does not yet scale well to studies with hundreds of taxa and thousands of DNA base pairs. For example a study of 349 primate taxa was estimated to require over 9 months of processing time. In this work, we present a new algorithm, AncestralAge, that significantly improves the performance of the divergence time process. As part of AncestralAge, we demonstrate a new method for the computation of phylogenetic likelihood and our experiments show a 90% improvement in likelihood computation time on the aforementioned dataset of 349 primates taxa with over 60,000 DNA base pairs. Additionally, we show that our new method for the computation of the Bayesian prior on node ages reduces the running time for this computation on the 349 taxa dataset by 99%. Through the use of these new algorithms we open up the ability to perform divergence time inference on large phylogenetic studies.

  12. Phylogenetic relationships among Maloideae species

    USDA-ARS?s Scientific Manuscript database

    The Maloideae is a highly diverse sub-family of the Rosaceae containing several agronomically important species (Malus sp. and Pyrus sp.) and their wild relatives. Previous phylogenetic work within the group has revealed extensive intergeneric hybridization and polyploidization. In order to develop...

  13. Verdant: automated annotation, alignment and phylogenetic analysis of whole chloroplast genomes.

    PubMed

    McKain, Michael R; Hartsock, Ryan H; Wohl, Molly M; Kellogg, Elizabeth A

    2017-01-01

    Chloroplast genomes are now produced in the hundreds for angiosperm phylogenetics projects, but current methods for annotation, alignment and tree estimation still require some manual intervention reducing throughput and increasing analysis time for large chloroplast systematics projects. Verdant is a web-based software suite and database built to take advantage a novel annotation program, annoBTD. Using annoBTD, Verdant provides accurate annotation of chloroplast genomes without manual intervention. Subsequent alignment and tree estimation can incorporate newly annotated and publically available plastomes and can accommodate a large number of taxa. Verdant sharply reduces the time required for analysis of assembled chloroplast genomes and removes the need for pipelines and software on personal hardware. Verdant is available at: http://verdant.iplantcollaborative.org/plastidDB/ It is implemented in PHP, Perl, MySQL, Javascript, HTML and CSS with all major browsers supported. mrmckain@gmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  14. Keeping All the PIECES: Phylogenetically Informed Ex Situ Conservation of Endangered Species.

    PubMed

    Larkin, Daniel J; Jacobi, Sarah K; Hipp, Andrew L; Kramer, Andrea T

    2016-01-01

    Ex situ conservation in germplasm and living collections is a major focus of global plant conservation strategies. Prioritizing species for ex situ collection is a necessary component of this effort for which sound strategies are needed. Phylogenetic considerations can play an important role in prioritization. Collections that are more phylogenetically diverse are likely to encompass more ecological and trait variation, and thus provide stronger conservation insurance and richer resources for future restoration efforts. However, phylogenetic criteria need to be weighed against other, potentially competing objectives. We used ex situ collection and threat rank data for North American angiosperms to investigate gaps in ex situ coverage and phylogenetic diversity of collections and to develop a flexible framework for prioritizing species across multiple objectives. We found that ex situ coverage of 18,766 North American angiosperm taxa was low with respect to the most vulnerable taxa: just 43% of vulnerable to critically imperiled taxa were in ex situ collections, far short of a year-2020 goal of 75%. In addition, species held in ex situ collections were phylogenetically clustered (P < 0.001), i.e., collections comprised less phylogenetic diversity than would be expected had species been drawn at random. These patterns support incorporating phylogenetic considerations into ex situ prioritization in a manner balanced with other criteria, such as vulnerability. To meet this need, we present the 'PIECES' index (Phylogenetically Informed Ex situ Conservation of Endangered Species). PIECES integrates phylogenetic considerations into a flexible framework for prioritizing species across competing objectives using multi-criteria decision analysis. Applying PIECES to prioritizing ex situ conservation of North American angiosperms, we show strong return on investment across multiple objectives, some of which are negatively correlated with each other. A spreadsheet

  15. Keeping All the PIECES: Phylogenetically Informed Ex Situ Conservation of Endangered Species

    PubMed Central

    Larkin, Daniel J.; Jacobi, Sarah K.; Hipp, Andrew L.; Kramer, Andrea T.

    2016-01-01

    Ex situ conservation in germplasm and living collections is a major focus of global plant conservation strategies. Prioritizing species for ex situ collection is a necessary component of this effort for which sound strategies are needed. Phylogenetic considerations can play an important role in prioritization. Collections that are more phylogenetically diverse are likely to encompass more ecological and trait variation, and thus provide stronger conservation insurance and richer resources for future restoration efforts. However, phylogenetic criteria need to be weighed against other, potentially competing objectives. We used ex situ collection and threat rank data for North American angiosperms to investigate gaps in ex situ coverage and phylogenetic diversity of collections and to develop a flexible framework for prioritizing species across multiple objectives. We found that ex situ coverage of 18,766 North American angiosperm taxa was low with respect to the most vulnerable taxa: just 43% of vulnerable to critically imperiled taxa were in ex situ collections, far short of a year-2020 goal of 75%. In addition, species held in ex situ collections were phylogenetically clustered (P < 0.001), i.e., collections comprised less phylogenetic diversity than would be expected had species been drawn at random. These patterns support incorporating phylogenetic considerations into ex situ prioritization in a manner balanced with other criteria, such as vulnerability. To meet this need, we present the ‘PIECES’ index (Phylogenetically Informed Ex situ Conservation of Endangered Species). PIECES integrates phylogenetic considerations into a flexible framework for prioritizing species across competing objectives using multi-criteria decision analysis. Applying PIECES to prioritizing ex situ conservation of North American angiosperms, we show strong return on investment across multiple objectives, some of which are negatively correlated with each other. A spreadsheet

  16. Phylogenetic versus functional signals in the evolution of form-function relationships in terrestrial vision.

    PubMed

    Motani, Ryosuke; Schmitz, Lars

    2011-08-01

    Phylogeny is deeply pertinent to evolutionary studies. Traits that perform a body function are expected to be strongly influenced by physical "requirements" of the function. We investigated if such traits exhibit phylogenetic signals, and, if so, how phylogenetic noises bias quantification of form-function relationships. A form-function system that is strongly influenced by physics, namely the relationship between eye morphology and visual optics in amniotes, was used. We quantified the correlation between form (i.e., eye morphology) and function (i.e., ocular optics) while varying the level of phylogenetic bias removal through adjusting Pagel's λ. Ocular soft-tissue dimensions exhibited the highest correlation with ocular optics when 1% of phylogenetic bias expected from Brownian motion was removed (i.e., λ= 0.01); the value for hard-tissue data were 8%. A small degree of phylogenetic bias therefore exists in morphology despite of the stringent functional constraints. We also devised a phylogenetically informed discriminant analysis and recorded the effects of phylogenetic bias on this method using the same data. Use of proper λ values during phylogenetic bias removal improved misidentification rates in resulting classifications when prior probabilities were assumed to be equal. Even a small degree of phylogenetic bias affected the classification resulting from phylogenetically informed discriminant analysis. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.

  17. SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.

    PubMed

    Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H

    2014-02-26

    Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.

  18. The Probability of a Gene Tree Topology within a Phylogenetic Network with Applications to Hybridization Detection

    PubMed Central

    Yu, Yun; Degnan, James H.; Nakhleh, Luay

    2012-01-01

    Gene tree topologies have proven a powerful data source for various tasks, including species tree inference and species delimitation. Consequently, methods for computing probabilities of gene trees within species trees have been developed and widely used in probabilistic inference frameworks. All these methods assume an underlying multispecies coalescent model. However, when reticulate evolutionary events such as hybridization occur, these methods are inadequate, as they do not account for such events. Methods that account for both hybridization and deep coalescence in computing the probability of a gene tree topology currently exist for very limited cases. However, no such methods exist for general cases, owing primarily to the fact that it is currently unknown how to compute the probability of a gene tree topology within the branches of a phylogenetic network. Here we present a novel method for computing the probability of gene tree topologies on phylogenetic networks and demonstrate its application to the inference of hybridization in the presence of incomplete lineage sorting. We reanalyze a Saccharomyces species data set for which multiple analyses had converged on a species tree candidate. Using our method, though, we show that an evolutionary hypothesis involving hybridization in this group has better support than one of strict divergence. A similar reanalysis on a group of three Drosophila species shows that the data is consistent with hybridization. Further, using extensive simulation studies, we demonstrate the power of gene tree topologies at obtaining accurate estimates of branch lengths and hybridization probabilities of a given phylogenetic network. Finally, we discuss identifiability issues with detecting hybridization, particularly in cases that involve extinction or incomplete sampling of taxa. PMID:22536161

  19. Multivariate Phylogenetic Comparative Methods: Evaluations, Comparisons, and Recommendations.

    PubMed

    Adams, Dean C; Collyer, Michael L

    2018-01-01

    Recent years have seen increased interest in phylogenetic comparative analyses of multivariate data sets, but to date the varied proposed approaches have not been extensively examined. Here we review the mathematical properties required of any multivariate method, and specifically evaluate existing multivariate phylogenetic comparative methods in this context. Phylogenetic comparative methods based on the full multivariate likelihood are robust to levels of covariation among trait dimensions and are insensitive to the orientation of the data set, but display increasing model misspecification as the number of trait dimensions increases. This is because the expected evolutionary covariance matrix (V) used in the likelihood calculations becomes more ill-conditioned as trait dimensionality increases, and as evolutionary models become more complex. Thus, these approaches are only appropriate for data sets with few traits and many species. Methods that summarize patterns across trait dimensions treated separately (e.g., SURFACE) incorrectly assume independence among trait dimensions, resulting in nearly a 100% model misspecification rate. Methods using pairwise composite likelihood are highly sensitive to levels of trait covariation, the orientation of the data set, and the number of trait dimensions. The consequences of these debilitating deficiencies are that a user can arrive at differing statistical conclusions, and therefore biological inferences, simply from a dataspace rotation, like principal component analysis. By contrast, algebraic generalizations of the standard phylogenetic comparative toolkit that use the trace of covariance matrices are insensitive to levels of trait covariation, the number of trait dimensions, and the orientation of the data set. Further, when appropriate permutation tests are used, these approaches display acceptable Type I error and statistical power. We conclude that methods summarizing information across trait dimensions, as well as

  20. Comparative Analysis of Begonia Plastid Genomes and Their Utility for Species-Level Phylogenetics

    PubMed Central

    Harrison, Nicola; Harrison, Richard J.

    2016-01-01

    Recent, rapid radiations make species-level phylogenetics difficult to resolve. We used a multiplexed, high-throughput sequencing approach to identify informative genomic regions to resolve phylogenetic relationships at low taxonomic levels in Begonia from a survey of sixteen species. A long-range PCR method was used to generate draft plastid genomes to provide a strong phylogenetic backbone, identify fast evolving regions and provide informative molecular markers for species-level phylogenetic studies in Begonia. PMID:27058864

  1. MICCA: a complete and accurate software for taxonomic profiling of metagenomic data

    PubMed Central

    Albanese, Davide; Fontana, Paolo; De Filippo, Carlotta; Cavalieri, Duccio; Donati, Claudio

    2015-01-01

    The introduction of high throughput sequencing technologies has triggered an increase of the number of studies in which the microbiota of environmental and human samples is characterized through the sequencing of selected marker genes. While experimental protocols have undergone a process of standardization that makes them accessible to a large community of scientist, standard and robust data analysis pipelines are still lacking. Here we introduce MICCA, a software pipeline for the processing of amplicon metagenomic datasets that efficiently combines quality filtering, clustering of Operational Taxonomic Units (OTUs), taxonomy assignment and phylogenetic tree inference. MICCA provides accurate results reaching a good compromise among modularity and usability. Moreover, we introduce a de-novo clustering algorithm specifically designed for the inference of Operational Taxonomic Units (OTUs). Tests on real and synthetic datasets shows that thanks to the optimized reads filtering process and to the new clustering algorithm, MICCA provides estimates of the number of OTUs and of other common ecological indices that are more accurate and robust than currently available pipelines. Analysis of public metagenomic datasets shows that the higher consistency of results improves our understanding of the structure of environmental and human associated microbial communities. MICCA is an open source project. PMID:25988396

  2. Phylogenetic rooting using minimal ancestor deviation.

    PubMed

    Tria, Fernando Domingues Kümmel; Landan, Giddy; Dagan, Tal

    2017-06-19

    Ancestor-descendent relations play a cardinal role in evolutionary theory. Those relations are determined by rooting phylogenetic trees. Existing rooting methods are hampered by evolutionary rate heterogeneity or the unavailability of auxiliary phylogenetic information. Here we present a rooting approach, the minimal ancestor deviation (MAD) method, which accommodates heterotachy by using all pairwise topological and metric information in unrooted trees. We demonstrate the performance of the method, in comparison to existing rooting methods, by the analysis of phylogenies from eukaryotes and prokaryotes. MAD correctly recovers the known root of eukaryotes and uncovers evidence for the origin of cyanobacteria in the ocean. MAD is more robust and consistent than existing methods, provides measures of the root inference quality and is applicable to any tree with branch lengths.

  3. Molecular identification and phylogenetic study of Demodex caprae.

    PubMed

    Zhao, Ya-E; Cheng, Juan; Hu, Li; Ma, Jun-Xian

    2014-10-01

    The DNA barcode has been widely used in species identification and phylogenetic analysis since 2003, but there have been no reports in Demodex. In this study, to obtain an appropriate DNA barcode for Demodex, molecular identification of Demodex caprae based on mitochondrial cox1 was conducted. Firstly, individual adults and eggs of D. caprae were obtained for genomic DNA (gDNA) extraction; Secondly, mitochondrial cox1 fragment was amplified, cloned, and sequenced; Thirdly, cox1 fragments of D. caprae were aligned with those of other Demodex retrieved from GenBank; Finally, the intra- and inter-specific divergences were computed and the phylogenetic trees were reconstructed to analyze phylogenetic relationship in Demodex. Results obtained from seven 429-bp fragments of D. caprae showed that sequence identities were above 99.1% among three adults and four eggs. The intraspecific divergences in D. caprae, Demodex folliculorum, Demodex brevis, and Demodex canis were 0.0-0.9, 0.5-0.9, 0.0-0.2, and 0.0-0.5%, respectively, while the interspecific divergences between D. caprae and D. folliculorum, D. canis, and D. brevis were 20.3-20.9, 21.8-23.0, and 25.0-25.3, respectively. The interspecific divergences were 10 times higher than intraspecific ones, indicating considerable barcoding gap. Furthermore, the phylogenetic trees showed that four Demodex species gathered separately, representing independent species; and Demodex folliculorum gathered with canine Demodex, D. caprae, and D. brevis in sequence. In conclusion, the selected 429-bp mitochondrial cox1 gene is an appropriate DNA barcode for molecular classification, identification, and phylogenetic analysis of Demodex. D. caprae is an independent species and D. folliculorum is closer to D. canis than to D. caprae or D. brevis.

  4. The dawn of open access to phylogenetic data.

    PubMed

    Magee, Andrew F; May, Michael R; Moore, Brian R

    2014-01-01

    The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation--extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for [Formula: see text] of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.

  5. Student Interpretations of Phylogenetic Trees in an Introductory Biology Course

    ERIC Educational Resources Information Center

    Dees, Jonathan; Momsen, Jennifer L.; Niemi, Jarad; Montplaisir, Lisa

    2014-01-01

    Phylogenetic trees are widely used visual representations in the biological sciences and the most important visual representations in evolutionary biology. Therefore, phylogenetic trees have also become an important component of biology education. We sought to characterize reasoning used by introductory biology students in interpreting taxa…

  6. A taxonomic and phylogenetic re-appraisal of the genus Curvularia

    USDA-ARS?s Scientific Manuscript database

    Species of Curvularia are important plant and human pathogens worldwide. In this study, the genus Curvularia is re-assessed based on molecular phylogenetic analysis and morphological observations of available isolates and specimens. A multi-gene phylogenetic tree inferred from ITS, TEF and GPDH gene...

  7. A new algorithm to construct phylogenetic networks from trees.

    PubMed

    Wang, J

    2014-03-06

    Developing appropriate methods for constructing phylogenetic networks from tree sets is an important problem, and much research is currently being undertaken in this area. BIMLR is an algorithm that constructs phylogenetic networks from tree sets. The algorithm can construct a much simpler network than other available methods. Here, we introduce an improved version of the BIMLR algorithm, QuickCass. QuickCass changes the selection strategy of the labels of leaves below the reticulate nodes, i.e., the nodes with an indegree of at least 2 in BIMLR. We show that QuickCass can construct simpler phylogenetic networks than BIMLR. Furthermore, we show that QuickCass is a polynomial-time algorithm when the output network that is constructed by QuickCass is binary.

  8. Structure-Based Phylogenetic Analysis of the Lipocalin Superfamily.

    PubMed

    Lakshmi, Balasubramanian; Mishra, Madhulika; Srinivasan, Narayanaswamy; Archunan, Govindaraju

    2015-01-01

    Lipocalins constitute a superfamily of extracellular proteins that are found in all three kingdoms of life. Although very divergent in their sequences and functions, they show remarkable similarity in 3-D structures. Lipocalins bind and transport small hydrophobic molecules. Earlier sequence-based phylogenetic studies of lipocalins highlighted that they have a long evolutionary history. However the molecular and structural basis of their functional diversity is not completely understood. The main objective of the present study is to understand functional diversity of the lipocalins using a structure-based phylogenetic approach. The present study with 39 protein domains from the lipocalin superfamily suggests that the clusters of lipocalins obtained by structure-based phylogeny correspond well with the functional diversity. The detailed analysis on each of the clusters and sub-clusters reveals that the 39 lipocalin domains cluster based on their mode of ligand binding though the clustering was performed on the basis of gross domain structure. The outliers in the phylogenetic tree are often from single member families. Also structure-based phylogenetic approach has provided pointers to assign putative function for the domains of unknown function in lipocalin family. The approach employed in the present study can be used in the future for the functional identification of new lipocalin proteins and may be extended to other protein families where members show poor sequence similarity but high structural similarity.

  9. Phylogenetic trees in bioinformatics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Burr, Tom L

    2008-01-01

    Genetic data is often used to infer evolutionary relationships among a collection of viruses, bacteria, animal or plant species, or other operational taxonomic units (OTU). A phylogenetic tree depicts such relationships and provides a visual representation of the estimated branching order of the OTUs. Tree estimation is unique for several reasons, including: the types of data used to represent each OTU; the use ofprobabilistic nucleotide substitution models; the inference goals involving both tree topology and branch length, and the huge number of possible trees for a given sample of a very modest number of OTUs, which implies that fmding themore » best tree(s) to describe the genetic data for each OTU is computationally demanding. Bioinformatics is too large a field to review here. We focus on that aspect of bioinformatics that includes study of similarities in genetic data from multiple OTUs. Although research questions are diverse, a common underlying challenge is to estimate the evolutionary history of the OTUs. Therefore, this paper reviews the role of phylogenetic tree estimation in bioinformatics, available methods and software, and identifies areas for additional research and development.« less

  10. Hal: an automated pipeline for phylogenetic analyses of genomic data.

    PubMed

    Robbertse, Barbara; Yoder, Ryan J; Boyd, Alex; Reeves, John; Spatafora, Joseph W

    2011-02-07

    The rapid increase in genomic and genome-scale data is resulting in unprecedented levels of discrete sequence data available for phylogenetic analyses. Major analytical impasses exist, however, prior to analyzing these data with existing phylogenetic software. Obstacles include the management of large data sets without standardized naming conventions, identification and filtering of orthologous clusters of proteins or genes, and the assembly of alignments of orthologous sequence data into individual and concatenated super alignments. Here we report the production of an automated pipeline, Hal that produces multiple alignments and trees from genomic data. These alignments can be produced by a choice of four alignment programs and analyzed by a variety of phylogenetic programs. In short, the Hal pipeline connects the programs BLASTP, MCL, user specified alignment programs, GBlocks, ProtTest and user specified phylogenetic programs to produce species trees. The script is available at sourceforge (http://sourceforge.net/projects/bio-hal/). The results from an example analysis of Kingdom Fungi are briefly discussed.

  11. Phyx: phylogenetic tools for unix.

    PubMed

    Brown, Joseph W; Walker, Joseph F; Smith, Stephen A

    2017-06-15

    The ease with which phylogenomic data can be generated has drastically escalated the computational burden for even routine phylogenetic investigations. To address this, we present phyx : a collection of programs written in C ++ to explore, manipulate, analyze and simulate phylogenetic objects (alignments, trees and MCMC logs). Modelled after Unix/GNU/Linux command line tools, individual programs perform a single task and operate on standard I/O streams that can be piped to quickly and easily form complex analytical pipelines. Because of the stream-centric paradigm, memory requirements are minimized (often only a single tree or sequence in memory at any instance), and hence phyx is capable of efficiently processing very large datasets. phyx runs on POSIX-compliant operating systems. Source code, installation instructions, documentation and example files are freely available under the GNU General Public License at https://github.com/FePhyFoFum/phyx. eebsmith@umich.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  12. Phyx: phylogenetic tools for unix

    PubMed Central

    Brown, Joseph W.; Walker, Joseph F.; Smith, Stephen A.

    2017-01-01

    Abstract Summary: The ease with which phylogenomic data can be generated has drastically escalated the computational burden for even routine phylogenetic investigations. To address this, we present phyx: a collection of programs written in C ++ to explore, manipulate, analyze and simulate phylogenetic objects (alignments, trees and MCMC logs). Modelled after Unix/GNU/Linux command line tools, individual programs perform a single task and operate on standard I/O streams that can be piped to quickly and easily form complex analytical pipelines. Because of the stream-centric paradigm, memory requirements are minimized (often only a single tree or sequence in memory at any instance), and hence phyx is capable of efficiently processing very large datasets. Availability and Implementation: phyx runs on POSIX-compliant operating systems. Source code, installation instructions, documentation and example files are freely available under the GNU General Public License at https://github.com/FePhyFoFum/phyx Contact: eebsmith@umich.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28174903

  13. Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies.

    PubMed

    Sayyari, Erfan; Mirarab, Siavash

    2018-02-28

    Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest.

  14. Testing for Polytomies in Phylogenetic Species Trees Using Quartet Frequencies

    PubMed Central

    Sayyari, Erfan

    2018-01-01

    Phylogenetic species trees typically represent the speciation history as a bifurcating tree. Speciation events that simultaneously create more than two descendants, thereby creating polytomies in the phylogeny, are possible. Moreover, the inability to resolve relationships is often shown as a (soft) polytomy. Both types of polytomies have been traditionally studied in the context of gene tree reconstruction from sequence data. However, polytomies in the species tree cannot be detected or ruled out without considering gene tree discordance. In this paper, we describe a statistical test based on properties of the multi-species coalescent model to test the null hypothesis that a branch in an estimated species tree should be replaced by a polytomy. On both simulated and biological datasets, we show that the null hypothesis is rejected for all but the shortest branches, and in most cases, it is retained for true polytomies. The test, available as part of the Accurate Species TRee ALgorithm (ASTRAL) package, can help systematists decide whether their datasets are sufficient to resolve specific relationships of interest. PMID:29495636

  15. Coalescent methods for estimating phylogenetic trees.

    PubMed

    Liu, Liang; Yu, Lili; Kubatko, Laura; Pearl, Dennis K; Edwards, Scott V

    2009-10-01

    We review recent models to estimate phylogenetic trees under the multispecies coalescent. Although the distinction between gene trees and species trees has come to the fore of phylogenetics, only recently have methods been developed that explicitly estimate species trees. Of the several factors that can cause gene tree heterogeneity and discordance with the species tree, deep coalescence due to random genetic drift in branches of the species tree has been modeled most thoroughly. Bayesian approaches to estimating species trees utilizes two likelihood functions, one of which has been widely used in traditional phylogenetics and involves the model of nucleotide substitution, and the second of which is less familiar to phylogeneticists and involves the probability distribution of gene trees given a species tree. Other recent parametric and nonparametric methods for estimating species trees involve parsimony criteria, summary statistics, supertree and consensus methods. Species tree approaches are an appropriate goal for systematics, appear to work well in some cases where concatenation can be misleading, and suggest that sampling many independent loci will be paramount. Such methods can also be challenging to implement because of the complexity of the models and computational time. In addition, further elaboration of the simplest of coalescent models will be required to incorporate commonly known issues such as deviation from the molecular clock, gene flow and other genetic forces.

  16. Phylogenetic impoverishment of Amazonian tree communities in an experimentally fragmented forest landscape.

    PubMed

    Santos, Bráulio A; Tabarelli, Marcelo; Melo, Felipe P L; Camargo, José L C; Andrade, Ana; Laurance, Susan G; Laurance, William F

    2014-01-01

    Amazonian rainforests sustain some of the richest tree communities on Earth, but their ecological and evolutionary responses to human threats remain poorly known. We used one of the largest experimental datasets currently available on tree dynamics in fragmented tropical forests and a recent phylogeny of angiosperms to test whether tree communities have lost phylogenetic diversity since their isolation about two decades previously. Our findings revealed an overall trend toward phylogenetic impoverishment across the experimentally fragmented landscape, irrespective of whether tree communities were in 1-ha, 10-ha, or 100-ha forest fragments, near forest edges, or in continuous forest. The magnitude of the phylogenetic diversity loss was low (<2% relative to before-fragmentation values) but widespread throughout the study landscape, occurring in 32 of 40 1-ha plots. Consistent with this loss in phylogenetic diversity, we observed a significant decrease of 50% in phylogenetic dispersion since forest isolation, irrespective of plot location. Analyses based on tree genera that have significantly increased (28 genera) or declined (31 genera) in abundance and basal area in the landscape revealed that increasing genera are more phylogenetically related than decreasing ones. Also, the loss of phylogenetic diversity was greater in tree communities where increasing genera proliferated and decreasing genera reduced their importance values, suggesting that this taxonomic replacement is partially underlying the phylogenetic impoverishment at the landscape scale. This finding has clear implications for the current debate about the role human-modified landscapes play in sustaining biodiversity persistence and key ecosystem services, such as carbon storage. Although the generalization of our findings to other fragmented tropical forests is uncertain, it could negatively affect ecosystem productivity and stability and have broader impacts on coevolved organisms.

  17. Effectiveness of protected areas for vertebrates based on taxonomic and phylogenetic diversity.

    PubMed

    Quan, Qing; Che, Xianli; Wu, Yongjie; Wu, Yuchun; Zhang, Qiang; Zhang, Min; Zou, Fasheng

    2018-04-01

    Establishing protected areas is the primary goal and tool for preventing irreversible biodiversity loss. However, the effectiveness of protected areas that target specific species has been questioned for some time because targeting key species for conservation may impair the integral regional pool of species diversity and phylogenetic and functional diversity are seldom considered. We assessed the efficacy of protected areas in China for the conservation of phylogenetic diversity based on the ranges and phylogenies of 2279 terrestrial vertebrates. Phylogenetic and taxonomic diversity were strongly and positively correlated, and only 12.1-43.8% of priority conservation areas are currently protected. However, the patterns and coverage of phylogenetic diversity were affected when weighted by species richness. These results indicated that in China, protected areas targeting high species richness protected phylogenetic diversity well overall but failed to do so in some regions with more unique or threatened communities (e.g., coastal areas of eastern China, where severely threatened avian communities were less protected). Our results suggest that the current distribution of protected areas could be improved, although most protected areas protect both taxonomic and phylogenetic diversity. © 2017 Society for Conservation Biology.

  18. Phylogenetic stratigraphy in the Guerrero Negro hypersaline microbial mat.

    PubMed

    Harris, J Kirk; Caporaso, J Gregory; Walker, Jeffrey J; Spear, John R; Gold, Nicholas J; Robertson, Charles E; Hugenholtz, Philip; Goodrich, Julia; McDonald, Daniel; Knights, Dan; Marshall, Paul; Tufo, Henry; Knight, Rob; Pace, Norman R

    2013-01-01

    The microbial mats of Guerrero Negro (GN), Baja California Sur, Mexico historically were considered a simple environment, dominated by cyanobacteria and sulfate-reducing bacteria. Culture-independent rRNA community profiling instead revealed these microbial mats as among the most phylogenetically diverse environments known. A preliminary molecular survey of the GN mat based on only ∼1500 small subunit rRNA gene sequences discovered several new phylum-level groups in the bacterial phylogenetic domain and many previously undetected lower-level taxa. We determined an additional ∼119,000 nearly full-length sequences and 28,000 >200 nucleotide 454 reads from a 10-layer depth profile of the GN mat. With this unprecedented coverage of long sequences from one environment, we confirm the mat is phylogenetically stratified, presumably corresponding to light and geochemical gradients throughout the depth of the mat. Previous shotgun metagenomic data from the same depth profile show the same stratified pattern and suggest that metagenome properties may be predictable from rRNA gene sequences. We verify previously identified novel lineages and identify new phylogenetic diversity at lower taxonomic levels, for example, thousands of operational taxonomic units at the family-genus levels differ considerably from known sequences. The new sequences populate parts of the bacterial phylogenetic tree that previously were poorly described, but indicate that any comprehensive survey of GN diversity has only begun. Finally, we show that taxonomic conclusions are generally congruent between Sanger and 454 sequencing technologies, with the taxonomic resolution achieved dependent on the abundance of reference sequences in the relevant region of the rRNA tree of life.

  19. Dual phylogenetic origins of Nigerian lions (Panthera leo).

    PubMed

    Tende, Talatu; Bensch, Staffan; Ottosson, Ulf; Hansson, Bengt

    2014-07-01

    Lion fecal DNA extracts from four individuals each from Yankari Game Reserve and Kainji-Lake National Park (central northeast and west Nigeria, respectively) were Sanger-sequenced for the mitochondrial cytochrome b gene. The sequences were aligned against 61 lion reference sequences from other parts of Africa and India. The sequence data were analyzed further for the construction of phylogenetic trees using the maximum-likelihood approach to depict phylogenetic patterns of distribution among sequences. Our results show that Nigerian lions grouped together with lions from West and Central Africa. At the smaller geographical scale, lions from Kainji-Lake National Park in western Nigeria grouped with lions from Benin (located west of Nigeria), whereas lions from Yankari Game Reserve in central northeastern Nigeria grouped with the lion populations in Cameroon (located east of Nigeria). The finding that the two remaining lion populations in Nigeria have different phylogenetic origins is an important aspect to consider in future decisions regarding management and conservation of rapidly shrinking lion populations in West Africa.

  20. Treetrimmer: a method for phylogenetic dataset size reduction.

    PubMed

    Maruyama, Shinichiro; Eveleigh, Robert J M; Archibald, John M

    2013-04-12

    With rapid advances in genome sequencing and bioinformatics, it is now possible to generate phylogenetic trees containing thousands of operational taxonomic units (OTUs) from a wide range of organisms. However, use of rigorous tree-building methods on such large datasets is prohibitive and manual 'pruning' of sequence alignments is time consuming and raises concerns over reproducibility. There is a need for bioinformatic tools with which to objectively carry out such pruning procedures. Here we present 'TreeTrimmer', a bioinformatics procedure that removes unnecessary redundancy in large phylogenetic datasets, alleviating the size effect on more rigorous downstream analyses. The method identifies and removes user-defined 'redundant' sequences, e.g., orthologous sequences from closely related organisms and 'recently' evolved lineage-specific paralogs. Representative OTUs are retained for more rigorous re-analysis. TreeTrimmer reduces the OTU density of phylogenetic trees without sacrificing taxonomic diversity while retaining the original tree topology, thereby speeding up downstream computer-intensive analyses, e.g., Bayesian and maximum likelihood tree reconstructions, in a reproducible fashion.

  1. Phylogenetic Diversity in the Macromolecular Composition of Microalgae

    PubMed Central

    Finkel, Zoe V.; Follows, Mick J.; Liefer, Justin D.; Brown, Chris M.; Benner, Ina; Irwin, Andrew J.

    2016-01-01

    The elemental stoichiometry of microalgae reflects their underlying macromolecular composition and influences competitive interactions among species and their role in the food web and biogeochemistry. Here we provide a new estimate of the macromolecular composition of microalgae using a hierarchical Bayesian analysis of data compiled from the literature. The median macromolecular composition of nutrient-sufficient exponentially growing microalgae is 32.2% protein, 17.3% lipid, 15.0% carbohydrate, 17.3% ash, 5.7% RNA, 1.1% chlorophyll-a and 1.0% DNA as percent dry weight. Our analysis identifies significant phylogenetic differences in macromolecular composition undetected by previous studies due to small sample sizes and the large inherent variability in macromolecular pools. The phylogenetic differences in macromolecular composition lead to variations in carbon-to-nitrogen ratios that are consistent with independent observations. These phylogenetic differences in macromolecular and elemental composition reflect adaptations in cellular architecture and biochemistry; specifically in the cell wall, the light harvesting apparatus, and storage pools. PMID:27228080

  2. Edge-related loss of tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.

    PubMed

    Santos, Bráulio A; Arroyo-Rodríguez, Víctor; Moreno, Claudia E; Tabarelli, Marcelo

    2010-09-08

    Deforestation and forest fragmentation are known major causes of nonrandom extinction, but there is no information about their impact on the phylogenetic diversity of the remaining species assemblages. Using a large vegetation dataset from an old hyper-fragmented landscape in the Brazilian Atlantic rainforest we assess whether the local extirpation of tree species and functional impoverishment of tree assemblages reduce the phylogenetic diversity of the remaining tree assemblages. We detected a significant loss of tree phylogenetic diversity in forest edges, but not in core areas of small (<80 ha) forest fragments. This was attributed to a reduction of 11% in the average phylogenetic distance between any two randomly chosen individuals from forest edges; an increase of 17% in the average phylogenetic distance to closest non-conspecific relative for each individual in forest edges; and to the potential manifestation of late edge effects in the core areas of small forest remnants. We found no evidence supporting fragmentation-induced phylogenetic clustering or evenness. This could be explained by the low phylogenetic conservatism of key life-history traits corresponding to vulnerable species. Edge effects must be reduced to effectively protect tree phylogenetic diversity in the severely fragmented Brazilian Atlantic forest.

  3. New substitution models for rooting phylogenetic trees.

    PubMed

    Williams, Tom A; Heaps, Sarah E; Cherlin, Svetlana; Nye, Tom M W; Boys, Richard J; Embley, T Martin

    2015-09-26

    The root of a phylogenetic tree is fundamental to its biological interpretation, but standard substitution models do not provide any information on its position. Here, we describe two recently developed models that relax the usual assumptions of stationarity and reversibility, thereby facilitating root inference without the need for an outgroup. We compare the performance of these models on a classic test case for phylogenetic methods, before considering two highly topical questions in evolutionary biology: the deep structure of the tree of life and the root of the archaeal radiation. We show that all three alignments contain meaningful rooting information that can be harnessed by these new models, thus complementing and extending previous work based on outgroup rooting. In particular, our analyses exclude the root of the tree of life from the eukaryotes or Archaea, placing it on the bacterial stem or within the Bacteria. They also exclude the root of the archaeal radiation from several major clades, consistent with analyses using other rooting methods. Overall, our results demonstrate the utility of non-reversible and non-stationary models for rooting phylogenetic trees, and identify areas where further progress can be made. © 2015 The Authors.

  4. Phylogenetic Invariants for Metazoan Mitochondrial Genome Evolution.

    PubMed

    Sankoff; Blanchette

    1998-01-01

    The method of phylogenetic invariants was developed to apply to aligned sequence data generated, according to a stochastic substitution model, for N species related through an unknown phylogenetic tree. The invariants are functions of the probabilities of the observable N-tuples, which are identically zero, over all choices of branch length, for some trees. Evaluating the invariants associated with all possible trees, using observed N-tuple frequencies over all sequence positions, enables us to rapidly infer the generating tree. An aspect of evolution at the genomic level much studied recently is the rearrangements of gene order along the chromosome from one species to another. Instead of the substitutions responsible for sequence evolution, we examine the non-local processes responsible for genome rearrangements such as inversion of arbitrarily long segments of chromosomes. By treating the potential adjacency of each possible pair of genes as a position", an appropriate substitution" model can be recognized as governing the rearrangement process, and a probabilistically principled phylogenetic inference can be set up. We calculate the invariants for this process for N=5, and apply them to mitochondrial genome data from coelomate metazoans, showing how they resolve key aspects of branching order.

  5. Phylogenetic relationship of Ornithobacterium rhinotracheale strains.

    PubMed

    DE Oca-Jimenez, Roberto Montes; Vega-Sanchez, Vicente; Morales-Erasto, Vladimir; Salgado-Miranda, Celene; Blackall, Patrick J; Soriano-Vargas, Edgardo

    2018-04-10

    The bacterium Ornithobacterium rhinotracheale is associated with respiratory disease in wild birds and poultry. In this study, the phylogenetic analysis of nine reference strains of O. rhinotracheale belonging to serovars A to I, and eight Mexican isolates belonging to serovar A, was performed. The analysis was extended to include available sequences from another 23 strains available in the public domain. The analysis showed that the 40 sequences formed six clusters, I to VI. All eight Mexican field isolates were placed in cluster I. One of the reference strains appears to present genetic diversity not previously recognized and was placed in a new genetic cluster. In conclusion, the phylogenetic analysis of O. rhinotracheale strains, based on the 16S rRNA gene, is a suitable tool for epidemiologic studies.

  6. Inferring phylogenetic trees from the knowledge of rare evolutionary events.

    PubMed

    Hellmuth, Marc; Hernandez-Rosales, Maribel; Long, Yangjing; Stadler, Peter F

    2018-06-01

    Rare events have played an increasing role in molecular phylogenetics as potentially homoplasy-poor characters. In this contribution we analyze the phylogenetic information content from a combinatorial point of view by considering the binary relation on the set of taxa defined by the existence of a single event separating two taxa. We show that the graph-representation of this relation must be a tree. Moreover, we characterize completely the relationship between the tree of such relations and the underlying phylogenetic tree. With directed operations such as tandem-duplication-random-loss events in mind we demonstrate how non-symmetric information constrains the position of the root in the partially reconstructed phylogeny.

  7. Diversity and Phylogenetic Distribution of Extracellular Microbial Peptidases

    NASA Astrophysics Data System (ADS)

    Nguyen, Trang; Mueller, Ryan; Myrold, David

    2017-04-01

    Depolymerization of proteinaceous compounds by extracellular proteolytic enzymes is a bottleneck in the nitrogen cycle, limiting the rate of the nitrogen turnover in soils. Protein degradation is accomplished by a diverse range of extracellular (secreted) peptidases. Our objective was to better understand the evolution of these enzymes and how their functional diversity corresponds to known phylogenetic diversity. Peptidase subfamilies from 110 archaeal, 1,860 bacterial, and 97 fungal genomes were extracted from the MEROPS database along with corresponding SSU sequences for each genome from the SILVA database, resulting in 43,177 secreted peptidases belonging to 34 microbial phyla and 149 peptidase subfamilies. We compared the distribution of each peptidase subfamily across all taxa to the phylogenetic relationships of these organisms based on their SSU gene sequences. The occurrence and abundance of genes coding for secreted peptidases varied across microbial taxa, distinguishing the peptidase complement of the three microbial kingdoms. Bacteria had the highest frequency of secreted peptidase coding genes per 1,000 genes and contributed from 1% to 6% of the gene content. Fungi only had a slightly higher number of secreted peptidase gene content than archaea, standardized by the total genes. The relative abundance profiles of secreted peptidases in each microbial kingdom also varied, in which aspartic family was found to be the greatest in fungi (25%), whereas it was only 12% in archaea and 4% in bacteria. Serine, metallo, and cysteine families consistently contributed widely up to 75% of the secreted peptidase abundance across the three kingdoms. Overall, bacteria had a much wider collection of secreted peptidases, whereas fungi and archaea shared most of their secreted peptidase families. Principle coordinate analysis of the peptidase subfamily-based dissimilarities showed distinguishable clusters for different groups of microorganisms. The distribution of

  8. Facilitation can increase the phylogenetic diversity of plant communities.

    PubMed

    Valiente-Banuet, Alfonso; Verdú, Miguel

    2007-11-01

    With the advent of molecular phylogenies the assessment of community assembly processes has become a central topic in community ecology. These processes have focused almost exclusively on habitat filtering and competitive exclusion. Recent evidence, however, indicates that facilitation has been important in preserving biodiversity over evolutionary time, with recent lineages conserving the regeneration niches of older, distant lineages. Here we test whether, if facilitation among distant-related species has preserved the regeneration niche of plant lineages, this has increased the phylogenetic diversity of communities. By analyzing a large worldwide database of species, we showed that the regeneration niches were strongly conserved across evolutionary history. Likewise, a phylogenetic supertree of all species of three communities driven by facilitation showed that nurse species facilitated distantly related species and increased phylogenetic diversity.

  9. A fully resolved consensus between fully resolved phylogenetic trees.

    PubMed

    Quitzau, José Augusto Amgarten; Meidanis, João

    2006-03-31

    Nowadays, there are many phylogeny reconstruction methods, each with advantages and disadvantages. We explored the advantages of each method, putting together the common parts of trees constructed by several methods, by means of a consensus computation. A number of phylogenetic consensus methods are already known. Unfortunately, there is also a taboo concerning consensus methods, because most biologists see them mainly as comparators and not as phylogenetic tree constructors. We challenged this taboo by defining a consensus method that builds a fully resolved phylogenetic tree based on the most common parts of fully resolved trees in a given collection. We also generated results showing that this consensus is in a way a kind of "median" of the input trees; as such it can be closer to the correct tree in many situations.

  10. Morphometric study of phylogenetic and ecologic signals in procyonid (mammalia: carnivora) endocasts.

    PubMed

    Ahrens, Heather E

    2014-12-01

    Endocasts provide a proxy for brain morphology but are rarely incorporated in phylogenetic analyses despite the potential for new suites of characters. The phylogeny of Procyonidae, a carnivoran family with relatively limited taxonomic diversity, is not well resolved because morphological and molecular data yield conflicting topologies. The presence of phylogenetic and ecologic signals in the endocasts of procyonids will be determined using three-dimensional geometric morphometrics. Endocasts of seven ingroup species and four outgroup species were digitally rendered and 21 landmarks were collected from the endocast surface. Two phylogenetic hypotheses of Procyonidae will be examined using methods testing for phylogenetic signal in morphometric data. In analyses of all taxa, there is significant phylogenetic signal in brain shape for both the morphological and molecular topologies. However, the analyses of ingroup taxa recover a significant phylogenetic signal for the morphological topology only. These results indicate support for the molecular outgroup topology, but not the ingroup topology given the brain shape data. Further examination of brain shape using principal components analysis and wireframe comparisons suggests procyonids possess more developed areas of the brain associated with motor control, spatial perception, and balance relative to the basal musteloid condition. Within Procyonidae, similar patterns of variation are present, and may be associated with increased arboreality in certain taxa. Thus, brain shape derived from endocasts may be used to test for phylogenetic signal and preliminary analyses suggest an association with behavior and ecology. © 2014 Wiley Periodicals, Inc.

  11. A RAD-based phylogenetics for Orestias fishes from Lake Titicaca.

    PubMed

    Takahashi, Tetsumi; Moreno, Edmundo

    2015-12-01

    The fish genus Orestias is endemic to the Andes highlands, and Lake Titicaca is the centre of the species diversity of the genus. Previous phylogenetic studies based on a single locus of mitochondrial and nuclear DNA strongly support the monophyly of a group composed of many of species endemic to the Lake Titicaca basin (the Lake Titicaca radiation), but the relationships among the species in the radiation remain unclear. Recently, restriction site-associated DNA (RAD) sequencing, which can produce a vast number of short sequences from various loci of nuclear DNA, has emerged as a useful way to resolve complex phylogenetic problems. To propose a new phylogenetic hypothesis of Orestias fishes of the Lake Titicaca radiation, we conducted a cluster analysis based on morphological similarities among fish samples and a molecular phylogenetic analysis based on RAD sequencing. From a morphological cluster analysis, we recognised four species groups in the radiation, and three of the four groups were resolved as monophyletic groups in maximum-likelihood trees based on RAD sequencing data. The other morphology-based group was not resolved as a monophyletic group in molecular phylogenies, and some members of the group were diverged from its sister group close to the root of the Lake Titicaca radiation. The evolution of these fishes is discussed from the phylogenetic relationships. Copyright © 2015 Elsevier Inc. All rights reserved.

  12. Iteratively Refined Guide Trees Help Improving Alignment and Phylogenetic Inference in the Mushroom Family Bolbitiaceae

    PubMed Central

    Tóth, Annamária; Hausknecht, Anton; Krisai-Greilhuber, Irmgard; Papp, Tamás; Vágvölgyi, Csaba; Nagy, László G.

    2013-01-01

    Reconciling traditional classifications, morphology, and the phylogenetic relationships of brown-spored agaric mushrooms has proven difficult in many groups, due to extensive convergence in morphological features. Here, we address the monophyly of the Bolbitiaceae, a family with over 700 described species and examine the higher-level relationships within the family using a newly constructed multilocus dataset (ITS, nrLSU rDNA and EF1-alpha). We tested whether the fast-evolving Internal Transcribed Spacer (ITS) sequences can be accurately aligned across the family, by comparing the outcome of two iterative alignment refining approaches (an automated and a manual) and various indel-treatment strategies. We used PRANK to align sequences in both cases. Our results suggest that – although PRANK successfully evades overmatching of gapped sites, referred previously to as alignment overmatching – it infers an unrealistically high number of indel events with natively generated guide-trees. This 'alignment undermatching' could be avoided by using more rigorous (e.g. ML) guide trees. The trees inferred in this study support the monophyly of the core Bolbitiaceae, with the exclusion of Panaeolus, Agrocybe, and some of the genera formerly placed in the family. Bolbitius and Conocybe were found monophyletic, however, Pholiotina and Galerella require redefinition. The phylogeny revealed that stipe coverage type is a poor predictor of phylogenetic relationships, indicating the need for a revision of the intrageneric relationships within Conocybe. PMID:23418526

  13. Plant Biodiversity Drivers in Brazilian Campos Rupestres: Insights from Phylogenetic Structure

    PubMed Central

    Zappi, Daniela C.; Moro, Marcelo F.; Meagher, Thomas R.; Nic Lughadha, Eimear

    2017-01-01

    Old, climate-buffered infertile landscapes (Ocbils) have attracted increasing levels of interest in recent years because of their exceptionally diverse plant communities. Brazil’s campos rupestres (rupestrian grasslands) are home to almost 15% of Brazil’s native flora in less than 0.8% of Brazil’s territory: an ideal study system for exploring variation in floristic diversity and phylogenetic structure in sites differing in geology and phytophysiognomy. We found significant differences in floristic diversity and phylogenetic structure across a range of study sites encompassing open vegetation and forest on quartzite (FQ) and on ironstone substrates, commonly termed canga. Substrate and physiognomy were key in structuring floristic diversity in the Espinhaço and physiognomy was more important than substrate in structuring phylogenetic diversity, with neither substrate nor its interaction with physiognomy accounting for significant variation in phylogenetic structure. Phylogenetic clustering was significant in open vegetation on both canga and quartzite, reflecting the potential role of environmental filtering in these exposed montane communities adapted to multiple environmental stressors. In forest communities, phylogenetic clustering was significant only at relatively deep nodes of the phylogeny in FQ while no significant phylogenetic clustering was detected across forest on canga (FC), which may be attributable to proximity to the megadiverse Atlantic forest biome and/or comparatively benign environmental conditions in FC with relatively deep, nutrient-rich soils and access to edaphic water reliable in comparison to those for open vegetation on canga and open or forest communities on quartzite. Clades representing relatively old lineages are significantly over-represented in campos rupestres on quartzite, consistent with the Gondwanan Heritage Hypothesis of Ocbil theory. In contrast, forested sites on canga are recognized as Yodfels. To be effective

  14. Plant Biodiversity Drivers in Brazilian Campos Rupestres: Insights from Phylogenetic Structure.

    PubMed

    Zappi, Daniela C; Moro, Marcelo F; Meagher, Thomas R; Nic Lughadha, Eimear

    2017-01-01

    Old, climate-buffered infertile landscapes (Ocbils) have attracted increasing levels of interest in recent years because of their exceptionally diverse plant communities. Brazil's campos rupestres (rupestrian grasslands) are home to almost 15% of Brazil's native flora in less than 0.8% of Brazil's territory: an ideal study system for exploring variation in floristic diversity and phylogenetic structure in sites differing in geology and phytophysiognomy. We found significant differences in floristic diversity and phylogenetic structure across a range of study sites encompassing open vegetation and forest on quartzite (FQ) and on ironstone substrates, commonly termed canga . Substrate and physiognomy were key in structuring floristic diversity in the Espinhaço and physiognomy was more important than substrate in structuring phylogenetic diversity, with neither substrate nor its interaction with physiognomy accounting for significant variation in phylogenetic structure. Phylogenetic clustering was significant in open vegetation on both canga and quartzite, reflecting the potential role of environmental filtering in these exposed montane communities adapted to multiple environmental stressors. In forest communities, phylogenetic clustering was significant only at relatively deep nodes of the phylogeny in FQ while no significant phylogenetic clustering was detected across forest on canga (FC), which may be attributable to proximity to the megadiverse Atlantic forest biome and/or comparatively benign environmental conditions in FC with relatively deep, nutrient-rich soils and access to edaphic water reliable in comparison to those for open vegetation on canga and open or forest communities on quartzite. Clades representing relatively old lineages are significantly over-represented in campos rupestres on quartzite, consistent with the Gondwanan Heritage Hypothesis of Ocbil theory. In contrast, forested sites on canga are recognized as Yodfels. To be effective

  15. Phylogenetic systematics of the genus Echinococcus (Cestoda: Taeniidae).

    PubMed

    Nakao, Minoru; Lavikainen, Antti; Yanagida, Tetsuya; Ito, Akira

    2013-11-01

    Echinococcosis is a serious helminthic zoonosis in humans, livestock and wildlife. The pathogenic organisms are members of the genus Echinococcus (Cestoda: Taeniidae). Life cycles of Echinococcus spp. are consistently dependent on predator-prey association between two obligate mammalian hosts. Carnivores (canids and felids) serve as definitive hosts for adult tapeworms and their herbivore prey (ungulates, rodents and lagomorphs) as intermediate hosts for metacestode larvae. Humans are involved as an accidental host for metacestode infections. The metacestodes develop in various internal organs, particularly in liver and lungs. Each metacestode of Echinococcus spp. has an organotropism and a characteristic form known as an unilocular (cystic), alveolar or polycystic hydatid. Recent molecular phylogenetic studies have demonstrated that the type species, Echinococcus granulosus, causing cystic echinococcosis is a cryptic species complex. Therefore, the orthodox taxonomy of Echinococcus established from morphological criteria has been revised from the standpoint of phylogenetic systematics. Nine valid species including newly resurrected taxa are recognised as a result of the revision. This review summarises the recent advances in the phylogenetic systematics of Echinococcus, together with the historical backgrounds and molecular epidemiological aspects of each species. A new phylogenetic tree inferred from the mitochondrial genomes of all valid Echinococcus spp. is also presented. The taxonomic nomenclature for Echinococcus oligarthrus is shown to be incorrect and this name should be replaced with Echinococcus oligarthra. Copyright © 2013 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.

  16. Land-Sparing Agriculture Best Protects Avian Phylogenetic Diversity.

    PubMed

    Edwards, David P; Gilroy, James J; Thomas, Gavin H; Uribe, Claudia A Medina; Haugaasen, Torbjørn

    2015-09-21

    The conversion of natural habitats to farmland is a major driver of the global extinction crisis. Two strategies are promoted to mitigate the impacts of agricultural expansion on biodiversity: land sharing integrates wildlife-friendly habitats within farmland landscapes, and land sparing intensifies farming to allow the offset of natural reserves. A key question is which strategy would protect the most phylogenetic diversity--the total evolutionary history shared across all species within a community. Conserving phylogenetic diversity decreases the chance of losing unique phenotypic and ecological traits and provides benefits for ecosystem function and stability. Focusing on birds in the threatened Chocó-Andes hotspot of endemism, we tested the relative benefits of each strategy for retaining phylogenetic diversity in tropical cloud forest landscapes threatened by cattle pastures. Using landscape simulations, we find that land sharing would protect lower community-level phylogenetic diversity than land sparing and that with increasing distance from forest (from 500 to >1,500 m), land sharing is increasingly inferior to land sparing. Isolation from forest also leads to the loss of more evolutionarily distinct species from communities within land-sharing landscapes, which can be avoided with effective land sparing. Land-sharing policies that promote the integration of small-scale wildlife-friendly habitats might be of limited benefit without the simultaneous protection of larger blocks of natural habitat, which is most likely to be achieved via land-sparing measures. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. PhyLIS: a simple GNU/Linux distribution for phylogenetics and phyloinformatics.

    PubMed

    Thomson, Robert C

    2009-07-30

    PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/.

  18. CDAO-Store: Ontology-driven Data Integration for Phylogenetic Analysis

    PubMed Central

    2011-01-01

    Background The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Results Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. Conclusions CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore. PMID:21496247

  19. CDAO-store: ontology-driven data integration for phylogenetic analysis.

    PubMed

    Chisham, Brandon; Wright, Ben; Le, Trung; Son, Tran Cao; Pontelli, Enrico

    2011-04-15

    The Comparative Data Analysis Ontology (CDAO) is an ontology developed, as part of the EvoInfo and EvoIO groups supported by the National Evolutionary Synthesis Center, to provide semantic descriptions of data and transformations commonly found in the domain of phylogenetic analysis. The core concepts of the ontology enable the description of phylogenetic trees and associated character data matrices. Using CDAO as the semantic back-end, we developed a triple-store, named CDAO-Store. CDAO-Store is a RDF-based store of phylogenetic data, including a complete import of TreeBASE. CDAO-Store provides a programmatic interface, in the form of web services, and a web-based front-end, to perform both user-defined as well as domain-specific queries; domain-specific queries include search for nearest common ancestors, minimum spanning clades, filter multiple trees in the store by size, author, taxa, tree identifier, algorithm or method. In addition, CDAO-Store provides a visualization front-end, called CDAO-Explorer, which can be used to view both character data matrices and trees extracted from the CDAO-Store. CDAO-Store provides import capabilities, enabling the addition of new data to the triple-store; files in PHYLIP, MEGA, nexml, and NEXUS formats can be imported and their CDAO representations added to the triple-store. CDAO-Store is made up of a versatile and integrated set of tools to support phylogenetic analysis. To the best of our knowledge, CDAO-Store is the first semantically-aware repository of phylogenetic data with domain-specific querying capabilities. The portal to CDAO-Store is available at http://www.cs.nmsu.edu/~cdaostore.

  20. The ethnobotany of psychoactive plant use: a phylogenetic perspective

    PubMed Central

    2016-01-01

    Psychoactive plants contain chemicals that presumably evolved as allelochemicals but target certain neuronal receptors when consumed by humans, altering perception, emotion and cognition. These plants have been used since ancient times as medicines and in the context of religious rituals for their various psychoactive effects (e.g., as hallucinogens, stimulants, sedatives). The ubiquity of psychoactive plants in various cultures motivates investigation of the commonalities among these plants, in which a phylogenetic framework may be insightful. A phylogeny of culturally diverse psychoactive plant taxa was constructed with their psychotropic effects and affected neurotransmitter systems mapped on the phylogeny. The phylogenetic distribution shows multiple evolutionary origins of psychoactive families. The plant families Myristicaceae (e.g., nutmeg), Papaveraceae (opium poppy), Cactaceae (peyote), Convolvulaceae (morning glory), Solanaceae (tobacco), Lamiaceae (mints), Apocynaceae (dogbane) have a disproportionate number of psychoactive genera with various indigenous groups using geographically disparate members of these plant families for the same psychoactive effect, an example of cultural convergence. Pharmacological traits related to hallucinogenic and sedative potential are phylogenetically conserved within families. Unrelated families that exert similar psychoactive effects also modulate similar neurotransmitter systems (i.e., mechanistic convergence). However, pharmacological mechanisms for stimulant effects were varied even within families suggesting that stimulant chemicals may be more evolutionarily labile than those associated with hallucinogenic and sedative effects. Chemically similar psychoactive chemicals may also exist in phylogenetically unrelated lineages, suggesting convergent evolution or differential gene regulation of a common metabolic pathway. Our study has shown that phylogenetic analysis of traditionally used psychoactive plants suggests

  1. The ethnobotany of psychoactive plant use: a phylogenetic perspective.

    PubMed

    Alrashedy, Nashmiah Aid; Molina, Jeanmaire

    2016-01-01

    Psychoactive plants contain chemicals that presumably evolved as allelochemicals but target certain neuronal receptors when consumed by humans, altering perception, emotion and cognition. These plants have been used since ancient times as medicines and in the context of religious rituals for their various psychoactive effects (e.g., as hallucinogens, stimulants, sedatives). The ubiquity of psychoactive plants in various cultures motivates investigation of the commonalities among these plants, in which a phylogenetic framework may be insightful. A phylogeny of culturally diverse psychoactive plant taxa was constructed with their psychotropic effects and affected neurotransmitter systems mapped on the phylogeny. The phylogenetic distribution shows multiple evolutionary origins of psychoactive families. The plant families Myristicaceae (e.g., nutmeg), Papaveraceae (opium poppy), Cactaceae (peyote), Convolvulaceae (morning glory), Solanaceae (tobacco), Lamiaceae (mints), Apocynaceae (dogbane) have a disproportionate number of psychoactive genera with various indigenous groups using geographically disparate members of these plant families for the same psychoactive effect, an example of cultural convergence. Pharmacological traits related to hallucinogenic and sedative potential are phylogenetically conserved within families. Unrelated families that exert similar psychoactive effects also modulate similar neurotransmitter systems (i.e., mechanistic convergence). However, pharmacological mechanisms for stimulant effects were varied even within families suggesting that stimulant chemicals may be more evolutionarily labile than those associated with hallucinogenic and sedative effects. Chemically similar psychoactive chemicals may also exist in phylogenetically unrelated lineages, suggesting convergent evolution or differential gene regulation of a common metabolic pathway. Our study has shown that phylogenetic analysis of traditionally used psychoactive plants suggests

  2. Phylogenetic constrains on mycorrhizal specificity in eight Dendrobium (Orchidaceae) species.

    PubMed

    Xing, Xiaoke; Ma, Xueting; Men, Jinxin; Chen, Yanhong; Guo, Shunxing

    2017-05-01

    Plant phylogeny constrains orchid mycorrhizal (OrM) fungal community composition in some orchids. Here, we investigated the structures of the OrM fungal communities of eight Dendrobium species in one niche to determine whether similarities in the OrM fungal communities correlated with the phylogeny of the host plants and whether the Dendrobium-OrM fungal interactions are phylogenetically conserved. A phylogeny based on DNA data was constructed for the eight coexisting Dendrobium species, and the OrM fungal communities were characterized by their roots. There were 31 different fungal lineages associated with the eight Dendrobium species. In total, 82.98% of the identified associations belonging to Tulasnellaceae, and a smaller proportion involved members of the unknown Basidiomycota (9.67%). Community analyses revealed that phylogenetically related Dendrobium tended to interact with a similar set of Tulasnellaceae fungi. The interactions between Dendrobium and Tulasnellaceae fungi were significantly influenced by the phylogenetic relationships among the Dendrobium species. Our results provide evidence that the mycorrhizal specificity in the eight coexisting Dendrobium species was phylogenetically conserved.

  3. Phylogenetic Network for European mtDNA

    PubMed Central

    Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

    2001-01-01

    The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229

  4. Marine turtle mitogenome phylogenetics and evolution.

    PubMed

    Duchene, Sebastián; Frey, Amy; Alfaro-Núñez, Alonzo; Dutton, Peter H; Thomas P Gilbert, M; Morin, Phillip A

    2012-10-01

    The sea turtles are a group of cretaceous origin containing seven recognized living species: leatherback, hawksbill, Kemp's ridley, olive ridley, loggerhead, green, and flatback. The leatherback is the single member of the Dermochelidae family, whereas all other sea turtles belong in Cheloniidae. Analyses of partial mitochondrial sequences and some nuclear markers have revealed phylogenetic inconsistencies within Cheloniidae, especially regarding the placement of the flatback. Population genetic studies based on D-Loop sequences have shown considerable structuring in species with broad geographic distributions, shedding light on complex migration patterns and possible geographic or climatic events as driving forces of sea-turtle distribution. We have sequenced complete mitogenomes for all sea-turtle species, including samples from their geographic range extremes, and performed phylogenetic analyses to assess sea-turtle evolution with a large molecular dataset. We found variation in the length of the ATP8 gene and a highly variable site in ND4 near a proton translocation channel in the resulting protein. Complete mitogenomes show strong support and resolution for phylogenetic relationships among all sea turtles, and reveal phylogeographic patterns within globally-distributed species. Although there was clear concordance between phylogenies and geographic origin of samples in most taxa, we found evidence of more recent dispersal events in the loggerhead and olive ridley turtles, suggesting more recent migrations (<1 Myr) in these species. Overall, our results demonstrate the complexity of sea-turtle diversity, and indicate the need for further research in phylogeography and molecular evolution. Published by Elsevier Inc.

  5. Phylogenetic relationships among arecoid palms (Arecaceae: Arecoideae)

    PubMed Central

    Baker, William J.; Norup, Maria V.; Clarkson, James J.; Couvreur, Thomas L. P.; Dowe, John L.; Lewis, Carl E.; Pintaud, Jean-Christophe; Savolainen, Vincent; Wilmot, Tomas; Chase, Mark W.

    2011-01-01

    Background and Aims The Arecoideae is the largest and most diverse of the five subfamilies of palms (Arecaceae/Palmae), containing >50 % of the species in the family. Despite its importance, phylogenetic relationships among Arecoideae are poorly understood. Here the most densely sampled phylogenetic analysis of Arecoideae available to date is presented. The results are used to test the current classification of the subfamily and to identify priority areas for future research. Methods DNA sequence data for the low-copy nuclear genes PRK and RPB2 were collected from 190 palm species, covering 103 (96 %) genera of Arecoideae. The data were analysed using the parsimony ratchet, maximum likelihood, and both likelihood and parsimony bootstrapping. Key Results and Conclusions Despite the recovery of paralogues and pseudogenes in a small number of taxa, PRK and RPB2 were both highly informative, producing well-resolved phylogenetic trees with many nodes well supported by bootstrap analyses. Simultaneous analyses of the combined data sets provided additional resolution and support. Two areas of incongruence between PRK and RPB2 were strongly supported by the bootstrap relating to the placement of tribes Chamaedoreeae, Iriarteeae and Reinhardtieae; the causes of this incongruence remain uncertain. The current classification within Arecoideae was strongly supported by the present data. Of the 14 tribes and 14 sub-tribes in the classification, only five sub-tribes from tribe Areceae (Basseliniinae, Linospadicinae, Oncospermatinae, Rhopalostylidinae and Verschaffeltiinae) failed to receive support. Three major higher level clades were strongly supported: (1) the RRC clade (Roystoneeae, Reinhardtieae and Cocoseae), (2) the POS clade (Podococceae, Oranieae and Sclerospermeae) and (3) the core arecoid clade (Areceae, Euterpeae, Geonomateae, Leopoldinieae, Manicarieae and Pelagodoxeae). However, new data sources are required to elucidate ambiguities that remain in phylogenetic

  6. Dimensional Reduction for the General Markov Model on Phylogenetic Trees.

    PubMed

    Sumner, Jeremy G

    2017-03-01

    We present a method of dimensional reduction for the general Markov model of sequence evolution on a phylogenetic tree. We show that taking certain linear combinations of the associated random variables (site pattern counts) reduces the dimensionality of the model from exponential in the number of extant taxa, to quadratic in the number of taxa, while retaining the ability to statistically identify phylogenetic divergence events. A key feature is the identification of an invariant subspace which depends only bilinearly on the model parameters, in contrast to the usual multi-linear dependence in the full space. We discuss potential applications including the computation of split (edge) weights on phylogenetic trees from observed sequence data.

  7. Elongation Factor-1α Accurately Reconstructs Relationships Amongst Psyllid Families (Hemiptera: Psylloidea), with Possible Diagnostic Implications.

    PubMed

    Martoni, Francesco; Bulman, Simon R; Pitman, Andrew; Armstrong, Karen F

    2017-12-05

    The superfamily Psylloidea (Hemiptera: Sternorrhyncha) lacks a robust multigene phylogeny. This impedes our understanding of the evolution of this group of insects and, consequently, an accurate identification of individuals, of their plant host associations, and their roles as vectors of economically important plant pathogens. The conserved nuclear gene elongation factor-1 alpha (EF-1α) has been valuable as a higher-level phylogenetic marker in insects and it has also been widely used to investigate the evolution of intron/exon structure. To explore evolutionary relationships among Psylloidea, polymerase chain reaction amplification and nucleotide sequencing of a 250-bp EF-1α gene fragment was applied to psyllids belonging to five different families. Introns were detected in three individuals belonging to two families. The nine genera belonging to the family Aphalaridae all lacked introns, highlighting the possibility of using intron presence/absence as a diagnostic tool at a family level. When paired with cytochrome oxidase I gene sequences, the 250 bp EF-1α sequence appeared to be a very promising higher-level phylogenetic marker for psyllids. © The Author(s) 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  8. Diversity and Phylogenetic Structure of Two Complex Marine Microbial Communities

    DTIC Science & Technology

    2004-09-01

    Science 190 and Engineering DOCTORAL DISSERTATION Diversity and Phylogenetic Structure of Two Complex Marine Microbial Communities by Vanja Klepac-Ceraj...Two Complex Marine Microbial Communities by Vanja Klepac-Ceraj Massachusetts Institute of Technology Cambridge, Massachusetts 02139 and Woods Hole...Phylogenetic Structure of Two Complex Marine Microbial Communities. Ph.D. Thesis. MIT/WHOI, 2004-11. Approved for publication; distribution unlimited

  9. Phylogenetic evidence for a case of misleading rather than mislabeling in caviar in the United Kingdom.

    PubMed

    Johnson, Tania Aspasia; Iyengar, Arati

    2015-01-01

    Sturgeons and paddlefish are freshwater fish which are highly valued for their caviar. Despite the fact that every single species of sturgeon and paddlefish is listed under CITES, there are reports of illegal trade in caviar where products are deliberately mislabeled. Three samples of caviar purchased in the United Kingdom were investigated for accurate CITES labeling using COI and cyt b sequencing. Initial species identification was carried out using BLAST followed by phylogenetic analyses using both maximum parsimony and maximum likelihood methods. Results showed no evidence for mislabeling with respect to CITES labels in any of the three samples, but we observed clear evidence for a case of misleading the customer in one sample. © 2014 American Academy of Forensic Sciences.

  10. Proximity within interphase chromosome contributes to the breakpoint distribution in radiation-induced intrachromosomal exchanges

    NASA Astrophysics Data System (ADS)

    Zhang, Ye; Uhlemeyer, Jimmy; Hada, Megumi; Asaithamby, A.; Chen, David J.; Wu, Honglu

    2014-07-01

    Previously, we reported that breaks involved in chromosome aberrations were clustered in several regions of chromosome 3 in human mammary epithelial cells after exposures to either low- or high-LET radiation. In particular, breaks in certain regions of the chromosome tended to rejoin with each other to form an intrachromosome exchange event. This study tests the hypothesis that proximity within a single chromosome in interphase cell nuclei contributes to the distribution of radiation-induced chromosome breaks. Chromosome 3 in G1 human mammary epithelial cells was hybridized with the multicolor banding in situ hybridization (mBAND) probes that distinguish the chromosome in six differently colored regions, and the location of these regions was measured with a laser confocal microscope. Results of the study indicated that, on a multi-mega base pair scale of the DNA, the arrangement of chromatin was non-random. Both telomere regions tended to be located towards the exterior of the chromosome domain, whereas the centromere region towards the interior. In addition, the interior of the chromosome domain was preferentially occupied by the p-arm of the chromatin, which is consistent with our previous finding of intrachromosome exchanges involving breaks on the p-arm and in the centromere region of chromosome 3. Other factors, such as the fragile sites in the 3p21 band and gene regulation, may also contribute to the breakpoint distribution in radiation-induced chromosome aberrations.

  11. Phylogenetic classification of Aureobasidium pullulans strains for production of feruloyl esterase

    USDA-ARS?s Scientific Manuscript database

    The objective was to phylogenetically classify diverse strains of A. pullulans and determine their production of feruloyl esterase. Seventeen strains from the A. pullulans literature were phylogenetically classified. Phenotypic traits of color variation and endo-ß-1,4-xylanase overproduction were as...

  12. The complete mitochondrial genome and phylogenetic analysis of the giant panda (Ailuropoda melanoleuca).

    PubMed

    Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong

    2007-08-01

    The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.

  13. Adaptive MCMC in Bayesian phylogenetics: an application to analyzing partitioned data in BEAST.

    PubMed

    Baele, Guy; Lemey, Philippe; Rambaut, Andrew; Suchard, Marc A

    2017-06-15

    Advances in sequencing technology continue to deliver increasingly large molecular sequence datasets that are often heavily partitioned in order to accurately model the underlying evolutionary processes. In phylogenetic analyses, partitioning strategies involve estimating conditionally independent models of molecular evolution for different genes and different positions within those genes, requiring a large number of evolutionary parameters that have to be estimated, leading to an increased computational burden for such analyses. The past two decades have also seen the rise of multi-core processors, both in the central processing unit (CPU) and Graphics processing unit processor markets, enabling massively parallel computations that are not yet fully exploited by many software packages for multipartite analyses. We here propose a Markov chain Monte Carlo (MCMC) approach using an adaptive multivariate transition kernel to estimate in parallel a large number of parameters, split across partitioned data, by exploiting multi-core processing. Across several real-world examples, we demonstrate that our approach enables the estimation of these multipartite parameters more efficiently than standard approaches that typically use a mixture of univariate transition kernels. In one case, when estimating the relative rate parameter of the non-coding partition in a heterochronous dataset, MCMC integration efficiency improves by > 14-fold. Our implementation is part of the BEAST code base, a widely used open source software package to perform Bayesian phylogenetic inference. guy.baele@kuleuven.be. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. Reliability, Validity, and Sensitivity of a Novel Smartphone-Based Eccentric Hamstring Strength Test in Professional Football Players.

    PubMed

    Lee, Justin W Y; Cai, Ming-Jing; Yung, Patrick S H; Chan, Kai-Ming

    2018-05-01

    To evaluate the test-retest reliability, sensitivity, and concurrent validity of a smartphone-based method for assessing eccentric hamstring strength among male professional football players. A total of 25 healthy male professional football players performed the Chinese University of Hong Kong (CUHK) Nordic break-point test, hamstring fatigue protocol, and isokinetic hamstring strength test. The CUHK Nordic break-point test is based on a Nordic hamstring exercise. The Nordic break-point angle was defined as the maximum point where the participant could no longer support the weight of his body against gravity. The criterion for the sensitivity test was the presprinting and postsprinting difference of the Nordic break-point angle with a hamstring fatigue protocol. The hamstring fatigue protocol consists of 12 repetitions of the 30-m sprint with 30-s recoveries between sprints. Hamstring peak torque of the isokinetic hamstring strength test was used as the criterion for validity. A high test-retest reliability (intraclass correlation coefficient = .94; 95% confidence interval, .82-.98) was found in the Nordic break-point angle measurements. The Nordic break-point angle significantly correlated with isokinetic hamstring peak torques at eccentric action of 30°/s (r = .88, r 2  = .77, P < .001). The minimal detectable difference was 8.03°. The sensitivity of the measure was good enough that a significance difference (effect size = 0.70, P < .001) was found between presprinting and postsprinting values. The CUHK Nordic break-point test is a simple, portable, quick smartphone-based method to provide reliable and accurate eccentric hamstring strength measures among male professional football players.

  15. On the Shapley Value of Unrooted Phylogenetic Trees.

    PubMed

    Wicke, Kristina; Fischer, Mareike

    2018-01-17

    The Shapley value, a solution concept from cooperative game theory, has recently been considered for both unrooted and rooted phylogenetic trees. Here, we focus on the Shapley value of unrooted trees and first revisit the so-called split counts of a phylogenetic tree and the Shapley transformation matrix that allows for the calculation of the Shapley value from the edge lengths of a tree. We show that non-isomorphic trees may have permutation-equivalent Shapley transformation matrices and permutation-equivalent null spaces. This implies that estimating the split counts associated with a tree or the Shapley values of its leaves does not suffice to reconstruct the correct tree topology. We then turn to the use of the Shapley value as a prioritization criterion in biodiversity conservation and compare it to a greedy solution concept. Here, we show that for certain phylogenetic trees, the Shapley value may fail as a prioritization criterion, meaning that the diversity spanned by the top k species (ranked by their Shapley values) cannot approximate the total diversity of all n species.

  16. Dual phylogenetic origins of Nigerian lions (Panthera leo)

    PubMed Central

    Tende, Talatu; Bensch, Staffan; Ottosson, Ulf; Hansson, Bengt

    2014-01-01

    Lion fecal DNA extracts from four individuals each from Yankari Game Reserve and Kainji-Lake National Park (central northeast and west Nigeria, respectively) were Sanger-sequenced for the mitochondrial cytochrome b gene. The sequences were aligned against 61 lion reference sequences from other parts of Africa and India. The sequence data were analyzed further for the construction of phylogenetic trees using the maximum-likelihood approach to depict phylogenetic patterns of distribution among sequences. Our results show that Nigerian lions grouped together with lions from West and Central Africa. At the smaller geographical scale, lions from Kainji-Lake National Park in western Nigeria grouped with lions from Benin (located west of Nigeria), whereas lions from Yankari Game Reserve in central northeastern Nigeria grouped with the lion populations in Cameroon (located east of Nigeria). The finding that the two remaining lion populations in Nigeria have different phylogenetic origins is an important aspect to consider in future decisions regarding management and conservation of rapidly shrinking lion populations in West Africa. PMID:25077018

  17. The Phylogenetics and Ecology of the Orthopoxviruses Endemic to North America

    PubMed Central

    Emerson, Ginny L.; Li, Yu; Frace, Michael A.; Olsen-Rasmussen, Melissa A.; Khristova, Marina L.; Govil, Dhwani; Sammons, Scott A.; Regnery, Russell L.; Karem, Kevin L.; Damon, Inger K.; Carroll, Darin S.

    2009-01-01

    The data presented herein support the North American orthopoxviruses (NA OPXV) in a sister relationship to all other currently described Orthopoxvirus (OPXV) species. This phylogenetic analysis reaffirms the identification of the NA OPXV as close relatives of “Old World” (Eurasian and African) OPXV and presents high support for deeper nodes within the Chordopoxvirinae family. The natural reservoir host(s) for many of the described OPXV species remains unknown although a clear virus-host association exists between the genus OPXV and several mammalian taxa. The hypothesized host associations and the deep divergence of the OPXV/NA OPXV clades depicted in this study may reflect the divergence patterns of the mammalian faunas of the Old and New World and reflect a more ancient presence of OPXV on what are now the American continents. Genes from the central region of the poxvirus genome are generally more conserved than genes from either end of the linear genome due to functional constraints imposed on viral replication abilities. The relatively slower evolution of these genes may more accurately reflect the deeper history among the poxvirus group, allowing for robust placement of the NA OPXV within Chordopoxvirinae. Sequence data for nine genes were compiled from three NA OPXV strains plus an additional 50 genomes collected from Genbank. The current, gene sequence based phylogenetic analysis reaffirms the identification of the NA OPXV as the nearest relatives of “Old World” OPXV and presents high support for deeper nodes within the Chordopoxvirinae family. Additionally, the substantial genetic distances that separate the currently described NA OPXV species indicate that it is likely that many more undescribed OPXV/NA OPXV species may be circulating among wild animals in North America. PMID:19865479

  18. Evaluating Fast Maximum Likelihood-Based Phylogenetic Programs Using Empirical Phylogenomic Data Sets

    PubMed Central

    Zhou, Xiaofan; Shen, Xing-Xing; Hittinger, Chris Todd

    2018-01-01

    Abstract The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs’ relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses. PMID:29177474

  19. PhyLIS: A Simple GNU/Linux Distribution for Phylogenetics and Phyloinformatics

    PubMed Central

    Thomson, Robert C.

    2009-01-01

    PhyLIS is a free GNU/Linux distribution that is designed to provide a simple, standardized platform for phylogenetic and phyloinformatic analysis. The operating system incorporates most commonly used phylogenetic software, which has been pre-compiled and pre-configured, allowing for straightforward application of phylogenetic methods and development of phyloinformatic pipelines in a stable Linux environment. The software is distributed as a live CD and can be installed directly or run from the CD without making changes to the computer. PhyLIS is available for free at http://www.eve.ucdavis.edu/rcthomson/phylis/. PMID:19812729

  20. Improving phylogenetic analyses by incorporating additional information from genetic sequence databases.

    PubMed

    Liang, Li-Jung; Weiss, Robert E; Redelings, Benjamin; Suchard, Marc A

    2009-10-01

    Statistical analyses of phylogenetic data culminate in uncertain estimates of underlying model parameters. Lack of additional data hinders the ability to reduce this uncertainty, as the original phylogenetic dataset is often complete, containing the entire gene or genome information available for the given set of taxa. Informative priors in a Bayesian analysis can reduce posterior uncertainty; however, publicly available phylogenetic software specifies vague priors for model parameters by default. We build objective and informative priors using hierarchical random effect models that combine additional datasets whose parameters are not of direct interest but are similar to the analysis of interest. We propose principled statistical methods that permit more precise parameter estimates in phylogenetic analyses by creating informative priors for parameters of interest. Using additional sequence datasets from our lab or public databases, we construct a fully Bayesian semiparametric hierarchical model to combine datasets. A dynamic iteratively reweighted Markov chain Monte Carlo algorithm conveniently recycles posterior samples from the individual analyses. We demonstrate the value of our approach by examining the insertion-deletion (indel) process in the enolase gene across the Tree of Life using the phylogenetic software BALI-PHY; we incorporate prior information about indels from 82 curated alignments downloaded from the BAliBASE database.

  1. Plant traits determine the phylogenetic structure of arbuscular mycorrhizal fungal communities.

    PubMed

    López-García, Álvaro; Varela-Cervero, Sara; Vasar, Martti; Öpik, Maarja; Barea, José M; Azcón-Aguilar, Concepción

    2017-12-01

    Functional diversity in ecosystems has traditionally been studied using aboveground plant traits. Despite the known effect of plant traits on the microbial community composition, their effects on the microbial functional diversity are only starting to be assessed. In this study, the phylogenetic structure of arbuscular mycorrhizal (AM) fungal communities associated with plant species differing in life cycle and growth form, that is, plant life forms, was determined to unravel the effect of plant traits on the functional diversity of this fungal group. The results of the 454 pyrosequencing showed that the AM fungal community composition differed across plant life forms and this effect was dependent on the soil collection date. Plants with ruderal characteristics tended to associate with phylogenetically clustered AM fungal communities. By contrast, plants with resource-conservative traits associated with phylogenetically overdispersed AM fungal communities. Additionally, the soil collected in different seasons yielded AM fungal communities with different phylogenetic dispersion. In summary, we found that the phylogenetic structure, and hence the functional diversity, of AM fungal communities is dependent on plant traits. This finding adds value to the use of plant traits for the evaluation of belowground ecosystem diversity, functions and processes. © 2017 John Wiley & Sons Ltd.

  2. GENOME-WIDE COMPARATIVE ANALYSIS OF PHYLOGENETIC TREES: THE PROKARYOTIC FOREST OF LIFE

    PubMed Central

    Puigbò, Pere; Wolf, Yuri I.; Koonin, Eugene V.

    2013-01-01

    Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance (SD) method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the applications methods used to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a ‘species tree’. PMID:22399455

  3. Genome-wide comparative analysis of phylogenetic trees: the prokaryotic forest of life.

    PubMed

    Puigbò, Pere; Wolf, Yuri I; Koonin, Eugene V

    2012-01-01

    Genome-wide comparison of phylogenetic trees is becoming an increasingly common approach in evolutionary genomics, and a variety of approaches for such comparison have been developed. In this article, we present several methods for comparative analysis of large numbers of phylogenetic trees. To compare phylogenetic trees taking into account the bootstrap support for each internal branch, the Boot-Split Distance (BSD) method is introduced as an extension of the previously developed Split Distance method for tree comparison. The BSD method implements the straightforward idea that comparison of phylogenetic trees can be made more robust by treating tree splits differentially depending on the bootstrap support. Approaches are also introduced for detecting tree-like and net-like evolutionary trends in the phylogenetic Forest of Life (FOL), i.e., the entirety of the phylogenetic trees for conserved genes of prokaryotes. The principal method employed for this purpose includes mapping quartets of species onto trees to calculate the support of each quartet topology and so to quantify the tree and net contributions to the distances between species. We describe the application of these methods to analyze the FOL and the results obtained with these methods. These results support the concept of the Tree of Life (TOL) as a central evolutionary trend in the FOL as opposed to the traditional view of the TOL as a "species tree."

  4. Phylogenetic diversity anomaly in angiosperms between eastern Asia and eastern North America.

    PubMed

    Qian, Hong; Jin, Yi; Ricklefs, Robert E

    2017-10-24

    Although eastern Asia (EAS) and eastern North America (ENA) have similar climates, plant species richness in EAS greatly exceeds that in ENA. The degree to which this diversity difference reflects the ages of the floras or their rates of evolutionary diversification has not been quantified. Measures of species diversity that do not incorporate the ages of lineages disregard the evolutionary distinctiveness of species. In contrast, phylogenetic diversity integrates both the number of species and their history of evolutionary diversification. Here we compared species diversity and phylogenetic diversity in a large number of flowering plant (angiosperm) floras distributed across EAS and ENA, two regions with similar contemporary environments and broadly shared floristic history. After accounting for climate and sample area, we found both species diversity and phylogenetic diversity to be significantly higher in EAS than in ENA. When we controlled the number of species statistically, we found that phylogenetic diversity remained substantially higher in EAS than in ENA, although it tended to converge at high latitude. This pattern held independently for herbs, shrubs, and trees. The anomaly in species and phylogenetic diversity likely resulted from differences in regional processes, related in part to high climatic and topographic heterogeneity, and a strong monsoon climate, in EAS. The broad connection between tropical and temperate floras in southern Asia also might have played a role in creating the phylogenetic diversity anomaly.

  5. Urbanisation and the loss of phylogenetic diversity in birds.

    PubMed

    Sol, Daniel; Bartomeus, Ignasi; González-Lagos, César; Pavoine, Sandrine

    2017-06-01

    Despite the recognised conservation value of phylogenetic diversity, little is known about how it is affected by the urbanisation process. Combining a complete avian phylogeny with surveys along urbanisation gradients from five continents, we show that highly urbanised environments supported on average 450 million fewer years of evolutionary history than the surrounding natural environments. This loss was primarily caused by species loss and could have been higher had not been partially compensated by the addition of urban exploiters and some exotic species. Highly urbanised environments also supported fewer evolutionary distinctive species, implying a disproportionate loss of evolutionary history. Compared with highly urbanised environments, changes in phylogenetic richness and evolutionary distinctiveness were less substantial in moderately urbanised environments. Protecting pristine environments is therefore essential for maintaining phylogenetic diversity, but moderate levels of urbanisation still preserve much of the original diversity. © 2017 John Wiley & Sons Ltd/CNRS.

  6. An efficient and extensible approach for compressing phylogenetic trees.

    PubMed

    Matthews, Suzanne J; Williams, Tiffani L

    2011-10-18

    Biologists require new algorithms to efficiently compress and store their large collections of phylogenetic trees. Our previous work showed that TreeZip is a promising approach for compressing phylogenetic trees. In this paper, we extend our TreeZip algorithm by handling trees with weighted branches. Furthermore, by using the compressed TreeZip file as input, we have designed an extensible decompressor that can extract subcollections of trees, compute majority and strict consensus trees, and merge tree collections using set operations such as union, intersection, and set difference. On unweighted phylogenetic trees, TreeZip is able to compress Newick files in excess of 98%. On weighted phylogenetic trees, TreeZip is able to compress a Newick file by at least 73%. TreeZip can be combined with 7zip with little overhead, allowing space savings in excess of 99% (unweighted) and 92%(weighted). Unlike TreeZip, 7zip is not immune to branch rotations, and performs worse as the level of variability in the Newick string representation increases. Finally, since the TreeZip compressed text (TRZ) file contains all the semantic information in a collection of trees, we can easily filter and decompress a subset of trees of interest (such as the set of unique trees), or build the resulting consensus tree in a matter of seconds. We also show the ease of which set operations can be performed on TRZ files, at speeds quicker than those performed on Newick or 7zip compressed Newick files, and without loss of space savings. TreeZip is an efficient approach for compressing large collections of phylogenetic trees. The semantic and compact nature of the TRZ file allow it to be operated upon directly and quickly, without a need to decompress the original Newick file. We believe that TreeZip will be vital for compressing and archiving trees in the biological community.

  7. A Phylogenetic Perspective on Biogeographical Divergence of the Flora in Yunnan, Southwestern China.

    PubMed

    Liu, Shuiyin; Zhu, Hua; Yang, Jie

    2017-02-21

    In recent years, an increasing number of studies incorporated biogeography with phylogenetic analyses to reveal the origin and evolutionary history of specific floras. In this study, we constructed the mega-phylogeny of the floras of three representative regions across Yunnan, southwestern China. We analyzed the phylogenetic structure and beta diversity based on the presence/absence of species (genus or family) data to investigate the phylogenetic patterns of regional floras. We found conspicuous divergence at the genus and species level in the pattern of phylogenetic structures, which most likely related to historical biogeography. The flora of southern Yunnan was shaped by the strike-slip extrusion of Indochina and the regional climatic stability, while the flora of northwestern Yunnan was shaped by the uplift of the Himalaya-Tibetan Plateau and the oscillations of the glacial-interglacial periods. The flora of central Yunnan had nearly equal proportions of the northern and southern floras that may be derived from a common Tertiary tropical or subtropical flora. Geological events fit well with the floristic and phylogenetic patterns across Yunnan. This study highlighted the importance of linking phylogenetic analyses to biogeographic interpretations to improve our understanding of the origin, evolution and divergence of regional floras.

  8. BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC

    PubMed Central

    Satija, Rahul; Novák, Ádám; Miklós, István; Lyngsø, Rune; Hein, Jotun

    2009-01-01

    Background We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. Results We present a novel approach, implemented in the software package BigFoot, for performing phylogenetic footprinting on greater numbers of sequences. We have developed a Markov chain Monte Carlo (MCMC) approach which samples both sequence alignments and locations of slowly evolving regions. We implement our method as an extension of the existing StatAlign software package and test it on well-annotated regions controlling the expression of the even-skipped gene in Drosophila and the α-globin gene in vertebrates. The results exhibit how adding additional sequences to the analysis has the potential to improve the accuracy of functional predictions, and demonstrate how BigFoot outperforms existing alignment-based phylogenetic footprinting techniques. Conclusion BigFoot extends a combined alignment and phylogenetic footprinting approach to analyze larger amounts of sequence data using MCMC. Our approach is robust to alignment error and uncertainty and can be applied to a variety of biological datasets. The source code and documentation are publicly available for download from PMID:19715598

  9. BigFoot: Bayesian alignment and phylogenetic footprinting with MCMC.

    PubMed

    Satija, Rahul; Novák, Adám; Miklós, István; Lyngsø, Rune; Hein, Jotun

    2009-08-28

    We have previously combined statistical alignment and phylogenetic footprinting to detect conserved functional elements without assuming a fixed alignment. Considering a probability-weighted distribution of alignments removes sensitivity to alignment errors, properly accommodates regions of alignment uncertainty, and increases the accuracy of functional element prediction. Our method utilized standard dynamic programming hidden markov model algorithms to analyze up to four sequences. We present a novel approach, implemented in the software package BigFoot, for performing phylogenetic footprinting on greater numbers of sequences. We have developed a Markov chain Monte Carlo (MCMC) approach which samples both sequence alignments and locations of slowly evolving regions. We implement our method as an extension of the existing StatAlign software package and test it on well-annotated regions controlling the expression of the even-skipped gene in Drosophila and the alpha-globin gene in vertebrates. The results exhibit how adding additional sequences to the analysis has the potential to improve the accuracy of functional predictions, and demonstrate how BigFoot outperforms existing alignment-based phylogenetic footprinting techniques. BigFoot extends a combined alignment and phylogenetic footprinting approach to analyze larger amounts of sequence data using MCMC. Our approach is robust to alignment error and uncertainty and can be applied to a variety of biological datasets. The source code and documentation are publicly available for download from http://www.stats.ox.ac.uk/~satija/BigFoot/

  10. Phylogenetic mixtures and linear invariants for equal input models.

    PubMed

    Casanellas, Marta; Steel, Mike

    2017-04-01

    The reconstruction of phylogenetic trees from molecular sequence data relies on modelling site substitutions by a Markov process, or a mixture of such processes. In general, allowing mixed processes can result in different tree topologies becoming indistinguishable from the data, even for infinitely long sequences. However, when the underlying Markov process supports linear phylogenetic invariants, then provided these are sufficiently informative, the identifiability of the tree topology can be restored. In this paper, we investigate a class of processes that support linear invariants once the stationary distribution is fixed, the 'equal input model'. This model generalizes the 'Felsenstein 1981' model (and thereby the Jukes-Cantor model) from four states to an arbitrary number of states (finite or infinite), and it can also be described by a 'random cluster' process. We describe the structure and dimension of the vector spaces of phylogenetic mixtures and of linear invariants for any fixed phylogenetic tree (and for all trees-the so called 'model invariants'), on any number n of leaves. We also provide a precise description of the space of mixtures and linear invariants for the special case of [Formula: see text] leaves. By combining techniques from discrete random processes and (multi-) linear algebra, our results build on a classic result that was first established by James Lake (Mol Biol Evol 4:167-191, 1987).

  11. Inferring 'weak spots' in phylogenetic trees: application to mosasauroid nomenclature.

    PubMed

    Madzia, Daniel; Cau, Andrea

    2017-01-01

    Mosasauroid squamates represented the apex predators within the Late Cretaceous marine and occasionally also freshwater ecosystems. Proper understanding of the origin of their ecological adaptations or paleobiogeographic dispersals requires adequate knowledge of their phylogeny. The studies assessing the position of mosasauroids on the squamate evolutionary tree and their origins have long given conflicting results. The phylogenetic relationships within Mosasauroidea, however, have experienced only little changes throughout the last decades. Considering the substantial improvements in the development of phylogenetic methodology that have undergone in recent years, resulting, among others, in numerous alterations in the phylogenetic hypotheses of other fossil amniotes, we test the robustness in our understanding of mosasauroid beginnings and their evolutionary history. We re-examined a data set that results from modifications assembled in the course of the last 20 years and performed multiple parsimony analyses and Bayesian tip-dating analysis. Following the inferred topologies and the 'weak spots' in the phylogeny of mosasauroids, we revise the nomenclature of the 'traditionally' recognized mosasauroid clades, to acknowledge the overall weakness among branches and the alternative topologies suggested previously, and discuss several factors that might have an impact on the differing phylogenetic hypotheses and their statistical support.

  12. Molecular phylogenetics of mastodon and Tyrannosaurus rex.

    PubMed

    Organ, Chris L; Schweitzer, Mary H; Zheng, Wenxia; Freimark, Lisa M; Cantley, Lewis C; Asara, John M

    2008-04-25

    We report a molecular phylogeny for a nonavian dinosaur, extending our knowledge of trait evolution within nonavian dinosaurs into the macromolecular level of biological organization. Fragments of collagen alpha1(I) and alpha2(I) proteins extracted from fossil bones of Tyrannosaurus rex and Mammut americanum (mastodon) were analyzed with a variety of phylogenetic methods. Despite missing sequence data, the mastodon groups with elephant and the T. rex groups with birds, consistent with predictions based on genetic and morphological data for mastodon and on morphological data for T. rex. Our findings suggest that molecular data from long-extinct organisms may have the potential for resolving relationships at critical areas in the vertebrate evolutionary tree that have, so far, been phylogenetically intractable.

  13. A guide to phylogenetic metrics for conservation, community ecology and macroecology

    PubMed Central

    Cadotte, Marc W.; Carvalho, Silvia B.; Davies, T. Jonathan; Ferrier, Simon; Fritz, Susanne A.; Grenyer, Rich; Helmus, Matthew R.; Jin, Lanna S.; Mooers, Arne O.; Pavoine, Sandrine; Purschke, Oliver; Redding, David W.; Rosauer, Dan F.; Winter, Marten; Mazel, Florent

    2016-01-01

    ABSTRACT The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub‐disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub‐disciplines hampers potential meta‐analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo‐diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo‐diversity metrics based on their mathematical form within these three dimensions and identify ‘anchor’ representatives: for α‐diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. PMID:26785932

  14. Monte Carlo algorithms for Brownian phylogenetic models.

    PubMed

    Horvilleur, Benjamin; Lartillot, Nicolas

    2014-11-01

    Brownian models have been introduced in phylogenetics for describing variation in substitution rates through time, with applications to molecular dating or to the comparative analysis of variation in substitution patterns among lineages. Thus far, however, the Monte Carlo implementations of these models have relied on crude approximations, in which the Brownian process is sampled only at the internal nodes of the phylogeny or at the midpoints along each branch, and the unknown trajectory between these sampled points is summarized by simple branchwise average substitution rates. A more accurate Monte Carlo approach is introduced, explicitly sampling a fine-grained discretization of the trajectory of the (potentially multivariate) Brownian process along the phylogeny. Generic Monte Carlo resampling algorithms are proposed for updating the Brownian paths along and across branches. Specific computational strategies are developed for efficient integration of the finite-time substitution probabilities across branches induced by the Brownian trajectory. The mixing properties and the computational complexity of the resulting Markov chain Monte Carlo sampler scale reasonably with the discretization level, allowing practical applications with up to a few hundred discretization points along the entire depth of the tree. The method can be generalized to other Markovian stochastic processes, making it possible to implement a wide range of time-dependent substitution models with well-controlled computational precision. The program is freely available at www.phylobayes.org. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. Phylogenetics and Taxonomy of the Fungal Vascular Wilt Pathogen Verticillium, with the Descriptions of Five New Species

    PubMed Central

    Inderbitzin, Patrik; Bostock, Richard M.; Davis, R. Michael; Usami, Toshiyuki; Platt, Harold W.; Subbarao, Krishna V.

    2011-01-01

    Knowledge of pathogen biology and genetic diversity is a cornerstone of effective disease management, and accurate identification of the pathogen is a foundation of pathogen biology. Species names provide an ideal framework for storage and retrieval of relevant information, a system that is contingent on a clear understanding of species boundaries and consistent species identification. Verticillium, a genus of ascomycete fungi, contains important plant pathogens whose species boundaries have been ill defined. Using phylogenetic analyses, morphological investigations and comparisons to herbarium material and the literature, we established a taxonomic framework for Verticillium comprising ten species, five of which are new to science. We used a collection of 74 isolates representing much of the diversity of Verticillium, and phylogenetic analyses based on the ribosomal internal transcribed spacer region (ITS), partial sequences of the protein coding genes actin (ACT), elongation factor 1-alpha (EF), glyceraldehyde-3-phosphate dehydrogenase (GPD) and tryptophan synthase (TS). Combined analyses of the ACT, EF, GPD and TS datasets recognized two major groups within Verticillium, Clade Flavexudans and Clade Flavnonexudans, reflecting the respective production and absence of yellow hyphal pigments. Clade Flavexudans comprised V. albo-atrum and V. tricorpus as well as the new species V. zaregamsianum, V. isaacii and V. klebahnii, of which the latter two were morphologically indistinguishable from V. tricorpus but may differ in pathogenicity. Clade Flavnonexudans comprised V. nubilum, V. dahliae and V. longisporum, as well as the two new species V. alfalfae and V. nonalfalfae, which resembled the distantly related V. albo-atrum in morphology. Apart from the diploid hybrid V. longisporum, each of the ten species corresponded to a single clade in the phylogenetic tree comprising just one ex-type strain, thereby establishing a direct link to a name tied to a herbarium specimen

  16. Disentangling environmental and spatial effects on phylogenetic structure of angiosperm tree communities in China.

    PubMed

    Qian, Hong; Chen, Shengbin; Zhang, Jin-Long

    2017-07-17

    Niche-based and neutrality-based theories are two major classes of theories explaining the assembly mechanisms of local communities. Both theories have been frequently used to explain species diversity and composition in local communities but their relative importance remains unclear. Here, we analyzed 57 assemblages of angiosperm trees in 0.1-ha forest plots across China to examine the effects of environmental heterogeneity (relevant to niche-based processes) and spatial contingency (relevant to neutrality-based processes) on phylogenetic structure of angiosperm tree assemblages distributed across a wide range of environment and space. Phylogenetic structure was quantified with six phylogenetic metrics (i.e., phylogenetic diversity, mean pairwise distance, mean nearest taxon distance, and the standardized effect sizes of these three metrics), which emphasize on different depths of evolutionary histories and account for different degrees of species richness effects. Our results showed that the variation in phylogenetic metrics explained independently by environmental variables was on average much greater than that explained independently by spatial structure, and the vast majority of the variation in phylogenetic metrics was explained by spatially structured environmental variables. We conclude that niche-based processes have played a more important role than neutrality-based processes in driving phylogenetic structure of angiosperm tree species in forest communities in China.

  17. The phylogenetic relationships of known mosquito (Diptera: Culicidae) mitogenomes.

    PubMed

    Chu, Hongliang; Li, Chunxiao; Guo, Xiaoxia; Zhang, Hengduan; Luo, Peng; Wu, Zhonghua; Wang, Gang; Zhao, Tongyan

    2018-01-01

    The known mosquito mitogenomes, containing a total of 34 species, which belong to five genera, were collected from GenBank, and the practicality and effectiveness of the variation in the complete mitochondrial DNA genome and portions of mitochondrial COI gene were assessed to reconstruct the phylogeny of mosquitoes. Phylogenetic trees were reconstructed on the basis of parsimony, maximum likelihood, and Bayesian (BI) methods. It is concluded that: (1) Both mitogenomes and COI gene support the monophly of following taxa: Subgenus Nyssorhynchus, Subgenus Cellia, Anopheles albitarsis complex, Anopheles gambiae complex, and Anopheles punctulatus group; (2) Genus Aedes is not monophyletic relative to Ochlerotatus vigilax; (3) The mitogenome results indicate a close relationship between Anopheles epiroticus and Anopheles gambiae complex, Anopheles dirus complex and Anopheles punctulatus group, respectively; (4) The Bayesian posterior probability (BPP) within phylogenetic tree reconstructed by mitogenomes is higher than COI tree. The results show that phylogenetic relationships reconstructed using the mitogenomes were more similar to those based on morphological data.

  18. SIFTER search: a web server for accurate phylogeny-based protein function prediction

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sahraeian, Sayed M.; Luo, Kevin R.; Brenner, Steven E.

    We are awash in proteins discovered through high-throughput sequencing projects. As only a minuscule fraction of these have been experimentally characterized, computational methods are widely used for automated annotation. Here, we introduce a user-friendly web interface for accurate protein function prediction using the SIFTER algorithm. SIFTER is a state-of-the-art sequence-based gene molecular function prediction algorithm that uses a statistical model of function evolution to incorporate annotations throughout the phylogenetic tree. Due to the resources needed by the SIFTER algorithm, running SIFTER locally is not trivial for most users, especially for large-scale problems. The SIFTER web server thus provides access tomore » precomputed predictions on 16 863 537 proteins from 232 403 species. Users can explore SIFTER predictions with queries for proteins, species, functions, and homologs of sequences not in the precomputed prediction set. Lastly, the SIFTER web server is accessible at http://sifter.berkeley.edu/ and the source code can be downloaded.« less

  19. SIFTER search: a web server for accurate phylogeny-based protein function prediction

    DOE PAGES

    Sahraeian, Sayed M.; Luo, Kevin R.; Brenner, Steven E.

    2015-05-15

    We are awash in proteins discovered through high-throughput sequencing projects. As only a minuscule fraction of these have been experimentally characterized, computational methods are widely used for automated annotation. Here, we introduce a user-friendly web interface for accurate protein function prediction using the SIFTER algorithm. SIFTER is a state-of-the-art sequence-based gene molecular function prediction algorithm that uses a statistical model of function evolution to incorporate annotations throughout the phylogenetic tree. Due to the resources needed by the SIFTER algorithm, running SIFTER locally is not trivial for most users, especially for large-scale problems. The SIFTER web server thus provides access tomore » precomputed predictions on 16 863 537 proteins from 232 403 species. Users can explore SIFTER predictions with queries for proteins, species, functions, and homologs of sequences not in the precomputed prediction set. Lastly, the SIFTER web server is accessible at http://sifter.berkeley.edu/ and the source code can be downloaded.« less

  20. Plunging hands into the mushroom jar: a phylogenetic framework for Lyophyllaceae (Agaricales, Basidiomycota).

    PubMed

    Bellanger, J-M; Moreau, P-A; Corriol, G; Bidaud, A; Chalange, R; Dudova, Z; Richard, F

    2015-04-01

    During the last two decades, the unprecedented development of molecular phylogenetic tools has propelled an opportunity to revisit the fungal kingdom under an evolutionary perspective. Mycology has been profoundly changed but a sustained effort to elucidate large sections of the astonishing fungal diversity is still needed. Here we fill this gap in the case of Lyophyllaceae, a species-rich and ecologically diversified family of mushrooms. Assembly and genealogical concordance multigene phylogenetic analysis of a large dataset that includes original, vouchered material from expert field mycologists reveal the phylogenetic topology of the family, from higher (generic) to lower (species) levels. A comparative analysis of the most widely used phylogenetic markers in Fungi indicates that the nuc rDNA region encompassing the internal transcribed spacers 1 and 2, along with the 5.8S rDNA (ITS) and portions of the genes for RNA polymerase II second largest subunit (RPB2) is the most performing combination to resolve the broadest range of taxa within Lyophyllaceae. Eleven distinct evolutionary lineages are identified, that display partial overlap with traditional genera as well as with the phylogenetic framework previously proposed for the family. Eighty phylogenetic species are delineated, which shed light on a large number of morphological concepts, including rare and poorly documented ones. Probing these novel phylogenetic species to the barcoding method of species limit delineation, indicates that the latter method fully resolves Lyophyllaceae species, except in one clade. This case study provides the first comprehensive phylogenetic overview of Lyophyllaceae, a necessary step towards a taxonomical, ecological and nomenclatural revision of this family of mushrooms. It also proposes a set of methodological guidelines that may be of relevance for future taxonomic works in other groups of Fungi.

  1. Analyzing Phylogenetic Trees with Timed and Probabilistic Model Checking: The Lactose Persistence Case Study.

    PubMed

    Requeno, José Ignacio; Colom, José Manuel

    2014-12-01

    Model checking is a generic verification technique that allows the phylogeneticist to focus on models and specifications instead of on implementation issues. Phylogenetic trees are considered as transition systems over which we interrogate phylogenetic questions written as formulas of temporal logic. Nonetheless, standard logics become insufficient for certain practices of phylogenetic analysis since they do not allow the inclusion of explicit time and probabilities. The aim of this paper is to extend the application of model checking techniques beyond qualitative phylogenetic properties and adapt the existing logical extensions and tools to the field of phylogeny. The introduction of time and probabilities in phylogenetic specifications is motivated by the study of a real example: the analysis of the ratio of lactose intolerance in some populations and the date of appearance of this phenotype.

  2. Analyzing phylogenetic trees with timed and probabilistic model checking: the lactose persistence case study.

    PubMed

    Requeno, José Ignacio; Colom, José Manuel

    2014-10-23

    Model checking is a generic verification technique that allows the phylogeneticist to focus on models and specifications instead of on implementation issues. Phylogenetic trees are considered as transition systems over which we interrogate phylogenetic questions written as formulas of temporal logic. Nonetheless, standard logics become insufficient for certain practices of phylogenetic analysis since they do not allow the inclusion of explicit time and probabilities. The aim of this paper is to extend the application of model checking techniques beyond qualitative phylogenetic properties and adapt the existing logical extensions and tools to the field of phylogeny. The introduction of time and probabilities in phylogenetic specifications is motivated by the study of a real example: the analysis of the ratio of lactose intolerance in some populations and the date of appearance of this phenotype.

  3. treeman: an R package for efficient and intuitive manipulation of phylogenetic trees.

    PubMed

    Bennett, Dominic J; Sutton, Mark D; Turvey, Samuel T

    2017-01-07

    Phylogenetic trees are hierarchical structures used for representing the inter-relationships between biological entities. They are the most common tool for representing evolution and are essential to a range of fields across the life sciences. The manipulation of phylogenetic trees-in terms of adding or removing tips-is often performed by researchers not just for reasons of management but also for performing simulations in order to understand the processes of evolution. Despite this, the most common programming language among biologists, R, has few class structures well suited to these tasks. We present an R package that contains a new class, called TreeMan, for representing the phylogenetic tree. This class has a list structure allowing phylogenetic trees to be manipulated more efficiently. Computational running times are reduced because of the ready ability to vectorise and parallelise methods. Development is also improved due to fewer lines of code being required for performing manipulation processes. We present three use cases-pinning missing taxa to a supertree, simulating evolution with a tree-growth model and detecting significant phylogenetic turnover-that demonstrate the new package's speed and simplicity.

  4. Phylogenetic congruence and ecological coherence in terrestrial Thaumarchaeota.

    PubMed

    Oton, Eduard Vico; Quince, Christopher; Nicol, Graeme W; Prosser, James I; Gubry-Rangin, Cécile

    2016-01-01

    Thaumarchaeota form a ubiquitously distributed archaeal phylum, comprising both the ammonia-oxidising archaea (AOA) and other archaeal groups in which ammonia oxidation has not been demonstrated (including Group 1.1c and Group 1.3). The ecology of AOA in terrestrial environments has been extensively studied using either a functional gene, encoding ammonia monooxygenase subunit A (amoA) or 16S ribosomal RNA (rRNA) genes, which show phylogenetic coherence with respect to soil pH. To test phylogenetic congruence between these two markers and to determine ecological coherence in all Thaumarchaeota, we performed high-throughput sequencing of 16S rRNA and amoA genes in 46 UK soils presenting 29 available contextual soil characteristics. Adaptation to pH and organic matter content reflected strong ecological coherence at various levels of taxonomic resolution for Thaumarchaeota (AOA and non-AOA), whereas nitrogen, total mineralisable nitrogen and zinc concentration were also important factors associated with AOA thaumarchaeotal community distribution. Other significant associations with environmental factors were also detected for amoA and 16S rRNA genes, reflecting different diversity characteristics between these two markers. Nonetheless, there was significant statistical congruence between the markers at fine phylogenetic resolution, supporting the hypothesis of low horizontal gene transfer between Thaumarchaeota. Group 1.1c Thaumarchaeota were also widely distributed, with two clusters predominating, particularly in environments with higher moisture content and organic matter, whereas a similar ecological pattern was observed for Group 1.3 Thaumarchaeota. The ecological and phylogenetic congruence identified is fundamental to understand better the life strategies, evolutionary history and ecosystem function of the Thaumarchaeota.

  5. Phylogenetic congruence and ecological coherence in terrestrial Thaumarchaeota

    PubMed Central

    Oton, Eduard Vico; Quince, Christopher; Nicol, Graeme W; Prosser, James I; Gubry-Rangin, Cécile

    2016-01-01

    Thaumarchaeota form a ubiquitously distributed archaeal phylum, comprising both the ammonia-oxidising archaea (AOA) and other archaeal groups in which ammonia oxidation has not been demonstrated (including Group 1.1c and Group 1.3). The ecology of AOA in terrestrial environments has been extensively studied using either a functional gene, encoding ammonia monooxygenase subunit A (amoA) or 16S ribosomal RNA (rRNA) genes, which show phylogenetic coherence with respect to soil pH. To test phylogenetic congruence between these two markers and to determine ecological coherence in all Thaumarchaeota, we performed high-throughput sequencing of 16S rRNA and amoA genes in 46 UK soils presenting 29 available contextual soil characteristics. Adaptation to pH and organic matter content reflected strong ecological coherence at various levels of taxonomic resolution for Thaumarchaeota (AOA and non-AOA), whereas nitrogen, total mineralisable nitrogen and zinc concentration were also important factors associated with AOA thaumarchaeotal community distribution. Other significant associations with environmental factors were also detected for amoA and 16S rRNA genes, reflecting different diversity characteristics between these two markers. Nonetheless, there was significant statistical congruence between the markers at fine phylogenetic resolution, supporting the hypothesis of low horizontal gene transfer between Thaumarchaeota. Group 1.1c Thaumarchaeota were also widely distributed, with two clusters predominating, particularly in environments with higher moisture content and organic matter, whereas a similar ecological pattern was observed for Group 1.3 Thaumarchaeota. The ecological and phylogenetic congruence identified is fundamental to understand better the life strategies, evolutionary history and ecosystem function of the Thaumarchaeota. PMID:26140533

  6. Simultaneously estimating evolutionary history and repeated traits phylogenetic signal: applications to viral and host phenotypic evolution

    PubMed Central

    Vrancken, Bram; Lemey, Philippe; Rambaut, Andrew; Bedford, Trevor; Longdon, Ben; Günthard, Huldrych F.; Suchard, Marc A.

    2014-01-01

    Phylogenetic signal quantifies the degree to which resemblance in continuously-valued traits reflects phylogenetic relatedness. Measures of phylogenetic signal are widely used in ecological and evolutionary research, and are recently gaining traction in viral evolutionary studies. Standard estimators of phylogenetic signal frequently condition on data summary statistics of the repeated trait observations and fixed phylogenetics trees, resulting in information loss and potential bias. To incorporate the observation process and phylogenetic uncertainty in a model-based approach, we develop a novel Bayesian inference method to simultaneously estimate the evolutionary history and phylogenetic signal from molecular sequence data and repeated multivariate traits. Our approach builds upon a phylogenetic diffusion framework that model continuous trait evolution as a Brownian motion process and incorporates Pagel’s λ transformation parameter to estimate dependence among traits. We provide a computationally efficient inference implementation in the BEAST software package. We evaluate the synthetic performance of the Bayesian estimator of phylogenetic signal against standard estimators, and demonstrate the use of our coherent framework to address several virus-host evolutionary questions, including virulence heritability for HIV, antigenic evolution in influenza and HIV, and Drosophila sensitivity to sigma virus infection. Finally, we discuss model extensions that will make useful contributions to our flexible framework for simultaneously studying sequence and trait evolution. PMID:25780554

  7. Phylogenetically-informed priorities for amphibian conservation.

    PubMed

    Isaac, Nick J B; Redding, David W; Meredith, Helen M; Safi, Kamran

    2012-01-01

    The amphibian decline and extinction crisis demands urgent action to prevent further large numbers of species extinctions. Lists of priority species for conservation, based on a combination of species' threat status and unique contribution to phylogenetic diversity, are one tool for the direction and catalyzation of conservation action. We describe the construction of a near-complete species-level phylogeny of 5713 amphibian species, which we use to create a list of evolutionarily distinct and globally endangered species (EDGE list) for the entire class Amphibia. We present sensitivity analyses to test the robustness of our priority list to uncertainty in species' phylogenetic position and threat status. We find that both sources of uncertainty have only minor impacts on our 'top 100' list of priority species, indicating the robustness of the approach. By contrast, our analyses suggest that a large number of Data Deficient species are likely to be high priorities for conservation action from the perspective of their contribution to the evolutionary history.

  8. Descriptive Statistics of the Genome: Phylogenetic Classification of Viruses.

    PubMed

    Hernandez, Troy; Yang, Jie

    2016-10-01

    The typical process for classifying and submitting a newly sequenced virus to the NCBI database involves two steps. First, a BLAST search is performed to determine likely family candidates. That is followed by checking the candidate families with the pairwise sequence alignment tool for similar species. The submitter's judgment is then used to determine the most likely species classification. The aim of this article is to show that this process can be automated into a fast, accurate, one-step process using the proposed alignment-free method and properly implemented machine learning techniques. We present a new family of alignment-free vectorizations of the genome, the generalized vector, that maintains the speed of existing alignment-free methods while outperforming all available methods. This new alignment-free vectorization uses the frequency of genomic words (k-mers), as is done in the composition vector, and incorporates descriptive statistics of those k-mers' positional information, as inspired by the natural vector. We analyze five different characterizations of genome similarity using k-nearest neighbor classification and evaluate these on two collections of viruses totaling over 10,000 viruses. We show that our proposed method performs better than, or as well as, other methods at every level of the phylogenetic hierarchy. The data and R code is available upon request.

  9. Estimating Bayesian Phylogenetic Information Content

    PubMed Central

    Lewis, Paul O.; Chen, Ming-Hui; Kuo, Lynn; Lewis, Louise A.; Fučíková, Karolina; Neupane, Suman; Wang, Yu-Bo; Shi, Daoyuan

    2016-01-01

    Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.] PMID:27155008

  10. A guide to phylogenetic metrics for conservation, community ecology and macroecology.

    PubMed

    Tucker, Caroline M; Cadotte, Marc W; Carvalho, Silvia B; Davies, T Jonathan; Ferrier, Simon; Fritz, Susanne A; Grenyer, Rich; Helmus, Matthew R; Jin, Lanna S; Mooers, Arne O; Pavoine, Sandrine; Purschke, Oliver; Redding, David W; Rosauer, Dan F; Winter, Marten; Mazel, Florent

    2017-05-01

    The use of phylogenies in ecology is increasingly common and has broadened our understanding of biological diversity. Ecological sub-disciplines, particularly conservation, community ecology and macroecology, all recognize the value of evolutionary relationships but the resulting development of phylogenetic approaches has led to a proliferation of phylogenetic diversity metrics. The use of many metrics across the sub-disciplines hampers potential meta-analyses, syntheses, and generalizations of existing results. Further, there is no guide for selecting the appropriate metric for a given question, and different metrics are frequently used to address similar questions. To improve the choice, application, and interpretation of phylo-diversity metrics, we organize existing metrics by expanding on a unifying framework for phylogenetic information. Generally, questions about phylogenetic relationships within or between assemblages tend to ask three types of question: how much; how different; or how regular? We show that these questions reflect three dimensions of a phylogenetic tree: richness, divergence, and regularity. We classify 70 existing phylo-diversity metrics based on their mathematical form within these three dimensions and identify 'anchor' representatives: for α-diversity metrics these are PD (Faith's phylogenetic diversity), MPD (mean pairwise distance), and VPD (variation of pairwise distances). By analysing mathematical formulae and using simulations, we use this framework to identify metrics that mix dimensions, and we provide a guide to choosing and using the most appropriate metrics. We show that metric choice requires connecting the research question with the correct dimension of the framework and that there are logical approaches to selecting and interpreting metrics. The guide outlined herein will help researchers navigate the current jungle of indices. © 2016 The Authors. Biological Reviews published by John Wiley © Sons Ltd on behalf of

  11. Seasonal cycles, phylogenetic assembly, and functional diversity of orchid bee communities.

    PubMed

    Ramírez, Santiago R; Hernández, Carlos; Link, Andres; López-Uribe, Margarita M

    2015-05-01

    Neotropical rainforests sustain some of the most diverse terrestrial communities on Earth. Euglossine (or orchid) bees are a diverse lineage of insect pollinators distributed throughout the American tropics, where they provide pollination services to a staggering diversity of flowering plant taxa. Elucidating the seasonal patterns of phylogenetic assembly and functional trait diversity of bee communities can shed new light into the mechanisms that govern the assembly of bee pollinator communities and the potential effects of declining bee populations. Male euglossine bees collect, store, and accumulate odoriferous compounds (perfumes) to subsequently use during courtship display. Thus, synthetic chemical baits can be used to attract and monitor euglossine bee populations. We conducted monthly censuses of orchid bees in three sites in the Magdalena valley of Colombia - a region where Central and South American biotas converge - to investigate the structure, diversity, and assembly of euglossine bee communities through time in relation to seasonal climatic cycles. In particular, we tested the hypothesis that phylogenetic community structure and functional trait diversity changed in response to seasonal rainfall fluctuations. All communities exhibited strong to moderate phylogenetic clustering throughout the year, with few pronounced bursts of phylogenetic overdispersion that coincided with the transition from wet-to-dry seasons. Despite the heterogeneous distribution of functional traits (e.g., body size, body mass, and proboscis length) and the observed seasonal fluctuations in phylogenetic diversity, we found that functional trait diversity, evenness, and divergence remained constant during all seasons in all communities. However, similar to the pattern observed with phylogenetic diversity, functional trait richness fluctuated markedly with rainfall in all sites. These results emphasize the importance of considering seasonal fluctuations in community assembly and

  12. Proximity Within Interphase Chromosome Contributes to the Breakpoint Distribution in Radiation-Induced Intrachromosomal Exchanges

    NASA Technical Reports Server (NTRS)

    Zhang, Ye; Uhlemeyer, Jimmy; Hada, Megumi; Asaithamby, A.; Chen, David J.; Wu, Honglu

    2015-01-01

    Previously, we reported that breaks involved in chromosome aberrations were clustered in several regions of chromosome3 in human mammary epithelial cells after exposures to either low-or high-LET radiation. In particular, breaks in certain regions of the chromosome tended to rejoin with each other to form an intrachromosome exchange event. This study tests the hypothesis that proximity within a single chromosome in interphase cell nuclei contributes to the distribution of radiation-induced chromosome breaks. Chromosome 3 in G1 human mammary epithelial cells was hybridized with the multicolor banding in situ hybridization (mBAND) probes that distinguish the chromosome in six differently colored regions, and the location of these regions was measured with a laser confocal microscope. Results of the study indicated that, on a multi-mega base pair scale of the DNA, the arrangement of chromatin was non-random. Both telomere regions tended to be located towards the exterior of the chromosome domain, whereas the centromere region towards the interior. In addition, the interior of the chromosome domain was preferentially occupied by the p-arm of the chromatin, which is consistent with our previous finding of intrachromosome exchanges involving breaks on the p-arm and in the centromere region of chromosome3. Other factors, such as the fragile sites in the 3p21 band and gene regulation, may also contribute to the breakpoint distribution in radiation-induced chromosome aberrations. Further investigations suggest that the 3D chromosome folding is cell type and culture condition dependent.

  13. Choosing and Using Introns in Molecular Phylogenetics

    PubMed Central

    Creer, Simon

    2007-01-01

    Introns are now commonly used in molecular phylogenetics in an attempt to recover gene trees that are concordant with species trees, but there are a range of genomic, logistical and analytical considerations that are infrequently discussed in empirical studies that utilize intron data. This review outlines expedient approaches for locus selection, overcoming paralogy problems, recombination detection methods and the identification and incorporation of LVHs in molecular systematics. A range of parsimony and Bayesian analytical approaches are also described in order to highlight the methods that can currently be employed to align sequences and treat indels in subsequent analyses. By covering the main points associated with the generation and analysis of intron data, this review aims to provide a comprehensive introduction to using introns (or any non-coding nuclear data partition) in contemporary phylogenetics. PMID:19461984

  14. IcyTree: rapid browser-based visualization for phylogenetic trees and networks

    PubMed Central

    2017-01-01

    Abstract Summary: IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. Availability and Implementation: IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree. Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. Contact: tgvaughan@gmail.com PMID:28407035

  15. Applying a multiobjective metaheuristic inspired by honey bees to phylogenetic inference.

    PubMed

    Santander-Jiménez, Sergio; Vega-Rodríguez, Miguel A

    2013-10-01

    The development of increasingly popular multiobjective metaheuristics has allowed bioinformaticians to deal with optimization problems in computational biology where multiple objective functions must be taken into account. One of the most relevant research topics that can benefit from these techniques is phylogenetic inference. Throughout the years, different researchers have proposed their own view about the reconstruction of ancestral evolutionary relationships among species. As a result, biologists often report different phylogenetic trees from a same dataset when considering distinct optimality principles. In this work, we detail a multiobjective swarm intelligence approach based on the novel Artificial Bee Colony algorithm for inferring phylogenies. The aim of this paper is to propose a complementary view of phylogenetics according to the maximum parsimony and maximum likelihood criteria, in order to generate a set of phylogenetic trees that represent a compromise between these principles. Experimental results on a variety of nucleotide data sets and statistical studies highlight the relevance of the proposal with regard to other multiobjective algorithms and state-of-the-art biological methods. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  16. IcyTree: rapid browser-based visualization for phylogenetic trees and networks.

    PubMed

    Vaughan, Timothy G

    2017-08-01

    IcyTree is an easy-to-use application which can be used to visualize a wide variety of phylogenetic trees and networks. While numerous phylogenetic tree viewers exist already, IcyTree distinguishes itself by being a purely online tool, having a responsive user interface, supporting phylogenetic networks (ancestral recombination graphs in particular), and efficiently drawing trees that include information such as ancestral locations or trait values. IcyTree also provides intuitive panning and zooming utilities that make exploring large phylogenetic trees of many thousands of taxa feasible. IcyTree is a web application and can be accessed directly at http://tgvaughan.github.com/icytree . Currently supported web browsers include Mozilla Firefox and Google Chrome. IcyTree is written entirely in client-side JavaScript (no plugin required) and, once loaded, does not require network access to run. IcyTree is free software, and the source code is made available at http://github.com/tgvaughan/icytree under version 3 of the GNU General Public License. tgvaughan@gmail.com. © The Author(s) 2017. Published by Oxford University Press.

  17. Autumn Algorithm-Computation of Hybridization Networks for Realistic Phylogenetic Trees.

    PubMed

    Huson, Daniel H; Linz, Simone

    2018-01-01

    A minimum hybridization network is a rooted phylogenetic network that displays two given rooted phylogenetic trees using a minimum number of reticulations. Previous mathematical work on their calculation has usually assumed the input trees to be bifurcating, correctly rooted, or that they both contain the same taxa. These assumptions do not hold in biological studies and "realistic" trees have multifurcations, are difficult to root, and rarely contain the same taxa. We present a new algorithm for computing minimum hybridization networks for a given pair of "realistic" rooted phylogenetic trees. We also describe how the algorithm might be used to improve the rooting of the input trees. We introduce the concept of "autumn trees", a nice framework for the formulation of algorithms based on the mathematics of "maximum acyclic agreement forests". While the main computational problem is hard, the run-time depends mainly on how different the given input trees are. In biological studies, where the trees are reasonably similar, our parallel implementation performs well in practice. The algorithm is available in our open source program Dendroscope 3, providing a platform for biologists to explore rooted phylogenetic networks. We demonstrate the utility of the algorithm using several previously studied data sets.

  18. Phylogenetic signal, feeding behaviour and brain volume in Neotropical bats.

    PubMed

    Rojas, D; Mancina, C A; Flores-Martínez, J J; Navarro, L

    2013-09-01

    Comparative correlational studies of brain size and ecological traits (e.g. feeding habits and habitat complexity) have increased our knowledge about the selective pressures on brain evolution. Studies conducted in bats as a model system assume that shared evolutionary history has a maximum effect on the traits. However, this effect has not been quantified. In addition, the effect of levels of diet specialization on brain size remains unclear. We examined the role of diet on the evolution of brain size in Mormoopidae and Phyllostomidae using two comparative methods. Body mass explained 89% of the variance in brain volume. The effect of feeding behaviour (either characterized as feeding habits, as levels of specialization on a type of item or as handling behaviour) on brain volume was also significant albeit not consistent after controlling for body mass and the strength of the phylogenetic signal (λ). Although the strength of the phylogenetic signal of brain volume and body mass was high when tested individually, λ values in phylogenetic generalized least squares models were significantly different from 1. This suggests that phylogenetic independent contrasts models are not always the best approach for the study of ecological correlates of brain size in New World bats. © 2013 The Authors. Journal of Evolutionary Biology © 2013 European Society For Evolutionary Biology.

  19. Phylesystem: a git-based data store for community-curated phylogenetic estimates.

    PubMed

    McTavish, Emily Jane; Hinchliff, Cody E; Allman, James F; Brown, Joseph W; Cranston, Karen A; Holder, Mark T; Rees, Jonathan A; Smith, Stephen A

    2015-09-01

    Phylogenetic estimates from published studies can be archived using general platforms like Dryad (Vision, 2010) or TreeBASE (Sanderson et al., 1994). Such services fulfill a crucial role in ensuring transparency and reproducibility in phylogenetic research. However, digital tree data files often require some editing (e.g. rerooting) to improve the accuracy and reusability of the phylogenetic statements. Furthermore, establishing the mapping between tip labels used in a tree and taxa in a single common taxonomy dramatically improves the ability of other researchers to reuse phylogenetic estimates. As the process of curating a published phylogenetic estimate is not error-free, retaining a full record of the provenance of edits to a tree is crucial for openness, allowing editors to receive credit for their work and making errors introduced during curation easier to correct. Here, we report the development of software infrastructure to support the open curation of phylogenetic data by the community of biologists. The backend of the system provides an interface for the standard database operations of creating, reading, updating and deleting records by making commits to a git repository. The record of the history of edits to a tree is preserved by git's version control features. Hosting this data store on GitHub (http://github.com/) provides open access to the data store using tools familiar to many developers. We have deployed a server running the 'phylesystem-api', which wraps the interactions with git and GitHub. The Open Tree of Life project has also developed and deployed a JavaScript application that uses the phylesystem-api and other web services to enable input and curation of published phylogenetic statements. Source code for the web service layer is available at https://github.com/OpenTreeOfLife/phylesystem-api. The data store can be cloned from: https://github.com/OpenTreeOfLife/phylesystem. A web application that uses the phylesystem web services is deployed

  20. Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility.

    PubMed

    Buchwalter, David B; Cain, Daniel J; Martin, Caitrin A; Xie, Lingtian; Luoma, Samuel N; Garland, Theodore

    2008-06-17

    We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature.

  1. Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility

    USGS Publications Warehouse

    Buchwalter, D.B.; Cain, D.J.; Martin, C.A.; Xie, Lingtian; Luoma, S.N.; Garland, T.

    2008-01-01

    We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. ?? 2008 by The National Academy of Sciences of the USA.

  2. Aquatic insect ecophysiological traits reveal phylogenetically based differences in dissolved cadmium susceptibility

    PubMed Central

    Buchwalter, David B.; Cain, Daniel J.; Martin, Caitrin A.; Xie, Lingtian; Luoma, Samuel N.; Garland, Theodore

    2008-01-01

    We used a phylogenetically based comparative approach to evaluate the potential for physiological studies to reveal patterns of diversity in traits related to susceptibility to an environmental stressor, the trace metal cadmium (Cd). Physiological traits related to Cd bioaccumulation, compartmentalization, and ultimately susceptibility were measured in 21 aquatic insect species representing the orders Ephemeroptera, Plecoptera, and Trichoptera. We mapped these experimentally derived physiological traits onto a phylogeny and quantified the tendency for related species to be similar (phylogenetic signal). All traits related to Cd bioaccumulation and susceptibility exhibited statistically significant phylogenetic signal, although the signal strength varied among traits. Conventional and phylogenetically based regression models were compared, revealing great variability within orders but consistent, strong differences among insect families. Uptake and elimination rate constants were positively correlated among species, but only when effects of body size and phylogeny were incorporated in the analysis. Together, uptake and elimination rates predicted dramatic Cd bioaccumulation differences among species that agreed with field-based measurements. We discovered a potential tradeoff between the ability to eliminate Cd and the ability to detoxify it across species, particularly mayflies. The best-fit regression models were driven by phylogenetic parameters (especially differences among families) rather than functional traits, suggesting that it may eventually be possible to predict a taxon's physiological performance based on its phylogenetic position, provided adequate physiological information is available for close relatives. There appears to be great potential for evolutionary physiological approaches to augment our understanding of insect responses to environmental stressors in nature. PMID:18559853

  3. Measures of phylogenetic differentiation provide robust and complementary insights into microbial communities.

    PubMed

    Parks, Donovan H; Beiko, Robert G

    2013-01-01

    High-throughput sequencing techniques have made large-scale spatial and temporal surveys of microbial communities routine. Gaining insight into microbial diversity requires methods for effectively analyzing and visualizing these extensive data sets. Phylogenetic β-diversity measures address this challenge by allowing the relationship between large numbers of environmental samples to be explored using standard multivariate analysis techniques. Despite the success and widespread use of phylogenetic β-diversity measures, an extensive comparative analysis of these measures has not been performed. Here, we compare 39 measures of phylogenetic β diversity in order to establish the relative similarity of these measures along with key properties and performance characteristics. While many measures are highly correlated, those commonly used within microbial ecology were found to be distinct from those popular within classical ecology, and from the recently recommended Gower and Canberra measures. Many of the measures are surprisingly robust to different rootings of the gene tree, the choice of similarity threshold used to define operational taxonomic units, and the presence of outlying basal lineages. Measures differ considerably in their sensitivity to rare organisms, and the effectiveness of measures can vary substantially under alternative models of differentiation. Consequently, the depth of sequencing required to reveal underlying patterns of relationships between environmental samples depends on the selected measure. Our results demonstrate that using complementary measures of phylogenetic β diversity can further our understanding of how communities are phylogenetically differentiated. Open-source software implementing the phylogenetic β-diversity measures evaluated in this manuscript is available at http://kiwi.cs.dal.ca/Software/ExpressBetaDiversity.

  4. Computational Tools for Parsimony Phylogenetic Analysis of Omics Data

    PubMed Central

    Salazar, Jose; Amri, Hakima; Noursi, David

    2015-01-01

    Abstract High-throughput assays from genomics, proteomics, metabolomics, and next generation sequencing produce massive omics datasets that are challenging to analyze in biological or clinical contexts. Thus far, there is no publicly available program for converting quantitative omics data into input formats to be used in off-the-shelf robust phylogenetic programs. To the best of our knowledge, this is the first report on creation of two Windows-based programs, OmicsTract and SynpExtractor, to address this gap. We note, as a way of introduction and development of these programs, that one particularly useful bioinformatics inferential modeling is the phylogenetic cladogram. Cladograms are multidimensional tools that show the relatedness between subgroups of healthy and diseased individuals and the latter's shared aberrations; they also reveal some characteristics of a disease that would not otherwise be apparent by other analytical methods. The OmicsTract and SynpExtractor were written for the respective tasks of (1) accommodating advanced phylogenetic parsimony analysis (through standard programs of MIX [from PHYLIP] and TNT), and (2) extracting shared aberrations at the cladogram nodes. OmicsTract converts comma-delimited data tables through assigning each data point into a binary value (“0” for normal states and “1” for abnormal states) then outputs the converted data tables into the proper input file formats for MIX or with embedded commands for TNT. SynapExtractor uses outfiles from MIX and TNT to extract the shared aberrations of each node of the cladogram, matching them with identifying labels from the dataset and exporting them into a comma-delimited file. Labels may be gene identifiers in gene-expression datasets or m/z values in mass spectrometry datasets. By automating these steps, OmicsTract and SynpExtractor offer a veritable opportunity for rapid and standardized phylogenetic analyses of omics data; their model can also be extended to next

  5. An efficient and extensible approach for compressing phylogenetic trees

    PubMed Central

    2011-01-01

    Background Biologists require new algorithms to efficiently compress and store their large collections of phylogenetic trees. Our previous work showed that TreeZip is a promising approach for compressing phylogenetic trees. In this paper, we extend our TreeZip algorithm by handling trees with weighted branches. Furthermore, by using the compressed TreeZip file as input, we have designed an extensible decompressor that can extract subcollections of trees, compute majority and strict consensus trees, and merge tree collections using set operations such as union, intersection, and set difference. Results On unweighted phylogenetic trees, TreeZip is able to compress Newick files in excess of 98%. On weighted phylogenetic trees, TreeZip is able to compress a Newick file by at least 73%. TreeZip can be combined with 7zip with little overhead, allowing space savings in excess of 99% (unweighted) and 92%(weighted). Unlike TreeZip, 7zip is not immune to branch rotations, and performs worse as the level of variability in the Newick string representation increases. Finally, since the TreeZip compressed text (TRZ) file contains all the semantic information in a collection of trees, we can easily filter and decompress a subset of trees of interest (such as the set of unique trees), or build the resulting consensus tree in a matter of seconds. We also show the ease of which set operations can be performed on TRZ files, at speeds quicker than those performed on Newick or 7zip compressed Newick files, and without loss of space savings. Conclusions TreeZip is an efficient approach for compressing large collections of phylogenetic trees. The semantic and compact nature of the TRZ file allow it to be operated upon directly and quickly, without a need to decompress the original Newick file. We believe that TreeZip will be vital for compressing and archiving trees in the biological community. PMID:22165819

  6. Shigella species epidemiology and antimicrobial susceptibility: the implications of emerging azithromycin resistance for guiding treatment, guidelines and breakpoints.

    PubMed

    Brown, Jeremy D; Willcox, Simon J; Franklin, Neil; Hazelton, Briony; Howard, Peter; Reinten, Tracie; Sheppeard, Vicky; O'Sullivan, Matthew

    2017-11-01

    To examine antimicrobial susceptibility patterns and predictors of resistance among Shigella isolates in New South Wales (NSW), Australia during 2013-14 with emphasis on azithromycin. Cross-sectional analysis of all shigellosis cases (160) notified to public health authorities in NSW, Australia was performed. Among 160 Shigella isolates tested, 139 (86.9%) were susceptible to azithromycin, 104 (65.0%) to ciprofloxacin and 38 (23.7%) to co-trimoxazole. Ciprofloxacin resistance was 1.9 times more common in infections acquired in Australia compared with those acquired overseas, while azithromycin resistance was 8.5 times more common in males. We recommend ongoing reconsideration of guidelines for the treatment of shigellosis based on emerging resistance patterns. First-line therapy may need to be reconsidered based on local resistance rates due to common resistance to co-trimoxazole and ciprofloxacin. We recommend culture and susceptibility testing for suspected and proven shigellosis. Azithromycin susceptibility breakpoints for Shigella species may need to be species specific. © The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Conservation threats and the phylogenetic utility of IUCN Red List rankings in Incilius toads.

    PubMed

    Schachat, Sandra R; Mulcahy, Daniel G; Mendelson, Joseph R

    2016-02-01

    Phylogenetic analysis of extinction threat is an emerging tool in the field of conservation. However, there are problems with the methods and data as commonly used. Phylogenetic sampling usually extends to the level of family or genus, but International Union for Conservation of Nature (IUCN) rankings are available only for individual species, and, although different species within a taxonomic group may have the same IUCN rank, the species may have been ranked as such for different reasons. Therefore, IUCN rank may not reflect evolutionary history and thus may not be appropriate for use in a phylogenetic context. To be used appropriately, threat-risk data should reflect the cause of extinction threat rather than the IUCN threat ranking. In a case study of the toad genus Incilius, with phylogenetic sampling at the species level (so that the resolution of the phylogeny matches character data from the IUCN Red List), we analyzed causes of decline and IUCN threat rankings by calculating metrics of phylogenetic signal (such as Fritz and Purvis' D). We also analyzed the extent to which cause of decline and threat ranking overlap by calculating phylogenetic correlation between these 2 types of character data. Incilius species varied greatly in both threat ranking and cause of decline; this variability would be lost at a coarser taxonomic resolution. We found far more phylogenetic signal, likely correlated with evolutionary history, for causes of decline than for IUCN threat ranking. Individual causes of decline and IUCN threat rankings were largely uncorrelated on the phylogeny. Our results demonstrate the importance of character selection and taxonomic resolution when extinction threat is analyzed in a phylogenetic context. © 2015 Society for Conservation Biology.

  8. Human-mediated loss of phylogenetic and functional diversity in coral reef fishes.

    PubMed

    D'agata, Stéphanie; Mouillot, David; Kulbicki, Michel; Andréfouët, Serge; Bellwood, David R; Cinner, Joshua E; Cowman, Peter F; Kronen, Mecki; Pinca, Silvia; Vigliola, Laurent

    2014-03-03

    Beyond the loss of species richness, human activities may also deplete the breadth of evolutionary history (phylogenetic diversity) and the diversity of roles (functional diversity) carried out by species within communities, two overlooked components of biodiversity. Both are, however, essential to sustain ecosystem functioning and the associated provision of ecosystem services, particularly under fluctuating environmental conditions. We quantified the effect of human activities on the taxonomic, phylogenetic, and functional diversity of fish communities in coral reefs, while teasing apart the influence of biogeography and habitat along a gradient of human pressure across the Pacific Ocean. We detected nonlinear relationships with significant breaking points in the impact of human population density on phylogenetic and functional diversity of parrotfishes, at 25 and 15 inhabitants/km(2), respectively, while parrotfish species richness decreased linearly along the same population gradient. Over the whole range, species richness decreased by 11.7%, while phylogenetic and functional diversity dropped by 35.8% and 46.6%, respectively. Our results call for caution when using species richness as a benchmark for measuring the status of ecosystems since it appears to be less responsive to variation in human population densities than its phylogenetic and functional counterparts, potentially imperiling the functioning of coral reef ecosystems. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Phylogenetic relationships of the dwarf boas and a comparison of Bayesian and bootstrap measures of phylogenetic support.

    PubMed

    Wilcox, Thomas P; Zwickl, Derrick J; Heath, Tracy A; Hillis, David M

    2002-11-01

    Four New World genera of dwarf boas (Exiliboa, Trachyboa, Tropidophis, and Ungaliophis) have been placed by many systematists in a single group (traditionally called Tropidophiidae). However, the monophyly of this group has been questioned in several studies. Moreover, the overall relationships among basal snake lineages, including the placement of the dwarf boas, are poorly understood. We obtained mtDNA sequence data for 12S, 16S, and intervening tRNA-val genes from 23 species of snakes representing most major snake lineages, including all four genera of New World dwarf boas. We then examined the phylogenetic position of these species by estimating the phylogeny of the basal snakes. Our phylogenetic analysis suggests that New World dwarf boas are not monophyletic. Instead, we find Exiliboa and Ungaliophis to be most closely related to sand boas (Erycinae), boas (Boinae), and advanced snakes (Caenophidea), whereas Tropidophis and Trachyboa form an independent clade that separated relatively early in snake radiation. Our estimate of snake phylogeny differs significantly in other ways from some previous estimates of snake phylogeny. For instance, pythons do not cluster with boas and sand boas, but instead show a strong relationship with Loxocemus and Xenopeltis. Additionally, uropeltids cluster strongly with Cylindrophis, and together are embedded in what has previously been considered the macrostomatan radiation. These relationships are supported by both bootstrapping (parametric and nonparametric approaches) and Bayesian analysis, although Bayesian support values are consistently higher than those obtained from nonparametric bootstrapping. Simulations show that Bayesian support values represent much better estimates of phylogenetic accuracy than do nonparametric bootstrap support values, at least under the conditions of our study. Copyright 2002 Elsevier Science (USA)

  10. Molecular phylogenetics of porcini mushrooms (Boletus section Boletus).

    PubMed

    Dentinger, Bryn T M; Ammirati, Joseph F; Both, Ernst E; Desjardin, Dennis E; Halling, Roy E; Henkel, Terry W; Moreau, Pierre-Arthur; Nagasawa, Eiji; Soytong, Kasem; Taylor, Andy F; Watling, Roy; Moncalvo, Jean-Marc; McLaughlin, David J

    2010-12-01

    Porcini (Boletus section Boletus: Boletaceae: Boletineae: Boletales) are a conspicuous group of wild, edible mushrooms characterized by fleshy fruiting bodies with a poroid hymenophore that is "stuffed" with white hyphae when young. Their reported distribution is with ectomycorrhizal plants throughout the Northern Hemisphere. Little progress has been made on the systematics of this group using modern molecular phylogenetic tools because sampling has been limited primarily to European species and the genes employed were insufficient to resolve the phylogeny. We examined the evolutionary history of porcini by using a global geographic sampling of most known species, new discoveries from little explored areas, and multiple genes. We used 78 sequences from the fast-evolving nuclear internal transcribed spacers and are able to recognize 18 reciprocally monophyletic species. To address whether or not porcini form a monophyletic group, we compiled a broadly sampled dataset of 41 taxa, including other members of the Boletineae, and used separate and combined phylogenetic analysis of sequences from the nuclear large subunit ribosomal DNA, the largest subunit of RNA polymerase II, and the mitochondrial ATPase subunit six gene. Contrary to previous studies, our separate and combined phylogenetic analyses support the monophyly of porcini. We also report the discovery of two taxa that expand the known distribution of porcini to Australia and Thailand and have ancient phylogenetic connections to the rest of the group. A relaxed molecular clock analysis with these new taxa dates the origin of porcini to between 42 and 54 million years ago, coinciding with the initial diversification of angiosperms, during the Eocene epoch when the climate was warm and humid. These results reveal an unexpected diversity, distribution, and ancient origin of a group of commercially valuable mushrooms that may provide an economic incentive for conservation and support the hypothesis of a tropical

  11. Local-scale Partitioning of Functional and Phylogenetic Beta Diversity in a Tropical Tree Assemblage.

    PubMed

    Yang, Jie; Swenson, Nathan G; Zhang, Guocheng; Ci, Xiuqin; Cao, Min; Sha, Liqing; Li, Jie; Ferry Slik, J W; Lin, Luxiang

    2015-08-03

    The relative degree to which stochastic and deterministic processes underpin community assembly is a central problem in ecology. Quantifying local-scale phylogenetic and functional beta diversity may shed new light on this problem. We used species distribution, soil, trait and phylogenetic data to quantify whether environmental distance, geographic distance or their combination are the strongest predictors of phylogenetic and functional beta diversity on local scales in a 20-ha tropical seasonal rainforest dynamics plot in southwest China. The patterns of phylogenetic and functional beta diversity were generally consistent. The phylogenetic and functional dissimilarity between subplots (10 × 10 m, 20 × 20 m, 50 × 50 m and 100 × 100 m) was often higher than that expected by chance. The turnover of lineages and species function within habitats was generally slower than that across habitats. Partitioning the variation in phylogenetic and functional beta diversity showed that environmental distance was generally a better predictor of beta diversity than geographic distance thereby lending relatively more support for deterministic environmental filtering over stochastic processes. Overall, our results highlight that deterministic processes play a stronger role than stochastic processes in structuring community composition in this diverse assemblage of tropical trees.

  12. Evidence of two distinct phylogenetic lineages of dog rabies virus circulating in Cambodia.

    PubMed

    Mey, Channa; Metlin, Artem; Duong, Veasna; Ong, Sivuth; In, Sotheary; Horwood, Paul F; Reynes, Jean-Marc; Bourhy, Hervé; Tarantola, Arnaud; Buchy, Philippe

    2016-03-01

    This first extensive retrospective study of the molecular epidemiology of dog rabies in Cambodia included 149 rabies virus (RABV) entire nucleoprotein sequences obtained from 1998-2011. The sequences were analyzed in conjunction with RABVs from other Asian countries. Phylogenetic reconstruction confirmed the South-East Asian phylogenetic clade comprising viruses from Cambodia, Vietnam, Thailand, Laos and Myanmar. The present study represents the first attempt to classify the phylogenetic lineages inside this clade, resulting in the confirmation that all the Cambodian viruses belonged to the South-East Asian (SEA) clade. Three distinct phylogenetic lineages in the region were established with the majority of viruses from Cambodia closely related to viruses from Thailand, Laos and Vietnam, forming the geographically widespread phylogenetic lineage SEA1. A South-East Asian lineage SEA2 comprised two viruses from Cambodia was identified, which shared a common ancestor with RABVs originating from Laos. Viruses from Myanmar formed separate phylogenetic lineages within the major SEA clade. Bayesian molecular clock analysis suggested that the time to most recent common ancestor (TMRCA) of all Cambodian RABVs dated to around 1950. The TMRCA of the Cambodian SEA1 lineage was around 1964 and that of the SEA2 lineage was around 1953. The results identified three phylogenetically distinct and geographically separated lineages inside the earlier identified major SEA clade, covering at least five countries in the region. A greater understanding of the molecular epidemiology of rabies in South-East Asia is an important step to monitor progress on the efforts to control canine rabies in the region. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Phylogenetic position of the genus Perkinsus (Protista, Apicomplexa) based on small subunit ribosomal RNA.

    PubMed

    Goggin, C L; Barker, S C

    1993-07-01

    Parasites of the genus Perkinsus destroy marine molluscs worldwide. Their phylogenetic position within the kingdom Protista is controversial. Nucleotide sequence data (1792 bp) from the small subunit rRNA gene of Perkinsus sp. from Anadara trapezia (Mollusca: Bivalvia) from Moreton Bay, Queensland, was used to examine the phylogenetic affinities of this enigmatic genus. These data were aligned with nucleotide sequences from 6 apicomplexans, 3 ciliates, 3 flagellates, a dinoflagellate, 3 fungi, maize and human. Phylogenetic trees were constructed after analysis with maximum parsimony and distance matrix methods. Our analyses indicate that Perkinsus is phylogenetically closer to dinoflagellates and to coccidean and piroplasm apicomplexans than to fungi or flagellates.

  14. Opposing assembly mechanisms in a neotropical dry forest: implications for phylogenetic and functional community ecology.

    PubMed

    Swenson, Nathan G; Enquist, Brian J

    2009-08-01

    Species diversity is promoted and maintained by ecological and evolutionary processes operating on species attributes through space and time. The degree to which variability in species function regulates distribution and promotes coexistence of species has been debated. Previous work has attempted to quantify the relative importance of species function by using phylogenetic relatedness as a proxy for functional similarity. The key assumption of this approach is that function is phylogenetically conserved. If this assumption is supported, then the phylogenetic dispersion in a community should mirror the functional dispersion. Here we quantify functional trait dispersion along several key axes of tree life-history variation and on multiple spatial scales in a Neotropical dry-forest community. We next compare these results to previously reported patterns of phylogenetic dispersion in this same forest. We find that, at small spatial scales, coexisting species are typically more functionally clustered than expected, but traits related to adult and regeneration niches are overdispersed. This outcome was repeated when the analyses were stratified by size class. Some of the trait dispersion results stand in contrast to the previously reported phylogenetic dispersion results. In order to address this inconsistency we examined the strength of phylogenetic signal in traits at different depths in the phylogeny. We argue that: (1) while phylogenetic relatedness may be a good general multivariate proxy for ecological similarity, it may have a reduced capacity to depict the functional mechanisms behind species coexistence when coexisting species simultaneously converge and diverge in function; and (2) the previously used metric of phylogenetic signal provided erroneous inferences about trait dispersion when married with patterns of phylogenetic dispersion.

  15. Metabolic Pathway Assignment of Plant Genes based on Phylogenetic Profiling–A Feasibility Study

    PubMed Central

    Weißenborn, Sandra; Walther, Dirk

    2017-01-01

    Despite many developed experimental and computational approaches, functional gene annotation remains challenging. With the rapidly growing number of sequenced genomes, the concept of phylogenetic profiling, which predicts functional links between genes that share a common co-occurrence pattern across different genomes, has gained renewed attention as it promises to annotate gene functions based on presence/absence calls alone. We applied phylogenetic profiling to the problem of metabolic pathway assignments of plant genes with a particular focus on secondary metabolism pathways. We determined phylogenetic profiles for 40,960 metabolic pathway enzyme genes with assigned EC numbers from 24 plant species based on sequence and pathway annotation data from KEGG and Ensembl Plants. For gene sequence family assignments, needed to determine the presence or absence of particular gene functions in the given plant species, we included data of all 39 species available at the Ensembl Plants database and established gene families based on pairwise sequence identities and annotation information. Aside from performing profiling comparisons, we used machine learning approaches to predict pathway associations from phylogenetic profiles alone. Selected metabolic pathways were indeed found to be composed of gene families of greater than expected phylogenetic profile similarity. This was particularly evident for primary metabolism pathways, whereas for secondary pathways, both the available annotation in different species as well as the abstraction of functional association via distinct pathways proved limiting. While phylogenetic profile similarity was generally not found to correlate with gene co-expression, direct physical interactions of proteins were reflected by a significantly increased profile similarity suggesting an application of phylogenetic profiling methods as a filtering step in the identification of protein-protein interactions. This feasibility study highlights the

  16. A Consistent Phylogenetic Backbone for the Fungi

    PubMed Central

    Ebersberger, Ingo; de Matos Simoes, Ricardo; Kupczok, Anne; Gube, Matthias; Kothe, Erika; Voigt, Kerstin; von Haeseler, Arndt

    2012-01-01

    The kingdom of fungi provides model organisms for biotechnology, cell biology, genetics, and life sciences in general. Only when their phylogenetic relationships are stably resolved, can individual results from fungal research be integrated into a holistic picture of biology. However, and despite recent progress, many deep relationships within the fungi remain unclear. Here, we present the first phylogenomic study of an entire eukaryotic kingdom that uses a consistency criterion to strengthen phylogenetic conclusions. We reason that branches (splits) recovered with independent data and different tree reconstruction methods are likely to reflect true evolutionary relationships. Two complementary phylogenomic data sets based on 99 fungal genomes and 109 fungal expressed sequence tag (EST) sets analyzed with four different tree reconstruction methods shed light from different angles on the fungal tree of life. Eleven additional data sets address specifically the phylogenetic position of Blastocladiomycota, Ustilaginomycotina, and Dothideomycetes, respectively. The combined evidence from the resulting trees supports the deep-level stability of the fungal groups toward a comprehensive natural system of the fungi. In addition, our analysis reveals methodologically interesting aspects. Enrichment for EST encoded data—a common practice in phylogenomic analyses—introduces a strong bias toward slowly evolving and functionally correlated genes. Consequently, the generalization of phylogenomic data sets as collections of randomly selected genes cannot be taken for granted. A thorough characterization of the data to assess possible influences on the tree reconstruction should therefore become a standard in phylogenomic analyses. PMID:22114356

  17. Non-linear absorption pharmacokinetics of amoxicillin: consequences for dosing regimens and clinical breakpoints.

    PubMed

    de Velde, Femke; de Winter, Brenda C M; Koch, Birgit C P; van Gelder, Teun; Mouton, Johan W

    2016-10-01

    To describe the population pharmacokinetics of oral amoxicillin and to compare the PTA of current dosing regimens. Two groups, each with 14 healthy male volunteers, received oral amoxicillin/clavulanic acid tablets on two separate days 1 week apart. One group received 875/125 mg twice daily and 500/125 mg three times daily and the other group 500/125 mg twice daily and 250/125 mg three times daily. A total of 1428 amoxicillin blood samples were collected before and after administration. We analysed the concentration-time profiles using a non-compartmental pharmacokinetic method (PKSolver) and a population pharmacokinetic method (NONMEM). The PTA was computed using Monte Carlo simulations for several dosing regimens. AUC0-24 and Cmax increased non-linearly with dose. The final model included the following components: Savic's transit compartment model, Michaelis-Menten absorption, two distribution compartments and first-order elimination. The mean central volume of distribution was 27.7 L and mean clearance was 21.3 L/h. We included variability for the central volume of distribution (34.4%), clearance (25.8%), transit compartment model parameters and Michaelis-Menten absorption parameters. For 40% fT>MIC and >97.5% PTA, the breakpoints were 0.125 mg/L (500 mg twice daily), 0.25 mg/L (250 mg three times daily and 875 mg twice daily), 0.5 mg/L (500 mg three times daily) and 1 mg/L (750, 875 or 1000 mg three times daily and 500 mg four times daily). The amoxicillin absorption rate appears to be saturable. The PTAs of high-dose as well as twice-daily regimens are less favourable than regimens with lower doses and higher frequency. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. Plant-Pollinator Coextinctions and the Loss of Plant Functional and Phylogenetic Diversity

    PubMed Central

    Vieira, Marcos Costa; Cianciaruso, Marcus Vinicius; Almeida-Neto, Mário

    2013-01-01

    Plant-pollinator coextinctions are likely to become more frequent as habitat alteration and climate change continue to threaten pollinators. The consequences of the resulting collapse of plant communities will depend partly on how quickly plant functional and phylogenetic diversity decline following pollinator extinctions. We investigated the functional and phylogenetic consequences of pollinator extinctions by simulating coextinctions in seven plant-pollinator networks coupled with independent data on plant phylogeny and functional traits. Declines in plant functional diversity were slower than expected under a scenario of random extinctions, while phylogenetic diversity often decreased faster than expected by chance. Our results show that plant functional diversity was relatively robust to plant-pollinator coextinctions, despite the underlying rapid loss of evolutionary history. Thus, our study suggests the possibility of uncoupled responses of functional and phylogenetic diversity to species coextinctions, highlighting the importance of considering both dimensions of biodiversity explicitly in ecological studies and when planning for the conservation of species and interactions. PMID:24312281

  19. Morphological, molecular and phylogenetic analyses of Diplotriaena bargusinica Skrjabin, 1917 (Nematoda: Diplotriaenidae).

    PubMed

    Dutra Vieira, Thainá; Pegoraro de Macedo, Marcia Raquel; Fedatto Bernardon, Fabiana; Müller, Gertrud

    2017-10-01

    The nematode Diplotriaena bargusinica is a bird air sac parasite, and its taxonomy is based mainly on morphological and morphometric characteristics. Increasing knowledge of genetic information variability has spurred the use of DNA markers in conjunction with morphological data for inferring phylogenetic relationships in different taxa. Considering the potential of molecular biology in taxonomy, this study presents the morphological and molecular characterization of D. bargusinica, and establishes the phylogenetic position of the nematode in Spirurina. Twenty partial sequences of the 18S region of D. bargusinica rDNA were generated. Phylogenetic trees were obtained through the Maximum Likelihood and Bayesian Inference methods where both had similar topology. The group Diplotriaenoidea is monophyletic and the topologies generated corroborate the phylogenetic studies based on traditional and previously performed molecular taxonomy. This study is the first to generate molecular data associated with the morphology of the species. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Phylogenetic relatedness and leaf functional traits, not introduced status, influence community assembly.

    PubMed

    Lemoine, Nathan P; Shue, Jessica; Verrico, Brittany; Erickson, David; Kress, W John; Parker, John D

    2015-10-01

    Considerable debate focuses on whether invasive species establish and become abundant by being functionally and phylogenetically distinct from native species, leading to a host of invasion-specific hypotheses of community assembly. Few studies, however, have quantitatively assessed whether similar patterns of phylogenetic and functional similarity explain local abundance of both native and introduced species, which would suggest similar assembly mechanisms regardless of origin. Using a chronosequence of invaded temperate forest stands, we tested whether the occurrence and abundance of both introduced and native species were predicted by phylogenetic relatedness, functional overlap, and key environmental characteristics including forest age. Environmental filtering against functionally and phylogenetically distinct species strongly dictated the occurrence and abundance of both introduced and native species, with slight modifications of these patterns according to forest age. Thus, once functional and evolutionary novelty were quantified, introduced status provided little information about species' presence or abundance, indicating largely similar sorting mechanisms for both native and introduced species.

  1. Phylogenetic Analysis of Nuclear-Encoded RNA Maturases

    PubMed Central

    Malik, Sunita; Upadhyaya, KC; Khurana, SM Paul

    2017-01-01

    Posttranscriptional processes, such as splicing, play a crucial role in gene expression and are prevalent not only in nuclear genes but also in plant mitochondria where splicing of group II introns is catalyzed by a class of proteins termed maturases. In plant mitochondria, there are 22 mitochondrial group II introns. matR, nMAT1, nMAT2, nMAT3, and nMAT4 proteins have been shown to be required for efficient splicing of several group II introns in Arabidopsis thaliana. Nuclear maturases (nMATs) are necessary for splicing of mitochondrial genes, leading to normal oxidative phosphorylation. Sequence analysis through phylogenetic tree (including bootstrapping) revealed high homology with maturase sequences of A thaliana and other plants. This study shows the phylogenetic relationship of nMAT proteins between A thaliana and other nonredundant plant species taken from BLASTP analysis. PMID:28607538

  2. Octocoral Mitochondrial Genomes Provide Insights into the Phylogenetic History of Gene Order Rearrangements, Order Reversals, and Cnidarian Phylogenetics

    PubMed Central

    Figueroa, Diego F.; Baco, Amy R.

    2015-01-01

    We use full mitochondrial genomes to test the robustness of the phylogeny of the Octocorallia, to determine the evolutionary pathway for the five known mitochondrial gene rearrangements in octocorals, and to test the suitability of using mitochondrial genomes for higher taxonomic-level phylogenetic reconstructions. Our phylogeny supports three major divisions within the Octocorallia and show that Paragorgiidae is paraphyletic, with Sibogagorgia forming a sister branch to the Coralliidae. Furthermore, Sibogagorgia cauliflora has what is presumed to be the ancestral gene order in octocorals, but the presence of a pair of inverted repeat sequences suggest that this gene order was not conserved but rather evolved back to this apparent ancestral state. Based on this we recommend the resurrection of the family Sibogagorgiidae to fix the paraphyly of the Paragorgiidae. This is the first study to show that in the Octocorallia, mitochondrial gene orders have evolved back to an ancestral state after going through a gene rearrangement, with at least one of the gene orders evolving independently in different lineages. A number of studies have used gene boundaries to determine the type of mitochondrial gene arrangement present. However, our findings suggest that this method known as gene junction screening may miss evolutionary reversals. Additionally, substitution saturation analysis demonstrates that while whole mitochondrial genomes can be used effectively for phylogenetic analyses within Octocorallia, their utility at higher taxonomic levels within Cnidaria is inadequate. Therefore for phylogenetic reconstruction at taxonomic levels higher than subclass within the Cnidaria, nuclear genes will be required, even when whole mitochondrial genomes are available. PMID:25539723

  3. Developing a statistically powerful measure for quartet tree inference using phylogenetic identities and Markov invariants.

    PubMed

    Sumner, Jeremy G; Taylor, Amelia; Holland, Barbara R; Jarvis, Peter D

    2017-12-01

    Recently there has been renewed interest in phylogenetic inference methods based on phylogenetic invariants, alongside the related Markov invariants. Broadly speaking, both these approaches give rise to polynomial functions of sequence site patterns that, in expectation value, either vanish for particular evolutionary trees (in the case of phylogenetic invariants) or have well understood transformation properties (in the case of Markov invariants). While both approaches have been valued for their intrinsic mathematical interest, it is not clear how they relate to each other, and to what extent they can be used as practical tools for inference of phylogenetic trees. In this paper, by focusing on the special case of binary sequence data and quartets of taxa, we are able to view these two different polynomial-based approaches within a common framework. To motivate the discussion, we present three desirable statistical properties that we argue any invariant-based phylogenetic method should satisfy: (1) sensible behaviour under reordering of input sequences; (2) stability as the taxa evolve independently according to a Markov process; and (3) explicit dependence on the assumption of a continuous-time process. Motivated by these statistical properties, we develop and explore several new phylogenetic inference methods. In particular, we develop a statistically bias-corrected version of the Markov invariants approach which satisfies all three properties. We also extend previous work by showing that the phylogenetic invariants can be implemented in such a way as to satisfy property (3). A simulation study shows that, in comparison to other methods, our new proposed approach based on bias-corrected Markov invariants is extremely powerful for phylogenetic inference. The binary case is of particular theoretical interest as-in this case only-the Markov invariants can be expressed as linear combinations of the phylogenetic invariants. A wider implication of this is that, for

  4. A Visual Interface for Querying Heterogeneous Phylogenetic Databases.

    PubMed

    Jamil, Hasan M

    2017-01-01

    Despite the recent growth in the number of phylogenetic databases, access to these wealth of resources remain largely tool or form-based interface driven. It is our thesis that the flexibility afforded by declarative query languages may offer the opportunity to access these repositories in a better way, and to use such a language to pose truly powerful queries in unprecedented ways. In this paper, we propose a substantially enhanced closed visual query language, called PhyQL, that can be used to query phylogenetic databases represented in a canonical form. The canonical representation presented helps capture most phylogenetic tree formats in a convenient way, and is used as the storage model for our PhyloBase database for which PhyQL serves as the query language. We have implemented a visual interface for the end users to pose PhyQL queries using visual icons, and drag and drop operations defined over them. Once a query is posed, the interface translates the visual query into a Datalog query for execution over the canonical database. Responses are returned as hyperlinks to phylogenies that can be viewed in several formats using the tree viewers supported by PhyloBase. Results cached in PhyQL buffer allows secondary querying on the computed results making it a truly powerful querying architecture.

  5. [PK/PD breakpoints and clinical/bacteriological effects of cefcapene pivoxil fine granules for children at free drug concentrations in pediatric patients with respiratory infection].

    PubMed

    Toyonaga, Yoshikiyo; Iwai, Naoichi; Motohiro, Takashi; Sunakawa, Keisuke; Fujii, Ryochi

    2008-06-01

    A post-marketing clinical study was previously conducted in pediatric patients with respiratory infection to evaluate the pharmacokinetics, efficacy and safety of cefcapene pivoxil (CFPN-PI) fine granules for children. Based on the results from this study, we evaluated PK/PD breakpoints and clinical/bacteriological effects of CFPN-PI at free drug concentrations in pediatric patients with respiratory infection to determine an effective and safe dosage regimen of CFPN-PI. The following results were obtained from 61 pediatric patients evaluated in our research. 1) The response rate of pediatric respiratory infection to CFPN-PI was 100% for laryngopharyngitis, 84.6% for acute bronchitis, 100% for tonsillitis, 100% for pneumonia and 95.8% for all. 2) The bacteriological response (eradication rate of Haemophilus influenzae, Streptococcus pyogenes, Moraxella catarrhalis, Streptococcus pneumoniae, etc.) of pediatric respiratory infection to CFPN-PI was 87.5% for laryngopharyngitis, 66.7% for acute bronchitis, 75.0% for tonsillitis, 63.6% for pneumonia and 73.8% for all. 3) The blood concentration simulation demonstrated that the PK/PD breakpoint exceeding the time above MIC (TAM) of 40% after administration of CFPN-PI 3 mg/kg three times daily was 0.27 microg/mL. 4) The pediatric patients with respiratory infection were stratified by the TAM (%) of CFPN-PI into 40% to 100% (TAM > or = 40% group) and 0% to 40% (TAM < 40% group) to compare the clinical and bacteriological effects of CFPN-PI. The clinical and bacteriological response rates, respectively, were 97.4% and 77.8% in the TAM > or = 40% group, and 88.9% and 62.5% in the TAM < 40% group. There was no difference in the clinical effect between the two TAM-stratified groups. On the other hand, the bacteriological effect, i.e., eradication rate, tended to be higher in the TAM > or = 40% group than in the TAM < 40% group, although the between-group difference was not statistically significant.

  6. Fish species introductions provide novel insights into the patterns and drivers of phylogenetic structure in freshwaters

    PubMed Central

    Strecker, Angela L.; Olden, Julian D.

    2014-01-01

    Despite long-standing interest of terrestrial ecologists, freshwater ecosystems are a fertile, yet unappreciated, testing ground for applying community phylogenetics to uncover mechanisms of species assembly. We quantify phylogenetic clustering and overdispersion of native and non-native fishes of a large river basin in the American Southwest to test for the mechanisms (environmental filtering versus competitive exclusion) and spatial scales influencing community structure. Contrary to expectations, non-native species were phylogenetically clustered and related to natural environmental conditions, whereas native species were not phylogenetically structured, likely reflecting human-related changes to the basin. The species that are most invasive (in terms of ecological impacts) tended to be the most phylogenetically divergent from natives across watersheds, but not within watersheds, supporting the hypothesis that Darwin's naturalization conundrum is driven by the spatial scale. Phylogenetic distinctiveness may facilitate non-native establishment at regional scales, but environmental filtering restricts local membership to closely related species with physiological tolerances for current environments. By contrast, native species may have been phylogenetically clustered in historical times, but species loss from contemporary populations by anthropogenic activities has likely shaped the phylogenetic signal. Our study implies that fundamental mechanisms of community assembly have changed, with fundamental consequences for the biogeography of both native and non-native species. PMID:24452027

  7. Phylogenetic perspective and the search for life on earth and elsewhere

    NASA Technical Reports Server (NTRS)

    Pace, Norman R.

    1989-01-01

    Any search for microbial life on Mars cannot rely upon cultivation of indigenous organisms. Only a minority of even terrestrial organisms that are observed in mixed, naturally-occurring microbial populations can be cultivated in the laboratory. Consequently, methods are being developed for analyzing the phylogenetic affiliations of the constituents of natural microbial populations without the need for their cultivation. This is more than an exercise in taxonomy, for the extent of phylogenetic relatedness between unknown and known organisms is some measure of the extent of their biochemical commonalities. In one approach, total DNA is isolated from natural microbial populations and 16S rRNA genes are shotgun cloned for rapid sequence determinations and phylogenetic analyses. A second approach employs oligodeoxynucleotide hybridization probes that bind to phylogenetic group-specific sequences in 16S rRNA. Since each actively growing cell contains about 104 ribosomes, the binding of the diagnostic probes to single cells can be visualized by radioactivity or fluorescence. The application of these methods and the use of in situ cultivation techniques is illustrated using submarine hydrothermal vent communities. Recommendations are made regarding planning toward future Mars missions.

  8. Phylogenetic Analysis and Molecular Characterization of Xanthium sibiricum Using DNA Barcoding, PCR-RFLP, and Specific Primers.

    PubMed

    Tomasello, Salvatore; Heubl, Günther

    2017-07-01

    The fruits of Xanthium sibiricum have been widely used in traditional Chinese medicine for the treatment of nasal sinusitis and headaches. The genus Xanthium (cocklebur) is a taxonomically complex genus. Different taxonomic concepts have been proposed, some including several species, others lumping the different taxa in a few extremely polymorphic species. Due to the morphological similarities between species, the correct authentication of X. sibiricum is very difficult. Therefore, we established a polymerase chain reaction-restriction fragment length polymorphism method and diagnostic PCR based on nuclear internal transcribed spacer and chloroplast trnQ-rps16 barcodes to differentiate X. sibirium from related species.Results from the phylogenetic analyses based on sequence information from four marker regions (plastidal psbA-trnH and trnQ-rps16 and nuclear ITS and D35 ) support those taxonomic concepts accepting a reduced number of species, as four to five major clades are revealed in the phylogenetic reconstructions. X. sibiricum , together with some accessions from closely related taxa, is always supported as monophyletic, constituting a well-defined genetic entity. Allele-specific primer pairs for ITS and trnQ-rps16 were designed to amplify diagnostic products from the genomic DNA of X. sibiricum . Specific PCR in combination with digestion using the restriction enzyme Mse I allowed for the identification of X. sibiricum by producing specific restriction patterns. The results demonstrate that the applied techniques provide effective and accurate authentication of X. sibiricum . Georg Thieme Verlag KG Stuttgart · New York.

  9. Complete mitogenome of Asiatic lion resolves phylogenetic status within Panthera.

    PubMed

    Bagatharia, Snehal B; Joshi, Madhvi N; Pandya, Rohan V; Pandit, Aanal S; Patel, Riddhi P; Desai, Shivangi M; Sharma, Anu; Panchal, Omkar; Jasmani, Falguni P; Saxena, Akshay K

    2013-08-23

    The origin, evolution and speciation of the lion, has been subject of interest, debate and study. The present surviving lions of the genus Panthera comprise of eight sub-species inclusive of Asiatic lion Panthera leo persica of India's Gir forest. Except for the Asiatic lion, the other seven subspecies are found in different parts of Africa. There have been different opinions regarding the phylogenetic status of Panthera leo, as well as classifying lions of different geographic regions into subspecies and races. In the present study, mitogenome sequence of P. leo persica deduced, using Ion Torrent PGM to assess phylogeny and evolution which may play an increasingly important role in conservation biology. The mtDNA sequence of P. leo persica is 17,057 bp in length with 40.8% GC content. Annotation of mitogenome revealed total 37 genes, including 13 protein coding, 2 rRNA and 22 tRNA. Phylogenetic analysis based on whole mitogenome, suggests Panthera pardus as a neighbouring species to P. leo with species divergence at ~2.96 mya. This work presents first report on complete mitogenome of Panthera leo persica. It sheds light on the phylogenetic and evolutionary status within and across Felidae members. The result compared and evaluated with earlier reports of Felidae shows alteration of phylogenetic status and species evolution. This study may provide information on genetic diversity and population stability.

  10. Complete mitogenome of asiatic lion resolves phylogenetic status within Panthera

    PubMed Central

    2013-01-01

    Background The origin, evolution and speciation of the lion, has been subject of interest, debate and study. The present surviving lions of the genus Panthera comprise of eight sub-species inclusive of Asiatic lion Panthera leo persica of India's Gir forest. Except for the Asiatic lion, the other seven subspecies are found in different parts of Africa. There have been different opinions regarding the phylogenetic status of Panthera leo, as well as classifying lions of different geographic regions into subspecies and races. In the present study, mitogenome sequence of P. leo persica deduced, using Ion Torrent PGM to assess phylogeny and evolution which may play an increasingly important role in conservation biology. Results The mtDNA sequence of P. leo persica is 17,057 bp in length with 40.8% GC content. Annotation of mitogenome revealed total 37 genes, including 13 protein coding, 2 rRNA and 22 tRNA. Phylogenetic analysis based on whole mitogenome, suggests Panthera pardus as a neighbouring species to P. leo with species divergence at ~2.96 mya. Conclusion This work presents first report on complete mitogenome of Panthera leo persica. It sheds light on the phylogenetic and evolutionary status within and across Felidae members. The result compared and evaluated with earlier reports of Felidae shows alteration of phylogenetic status and species evolution. This study may provide information on genetic diversity and population stability. PMID:23968279

  11. High-resolution phylogenetic microbial community profiling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Singer, Esther; Coleman-Derr, Devin; Bowman, Brett

    2014-03-17

    The representation of bacterial and archaeal genome sequences is strongly biased towards cultivated organisms, which belong to merely four phylogenetic groups. Functional information and inter-phylum level relationships are still largely underexplored for candidate phyla, which are often referred to as microbial dark matter. Furthermore, a large portion of the 16S rRNA gene records in the GenBank database are labeled as environmental samples and unclassified, which is in part due to low read accuracy, potential chimeric sequences produced during PCR amplifications and the low resolution of short amplicons. In order to improve the phylogenetic classification of novel species and advance ourmore » knowledge of the ecosystem function of uncultivated microorganisms, high-throughput full length 16S rRNA gene sequencing methodologies with reduced biases are needed. We evaluated the performance of PacBio single-molecule real-time (SMRT) sequencing in high-resolution phylogenetic microbial community profiling. For this purpose, we compared PacBio and Illumina metagenomic shotgun and 16S rRNA gene sequencing of a mock community as well as of an environmental sample from Sakinaw Lake, British Columbia. Sakinaw Lake is known to contain a large age of microbial species from candidate phyla. Sequencing results show that community structure based on PacBio shotgun and 16S rRNA gene sequences is highly similar in both the mock and the environmental communities. Resolution power and community representation accuracy from SMRT sequencing data appeared to be independent of GC content of microbial genomes and was higher when compared to Illumina-based metagenome shotgun and 16S rRNA gene (iTag) sequences, e.g. full-length sequencing resolved all 23 OTUs in the mock community, while iTags did not resolve closely related species. SMRT sequencing hence offers various potential benefits when characterizing uncharted microbial communities.« less

  12. Molecular phylogenetic study in genus Hydra.

    PubMed

    Kawaida, Hitomi; Shimizu, Hiroshi; Fujisawa, Toshitaka; Tachida, Hidenori; Kobayakawa, Yoshitaka

    2010-11-15

    Among 8000-9000 species of Cnidaria, only several dozens of species of Hydrozoa have been found in the fresh water. Hydra is such a fresh water polyp and has been used as a good material for research in developmental biology, regeneration and pattern formation. Although the genus Hydra has only a few ten species, its distribution is cosmopolitan. The phylogenetic relationship between hydra species is fascinating from the aspect of evolutionary biology and biogeography. However, only a few molecular phylogenetic studies have been reported on hydra. Therefore, we conducted a molecular phylogenetic study of the genus Hydra based on mitochondrial and nuclear nucleotide sequences using a hydra collection that has been kept in the National Institute of Genetics (NIG) of Japan. The results support the idea that four species groups comprise the genus Hydra. Within the viridissima group (green hydra) and braueri group, genetic distances between strains were relatively large. In contrast, genetic distances between strains among the vulgaris and oligactis groups were small irrespective of their geographic distribution. The vulgaris group strains were classified at least (as far as our investigated samples) into three sub-groups, vulgaris sub-group, carnea sub-group, and H. sp. (K5 and K6) sub-group. All of the vulgaris sub-group and H. sp. (K5 and K6) sub-group strains were collected in Eurasia. The carnea sub-group strains in NIG collection were all collected in North America. A few newly collected samples in Japan, however, suggested belonging to the carnea sub-group according to the molecular phylogenic analysis. This suggests a trans-Pacific distribution of the carnea sub-group hydra. Copyright © 2010 Elsevier B.V. All rights reserved.

  13. Evolutionary history of tall fescue morphotypes inferred from molecular phylogenetics of the Lolium-Festuca species complex

    PubMed Central

    2010-01-01

    Background The agriculturally important pasture grass tall fescue (Festuca arundinacea Schreb. syn. Lolium arundinaceum (Schreb.) Darbysh.) is an outbreeding allohexaploid, that may be more accurately described as a species complex consisting of three major (Continental, Mediterranean and rhizomatous) morphotypes. Observation of hybrid infertility in some crossing combinations between morphotypes suggests the possibility of independent origins from different diploid progenitors. This study aims to clarify the evolutionary relationships between each tall fescue morphotype through phylogenetic analysis using two low-copy nuclear genes (encoding plastid acetyl-CoA carboxylase [Acc1] and centroradialis [CEN]), the nuclear ribosomal DNA internal transcribed spacer (rDNA ITS) and the chloroplast DNA (cpDNA) genome-located matK gene. Other taxa within the closely related Lolium-Festuca species complex were also included in the study, to increase understanding of evolutionary processes in a taxonomic group characterised by multiple inter-specific hybridisation events. Results Putative homoeologous sequences from both nuclear genes were obtained from each polyploid species and compared to counterparts from 15 diploid taxa. Phylogenetic reconstruction confirmed F. pratensis and F. arundinacea var. glaucescens as probable progenitors to Continental tall fescue, and these species are also likely to be ancestral to the rhizomatous morphotype. However, these two morphotypes are sufficiently distinct to be located in separate clades based on the ITS-derived data set. All four of the generated data sets suggest independent evolution of the Mediterranean and Continental morphotypes, with minimal affinity between cognate sequence haplotypes. No obvious candidate progenitor species for Mediterranean tall fescues were identified, and only two putative sub-genome-specific haplotypes were identified for this morphotype. Conclusions This study describes the first phylogenetic analysis of

  14. Molecular phylogenetic reconstruction of the endemic Asian salamander family Hynobiidae (Amphibia, Caudata).

    PubMed

    Weisrock, David W; Macey, J Robert; Matsui, Masafumi; Mulcahy, Daniel G; Papenfuss, Theodore J

    2013-01-01

    The salamander family Hynobiidae contains over 50 species and has been the subject of a number of molecular phylogenetic investigations aimed at reconstructing branches across the entire family. In general, studies using the greatest amount of sequence data have used reduced taxon sampling, while the study with the greatest taxon sampling has used a limited sequence data set. Here, we provide insights into the phylogenetic history of the Hynobiidae using both dense taxon sampling and a large mitochondrial DNA sequence data set. We report exclusive new mitochondrial DNA data of 2566 aligned bases (with 151 excluded sites, of included sites 1157 are variable with 957 parsimony informative). This is sampled from two genic regions encoding a 12S-16S region (the 3' end of 12S rRNA, tRNA(VAI), and the 5' end of 16S rRNA), and a ND2-COI region (ND2, tRNA(Trp), tRNA(Ala), tRNA(Asn), the origin for light strand replication--O(L), tRNA(Cys), tRNAT(Tyr), and the 5' end of COI). Analyses using parsimony, Bayesian, and maximum likelihood optimality criteria produce similar phylogenetic trees, with discordant branches generally receiving low levels of branch support. Monophyly of the Hynobiidae is strongly supported across all analyses, as is the sister relationship and deep divergence between the genus Onychodactylus with all remaining hynobiids. Within this latter grouping our phylogenetic results identify six clades that are relatively divergent from one another, but for which there is minimal support for their phylogenetic placement. This includes the genus Batrachuperus, the genus Hynobius, the genus Pachyhynobius, the genus Salamandrella, a clade containing the genera Ranodon and Paradactylodon, and a clade containing the genera Liua and Pseudohynobius. This latter clade receives low bootstrap support in the parsimony analysis, but is consistent across all three analytical methods. Our results also clarify a number of well-supported relationships within the larger

  15. Classification of Phylogenetic Profiles for Protein Function Prediction: An SVM Approach

    NASA Astrophysics Data System (ADS)

    Kotaru, Appala Raju; Joshi, Ramesh C.

    Predicting the function of an uncharacterized protein is a major challenge in post-genomic era due to problems complexity and scale. Having knowledge of protein function is a crucial link in the development of new drugs, better crops, and even the development of biochemicals such as biofuels. Recently numerous high-throughput experimental procedures have been invented to investigate the mechanisms leading to the accomplishment of a protein’s function and Phylogenetic profile is one of them. Phylogenetic profile is a way of representing a protein which encodes evolutionary history of proteins. In this paper we proposed a method for classification of phylogenetic profiles using supervised machine learning method, support vector machine classification along with radial basis function as kernel for identifying functionally linked proteins. We experimentally evaluated the performance of the classifier with the linear kernel, polynomial kernel and compared the results with the existing tree kernel. In our study we have used proteins of the budding yeast saccharomyces cerevisiae genome. We generated the phylogenetic profiles of 2465 yeast genes and for our study we used the functional annotations that are available in the MIPS database. Our experiments show that the performance of the radial basis kernel is similar to polynomial kernel is some functional classes together are better than linear, tree kernel and over all radial basis kernel outperformed the polynomial kernel, linear kernel and tree kernel. In analyzing these results we show that it will be feasible to make use of SVM classifier with radial basis function as kernel to predict the gene functionality using phylogenetic profiles.

  16. Is multiple-sequence alignment required for accurate inference of phylogeny?

    PubMed

    Höhl, Michael; Ragan, Mark A

    2007-04-01

    The process of inferring phylogenetic trees from molecular sequences almost always starts with a multiple alignment of these sequences but can also be based on methods that do not involve multiple sequence alignment. Very little is known about the accuracy with which such alignment-free methods recover the correct phylogeny or about the potential for increasing their accuracy. We conducted a large-scale comparison of ten alignment-free methods, among them one new approach that does not calculate distances and a faster variant of our pattern-based approach; all distance-based alignment-free methods are freely available from http://www.bioinformatics.org.au (as Python package decaf+py). We show that most methods exhibit a higher overall reconstruction accuracy in the presence of high among-site rate variation. Under all conditions that we considered, variants of the pattern-based approach were significantly better than the other alignment-free methods. The new pattern-based variant achieved a speed-up of an order of magnitude in the distance calculation step, accompanied by a small loss of tree reconstruction accuracy. A method of Bayesian inference from k-mers did not improve on classical alignment-free (and distance-based) methods but may still offer other advantages due to its Bayesian nature. We found the optimal word length k of word-based methods to be stable across various data sets, and we provide parameter ranges for two different alphabets. The influence of these alphabets was analyzed to reveal a trade-off in reconstruction accuracy between long and short branches. We have mapped the phylogenetic accuracy for many alignment-free methods, among them several recently introduced ones, and increased our understanding of their behavior in response to biologically important parameters. In all experiments, the pattern-based approach emerged as superior, at the expense of higher resource consumption. Nonetheless, no alignment-free method that we examined recovers

  17. Phylogenetic impoverishment of plant communities following chronic human disturbances in the Brazilian Caatinga.

    PubMed

    Ribeiro, Elâine M S; Santos, Bráulio A; Arroyo-Rodríguez, Víctor; Tabarelli, Marcelo; Souza, Gustavo; Leal, Inara R

    2016-06-01

    Chronic disturbances, such as selective logging, firewood extraction and extensive grazing, may lead to the taxonomic and phylogenetic impoverishment of remaining old-growth forest communities worldwide; however, the empirical evidence on this topic is limited. We tested this hypothesis in the Caatinga vegetation--a seasonally dry tropical forest restricted to northeast Brazil. We sampled 11,653 individuals (adults, saplings, and seedlings) from 51 species in 29 plots distributed along a gradient of chronic disturbance. The gradient was assessed using a chronic disturbance index (CDI) based on five recognized indicators of chronic disturbances: proximity to urban center, houses and roads and the density of both people and livestock. We used linear models to test if mean effective number of lineages, mean phylogenetic distance and phylogenetic dispersion decreased with CDI and if such relationships differed among ontogenetic stages. As expected, the mean effective number of lineages and the mean phylogenetic distance were negatively related to CDI, and such diversity losses occurred irrespective of ontogeny. Yet the increase in phylogenetic clustering in more disturbed plots was only evident in seedlings and saplings, mostly because clades with more descendent taxa than expected by chance (e.g., Euphorbiaceae) thrived in more disturbed plots. This novel study indicates that chronic human disturbances are promoting the phylogenetic impoverishment of the irreplaceable woody flora of the Brazilian Caatinga forest. The highest impoverishment was observed in seedlings and saplings, indicating that if current chronic disturbances remain, they will result in increasingly poorer phylogenetically forests. This loss of evolutionary history will potentially limit the capacity of this ecosystem to respond to human disturbances (i.e., lower ecological resilience) and particularly their ability to adapt to rapid climatic changes in the region.

  18. Phylogenetic patterns of climatic, habitat and trophic niches in a European avian assemblage

    PubMed Central

    Pearman, Peter B; Lavergne, Sébastien; Roquet, Cristina; Wüest, Rafael; Zimmermann, Niklaus E; Thuiller, Wilfried

    2014-01-01

    Aim The origins of ecological diversity in continental species assemblages have long intrigued biogeographers. We apply phylogenetic comparative analyses to disentangle the evolutionary patterns of ecological niches in an assemblage of European birds. We compare phylogenetic patterns in trophic, habitat and climatic niche components. Location Europe. Methods From polygon range maps and handbook data we inferred the realized climatic, habitat and trophic niches of 405 species of breeding birds in Europe. We fitted Pagel's lambda and kappa statistics, and conducted analyses of disparity through time to compare temporal patterns of ecological diversification on all niche axes together. All observed patterns were compared with expectations based on neutral (Brownian) models of niche divergence. Results In this assemblage, patterns of phylogenetic signal (lambda) suggest that related species resemble each other less in regard to their climatic and habitat niches than they do in their trophic niche. Kappa estimates show that ecological divergence does not gradually increase with divergence time, and that this punctualism is stronger in climatic niches than in habitat and trophic niches. Observed niche disparity markedly exceeds levels expected from a Brownian model of ecological diversification, thus providing no evidence for past phylogenetic niche conservatism in these multivariate niches. Levels of multivariate disparity are greatest for the climatic niche, followed by disparity of the habitat and the trophic niches. Main conclusions Phylogenetic patterns in the three niche components differ within this avian assemblage. Variation in evolutionary rates (degree of gradualism, constancy through the tree) and/or non-random macroecological sampling probably lead here to differences in the phylogenetic structure of niche components. Testing hypotheses on the origin of these patterns requires more complete phylogenetic trees of the birds, and extended ecological data on

  19. Phylogenetic diversity of plants alters the effect of species richness on invertebrate herbivory

    PubMed Central

    2013-01-01

    Long-standing ecological theory proposes that diverse communities of plants should experience a decrease in herbivory. Yet previous empirical examinations of this hypothesis have revealed that plant species richness increases herbivory in just as many systems as it decreases it. In this study, I ask whether more insight into the role of plant diversity in promoting or suppressing herbivory can be gained by incorporating information about the evolutionary history of species in a community. In an old field system in southern Ontario, I surveyed communities of plants and measured levels of leaf damage on 27 species in 38 plots. I calculated a measure of phylogenetic diversity (PSE) that encapsulates information about the amount of evolutionary history represented in each of the plots and looked for a relationship between levels of herbivory and both species richness and phylogenetic diversity using a generalized linear mixed model (GLMM) that could account for variation in herbivory levels between species. I found that species richness was positively associated with herbivore damage at the plot-level, in keeping with the results from several other recent studies on this question. On the other hand, phylogenetic diversity was associated with decreased herbivory. Importantly, there was also an interaction between species richness and phylogenetic diversity, such that plots with the highest levels of herbivory were plots which had many species but only if those species tended to be closely related to one another. I propose that these results are the consequence of interactions with herbivores whose diets are phylogenetically specialized (for which I introduce the term cladophage), and how phylogenetic diversity may alter their realized host ranges. These results suggest that incorporating a phylogenetic perspective can add valuable additional insight into the role of plant diversity in explaining or predicting levels of herbivory at a whole-community scale. PMID:23825795

  20. Identifiability of tree-child phylogenetic networks under a probabilistic recombination-mutation model of evolution.

    PubMed

    Francis, Andrew; Moulton, Vincent

    2018-06-07

    Phylogenetic networks are an extension of phylogenetic trees which are used to represent evolutionary histories in which reticulation events (such as recombination and hybridization) have occurred. A central question for such networks is that of identifiability, which essentially asks under what circumstances can we reliably identify the phylogenetic network that gave rise to the observed data? Recently, identifiability results have appeared for networks relative to a model of sequence evolution that generalizes the standard Markov models used for phylogenetic trees. However, these results are quite limited in terms of the complexity of the networks that are considered. In this paper, by introducing an alternative probabilistic model for evolution along a network that is based on some ground-breaking work by Thatte for pedigrees, we are able to obtain an identifiability result for a much larger class of phylogenetic networks (essentially the class of so-called tree-child networks). To prove our main theorem, we derive some new results for identifying tree-child networks combinatorially, and then adapt some techniques developed by Thatte for pedigrees to show that our combinatorial results imply identifiability in the probabilistic setting. We hope that the introduction of our new model for networks could lead to new approaches to reliably construct phylogenetic networks. Copyright © 2018 Elsevier Ltd. All rights reserved.