Brassac, Jonathan; Blattner, Frank R
2015-09-01
Polyploidization is an important speciation mechanism in the barley genus Hordeum. To analyze evolutionary changes after allopolyploidization, knowledge of parental relationships is essential. One chloroplast and 12 nuclear single-copy loci were amplified by polymerase chain reaction (PCR) in all Hordeum plus six out-group species. Amplicons from each of 96 individuals were pooled, sheared, labeled with individual-specific barcodes and sequenced in a single run on a 454 platform. Reference sequences were obtained by cloning and Sanger sequencing of all loci for nine supplementary individuals. The 454 reads were assembled into contigs representing the 13 loci and, for polyploids, also homoeologues. Phylogenetic analyses were conducted for all loci separately and for a concatenated data matrix of all loci. For diploid taxa, a Bayesian concordance analysis and a coalescent-based dated species tree was inferred from all gene trees. Chloroplast matK was used to determine the maternal parent in allopolyploid taxa. The relative performance of different multilocus analyses in the presence of incomplete lineage sorting and hybridization was also assessed. The resulting multilocus phylogeny reveals for the first time species phylogeny and progenitor-derivative relationships of all di- and polyploid Hordeum taxa within a single analysis. Our study proves that it is possible to obtain a multilocus species-level phylogeny for di- and polyploid taxa by combining PCR with next-generation sequencing, without cloning and without creating a heavy load of sequence data. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen
2016-01-01
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335
Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M; Kashi, Yechezkel
2004-04-01
Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria.
Diamant, Eran; Palti, Yniv; Gur-Arie, Riva; Cohen, Helit; Hallerman, Eric M.; Kashi, Yechezkel
2004-01-01
Multilocus sequencing of housekeeping genes has been used previously for bacterial strain typing and for inferring evolutionary relationships among strains of Escherichia coli. In this study, we used shorter intergenic sequences that contained simple sequence repeats (SSRs) of repeating mononucleotide motifs (mononucleotide repeats [MNRs]) to infer the phylogeny of pathogenic and commensal E. coli strains. Seven noncoding loci (four MNRs and three non-SSRs) were sequenced in 27 strains, including enterohemorrhagic (six isolates of O157:H7), enteropathogenic, enterotoxigenic, B, and K-12 strains. The four MNRs were also sequenced in 20 representative strains of the E. coli reference (ECOR) collection. Sequence polymorphism was significantly higher at the MNR loci, including the flanking sequences, indicating a higher mutation rate in the sequences flanking the MNR tracts. The four MNR loci were amplifiable by PCR in the standard ECOR A, B1, and D groups, but only one (yaiN) in the B2 group was amplified, which is consistent with previous studies that suggested that B2 is the most ancient group. High sequence compatibility was found between the four MNR loci, indicating that they are in the same clonal frame. The phylogenetic trees that were constructed from the sequence data were in good agreement with those of previous studies that used multilocus enzyme electrophoresis. The results demonstrate that MNR loci are useful for inferring phylogenetic relationships and provide much higher sequence variation than housekeeping genes. Therefore, the use of MNR loci for multilocus sequence typing should prove efficient for clinical diagnostics, epidemiology, and evolutionary study of bacteria. PMID:15066845
Aspergillus section Versicolores: nine new species and multilocus DNA sequence based phylogeny
USDA-ARS?s Scientific Manuscript database
ß-tubulin, calmodulin, internal transcribed spacer and partial lsu-rDNA, RNA polymerase, DNA replication licensing factor Mcm7, and pre-rRNA processing protein Tsr1 were amplified and sequenced from 62 A. versicolor clade isolates and analyzed phylogenetically using the concordance model to establis...
Aspergillus section Versicolores, nine new species and multilocus DNA sequence based phylogeny
USDA-ARS?s Scientific Manuscript database
ß-tubulin, calmodulin, internal transcribed spacer and partial lsu-rDNA, RNA polymerase, DNA replication licensing factor Mcm7, and pre-rRNA processing protein Tsr1 were amplified and sequenced from 62 A. versicolor clade isolates and analyzed phylogenetically using the concordance model to establis...
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781
Resolving the Mortierellaceae phylogeny through Multi-Locus Sequence Typing (MLST) and phylogenomics
USDA-ARS?s Scientific Manuscript database
The Mortierellaceae (Mortierellomycotina) are a diverse family of fungi that are of evolutionary and ecological relevance. They are the closest lineage to the arbuscular mycorrhizae (Glomeromycotina) and include some of the first species to evolve fruiting body production. The Mortierellaceae are es...
Boité, Mariana C.; Mauricio, Isabel L.; Miles, Michael A.; Cupolillo, Elisa
2012-01-01
The Leishmania genus comprises up to 35 species, some with status still under discussion. The multilocus sequence typing (MLST)—extensively used for bacteria—has been proposed for pathogenic trypanosomatids. For Leishmania, however, a detailed analysis and revision on the taxonomy is still required. We have partially sequenced four housekeeping genes—glucose-6-phosphate dehydrogenase (G6PD), 6-phosphogluconate dehydrogenase (6PGD), mannose phosphate isomerase (MPI) and isocitrate dehydrogenase (ICD)—from 96 Leishmania (Viannia) strains and assessed their discriminatory typing capacity. The fragments had different degrees of diversity, and are thus suitable to be used in combination for intra- and inter-specific inferences. Species-specific single nucleotide polymorphisms were detected, but not for all species; ambiguous sites indicating heterozygosis were observed, as well as the putative homozygous donor. A large number of haplotypes were detected for each marker; for 6PGD a possible ancestral allele for L. (Viannia) was found. Maximum parsimony-based haplotype networks were built. Strains of different species, as identified by multilocus enzyme electrophoresis (MLEE), formed separated clusters in each network, with exceptions. NeighborNet of concatenated sequences confirmed species-specific clusters, suggesting recombination occurring in L. braziliensis and L. guyanensis. Phylogenetic analysis indicates L. lainsoni and L. naiffi as the most divergent species and does not support L. shawi as a distinct species, placing it in the L. guyanensis cluster. BURST analysis resulted in six clonal complexes (CC), corresponding to distinct species. The L. braziliensis strains evaluated correspond to one widely geographically distributed CC and another restricted to one endemic area. This study demonstrates the value of systematic multilocus sequence analysis (MLSA) for determining intra- and inter-species relationships and presents an approach to validate the species status of some entities. Furthermore, it contributes to the phylogeny of L. (Viannia) and might be helpful for epidemiological and population genetics analysis based on haplotype/diplotype determinations and inferences. PMID:23133690
Chassain, Benoît; Lemée, Ludovic; Didi, Jennifer; Thiberge, Jean-Michel; Brisse, Sylvain; Pons, Jean-Louis
2012-01-01
Staphylococcus lugdunensis is recognized as one of the major pathogenic species within the genus Staphylococcus, even though it belongs to the coagulase-negative group. A multilocus sequence typing (MLST) scheme was developed to study the genetic relationships and population structure of 87 S. lugdunensis isolates from various clinical and geographic sources by DNA sequence analysis of seven housekeeping genes (aroE, dat, ddl, gmk, ldh, recA, and yqiL). The number of alleles ranged from four (gmk and ldh) to nine (yqiL). Allelic profiles allowed the definition of 20 different sequence types (STs) and five clonal complexes. The 20 STs lacked correlation with geographic source. Isolates recovered from hematogenic infections (blood or osteoarticular isolates) or from skin and soft tissue infections did not cluster in separate lineages. Penicillin-resistant isolates clustered mainly in one clonal complex, unlike glycopeptide-tolerant isolates, which did not constitute a distinct subpopulation within S. lugdunensis. Phylogenies from the sequences of the seven individual housekeeping genes were congruent, indicating a predominantly mutational evolution of these genes. Quantitative analysis of the linkages between alleles from the seven loci revealed a significant linkage disequilibrium, thus confirming a clonal population structure for S. lugdunensis. This first MLST scheme for S. lugdunensis provides a new tool for investigating the macroepidemiology and phylogeny of this unusually virulent coagulase-negative Staphylococcus. PMID:22785196
Xia, Rong; Durand, Jean-Dominique; Fu, Cuizhang
2016-03-01
The interrelationships among mugilids (Mugiliformes: Mugilidae) remain highly debated. Using a mitochondrial gene-based phylogeny as criterion, a revised classification with 25 genera in the Mugilidae has recently been proposed. However, phylogenetic relationships of major mitochondrial lineages remain unresolved and to gain a general acceptance the classification requires confirmation based on multilocus evidence and diagnostic morphological characters. Here, we construct a species-tree using twelve nuclear and three mitochondrial loci and infer the evolution of 71 morphological characters. Our multilocus phylogeny does not agree with previous morphology-based hypotheses for the relationships within Mugilidae, confirms the revised classification with 25 genera and further resolves their phylogenetic relationships. Using the well-resolved multilocus phylogeny as the criterion, we reclassify Mugilidae genera into three new subfamilies (Myxinae, Rhinomugilinae, and Cheloninae) and one new, recombined, subfamily (Mugilinae). The Rhinomugilinae subfamily is further divided into four tribes. The revised classification of Mugilidae is supported by morpho-anatomical synapomorphies or a combination of characters. These characters are used to erect a key to the subfamilies and genera. Copyright © 2015 Elsevier Inc. All rights reserved.
STBase: one million species trees for comparative biology.
McMahon, Michelle M; Deepak, Akshay; Fernández-Baca, David; Boss, Darren; Sanderson, Michael J
2015-01-01
Comprehensively sampled phylogenetic trees provide the most compelling foundations for strong inferences in comparative evolutionary biology. Mismatches are common, however, between the taxa for which comparative data are available and the taxa sampled by published phylogenetic analyses. Moreover, many published phylogenies are gene trees, which cannot always be adapted immediately for species level comparisons because of discordance, gene duplication, and other confounding biological processes. A new database, STBase, lets comparative biologists quickly retrieve species level phylogenetic hypotheses in response to a query list of species names. The database consists of 1 million single- and multi-locus data sets, each with a confidence set of 1000 putative species trees, computed from GenBank sequence data for 413,000 eukaryotic taxa. Two bodies of theoretical work are leveraged to aid in the assembly of multi-locus concatenated data sets for species tree construction. First, multiply labeled gene trees are pruned to conflict-free singly-labeled species-level trees that can be combined between loci. Second, impacts of missing data in multi-locus data sets are ameliorated by assembling only decisive data sets. Data sets overlapping with the user's query are ranked using a scheme that depends on user-provided weights for tree quality and for taxonomic overlap of the tree with the query. Retrieval times are independent of the size of the database, typically a few seconds. Tree quality is assessed by a real-time evaluation of bootstrap support on just the overlapping subtree. Associated sequence alignments, tree files and metadata can be downloaded for subsequent analysis. STBase provides a tool for comparative biologists interested in exploiting the most relevant sequence data available for the taxa of interest. It may also serve as a prototype for future species tree oriented databases and as a resource for assembly of larger species phylogenies from precomputed trees.
Vibrio chromosomes share common history.
Kirkup, Benjamin C; Chang, LeeAnn; Chang, Sarah; Gevers, Dirk; Polz, Martin F
2010-05-10
While most gamma proteobacteria have a single circular chromosome, Vibrionales have two circular chromosomes. Horizontal gene transfer is common among Vibrios, and in light of this genetic mobility, it is an open question to what extent the two chromosomes themselves share a common history since their formation. Single copy genes from each chromosome (142 genes from chromosome I and 42 genes from chromosome II) were identified from 19 sequenced Vibrionales genomes and their phylogenetic comparison suggests consistent phylogenies for each chromosome. Additionally, study of the gene organization and phylogeny of the respective origins of replication confirmed the shared history. Thus, while elements within the chromosomes may have experienced significant genetic mobility, the backbones share a common history. This allows conclusions based on multilocus sequence analysis (MLSA) for one chromosome to be applied equally to both chromosomes.
Population genetics, taxonomy, phylogeny and evolution of Borrelia burgdorferi sensu lato
Margos, Gabriele; Vollmer, Stephanie A.; Ogden, Nicholas H.; Fish, Durland
2011-01-01
In order to understand the population structure and dynamics of bacterial microorganisms, typing systems that accurately reflect the phylogenetic and evolutionary relationship of the agents are required. Over the past 15 years multilocus sequence typing schemes have replaced single locus approaches, giving novel insights into phylogenetic and evolutionary relationships of many bacterial species and facilitating taxonomy. Since 2004, several schemes using multiple loci have been developed to better understand the taxonomy, phylogeny and evolution of Lyme borreliosis spirochetes and in this paper we have reviewed and summarized the progress that has been made for this important group of vector-borne zoonotic bacteria. PMID:21843658
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassotreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics. PMID:25250663
Salvi, Daniele; Macali, Armando; Mariottini, Paolo
2014-01-01
The bivalve family Ostreidae has a worldwide distribution and includes species of high economic importance. Phylogenetics and systematic of oysters based on morphology have proved difficult because of their high phenotypic plasticity. In this study we explore the phylogenetic information of the DNA sequence and secondary structure of the nuclear, fast-evolving, ITS2 rRNA and the mitochondrial 16S rRNA genes from the Ostreidae and we implemented a multi-locus framework based on four loci for oyster phylogenetics and systematics. Sequence-structure rRNA models aid sequence alignment and improved accuracy and nodal support of phylogenetic trees. In agreement with previous molecular studies, our phylogenetic results indicate that none of the currently recognized subfamilies, Crassostreinae, Ostreinae, and Lophinae, is monophyletic. Single gene trees based on Maximum likelihood (ML) and Bayesian (BA) methods and on sequence-structure ML were congruent with multilocus trees based on a concatenated (ML and BA) and coalescent based (BA) approaches and consistently supported three main clades: (i) Crassostrea, (ii) Saccostrea, and (iii) an Ostreinae-Lophinae lineage. Therefore, the subfamily Crassostreinae (including Crassostrea), Saccostreinae subfam. nov. (including Saccostrea and tentatively Striostrea) and Ostreinae (including Ostreinae and Lophinae taxa) are recognized [corrected]. Based on phylogenetic and biogeographical evidence the Asian species of Crassostrea from the Pacific Ocean are assigned to Magallana gen. nov., whereas an integrative taxonomic revision is required for the genera Ostrea and Dendostrea. This study pointed out the suitability of the ITS2 marker for DNA barcoding of oyster and the relevance of using sequence-structure rRNA models and features of the ITS2 folding in molecular phylogenetics and taxonomy. The multilocus approach allowed inferring a robust phylogeny of Ostreidae providing a broad molecular perspective on their systematics.
STBase: One Million Species Trees for Comparative Biology
McMahon, Michelle M.; Deepak, Akshay; Fernández-Baca, David; Boss, Darren; Sanderson, Michael J.
2015-01-01
Comprehensively sampled phylogenetic trees provide the most compelling foundations for strong inferences in comparative evolutionary biology. Mismatches are common, however, between the taxa for which comparative data are available and the taxa sampled by published phylogenetic analyses. Moreover, many published phylogenies are gene trees, which cannot always be adapted immediately for species level comparisons because of discordance, gene duplication, and other confounding biological processes. A new database, STBase, lets comparative biologists quickly retrieve species level phylogenetic hypotheses in response to a query list of species names. The database consists of 1 million single- and multi-locus data sets, each with a confidence set of 1000 putative species trees, computed from GenBank sequence data for 413,000 eukaryotic taxa. Two bodies of theoretical work are leveraged to aid in the assembly of multi-locus concatenated data sets for species tree construction. First, multiply labeled gene trees are pruned to conflict-free singly-labeled species-level trees that can be combined between loci. Second, impacts of missing data in multi-locus data sets are ameliorated by assembling only decisive data sets. Data sets overlapping with the user’s query are ranked using a scheme that depends on user-provided weights for tree quality and for taxonomic overlap of the tree with the query. Retrieval times are independent of the size of the database, typically a few seconds. Tree quality is assessed by a real-time evaluation of bootstrap support on just the overlapping subtree. Associated sequence alignments, tree files and metadata can be downloaded for subsequent analysis. STBase provides a tool for comparative biologists interested in exploiting the most relevant sequence data available for the taxa of interest. It may also serve as a prototype for future species tree oriented databases and as a resource for assembly of larger species phylogenies from precomputed trees. PMID:25679219
Liu, Wenjun; Yu, Jie; Sun, Zhihong; Song, Yuqin; Wang, Xueni; Wang, Hongmei; Wuren, Tuoya; Zha, Musu; Menghe, Bilige; Heping, Zhang
2016-01-01
Lactobacillus delbrueckii ssp. bulgaricus (L. bulgaricus) is well known for its worldwide application in yogurt production. Flavor production and acid producing are considered as the most important characteristics for starter culture screening. To our knowledge this is the first study applying functional gene sequence multilocus sequence typing technology to predict the fermentation and flavor-producing characteristics of yogurt-producing bacteria. In the present study, phenotypic characteristics of 35 L. bulgaricus strains were quantified during the fermentation of milk to yogurt and during its subsequent storage; these included fermentation time, acidification rate, pH, titratable acidity, and flavor characteristics (acetaldehyde concentration). Furthermore, multilocus sequence typing analysis of 7 functional genes associated with fermentation time, acid production, and flavor formation was done to elucidate the phylogeny and genetic evolution of the same L. bulgaricus isolates. The results showed that strains significantly differed in fermentation time, acidification rate, and acetaldehyde production. Combining functional gene sequence analysis with phenotypic characteristics demonstrated that groups of strains established using genotype data were consistent with groups identified based on their phenotypic traits. This study has established an efficient and rapid molecular genotyping method to identify strains with good fermentation traits; this has the potential to replace time-consuming conventional methods based on direct measurement of phenotypic traits. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Zhou, Li-Wei; Cao, Yun; Wu, Sheng-Hua; Vlasák, Josef; Li, De-Wei; Li, Meng-Jie; Dai, Yu-Cheng
2015-06-01
Species of the Ganoderma lucidum complex are used in many types of health products. However, the taxonomy of this complex has long been chaotic, thus limiting its uses. In the present study, 32 collections of the complex from Asia, Europe and North America were analyzed from both morphological and molecular phylogenetic perspectives. The combined dataset, including an outgroup, comprised 33 ITS, 24 tef1α, 24 rpb1 and 21 rpb2 sequences, of which 19 ITS, 20 tef1α, 20 rpb1 and 17 rpb2 sequences were newly generated. A total of 13 species of the complex were recovered in the multilocus phylogeny. These 13 species were not strongly supported as a single monophyletic lineage, and were further grouped into three lineages that cannot be defined by their geographic distributions. Clade A comprised Ganoderma curtisii, Ganoderma flexipes, Ganoderma lingzhi, Ganoderma multipileum, Ganoderma resinaceum, Ganoderma sessile, Ganoderma sichuanense and Ganoderma tropicum, Clade B comprised G. lucidum, Ganoderma oregonense and Ganoderma tsugae, and Clade C comprised Ganoderma boninense and Ganoderma zonatum. A dichotomous key to the 13 species is provided, and their key morphological characters from context, pores, cuticle cells and basidiospores are presented in a table. The taxonomic positions of these species are briefly discussed. Noteworthy, the epitypification of G. sichuanense is rejected. Copyright © 2014 Elsevier Ltd. All rights reserved.
Nørskov-Lauritsen, Niels; Overballe, Merete D.; Kilian, Mogens
2009-01-01
To obtain more information on the much-debated definition of prokaryotic species, we investigated the borders of Haemophilus influenzae by comparative analysis of H. influenzae reference strains with closely related bacteria including strains assigned to Haemophilus haemolyticus, cryptic genospecies biotype IV, and the never formally validated species “Haemophilus intermedius”. Multilocus sequence phylogeny based on six housekeeping genes separated a cluster encompassing the type and the reference strains of H. influenzae from 31 more distantly related strains. Comparison of 16S rRNA gene sequences supported this delineation but was obscured by a conspicuously high number of polymorphic sites in many of the strains that did not belong to the core group of H. influenzae strains. The division was corroborated by the differential presence of genes encoding H. influenzae adhesion and penetration protein, fuculokinase, and Cu,Zn-superoxide dismutase, whereas immunoglobulin A1 protease activity or the presence of the iga gene was of limited discriminatory value. The existence of porphyrin-synthesizing strains (“H. intermedius”) closely related to H. influenzae was confirmed. Several chromosomally encoded hemin biosynthesis genes were identified, and sequence analysis showed these genes to represent an ancestral genotype rather than recent transfers from, e.g., Haemophilus parainfluenzae. Strains previously assigned to H. haemolyticus formed several separate lineages within a distinct but deeply branching cluster, intermingled with strains of “H. intermedius” and cryptic genospecies biotype IV. Although H. influenzae is phenotypically more homogenous than some other Haemophilus species, the genetic diversity and multicluster structure of strains traditionally associated with H. influenzae make it difficult to define the natural borders of that species. PMID:19060144
Singh, Reema; Schilde, Christina; Schaap, Pauline
2016-11-17
Dictyostelia are a well-studied group of organisms with colonial multicellularity, which are members of the mostly unicellular Amoebozoa. A phylogeny based on SSU rDNA data subdivided all Dictyostelia into four major groups, but left the position of the root and of six group-intermediate taxa unresolved. Recent phylogenies inferred from 30 or 213 proteins from sequenced genomes, positioned the root between two branches, each containing two major groups, but lacked data to position the group-intermediate taxa. Since the positions of these early diverging taxa are crucial for understanding the evolution of phenotypic complexity in Dictyostelia, we sequenced six representative genomes of early diverging taxa. We retrieved orthologs of 47 housekeeping proteins with an average size of 890 amino acids from six newly sequenced and eight published genomes of Dictyostelia and unicellular Amoebozoa and inferred phylogenies from single and concatenated protein sequence alignments. Concatenated alignments of all 47 proteins, and four out of five subsets of nine concatenated proteins all produced the same consensus phylogeny with 100% statistical support. Trees inferred from just two out of the 47 proteins, individually reproduced the consensus phylogeny, highlighting that single gene phylogenies will rarely reflect correct species relationships. However, sets of two or three concatenated proteins again reproduced the consensus phylogeny, indicating that a small selection of genes suffices for low cost classification of as yet unincorporated or newly discovered dictyostelid and amoebozoan taxa by gene amplification. The multi-locus consensus phylogeny shows that groups 1 and 2 are sister clades in branch I, with the group-intermediate taxon D. polycarpum positioned as outgroup to group 2. Branch II consists of groups 3 and 4, with the group-intermediate taxon Polysphondylium violaceum positioned as sister to group 4, and the group-intermediate taxon Dictyostelium polycephalum branching at the base of that whole clade. Given the data, the approximately unbiased test rejects all alternative topologies favoured by SSU rDNA and individual proteins with high statistical support. The test also rejects monophyletic origins for the genera Acytostelium, Polysphondylium and Dictyostelium. The current position of Acytostelium ellipticum in the consensus phylogeny indicates that somatic cells were lost twice in Dictyostelia.
Adherent and Invasive Escherichia coli Is Associated with Granulomatous Colitis in Boxer Dogs
Simpson, Kenneth W.; Dogan, Belgin; Rishniw, Mark; Goldstein, Richard E.; Klaessig, Suzanne; McDonough, Patrick L.; German, Alex J.; Yates, Robin M.; Russell, David G.; Johnson, Susan E.; Berg, Douglas E.; Harel, Josee; Bruant, Guillaume; McDonough, Sean P.; Schukken, Ynte H.
2006-01-01
The mucosa-associated microflora is increasingly considered to play a pivotal role in the pathogenesis of inflammatory bowel disease. This study explored the possibility that an abnormal mucosal flora is involved in the etiopathogenesis of granulomatous colitis of Boxer dogs (GCB). Colonic biopsy samples from affected dogs (n = 13) and controls (n = 38) were examined by fluorescent in situ hybridization (FISH) with a eubacterial 16S rRNA probe. Culture, 16S ribosomal DNA sequencing, and histochemistry were used to guide subsequent FISH. GCB-associated Escherichia coli isolates were evaluated for their ability to invade and persist in cultured epithelial cells and macrophages as well as for serotype, phylogenetic group, genome size, overall genotype, and presence of virulence genes. Intramucosal gram-negative coccobacilli were present in 100% of GCB samples but not controls. Invasive bacteria hybridized with FISH probes to E. coli. Three of four GCB-associated E. coli isolates adhered to, invaded, and replicated within cultured epithelial cells. Invasion triggered a “splash”-type response, was decreased by cytochalasin D, genistein, colchicine, and wortmannin, and paralleled the behavior of the Crohn's disease-associated strain E. coli LF 82. GCB E. coli and LF 82 were diverse in serotype and overall genotype but similar in phylogeny (B2 and D), in virulence gene profiles (fyuA, irp1, irp2, chuA, fepC, ibeA, kpsMII, iss), in having a larger genome size than commensal E. coli, and in the presence of novel multilocus sequence types. We conclude that GCB is associated with selective intramucosal colonization by E. coli. E. coli strains associated with GCB and Crohn's disease have an adherent and invasive phenotype and novel multilocus sequence types and resemble E. coli associated with extraintestinal disease in phylogeny and virulence gene profile. PMID:16861666
Ali, Habib; Muhammad, Abrar; Hou, Youming
2018-05-28
The intracellular bacterium Wolbachia pipientis is widespread in arthropods. Recently, possibilities of novel Wolbachia -mediated hosts, their distribution, and natural rate have been anticipated, and the coconut leaf beetle Brontispa longissima (Gestro) (Coleoptera: Chrysomelidae), which has garnered attention as a serious pest of palms, was subjected to this interrogation. By adopting Wolbachia surface protein ( wsp ) and multilocus sequence type (MLST) genotypic systems, we determined the Wolbachia infection density within host developmental stages, body parts, and tissues, and the results revealed that all the tested samples of B. longissima were infected with the same Wolbachia strain (wLog), suggesting complete vertical transmission. The MLST profile elucidated two new alleles ( ftsZ -234 and coxA-266) that define a new sequence type (ST-483), which indicates the particular genotypic association of B. longissima and Wolbachia . The quantitative real-time PCR analysis revealed a higher infection density in the eggs and adult stage, followed by the abdomen and reproductive tissues, respectively. However, no significant differences were observed in the infection density between sexes. Moreover, the wsp and concatenated MLST alignment analysis of this study with other known Wolbachia-mediated arthropods revealed similar clustering with distinct monophyletic supergroup B. This is the first comprehensive report on the prevalence, infection dynamics, and phylogeny of the Wolbachia endosymbiont in B. longissima , which demonstrated that Wolbachia is ubiquitous across all developmental stages and distributed in the entire body of B. longissima . Understanding the Wolbachia infection dynamics would provide useful insight to build a framework for future investigations, understand its impacts on host physiology, and exploit it as a potential biocontrol agent.
Berenger, Byron M; Berry, Chrystal; Peterson, Trevor; Fach, Patrick; Delannoy, Sabine; Li, Vincent; Tschetter, Lorelee; Nadon, Celine; Honish, Lance; Louie, Marie; Chui, Linda
2015-01-01
A standardised method for determining Escherichia coli O157:H7 strain relatedness using whole genome sequencing or virulence gene profiling is not yet established. We sought to assess the capacity of either high-throughput polymerase chain reaction (PCR) of 49 virulence genes, core-genome single nt variants (SNVs) or k-mer clustering to discriminate between outbreak-associated and sporadic E. coli O157:H7 isolates. Three outbreaks and multiple sporadic isolates from the province of Alberta, Canada were included in the study. Two of the outbreaks occurred concurrently in 2014 and one occurred in 2012. Pulsed-field gel electrophoresis (PFGE) and multilocus variable-number tandem repeat analysis (MLVA) were employed as comparator typing methods. The virulence gene profiles of isolates from the 2012 and 2014 Alberta outbreak events and contemporary sporadic isolates were mostly identical; therefore the set of virulence genes chosen in this study were not discriminatory enough to distinguish between outbreak clusters. Concordant with PFGE and MLVA results, core genome SNV and k-mer phylogenies clustered isolates from the 2012 and 2014 outbreaks as distinct events. k-mer phylogenies demonstrated increased discriminatory power compared with core SNV phylogenies. Prior to the widespread implementation of whole genome sequencing for routine public health use, issues surrounding cost, technical expertise, software standardisation, and data sharing/comparisons must be addressed.
Facey, Paul D.; Méric, Guillaume; Hitchings, Matthew D.; Pachebat, Justin A.; Hegarty, Matt J.; Chen, Xiaorui; Morgan, Laura V.A.; Hoeppner, James E.; Whitten, Miranda M.A.; Kirk, William D.J.; Dyson, Paul J.; Sheppard, Sam K.; Sol, Ricardo Del
2015-01-01
Obligate bacterial symbionts are widespread in many invertebrates, where they are often confined to specialized host cells and are transmitted directly from mother to progeny. Increasing numbers of these bacteria are being characterized but questions remain about their population structure and evolution. Here we take a comparative genomics approach to investigate two prominent bacterial symbionts (BFo1 and BFo2) isolated from geographically separated populations of western flower thrips, Frankliniella occidentalis. Our multifaceted approach to classifying these symbionts includes concatenated multilocus sequence analysis (MLSA) phylogenies, ribosomal multilocus sequence typing (rMLST), construction of whole-genome phylogenies, and in-depth genomic comparisons. We showed that the BFo1 genome clusters more closely to species in the genus Erwinia, and is a putative close relative to Erwinia aphidicola. BFo1 is also likely to have shared a common ancestor with Erwinia pyrifoliae/Erwinia amylovora and the nonpathogenic Erwinia tasmaniensis and genetic traits similar to Erwinia billingiae. The BFo1 genome contained virulence factors found in the genus Erwinia but represented a divergent lineage. In contrast, we showed that BFo2 belongs within the Enterobacteriales but does not group closely with any currently known bacterial species. Concatenated MLSA phylogenies indicate that it may have shared a common ancestor to the Erwinia and Pantoea genera, and based on the clustering of rMLST genes, it was most closely related to Pantoea ananatis but represented a divergent lineage. We reconstructed a core genome of a putative common ancestor of Erwinia and Pantoea and compared this with the genomes of BFo bacteria. BFo2 possessed none of the virulence determinants that were omnipresent in the Erwinia and Pantoea genera. Taken together, these data are consistent with BFo2 representing a highly novel species that maybe related to known Pantoea. PMID:26185096
Facey, Paul D; Méric, Guillaume; Hitchings, Matthew D; Pachebat, Justin A; Hegarty, Matt J; Chen, Xiaorui; Morgan, Laura V A; Hoeppner, James E; Whitten, Miranda M A; Kirk, William D J; Dyson, Paul J; Sheppard, Sam K; Del Sol, Ricardo
2015-07-15
Obligate bacterial symbionts are widespread in many invertebrates, where they are often confined to specialized host cells and are transmitted directly from mother to progeny. Increasing numbers of these bacteria are being characterized but questions remain about their population structure and evolution. Here we take a comparative genomics approach to investigate two prominent bacterial symbionts (BFo1 and BFo2) isolated from geographically separated populations of western flower thrips, Frankliniella occidentalis. Our multifaceted approach to classifying these symbionts includes concatenated multilocus sequence analysis (MLSA) phylogenies, ribosomal multilocus sequence typing (rMLST), construction of whole-genome phylogenies, and in-depth genomic comparisons. We showed that the BFo1 genome clusters more closely to species in the genus Erwinia, and is a putative close relative to Erwinia aphidicola. BFo1 is also likely to have shared a common ancestor with Erwinia pyrifoliae/Erwinia amylovora and the nonpathogenic Erwinia tasmaniensis and genetic traits similar to Erwinia billingiae. The BFo1 genome contained virulence factors found in the genus Erwinia but represented a divergent lineage. In contrast, we showed that BFo2 belongs within the Enterobacteriales but does not group closely with any currently known bacterial species. Concatenated MLSA phylogenies indicate that it may have shared a common ancestor to the Erwinia and Pantoea genera, and based on the clustering of rMLST genes, it was most closely related to Pantoea ananatis but represented a divergent lineage. We reconstructed a core genome of a putative common ancestor of Erwinia and Pantoea and compared this with the genomes of BFo bacteria. BFo2 possessed none of the virulence determinants that were omnipresent in the Erwinia and Pantoea genera. Taken together, these data are consistent with BFo2 representing a highly novel species that maybe related to known Pantoea. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Bybee, Seth M; Bracken-Grissom, Heather; Haynes, Benjamin D; Hermansen, Russell A; Byers, Robert L; Clement, Mark J; Udall, Joshua A; Wilcox, Edward R; Crandall, Keith A
2011-01-01
Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach.
Bybee, Seth M.; Bracken-Grissom, Heather; Haynes, Benjamin D.; Hermansen, Russell A.; Byers, Robert L.; Clement, Mark J.; Udall, Joshua A.; Wilcox, Edward R.; Crandall, Keith A.
2011-01-01
Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach. PMID:22002916
USDA-ARS?s Scientific Manuscript database
A multilocus phylogenetic study was carried out to assess the species distribution in a set of 34 clinical isolates of Aspergillus section Circumdati from the USA and their in vitro antifungal susceptibility were determined against eight antifungal drugs. The genetic markers used were ITS, BenA, CaM...
Genomic insights into the taxonomic status of the Bacillus cereus group
Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P.; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze
2015-01-01
The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19–20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441
Major clades of Agaricales: a multilocus phylogenetic overview.
P. Brandon Matheny; Judd M. Curtis; Valerie Hofstetter; M. Catherine Aime; Jean-Marc Moncalvo; Zai-Wei Ge; Zhu-Liang Yang; Joseph F. Ammirati; Timothy J. Baroni; Neale L. Bougher; Karen W. Lodge Hughes; Richard W. Kerrigan; Michelle T. Seidl; Aanen; Matthew Duur K. DeNitis; Graciela M. Daniele; Dennis E. Desjardin; Bradley R. Kropp; Lorelei L. Norvell; Andrew Parker; Else C. Vellinga; Rytas Vilgalys; David S. Hibbett
2006-01-01
An overview of the phylogeny of the Agaricales is presented based on a multilocus analysis of a six-gene region supermatrix. Bayesian analyses of 5611 nucleotide characters of rpb1, rpb1-intron 2, rpb2 and 18S, 25S, and 5.8S ribosomal RNA genes recovered six major clades, which are recognized informally and labeled the Agaricoid, Tricholomatoid, Marasmioid, Pluteoid,...
Buján, Noemí; Balboa, Sabela; L Romalde, Jesús; E Toranzo, Alicia; Magariños, Beatriz
2018-05-08
At present, the genus Edwardsiella compiles five species: E. tarda, E. hoshinae, E. ictaluri, E. piscicida and E. anguillarum. Some species of this genus such us E. ictaluri and E. piscicida are important pathogens of numerous fish species. With the description of the two latter species, the phylogeny of Edwardsiella became more complicated. With the aim to clarify the relationships among all species in the genus, a multilocus sequence typing (MLST) approach was developed and applied to characterize 56 isolates and 6 reference strains belonging to the five Edwardsiella species. Moreover, several analyses based on the MLST scheme were performed to investigate the evolution within the genus, as well as the influence of recombination and mutation in the speciation. Edwardsiella isolates presented a high genetic variability reflected in the fourteen sequence types (ST) represented by a single isolates out of eighteen total ST. Mutation events were considerably more frequent than recombination, although both approximately equal influenced the genetic diversification. However, the speciation among species occurred mostly by recombination. Edwardsiella genus displays a non-clonal population structure with some degree of geographical isolation followed by a population expansion of E. piscicida. A database from this study was created and hosted on pubmlst.org (http://pubmlst.org/edwardsiella/). Copyright © 2018 Elsevier Inc. All rights reserved.
Frequent gene flow blurred taxonomic boundaries of sections in Lilium L. (Liliaceae)
Liu, Shih-Hui; Chiang, Tzen-Yuh
2017-01-01
Gene flow between species may last a long time in plants. Reticulation inevitably causes difficulties in phylogenetic reconstruction. In this study, we looked into the genetic divergence and phylogeny of 20 Lilium species based on multilocus analyses of 8 genes of chloroplast DNA (cpDNA), the internally transcribed nuclear ribosomal DNA (nrITS) spacer and 20 loci extracted from the expressed sequence tag (EST) libraries of L. longiflorum Thunb. and L. formosanum Wallace. The phylogeny based on the combined data of the maternally inherited cpDNA and nrITS was largely consistent with the taxonomy of Lilium sections. This phylogeny was deemed the hypothetical species tree and uncovered three groups, i.e., Cluster A consisting of 4 taxa from the sections Pseudolirium and Liriotypus, Cluster B consisting of the 4 taxa from the sections Leucolirion, Archelirion and Daurolirion, and Cluster C comprising 10 taxa mostly from the sections Martagon and Sinomartagon. In contrast, systematic inconsistency occurred across the EST loci, with up to 19 genes (95%) displaying tree topologies deviating from the hypothetical species tree. The phylogenetic incongruence was likely attributable to the frequent genetic exchanges between species/sections, as indicated by the high levels of genetic recombination and the IMa analyses with the EST loci. Nevertheless, multilocus analysis could provide complementary information among the loci on the species split and the extent of gene flow between the species. In conclusion, this study not only detected frequent gene flow among Lilium sections that resulted in phylogenetic incongruence but also reconstructed a hypothetical species tree that gave insights into the nature of the complex relationships among Lilium species. PMID:28841664
Shahin, Arwa; Smulders, Marinus J. M.; van Tuyl, Jaap M.; Arens, Paul; Bakker, Freek T.
2014-01-01
Next Generation Sequencing (NGS) may enable estimating relationships among genotypes using allelic variation of multiple nuclear genes simultaneously. We explored the potential and caveats of this strategy in four genetically distant Lilium cultivars to estimate their genetic divergence from transcriptome sequences using three approaches: POFAD (Phylogeny of Organisms from Allelic Data, uses allelic information of sequence data), RAxML (Randomized Accelerated Maximum Likelihood, tree building based on concatenated consensus sequences) and Consensus Network (constructing a network summarizing among gene tree conflicts). Twenty six gene contigs were chosen based on the presence of orthologous sequences in all cultivars, seven of which also had an orthologous sequence in Tulipa, used as out-group. The three approaches generated the same topology. Although the resolution offered by these approaches is high, in this case there was no extra benefit in using allelic information. We conclude that these 26 genes can be widely applied to construct a species tree for the genus Lilium. PMID:25368628
Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand
2009-12-29
Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.
Listeria monocytogenes sequence type 1 is predominant in ruminant rhombencephalitis
Dreyer, Margaux; Aguilar-Bultet, Lisandra; Rupp, Sebastian; Guldimann, Claudia; Stephan, Roger; Schock, Alexandra; Otter, Arthur; Schüpbach, Gertraud; Brisse, Sylvain; Lecuit, Marc; Frey, Joachim; Oevermann, Anna
2016-01-01
Listeria (L.) monocytogenes is an opportunistic pathogen causing life-threatening infections in diverse mammalian species including humans and ruminants. As little is known on the link between strains and clinicopathological phenotypes, we studied potential strain-associated virulence and organ tropism in L. monocytogenes isolates from well-defined ruminant cases of clinical infections and the farm environment. The phylogeny of isolates and their virulence-associated genes were analyzed by multilocus sequence typing (MLST) and sequence analysis of virulence-associated genes. Additionally, a panel of representative isolates was subjected to in vitro infection assays. Our data suggest the environmental exposure of ruminants to a broad range of strains and yet the strong association of sequence type (ST) 1 from clonal complex (CC) 1 with rhombencephalitis, suggesting increased neurotropism of ST1 in ruminants, which is possibly related to its hypervirulence. This study emphasizes the importance of considering clonal background of L. monocytogenes isolates in surveillance, epidemiological investigation and disease control. PMID:27848981
Graham Reynolds, R; Niemiller, Matthew L; Revell, Liam J
2014-02-01
Snakes in the families Boidae and Pythonidae constitute some of the most spectacular reptiles and comprise an enormous diversity of morphology, behavior, and ecology. While many species of boas and pythons are familiar, taxonomy and evolutionary relationships within these families remain contentious and fluid. A major effort in evolutionary and conservation biology is to assemble a comprehensive Tree-of-Life, or a macro-scale phylogenetic hypothesis, for all known life on Earth. No previously published study has produced a species-level molecular phylogeny for more than 61% of boa species or 65% of python species. Using both novel and previously published sequence data, we have produced a species-level phylogeny for 84.5% of boid species and 82.5% of pythonid species, contextualized within a larger phylogeny of henophidian snakes. We obtained new sequence data for three boid, one pythonid, and two tropidophiid taxa which have never previously been included in a molecular study, in addition to generating novel sequences for seven genes across an additional 12 taxa. We compiled an 11-gene dataset for 127 taxa, consisting of the mitochondrial genes CYTB, 12S, and 16S, and the nuclear genes bdnf, bmp2, c-mos, gpr35, rag1, ntf3, odc, and slc30a1, totaling up to 7561 base pairs per taxon. We analyzed this dataset using both maximum likelihood and Bayesian inference and recovered a well-supported phylogeny for these species. We found significant evidence of discordance between taxonomy and evolutionary relationships in the genera Tropidophis, Morelia, Liasis, and Leiopython, and we found support for elevating two previously suggested boid species. We suggest a revised taxonomy for the boas (13 genera, 58 species) and pythons (8 genera, 40 species), review relationships between our study and the many other molecular phylogenetic studies of henophidian snakes, and present a taxonomic database and alignment which may be easily used and built upon by other researchers. Copyright © 2013 Elsevier Inc. All rights reserved.
Tong, Steven Y.C.; Holden, Matthew T.G.; Nickerson, Emma K.; Cooper, Ben S.; Köser, Claudio U.; Cori, Anne; Jombart, Thibaut; Cauchemez, Simon; Fraser, Christophe; Wuthiekanun, Vanaporn; Thaipadungpanit, Janjira; Hongsuwan, Maliwan; Day, Nicholas P.; Limmathurotsakul, Direk; Parkhill, Julian; Peacock, Sharon J.
2015-01-01
Methicillin-resistant Staphylococcus aureus (MRSA) is a major cause of nosocomial infection. Whole-genome sequencing of MRSA has been used to define phylogeny and transmission in well-resourced healthcare settings, yet the greatest burden of nosocomial infection occurs in resource-restricted settings where barriers to transmission are lower. Here, we study the flux and genetic diversity of MRSA on ward and individual patient levels in a hospital where transmission was common. We repeatedly screened all patients on two intensive care units for MRSA carriage over a 3-mo period. All MRSA belonged to multilocus sequence type 239 (ST 239). We defined the population structure and charted the spread of MRSA by sequencing 79 isolates from 46 patients and five members of staff, including the first MRSA-positive screen isolates and up to two repeat isolates where available. Phylogenetic analysis identified a flux of distinct ST 239 clades over time in each intensive care unit. In total, five main clades were identified, which varied in the carriage of plasmids encoding antiseptic and antimicrobial resistance determinants. Sequence data confirmed intra- and interwards transmission events and identified individual patients who were colonized by more than one clade. One patient on each unit was the source of numerous transmission events, and deep sampling of one of these cases demonstrated colonization with a “cloud” of related MRSA variants. The application of whole-genome sequencing and analysis provides novel insights into the transmission of MRSA in under-resourced healthcare settings and has relevance to wider global health. PMID:25491771
Carbone, Ignazio; White, James B; Miadlikowska, Jolanta; Arnold, A Elizabeth; Miller, Mark A; Kauff, Frank; U'Ren, Jana M; May, Georgiana; Lutzoni, François
2017-04-15
High-quality phylogenetic placement of sequence data has the potential to greatly accelerate studies of the diversity, systematics, ecology and functional biology of diverse groups. We developed the Tree-Based Alignment Selector (T-BAS) toolkit to allow evolutionary placement and visualization of diverse DNA sequences representing unknown taxa within a robust phylogenetic context, and to permit the downloading of highly curated, single- and multi-locus alignments for specific clades. In its initial form, T-BAS v1.0 uses a core phylogeny of 979 taxa (including 23 outgroup taxa, as well as 61 orders, 175 families and 496 genera) representing all 13 classes of largest subphylum of Fungi-Pezizomycotina (Ascomycota)-based on sequence alignments for six loci (nr5.8S, nrLSU, nrSSU, mtSSU, RPB1, RPB2 ). T-BAS v1.0 has three main uses: (i) Users may download alignments and voucher tables for members of the Pezizomycotina directly from the reference tree, facilitating systematics studies of focal clades. (ii) Users may upload sequence files with reads representing unknown taxa and place these on the phylogeny using either BLAST or phylogeny-based approaches, and then use the displayed tree to select reference taxa to include when downloading alignments. The placement of unknowns can be performed for large numbers of Sanger sequences obtained from fungal cultures and for alignable, short reads of environmental amplicons. (iii) User-customizable metadata can be visualized on the tree. T-BAS Version 1.0 is available online at http://tbas.hpc.ncsu.edu . Registration is required to access the CIPRES Science Gateway and NSF XSEDE's large computational resources. icarbon@ncsu.edu. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N
2014-07-01
The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of closely related organisms, and discuss how it could be extended to future studies of multilocus rDNA systems. [concerted evolution; genome hydridisation; phylogenetic analysis; ribosomal DNA; whole genome sequencing; yeast]. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Schirtzinger, Erin E.; Tavares, Erika S.; Gonzales, Lauren A.; Eberhard, Jessica R.; Miyaki, Cristina Y.; Sanchez, Juan J.; Hernandez, Alexis; Müeller, Heinrich; Graves, Gary R.; Fleischer, Robert C.; Wright, Timothy F.
2012-01-01
Mitochondrial genomes are generally thought to be under selection for compactness, due to their small size, consistent gene content, and a lack of introns or intergenic spacers. As more animal mitochondrial genomes are fully sequenced, rearrangements and partial duplications are being identified with increasing frequency, particularly in birds (Class Aves). In this study, we investigate the evolutionary history of mitochondrial control region states within the avian order Psittaciformes (parrots and cockatoos). To this aim, we reconstructed a comprehensive multi-locus phylogeny of parrots, used PCR of three diagnostic fragments to classify the mitochondrial control region state as single or duplicated, and mapped these states onto the phylogeny. We further sequenced 44 selected species to validate these inferences of control region state. Ancestral state reconstruction using a range of weighting schemes identified six independent origins of mitochondrial control region duplications within Psittaciformes. Analysis of sequence data showed that varying levels of mitochondrial gene and tRNA homology and degradation were present within a given clade exhibiting duplications. Levels of divergence between control regions within an individual varied from 0–10.9% with the differences occurring mainly between 51 and 225 nucleotides 3′ of the goose hairpin in domain I. Further investigations into the fates of duplicated mitochondrial genes, the potential costs and benefits of having a second control region, and the complex relationship between evolutionary rates, selection, and time since duplication are needed to fully explain these patterns in the mitochondrial genome. PMID:22543055
Carro, Lorena; Spröer, Cathrin; Alonso, Pilar; Trujillo, Martha E
2012-03-01
It was recently reported that Micromonospora inhabits the intracellular tissues of nitrogen fixing nodules of the wild legume Lupinus angustifolius. To determine if Micromonospora populations are also present in nitrogen fixing nodules of cultivated legumes such as Pisum sativum, we carried out the isolation of this actinobacterium from P. sativum plants collected in two man-managed fields in the region of Castilla and León (Spain). In this work, we describe the isolation of 93 Micromonospora strains recovered from nitrogen fixing nodules and the rhizosphere of P. sativum. The genomic diversity of the strains was analyzed by amplified ribosomal DNA restriction analysis (ARDRA). Forty-six isolates and 34 reference strains were further analyzed using a multilocus sequence analysis scheme developed to address the phylogeny of the genus Micromonospora and to evaluate the species distribution in the two studied habitats. The MLSA results were evaluated by DNA-DNA hybridization to determine their usefulness for the delineation of Micromonospora at the species level. In most cases, DDH values below 70% were obtained with strains that shared a sequence similarity of 98.5% or less. Thus, MLSA studies clearly supported the established taxonomy of the genus Micromonospora and indicated that genomic species could be delineated as groups of strains that share > 98.5% sequence similarity based on the 5 genes selected. The species diversity of the strains isolated from both the rhizosphere and nodules was very high and in many cases the new strains could not be related to any of the currently described species. Copyright © 2011 Elsevier GmbH. All rights reserved.
Apablaza, P; Løland, A D; Brevik, Ø J; Ilardi, P; Battaglia, J; Nylund, A
2013-04-01
To aim of the study was to describe the genetic relationship between isolates of Flavobacterium psychrophilum with a main emphasis of samples from Chile and Norway. The isolates have been obtained from farmed salmonids in Norway and Chile, and from wild salmonids in Norway, but isolates from North America and European countries are also included in the analysis. The study is based on phylogenetic analysis of 16S rRNA and seven housekeeping genes (HG), gyrB, atpA, dnaK, trpB, fumC, murG and tuf, and the use of a multilocus sequence typing (MLST) system, based on nucleotide polymorphism in the HG, as an alternative to the phylogenies. The variation within the selected genes was limited, and the phylogenetic analysis gave little resolution between the isolates. The MLST gave a much better resolution resulting in 53 sequence types where the same sequences types could be found in Chile, North America and European countries, and in different host species. Multilocus sequence typing give a relatively good separation of different isolates of Fl. psychrophilum and show that there are no distinct geographical or host-specific isolates in the studied material from Chile, North America and Europe. Nor was it possible to separate between isolates from ulcers and systemic infections vs isolates from the surface of healthy salmonids. This study shows a wide geographical distribution of Fl. psychrophilum, indicating that the bacterium has a large potential for transmission over long distances, and between different salmonid hosts species. This knowledge will be important for future management of salmonids diseases connected to Fl. psychrophilum. © 2013 The Society for Applied Microbiology.
Mitogenomics of 'Old World Acraea' butterflies reveals a highly divergent 'Bematistes'.
Timmermans, M J T N; Lees, D C; Thompson, M J; Sáfián, Sz; Brattström, O
2016-04-01
Afrotropical Acraeini butterflies provide a fascinating potential model system to contrast with the Neotropical Heliconiini, yet their phylogeny remains largely unexplored by molecular methods and their generic level nomenclature is still contentious. To test the potential of mitogenomes in a simultaneous analysis of the radiation, we sequenced the full mitochondrial genomes of 19 African species. Analyses show the potential of mitogenomic phylogeny reconstruction in this group. Inferred relationships are largely congruent with a previous multilocus study. We confirm a monophyletic Telchinia to include the Asiatic Pareba with a complicated paraphylum, traditional (sub)genus Acraea, toward the base. The results suggest that several proposed subgenera and some species groups within Telchinia are not monophyletic, while two other (sub)genera could possibly be combined. Telchinia was recovered without strong support as sister to the potentially interesting system of distasteful model butterflies known as Bematistes, a name that is suppressed in some treatments. Surprisingly, we find that this taxon has remarkably divergent mitogenomes and unexpected synapomorphic tRNA rearrangements. These gene order changes, combined with evidence for deviating dN/dS ratios and evidence for episodal diversifying selection, suggest that the ancestral Bematistes mitogenome has had a turbulent past. Our study adds genetic support for treating this clade as a distinct genus, while the alternative option, adopted by some authors, of Acraea being equivalent to Acraeini merely promotes redundancy. We pave the way for more detailed mitogenomic and multi-locus molecular analyses which can determine how many genera are needed (possibly at least six) to divide Acraeini into monophyletic groups that also facilitate communication about their biology. Copyright © 2016 Elsevier Inc. All rights reserved.
Species limits, phylogeography and reproductive mode in the Metarhizium anisopliae complex
USDA-ARS?s Scientific Manuscript database
An essential first step toward understanding the ecology and life histories of Metarhizium anisopliae-group species as entomopathogens, endophytes and soil-adapted fungi is the ability to accurately define species limits and confidently infer a species tree. Here we present a multilocus phylogeny of...
Reconstructing the backbone of the Saccharomycotina yeast phylogeny using genome-scale data
USDA-ARS?s Scientific Manuscript database
Understanding the phylogenetic relationships among the yeasts of the subphylum Saccharomycotina is a prerequisite for understanding the evolution of their metabolisms and ecological lifestyles. In the last two decades, the use of rDNA and multi-locus data sets has greatly advanced our understanding ...
Lefoulon, Emilie; Bourret, Jérôme; Junker, Kerstin; Guerrero, Ricardo; Cañizales, Israel; Kuzmin, Yuriy; Satoto, Tri Baskoro T.; Cardenas-Callirgos, Jorge Manuel; de Souza Lima, Sueli; Raccurt, Christian; Mutafchiev, Yasen; Gavotte, Laurent; Martin, Coralie
2015-01-01
During the past twenty years, a number of molecular analyses have been performed to determine the evolutionary relationships of Onchocercidae, a family of filarial nematodes encompassing several species of medical or veterinary importance. However, opportunities for broad taxonomic sampling have been scarce, and analyses were based mainly on 12S rDNA and coxI gene sequences. While being suitable for species differentiation, these mitochondrial genes cannot be used to infer phylogenetic hypotheses at higher taxonomic levels. In the present study, 48 species, representing seven of eight subfamilies within the Onchocercidae, were sampled and sequences of seven gene loci (nuclear and mitochondrial) analysed, resulting in the hitherto largest molecular phylogenetic investigation into this family. Although our data support the current hypothesis that the Oswaldofilariinae, Waltonellinae and Icosiellinae subfamilies separated early from the remaining onchocercids, Setariinae was recovered as a well separated clade. Dirofilaria, Loxodontofilaria and Onchocerca constituted a strongly supported clade despite belonging to different subfamilies (Onchocercinae and Dirofilariinae). Finally, the separation between Splendidofilariinae, Dirofilariinae and Onchocercinae will have to be reconsidered. PMID:26588229
Chavda, Kalyan D.; Chen, Liang; Fouts, Derrick E.; Sutton, Granger; Brinkac, Lauren; Jenkins, Stephen G.; Bonomo, Robert A.
2016-01-01
ABSTRACT Knowledge regarding the genomic structure of Enterobacter spp., the second most prevalent carbapenemase-producing Enterobacteriaceae, remains limited. Here we sequenced 97 clinical Enterobacter species isolates that were both carbapenem susceptible and resistant from various geographic regions to decipher the molecular origins of carbapenem resistance and to understand the changing phylogeny of these emerging and drug-resistant pathogens. Of the carbapenem-resistant isolates, 30 possessed blaKPC-2, 40 had blaKPC-3, 2 had blaKPC-4, and 2 had blaNDM-1. Twenty-three isolates were carbapenem susceptible. Six genomes were sequenced to completion, and their sizes ranged from 4.6 to 5.1 Mbp. Phylogenomic analysis placed 96 of these genomes, 351 additional Enterobacter genomes downloaded from NCBI GenBank, and six newly sequenced type strains into 19 phylogenomic groups—18 groups (A to R) in the Enterobacter cloacae complex and Enterobacter aerogenes. Diverse mechanisms underlying the molecular evolutionary trajectory of these drug-resistant Enterobacter spp. were revealed, including the acquisition of an antibiotic resistance plasmid, followed by clonal spread, horizontal transfer of blaKPC-harboring plasmids between different phylogenomic groups, and repeated transposition of the blaKPC gene among different plasmid backbones. Group A, which comprises multilocus sequence type 171 (ST171), was the most commonly identified (23% of isolates). Genomic analysis showed that ST171 isolates evolved from a common ancestor and formed two different major clusters; each acquiring unique blaKPC-harboring plasmids, followed by clonal expansion. The data presented here represent the first comprehensive study of phylogenomic interrogation and the relationship between antibiotic resistance and plasmid discrimination among carbapenem-resistant Enterobacter spp., demonstrating the genetic diversity and complexity of the molecular mechanisms driving antibiotic resistance in this genus. PMID:27965456
O'Donnell, Kerry; Sutton, Deanna A; Fothergill, Annette; McCarthy, Dora; Rinaldi, Michael G; Brandt, Mary E; Zhang, Ning; Geiser, David M
2008-08-01
Members of the species-rich Fusarium solani species complex (FSSC) are responsible for approximately two-thirds all fusarioses of humans and other animals. In addition, many economically important phytopathogenic species are nested within this complex. Due to their increasing clinical relevance and because most of the human pathogenic and plant pathogenic FSSC lack Latin binomials, we have extended the multilocus haplotype nomenclatural system introduced in a previous study (D. C. Chang, G. B. Grant, K. O'Donnell, K. A. Wannemuehler, J. Noble-Wang, C. Y. Rao, L. M. Jacobson, C. S. Crowell, R. S. Sneed, F. M. T. Lewis, J. K. Schaffzin, M. A. Kainer, C. A. Genese, E. C. Alfonso, D. B. Jones, A. Srinivasan, S. K. Fridkin, and B. J. Park, JAMA 296:953-963, 2006) to all 34 species within the medically important FSSC clade 3 to facilitate global epidemiological studies. The typing scheme is based on polymorphisms in portions of the following three genes: the internal transcribed spacer region and domains D1 plus D2 of the nuclear large-subunit rRNA, the translation elongation factor 1 alpha gene (EF-1alpha), and the second largest subunit of RNA polymerase II gene (RPB2). Of the 251 isolates subjected to multilocus DNA sequence typing, 191 sequence types were differentiated, and these were distributed among three strongly supported clades designated 1, 2, and 3. All of the mycosis-associated isolates were restricted to FSSC clade 3, as previously reported (N. Zhang, K. O'Donnell, D. A. Sutton, F. A Nalim, R. C. Summerbell, A. A. Padhye, and D. M. Geiser, J. Clin. Microbiol. 44:2186-2190, 2006), and these represent at least 20 phylogenetically distinct species. Analyses of the combined DNA sequence data by use of two separate phylogenetic methods yielded the most robust hypothesis of evolutionary relationships and genetic diversity within the FSSC to date. The in vitro activities of 10 antifungals tested against 19 isolates representing 18 species that span the breadth of the FSSC phylogeny show that members of this complex are broadly resistant to these drugs.
Population Structure in Nontypeable Haemophilus influenzae
LaCross, Nathan C.; Marrs, Carl F.; Gilsdorf, Janet R.
2013-01-01
Nontypeable Haemophilus influenzae (NTHi) frequently colonize the human pharynx asymptomatically, and are an important cause of otitis media in children. Past studies have identified typeable H. influenzae as being clonal, but the population structure of NTHi has not been extensively characterized. The research presented here investigated the diversity and population structure in a well-characterized collection of NTHi isolated from the middle ears of children with otitis media or the pharynges of healthy children in three disparate geographic regions. Multilocus sequence typing identified 109 unique sequence types among 170 commensal and otitis media-associated NTHi isolates from Finland, Israel, and the US. The largest clonal complex contained only five sequence types, indicating a high level of genetic diversity. The eBURST v3, ClonalFrame 1.1, and structure 2.3.3 programs were used to further characterize diversity and population structure from the sequence typing data. Little clustering was apparent by either disease state (otitis media or commensalism) or geography in the ClonalFrame phylogeny. Population structure was clearly evident, with support for eight populations when all 170 isolates were analyzed. Interestingly, one population contained only commensal isolates, while two others consisted solely of otitis media isolates, suggesting associations between population structure and disease. PMID:23266487
Álvarez, Natalí; Gómez, Giovan F; Naranjo-Díaz, Nelson; Correa, Margarita M
2018-06-18
The Arribalzagia Series of the Anopheles Subgenus comprises morphologically similar species or members of species complexes which makes correct species identification difficult. Therefore, the aim of this work was to discriminate the morphospecies of the Arribalzagia Series present in Colombia using a multilocus approach based on ITS2, COI and CAD sequences. Specimens of the Arribalzagia Series collected at 32 localities in nine departments were allocated to seven species. Individual and concatenated Bayesian analyses showed high support for each of the species and reinforced the previous report of the Apicimacula species Complex with distribution in the Pacific Coast and northwestern Colombia. In addition, a new molecular operational taxonomic unit-MOTU was identified, herein denominated near Anopheles peryassui, providing support for the existence of a Peryassui species Complex. Further, the CAD gene, just recently used for Anopheles taxonomy and phylogeny, demonstrated its power in resolving phylogenetic relationships among species of the Arribalzagia Series. The divergence times for these species correspond to the early Pliocene and the Miocene. Considering the epidemiological importance of some species of the Series and their co-occurrence in malaria endemic regions of Colombia, their discrimination constitutes an important step for vector incrimination and control in the country. Copyright © 2018. Published by Elsevier B.V.
USDA-ARS?s Scientific Manuscript database
A Multilocus Sequence Typing (MLST) method based on allelic variation of 7 chromosomal loci was developed for characterizing genotypes within the genus Bradyrhizobium. With the method 29 distinct multilocus genotypes (GTs) were identified among 191 culture collection soybean strains. The occupancy ...
Low Divergence of Clonorchis sinensis in China Based on Multilocus Analysis
Sun, Jiufeng; Huang, Yan; Huang, Huaiqiu; Liang, Pei; Wang, Xiaoyun; Mao, Qiang; Men, Jingtao; Chen, Wenjun; Deng, Chuanhuan; Zhou, Chenhui; Lv, Xiaoli; Zhou, Juanjuan; Zhang, Fan; Li, Ran; Tian, Yanli; Lei, Huali; Liang, Chi; Hu, Xuchu; Xu, Jin; Li, Xuerong; XinbingYu
2013-01-01
Clonorchis sinensis, an ancient parasite that infects a number of piscivorous mammals, attracts significant public health interest due to zoonotic exposure risks in Asia. The available studies are insufficient to reflect the prevalence, geographic distribution, and intraspecific genetic diversity of C. sinensis in endemic areas. Here, a multilocus analysis based on eight genes (ITS1, act, tub, ef-1a, cox1, cox3, nad4 and nad5 [4.986 kb]) was employed to explore the intra-species genetic construction of C. sinensis in China. Two hundred and fifty-six C. sinensis isolates were obtained from environmental reservoirs from 17 provinces of China. A total of 254 recognized Multilocus Types (MSTs) showed high diversity among these isolates using multilocus analysis. The comparison analysis of nuclear and mitochondrial phylogeny supports separate clusters in a nuclear dendrogram. Genetic differentiation analysis of three clusters (A, B, and C) showed low divergence within populations. Most isolates from clusters B and C are geographically limited to central China, while cluster A is extraordinarily genetically diverse. Further genetic analyses between different geographic distributions, water bodies and hosts support the low population divergence. The latter haplotype analyses were consistent with the phylogenetic and genetic differentiation results. A recombination network based on concatenated sequences showed a concentrated linkage recombination population in cox1, cox3, nad4 and nad5, with spatial structuring in ITS1. Coupled with the history record and archaeological evidence of C. sinensis infection in mummified desiccated feces, these data point to an ancient origin of C. sinensis in China. In conclusion, we present a likely phylogenetic structure of the C. sinensis population in mainland China, highlighting its possible tendency for biogeographic expansion. Meanwhile, ITS1 was found to be an effective marker for tracking C. sinensis infection worldwide. Thus, the present study improves our understanding of the global epidemiology and evolution of C. sinensis. PMID:23825605
USDA-ARS?s Scientific Manuscript database
Species of Colletotrichum interact with a vast but as yet undetermined number of plant species as pathogens and as asymptomatic endophytes. It is not known, however, whether these contrasting ecological modes are optional strategies exercised by individual species or whether species ecology is more ...
Merkel, Viktor; Ohder, Barbara; Bielaszewska, Martina; Zhang, Wenlan; Fruth, Angelika; Menge, Christian; Borrmann, Erika; Middendorf, Barbara; Müthing, Johannes; Karch, Helge; Mellmann, Alexander
2010-01-01
eibG in Shiga toxin-producing Escherichia coli (STEC) O91 encodes a protein (EibG) which binds human immunoglobulins G and A and contributes to bacterial chain-like adherence to human epithelial cells. We investigated the prevalence of eibG among STEC, the phylogeny of eibG, and eibG allelic variations and their impact on the adherence phenotype. eibG was found in 15.0% of 240 eae-negative STEC strains but in none of 157 eae-positive STEC strains. The 36 eibG-positive STEC strains belonged to 14 serotypes and to eight multilocus sequence types (STs), with serotype O91:H14/H− and ST33 being the most common. Sequences of the complete eibG gene (1,527 bp in size) from eibG-positive STEC resulted in 21 different alleles with 88.11% to 100% identity to the previously reported eibG sequence; they clustered into three eibG subtypes (eibG-α, eibG-β, and eibG-γ). Strains expressing EibG-α and EibG-β displayed a mostly typical chain-like adherence pattern (CLAP), with formation of long chains on both human and bovine intestinal epithelial cells, whereas strains with EibG-γ adhered in short chains, a pattern we termed atypical CLAP. The same adherence phenotypes were displayed by E. coli BL21(DE3) clones containing the respective eibG-α, eibG-β, and eibG-γ subtypes. We propose two possible evolutionary scenarios for eibG in STEC: a clonal development of eibG in strains with the same phylogenetic background or horizontal transfer of eibG between phylogenetically unrelated STEC strains. PMID:20547747
Rojas, Enith I; Rehner, Stephen A; Samuels, Gary J; Van Bael, Sunshine A; Herre, Edward A; Cannon, Paul; Chen, Rui; Pang, Junfeng; Wang, Ruiwu; Zhang, Yaping; Peng, Yan-Qiong; Sha, Tao
2010-01-01
Colletotrichum interacts with numerous plant species overtly as symptomatic pathogens and cryptically as asymptomatic endophytes. It is not known whether these contrasting ecological modes are optional strategies expressed by individual Colletotrichum species or whether a species' ecology is explicitly pathogenic or endophytic. We explored this question by inferring relationships among 77 C. gloeosporioides s.l. strains isolated from asymptomatic leaves and from anthracnose lesions on leaves and fruits of Theobroma cacao (cacao) and other plants from Panamá. ITS and 5'-tef1 were used to assess diversity and to delineate operational taxonomic units for multilocus phylogenetic analysis. The ITS and 5'-tef1 screens concordantly resolved four strongly supported lineages, clades A-D: Clade A includes the ex type of C. gloeosporioides, clade B includes the ex type ITS sequence of C. boninense, and clades C and D are unidentified. The ITS yielded limited resolution and support within all clades, in particular the C. gloeosporioides clade (A), the focal lineage dealt with in this study. In contrast the 5'-tef1 screen differentiated nine distinctive haplotype subgroups within the C. gloeosporioides clade that were concordant with phylogenetic terminals resolved in a five-locus nuclear phylogeny. Among these were two phylogenetic species associated with symptomatic infections specific to either cacao or mango and five phylogenetic species isolated principally as asymptomatic infections from cacao and other plant hosts. We formally describe two new species, C. tropicale and C. ignotum, that are frequent asymptomatic associates of cacao and other Neotropical plant species, and epitypify C. theobromicola, which is associated with foliar and fruit anthracnose lesions of cacao. Asymptomatic Colletotrichum strains isolated from cacao plants grown in China included six distinct C. gloeosporioides clade taxa, only one of which is known to occur in the Neotropics.
Molecular phylogeny and a new Iranian species of Caudospora (Sydowiellaceae, Diaporthales).
Voglmayr, Hermann; Mehrabi, Mehdi
2018-05-02
For the first time, molecular phylogenetic data on the peculiar diaporthalean genus Caudospora are available. Macro- and microscopic morphology and phylogenetic multilocus analyses of partial nuc SSU-ITS-LSU rDNA, cal , ms204 , rpb1 , rpb2 , tef1 and tub2 sequences revealed two distinct species of Caudospora , which are described and illustrated by light and scanning electron microscopy. Caudospora iranica is described as a new species from corticated dead twigs of Quercus sp. collected in Iran. It differs from the generic type, C. taleola , mainly by coarsely verrucose ascospores. The asexual morph of C. taleola on natural substrate is described and illustrated. Caudospora taleola is neotypified, and it is recorded from Iran for the first time. Phylogenetic analyses of a multigene matrix containing a representative selection of Diaporthales from four loci (ITS, LSU rDNA, rpb2 and tef1 ) revealed a placement of Caudospora within Sydowiellaceae.
Lerner, Heather R L; Meyer, Matthias; James, Helen F; Hofreiter, Michael; Fleischer, Robert C
2011-11-08
Evolutionary theory has gained tremendous insight from studies of adaptive radiations. High rates of speciation, morphological divergence, and hybridization, combined with low sequence variability, however, have prevented phylogenetic reconstruction for many radiations. The Hawaiian honeycreepers are an exceptional adaptive radiation, with high phenotypic diversity and speciation that occurred within the geologically constrained setting of the Hawaiian Islands. Here we analyze a new data set of 13 nuclear loci and pyrosequencing of mitochondrial genomes that resolves the Hawaiian honeycreeper phylogeny. We show that they are a sister taxon to Eurasian rosefinches (Carpodacus) and probably came to Hawaii from Asia. We use island ages to calibrate DNA substitution rates, which vary substantially among gene regions, and calculate divergence times, showing that the radiation began roughly when the oldest of the current large Hawaiian Islands (Kauai and Niihau) formed, ~5.7 million years ago (mya). We show that most of the lineages that gave rise to distinctive morphologies diverged after Oahu emerged (4.0-3.7 mya) but before the formation of Maui and adjacent islands (2.4-1.9 mya). Thus, the formation of Oahu, and subsequent cycles of colonization and speciation between Kauai and Oahu, played key roles in generating the morphological diversity of the extant honeycreepers. Copyright © 2011 Elsevier Ltd. All rights reserved.
McGowen, Michael R
2011-09-01
Oceanic dolphins (Delphinidae) are the product of a rapid radiation that yielded ∼36 extant species of small to medium-sized cetaceans that first emerged in the Late Miocene. Although they are a charismatic group of organisms that have become poster children for marine conservation, many phylogenetic relationships within Delphinidae remain elusive due to the slow molecular evolution of the group and the difficulty of resolving short branches from successive cladogenic events. Here I combine existing and newly generated sequences from four mitochondrial (mt) genes and 20 nuclear (nu) genes to reconstruct a well-supported phylogenetic hypothesis for Delphinidae. This study compares maximum-likelihood and Bayesian inference methods of several data sets including mtDNA, combined nuDNA, gene trees of individual nuDNA loci, and concatenated mtDNA+nuDNA. In addition, I contrast these standard phylogenetic analyses with the species tree reconstruction method of Bayesian concordance analysis (BCA). Despite finding discordance between mtDNA and individual nuDNA loci, the concatenated matrix recovers a completely resolved and robustly supported phylogeny that is also broadly congruent with BCA trees. This study strongly supports groupings such as Delphininae, Lissodelphininae, Globicephalinae, Sotalia+Delphininae, Steno+Orcaella+Globicephalinae, and Leucopleurus acutus, Lagenorhynchus albirostris, and Orcinus orca as basal delphinid taxa. Copyright © 2011 Elsevier Inc. All rights reserved.
N.J. Brazee; D.L. Lindner
2013-01-01
Phellinus sensu lato (s.l.) is a complex of segregate genera that act as aggressive pathogens of woody plants. Nearly all of the genera in this complex have unresolved taxonomies, including Porodaedalea, which is one of the most important trunk rot pathogens of coniferous trees throughout the northern hemisphere. In an attempt...
Kotsakiozi, Panayiota; Jablonski, Daniel; Ilgaz, Çetin; Kumlutaş, Yusuf; Avcı, Aziz; Meiri, Shai; Itescu, Yuval; Kukushkin, Oleg; Gvoždík, Václav; Scillitani, Giovanni; Roussos, Stephanos A; Jandzik, David; Kasapidis, Panagiotis; Lymberakis, Petros; Poulakakis, Nikos
2018-08-01
Kotschy's Gecko, Mediodactylus kotschyi, is a small gecko native to southeastern Europe and the Levant. It displays great morphological variation with a large number of morphologically recognized subspecies. However, it has been suggested that it constitutes a species complex of several yet unrecognized species. In this study, we used multilocus sequence data (three mitochondrial and three nuclear gene fragments) to estimate the phylogenetic relationships of 174 specimens from 129 sampling localities, covering a substantial part of the distribution range of the species. Our results revealed high genetic diversity of M. kotschyi populations and contributed to our knowledge about the phylogenetic relationships and the estimation of the divergence times between them. Diversification within M. kotschyi began approximately 15 million years ago (Mya) in the Middle Miocene, whereas the diversification within most of the major clades have been occurred in the last 5 Mya. Species delimitation analysis suggests there exists five species within the complex, and we propose to tentatively recognize the following taxa as full species: M. kotschyi (mainland Balkans, most of Aegean islands, and Italy), M. orientalis (Levant, Cyprus, southern Anatolia, and south-eastern Aegean islands), M. danilewskii (Black Sea region and south-western Anatolia), M. bartoni (Crete), and M. oertzeni (southern Dodecanese Islands). This newly recognized diversity underlines the complex biogeographical history of the Eastern Mediterranean region. Copyright © 2018 Elsevier Inc. All rights reserved.
Medina, Cintia Débora; Avila, Luciano Javier; Sites, Jack Walter; Santos, Juan; Morando, Mariana
2018-03-01
We present different approaches to a multi-locus phylogeny for the Liolaemus elongatus-kriegi group, including almost all species and recognized lineages. We sequenced two mitochondrial and five nuclear gene regions for 123 individuals from 35 taxa, and compared relationships resolved from concatenated and species tree methods. The L. elongatus-kriegi group was inferred as monophyletic in three of the five analyses (concatenated mitochondrial, concatenated mitochondrial + nuclear gene trees, and SVD quartet species tree). The mitochondrial gene tree resolved four haploclades, three corresponding to the previously recognized complexes: L. elongatus, L. kriegi and L. petrophilus complexes, and the L. punmahuida group. The BEAST species tree approach included the L. punmahuida group within the L. kriegi complex, but the SVD quartet method placed it as sister to the L. elongatus-kriegi group. BEAST inferred species of the L. elongatus and L. petrophilus complexes as one clade, while SVDquartet inferred these two complexes as monophyletic (although with no statistical support for the L. petrophilus complex). The species tree approach also included the L. punmahuida group as part of the L. elongatus-kriegi group. Our study provides detailed multilocus phylogenetic hypotheses for the L. elongatus-kriegi group, and we discuss possible reasons for differences in the concatenation and species tree methods. Copyright © 2017 Elsevier Inc. All rights reserved.
Multilocus sequence analysis of phytopathogenic species of the genus Streptomyces
USDA-ARS?s Scientific Manuscript database
The identification and classification of species within the genus Streptomyces is difficult because there are presently 576 validly described species and this number increases every year. The value of the application of multilocus sequence analysis scheme to the systematics of Streptomyces species h...
Crottini, Angelica; Dordel, Janina; Köhler, Jörn; Glaw, Frank; Schmitz, Andreas; Vences, Miguel
2009-10-01
A phylogeny for 29 species of scincine lizards from Madagascar, based on 3693 bp of six mitochondrial and five nuclear genes, revealed multiple parallel evolution of adaptations for a burrowing life, and unexpected relationships of the monotypic genera Androngo and Cryptoscincus. Androngo trivittatus was sister to Pygomeles braconnieri, and Cryptoscincus minimus was deeply nested within the genus Paracontias, all of these being fossorial taxa of elongated bodies and partly or fully reduced limbs. To account for these results, we place Cryptoscincus as a junior synonym of Paracontias, and discuss possible taxonomic consequences that may affect the status of Androngo, once additional data become available.
Welker, Cassiano A D; Souza-Chies, Tatiana T; Longhi-Wagner, Hilda M; Peichoto, Myriam Carolina; McKain, Michael R; Kellogg, Elizabeth A
2016-06-01
Species delimitation is a vital issue concerning evolutionary biology and conservation of biodiversity. However, it is a challenging task for several reasons, including the low interspecies variability of markers currently used in phylogenetic reconstructions and the occurrence of reticulate evolution and polyploidy in many lineages of flowering plants. The first phylogeny of the grass genus Eriochrysis is presented here, focusing on the New World species, in order to examine its relationships to other genera of the subtribe Saccharinae/tribe Andropogoneae and to define the circumscriptions of its taxonomically complicated species. Molecular cloning and sequencing of five regions of four low-copy nuclear genes (apo1, d8, ep2-ex7 and ep2-ex8, kn1) were performed, as well as complete plastome sequencing. Trees were reconstructed using maximum parsimony, maximum likelihood, and Bayesian inference analyses. The present phylogenetic analyses indicate that Eriochrysis is monophyletic and the Old World E. pallida is sister to the New World species. Subtribe Saccharinae is polyphyletic, as is the genus Eulalia. Based on nuclear and plastome sequences plus morphology, we define the circumscriptions of the New World species of Eriochrysis: E. laxa is distinct from E. warmingiana, and E. villosa is distinct from E. cayennensis. Natural hybrids occur between E. laxa and E. villosa. The hybrids are probably tetraploids, based on the number of paralogues in the nuclear gene trees. This is the first record of a polyploid taxon in the genus Eriochrysis. Some incongruities between nuclear genes and plastome analyses were detected and are potentially caused by incomplete lineage sorting and/or ancient hybridization. The set of low-copy nuclear genes used in this study seems to be sufficient to resolve phylogenetic relationships and define the circumscriptions of other species complexes in the grass family and relatives, even in the presence of polyploidy and reticulate evolution. Complete plastome sequencing is also a promising tool for phylogenetic inference. Copyright © 2016 Elsevier Inc. All rights reserved.
Holmes, Anne; Allison, Lesley; Ward, Melissa; Dallman, Timothy J; Clark, Richard; Fawkes, Angie; Murphy, Lee; Hanson, Mary
2015-11-01
Detailed laboratory characterization of Escherichia coli O157 is essential to inform epidemiological investigations. This study assessed the utility of whole-genome sequencing (WGS) for outbreak detection and epidemiological surveillance of E. coli O157, and the data were used to identify discernible associations between genotypes and clinical outcomes. One hundred five E. coli O157 strains isolated over a 5-year period from human fecal samples in Lothian, Scotland, were sequenced with the Ion Torrent Personal Genome Machine. A total of 8,721 variable sites in the core genome were identified among the 105 isolates; 47% of the single nucleotide polymorphisms (SNPs) were attributable to six "atypical" E. coli O157 strains and included recombinant regions. Phylogenetic analyses showed that WGS correlated well with the epidemiological data. Epidemiological links existed between cases whose isolates differed by three or fewer SNPs. WGS also correlated well with multilocus variable-number tandem repeat analysis (MLVA) typing data, with only three discordant results observed, all among isolates from cases not known to be epidemiologically related. WGS produced a better-supported, higher-resolution phylogeny than MLVA, confirming that the method is more suitable for epidemiological surveillance of E. coli O157. A combination of in silico analyses (VirulenceFinder, ResFinder, and local BLAST searches) were used to determine stx subtypes, multilocus sequence types (15 loci), and the presence of virulence and acquired antimicrobial resistance genes. There was a high level of correlation between the WGS data and our routine typing methods, although some discordant results were observed, mostly related to the limitation of short sequence read assembly. The data were used to identify sublineages and clades of E. coli O157, and when they were correlated with the clinical outcome data, they showed that one clade, Ic3, was significantly associated with severe disease. Together, the results show that WGS data can provide higher resolution of the relationships between E. coli O157 isolates than that provided by MLVA. The method has the potential to streamline the laboratory workflow and provide detailed information for the clinical management of patients and public health interventions. Copyright © 2015, Holmes et al.
Allison, Lesley; Ward, Melissa; Dallman, Timothy J.; Clark, Richard; Fawkes, Angie; Murphy, Lee; Hanson, Mary
2015-01-01
Detailed laboratory characterization of Escherichia coli O157 is essential to inform epidemiological investigations. This study assessed the utility of whole-genome sequencing (WGS) for outbreak detection and epidemiological surveillance of E. coli O157, and the data were used to identify discernible associations between genotypes and clinical outcomes. One hundred five E. coli O157 strains isolated over a 5-year period from human fecal samples in Lothian, Scotland, were sequenced with the Ion Torrent Personal Genome Machine. A total of 8,721 variable sites in the core genome were identified among the 105 isolates; 47% of the single nucleotide polymorphisms (SNPs) were attributable to six “atypical” E. coli O157 strains and included recombinant regions. Phylogenetic analyses showed that WGS correlated well with the epidemiological data. Epidemiological links existed between cases whose isolates differed by three or fewer SNPs. WGS also correlated well with multilocus variable-number tandem repeat analysis (MLVA) typing data, with only three discordant results observed, all among isolates from cases not known to be epidemiologically related. WGS produced a better-supported, higher-resolution phylogeny than MLVA, confirming that the method is more suitable for epidemiological surveillance of E. coli O157. A combination of in silico analyses (VirulenceFinder, ResFinder, and local BLAST searches) were used to determine stx subtypes, multilocus sequence types (15 loci), and the presence of virulence and acquired antimicrobial resistance genes. There was a high level of correlation between the WGS data and our routine typing methods, although some discordant results were observed, mostly related to the limitation of short sequence read assembly. The data were used to identify sublineages and clades of E. coli O157, and when they were correlated with the clinical outcome data, they showed that one clade, Ic3, was significantly associated with severe disease. Together, the results show that WGS data can provide higher resolution of the relationships between E. coli O157 isolates than that provided by MLVA. The method has the potential to streamline the laboratory workflow and provide detailed information for the clinical management of patients and public health interventions. PMID:26354815
USDA-ARS?s Scientific Manuscript database
Flavobacterium psychrophilum is an important pathogen of salmonids worldwide. Multilocus sequence typing (MLST) has identified a recombinogenic population structure from which emerged a few epidemic clonal complexes particularly threatening for salmonid aquaculture. To date, MLST genotypes for this ...
Molecular epidemiology, phylogeny and evolution of Candida albicans.
McManus, Brenda A; Coleman, David C
2014-01-01
A small number of Candida species form part of the normal microbial flora of mucosal surfaces in humans and may give rise to opportunistic infections when host defences are impaired. Candida albicans is by far the most prevalent commensal and pathogenic Candida species. Several different molecular typing approaches including multilocus sequence typing, multilocus microsatellite typing and DNA fingerprinting using C. albicans-specific repetitive sequence-containing DNA probes have yielded a wealth of information regarding the epidemiology and population structure of this species. Such studies revealed that the C. albicans population structure consists of multiple major and minor clades, some of which exhibit geographical or phenotypic enrichment and that C. albicans reproduction is predominantly clonal. Despite this, losses of heterozygosity by recombination, the existence of a parasexual cycle, toleration of a wide range of aneuploidies and the recent description of viable haploid strains have all demonstrated the extensive plasticity of the C. albicans genome. Recombination and gross chromosomal rearrangements are more common under stressful environmental conditions, and have played a significant role in the evolution of this opportunistic pathogen. Surprisingly, Candida dubliniensis, the closest relative of C. albicans exhibits more karyotype variability than C. albicans, but is significantly less adaptable to unfavourable environments. This disparity most likely reflects the evolutionary processes that occurred during or soon after the divergence of both species from their common ancestor. Whilst C. dubliniensis underwent significant gene loss and pseudogenisation, C. albicans expanded gene families considered to be important in virulence. It is likely that technological developments in whole genome sequencing and data analysis in coming years will facilitate its routine use for population structure, epidemiological investigations, and phylogenetic analyses of Candida species. These are likely to reveal more minor C. albicans clades and to enhance our understanding of the population biology of this versatile organism. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Tomasello, Salvatore; Álvarez, Inés; Vargas, Pablo; Oberprieler, Christoph
2015-01-01
The present study provides results of multi-species coalescent species tree analyses of DNA sequences sampled from multiple nuclear and plastid regions to infer the phylogenetic relationships among the members of the subtribe Leucanthemopsidinae (Compositae, Anthemideae), to which besides the annual Castrilanthemum debeauxii (Degen, Hervier & É.Rev.) Vogt & Oberp., one of the rarest flowering plant species of the Iberian Peninsula, two other unispecific genera (Hymenostemma, Prolongoa), and the polyploidy complex of the genus Leucanthemopsis belong. Based on sequence information from two single- to low-copy nuclear regions (C16, D35, characterised by Chapman et al. (2007)), the multi-copy region of the nrDNA internal transcribed spacer regions ITS1 and ITS2, and two intergenic spacer regions of the cpDNA gene trees were reconstructed using Bayesian inference methods. For the reconstruction of a multi-locus species tree we applied three different methods: (a) analysis of concatenated sequences using Bayesian inference (MrBayes), (b) a tree reconciliation approach by minimizing the number of deep coalescences (PhyloNet), and (c) a coalescent-based species-tree method in a Bayesian framework ((∗)BEAST). All three species tree reconstruction methods unequivocally support the close relationship of the subtribe with the hitherto unclassified genus Phalacrocarpum, the sister-group relationship of Castrilanthemum with the three remaining genera of the subtribe, and the further sister-group relationship of the clade of Hymenostemma+Prolongoa with a monophyletic genus Leucanthemopsis. Dating of the (∗)BEAST phylogeny supports the long-lasting (Early Miocene, 15-22Ma) taxonomical independence and the switch from the plesiomorphic perennial to the apomorphic annual life-form assumed for the Castrilanthemum lineage that may have occurred not earlier than in the Pliocene (3Ma) when the establishment of a Mediterranean climate with summer droughts triggered evolution towards annuality. Copyright © 2014 Elsevier Inc. All rights reserved.
Visualizing phylogenetic tree landscapes.
Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A
2017-02-02
Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D projections significantly increase the fit between the tree-to-tree distances and can facilitate the interpretation of the relationship among phylogenetic trees. We demonstrate that the choice of dimensionality reduction method can significantly influence the spatial relationship among a large set of competing phylogenetic trees. We highlight the importance of selecting a dimensionality reduction method to visualize large multi-locus phylogenetic landscapes and demonstrate that 3D projections of mitochondrial tree landscapes better capture the relationship among the trees being compared.
Alström, Per; Barnes, Keith N; Olsson, Urban; Barker, F Keith; Bloomer, Paulette; Khan, Aleem Ahmed; Qureshi, Masood Ahmed; Guillaumet, Alban; Crochet, Pierre-André; Ryan, Peter G
2013-12-01
The Alaudidae (larks) is a large family of songbirds in the superfamily Sylvioidea. Larks are cosmopolitan, although species-level diversity is by far largest in Africa, followed by Eurasia, whereas Australasia and the New World have only one species each. The present study is the first comprehensive phylogeny of the Alaudidae. It includes 83.5% of all species and representatives from all recognised genera, and was based on two mitochondrial and three nuclear loci (in total 6.4 kbp, although not all loci were available for all species). In addition, a larger sample, comprising several subspecies of some polytypic species was analysed for one of the mitochondrial loci. There was generally good agreement in trees inferred from different loci, although some strongly supported incongruences were noted. The tree based on the concatenated multilocus data was overall well resolved and well supported by the data. We stress the importance of performing single gene as well as combined data analyses, as the latter may obscure significant incongruence behind strong nodal support values. The multilocus tree revealed many unpredicted relationships, including some non-monophyletic genera (Calandrella, Mirafra, Melanocorypha, Spizocorys). The tree based on the extended mitochondrial data set revealed several unexpected deep divergences between taxa presently treated as conspecific (e.g. within Ammomanes cinctura, Ammomanes deserti, Calandrella brachydactyla, Eremophila alpestris), as well as some shallow splits between currently recognised species (e.g. Certhilauda brevirostris-C. semitorquata-C. curvirostris; Calendulauda barlowi-C. erythrochlamys; Mirafra cantillans-M. javanica). Based on our results, we propose a revised generic classification, and comment on some species limits. We also comment on the extraordinary morphological adaptability in larks, which has resulted in numerous examples of parallel evolution (e.g. in Melanocorypha mongolica and Alauda leucoptera [both usually placed in Melanocorypha]; Ammomanopsis grayi and Ammomanes cinctura/deserti [former traditionally placed in Ammomanes]; Chersophilus duponti and Certhilauda spp.; Eremopterix hova [usually placed in Mirafra] and several Mirafra spp.), as well as both highly conserved plumages (e.g. within Mirafra) and strongly divergent lineages (e.g. Eremopterix hova vs. other Eremopterix spp.; Calandrella cinerea complex vs. Eremophila spp.; Eremalauda dunni vs. Chersophilus duponti; Melanocorypha mongolica and male M. yeltoniensis vs. other Melanocorypha spp. and female M. yeltoniensis). Sexual plumage dimorphism has evolved multiple times. Few groups of birds show the same level of disagreement between taxonomy based on morphology and phylogenetic relationships as inferred from DNA sequences. Copyright © 2013 Elsevier Inc. All rights reserved.
Ben Said, Mourad; Ben Asker, Alaa; Belkahia, Hanène; Ghribi, Raoua; Selmi, Rachid; Messadi, Lilia
2018-05-12
Anaplasma marginale, which is responsible for bovine anaplasmosis in tropical and subtropical regions, is a tick-borne obligatory intraerythrocytic bacterium of cattle and wild ruminants. In Tunisia, information about the genetic diversity and the phylogeny of A. marginale strains are limited to the msp4 gene analysis. The purpose of this study is to investigate A. marginale isolates infecting 16 cattle located in different bioclimatic areas of northern Tunisia with single gene analysis and multilocus sequence typing methods on the basis of seven partial genes (dnaA, ftsZ, groEL, lipA, secY, recA and sucB). The single gene analysis confirmed the presence of different and novel heterogenic A. marginale strains infecting cattle from the north of Tunisia. The concatenated sequence analysis showed a phylogeographical resolution at the global level and that most of the Tunisian sequence types (STs) formed a separate cluster from a South African isolate and from all New World isolates and strains. By combining the characteristics of each single locus with those of the multi-loci scheme, these results provide a more detailed understanding on the diversity and the evolution of Tunisian A. marginale strains. Copyright © 2018 Elsevier GmbH. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Multi-locus sequence analysis has been demonstrated to be a useful tool for identification of Streptomyces species and was previously applied to phylogenetically differentiate the type strains of species pathogenic on potatoes (Solanum tuberosum L.). The ARS Culture Collection (NRRL) contains 43 str...
Zhang, Bin; He, Kai; Wan, Tao; Chen, Peng; Sun, Guozheng; Liu, Shaoying; Nguyen, Truong Son; Lin, Liangkong; Jiang, Xuelong
2016-12-01
Niviventer is a genus of white-bellied rats that are among the most common rodents in the Indo-Sundaic region. The taxonomy of the genus has undergone extensive revisions and remains controversial. The current phylogeny is unresolved and was developed primarily on the basis of mitochondrial genes. Identification is extremely difficult, and a large number of GenBank sequences seem to be problematic. We extensively sampled specimens of Niviventer in China and neighboring northern Vietnam, including topotypes of the most reported species (n = 6), subspecies (n = 8), and synonyms (n = 4). We estimated phylogenetic relationships on the basis of one mitochondrial and three nuclear genes, using concatenation and coalescent-based approaches. We also employed molecular species delimitation approaches to test the existence of cryptic and putative new species. Our phylogeny was finely resolved, especially for the N. confucianus-like species. Our data provided the first support for N. brahma and N. eha as sister species, an assignment that is congruent with their morphological similarities. Species delimitation analyses provided new insight into species diversity and systematics. Three geographic populations of N. confucianus and one of N. fulvescens were supported as genetically distinct in our species delimitation analyses, while three recognized species (N. coninga, N. huang, and N. lotipes) were not strongly supported as distinct. Our results suggested that several genetically distinct species may be contained within the species currently known as N. confucianus and N. fulvescens. In addition, the results of Bayesian Phylogenetics and Phylogeography (BPP) for N. coninga, N. huang, and N. lotipes indicated that either inter-specific gene flow had occurred or imperfect taxonomy was present. Morphological examinations and morphometric analyses are warranted to examine the molecular results.
Pereira, Anieli G; Sterli, Juliana; Moreira, Filipe R R; Schrago, Carlos G
2017-08-01
Despite their complex evolutionary history and the rich fossil record, the higher level phylogeny and historical biogeography of living turtles have not been investigated in a comprehensive and statistical framework. To tackle these issues, we assembled a large molecular dataset, maximizing both taxonomic and gene sampling. As different models provide alternative biogeographical scenarios, we have explicitly tested such hypotheses in order to reconstruct a robust biogeographical history of Testudines. We scanned publicly available databases for nucleotide sequences and composed a dataset comprising 13 loci for 294 living species of Testudines, which accounts for all living genera and 85% of their extant species diversity. Phylogenetic relationships and species divergence times were estimated using a thorough evaluation of fossil information as calibration priors. We then carried out the analysis of historical biogeography of Testudines in a fully statistical framework. Our study recovered the first large-scale phylogeny of turtles with well-supported relationships following the topology proposed by phylogenomic works. Our dating result consistently indicated that the origin of the main clades, Pleurodira and Cryptodira, occurred in the early Jurassic. The phylogenetic and historical biogeographical inferences permitted us to clarify how geological events affected the evolutionary dynamics of crown turtles. For instance, our analyses support the hypothesis that the breakup of Pangaea would have driven the divergence between the cryptodiran and pleurodiran lineages. The reticulated pattern in the ancestral distribution of the cryptodiran lineage suggests a complex biogeographic history for the clade, which was supposedly related to the complex paleogeographic history of Laurasia. On the other hand, the biogeographical history of Pleurodira indicated a tight correlation with the paleogeography of the Gondwanan landmasses. Copyright © 2017 Elsevier Inc. All rights reserved.
Reconstructing the backbone of the Saccharomycotina yeast phylogeny using genome-scale data
Shen, Xing -Xing; Zhou, Xiaofan; Kominek, Jacek; ...
2016-09-26
Understanding the phylogenetic relationships among the yeasts of the subphylum Saccharomycotina is a prerequisite for understanding the evolution of their metabolisms and ecological lifestyles. In the last two decades, the use of rDNA and multilocus data sets has greatly advanced our understanding of the yeast phylogeny, but many deep relationships remain unsupported. In contrast, phylogenomic analyses have involved relatively few taxa and lineages that were often selected with limited considerations for covering the breadth of yeast biodiversity. Here we used genome sequence data from 86 publicly available yeast genomes representing nine of the 11 known major lineages and 10 nonyeastmore » fungal outgroups to generate a 1233-gene, 96-taxon data matrix. Species phylogenies reconstructed using two different methods (concatenation and coalescence) and two data matrices (amino acids or the first two codon positions) yielded identical and highly supported relationships between the nine major lineages. Aside from the lineage comprised by the family Pichiaceae, all other lineages were monophyletic. Most interrelationships among yeast species were robust across the two methods and data matrices. Furthermore, eight of the 93 internodes conflicted between analyses or data sets, including the placements of: the clade defined by species that have reassigned the CUG codon to encode serine, instead of leucine; the clade defined by a whole genome duplication; and the species Ascoidea rubescens. These phylogenomic analyses provide a robust roadmap for future comparative work across the yeast subphylum in the disciplines of taxonomy, molecular genetics, evolutionary biology, ecology, and biotechnology. To further this end, we have also provided a BLAST server to query the 86 Saccharomycotina genomes, which can be found at http://y1000plus.org/blast.« less
Reconstructing the Backbone of the Saccharomycotina Yeast Phylogeny Using Genome-Scale Data
Shen, Xing-Xing; Zhou, Xiaofan; Kominek, Jacek; Kurtzman, Cletus P.; Hittinger, Chris Todd; Rokas, Antonis
2016-01-01
Understanding the phylogenetic relationships among the yeasts of the subphylum Saccharomycotina is a prerequisite for understanding the evolution of their metabolisms and ecological lifestyles. In the last two decades, the use of rDNA and multilocus data sets has greatly advanced our understanding of the yeast phylogeny, but many deep relationships remain unsupported. In contrast, phylogenomic analyses have involved relatively few taxa and lineages that were often selected with limited considerations for covering the breadth of yeast biodiversity. Here we used genome sequence data from 86 publicly available yeast genomes representing nine of the 11 known major lineages and 10 nonyeast fungal outgroups to generate a 1233-gene, 96-taxon data matrix. Species phylogenies reconstructed using two different methods (concatenation and coalescence) and two data matrices (amino acids or the first two codon positions) yielded identical and highly supported relationships between the nine major lineages. Aside from the lineage comprised by the family Pichiaceae, all other lineages were monophyletic. Most interrelationships among yeast species were robust across the two methods and data matrices. However, eight of the 93 internodes conflicted between analyses or data sets, including the placements of: the clade defined by species that have reassigned the CUG codon to encode serine, instead of leucine; the clade defined by a whole genome duplication; and the species Ascoidea rubescens. These phylogenomic analyses provide a robust roadmap for future comparative work across the yeast subphylum in the disciplines of taxonomy, molecular genetics, evolutionary biology, ecology, and biotechnology. To further this end, we have also provided a BLAST server to query the 86 Saccharomycotina genomes, which can be found at http://y1000plus.org/blast. PMID:27672114
Reconstructing the backbone of the Saccharomycotina yeast phylogeny using genome-scale data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shen, Xing -Xing; Zhou, Xiaofan; Kominek, Jacek
Understanding the phylogenetic relationships among the yeasts of the subphylum Saccharomycotina is a prerequisite for understanding the evolution of their metabolisms and ecological lifestyles. In the last two decades, the use of rDNA and multilocus data sets has greatly advanced our understanding of the yeast phylogeny, but many deep relationships remain unsupported. In contrast, phylogenomic analyses have involved relatively few taxa and lineages that were often selected with limited considerations for covering the breadth of yeast biodiversity. Here we used genome sequence data from 86 publicly available yeast genomes representing nine of the 11 known major lineages and 10 nonyeastmore » fungal outgroups to generate a 1233-gene, 96-taxon data matrix. Species phylogenies reconstructed using two different methods (concatenation and coalescence) and two data matrices (amino acids or the first two codon positions) yielded identical and highly supported relationships between the nine major lineages. Aside from the lineage comprised by the family Pichiaceae, all other lineages were monophyletic. Most interrelationships among yeast species were robust across the two methods and data matrices. Furthermore, eight of the 93 internodes conflicted between analyses or data sets, including the placements of: the clade defined by species that have reassigned the CUG codon to encode serine, instead of leucine; the clade defined by a whole genome duplication; and the species Ascoidea rubescens. These phylogenomic analyses provide a robust roadmap for future comparative work across the yeast subphylum in the disciplines of taxonomy, molecular genetics, evolutionary biology, ecology, and biotechnology. To further this end, we have also provided a BLAST server to query the 86 Saccharomycotina genomes, which can be found at http://y1000plus.org/blast.« less
Rickettsia asembonensis Characterization by Multilocus Sequence Typing of Complete Genes, Peru.
Loyola, Steev; Flores-Mendoza, Carmen; Torre, Armando; Kocher, Claudine; Melendrez, Melanie; Luce-Fedrow, Alison; Maina, Alice N; Richards, Allen L; Leguia, Mariana
2018-05-01
While studying rickettsial infections in Peru, we detected Rickettsia asembonensis in fleas from domestic animals. We characterized 5 complete genomic regions (17kDa, gltA, ompA, ompB, and sca4) and conducted multilocus sequence typing and phylogenetic analyses. The molecular isolate from Peru is distinct from the original R. asembonensis strain from Kenya.
Simultaneous phylogeny reconstruction and multiple sequence alignment
Yue, Feng; Shi, Jian; Tang, Jijun
2009-01-01
Background A phylogeny is the evolutionary history of a group of organisms. To date, sequence data is still the most used data type for phylogenetic reconstruction. Before any sequences can be used for phylogeny reconstruction, they must be aligned, and the quality of the multiple sequence alignment has been shown to affect the quality of the inferred phylogeny. At the same time, all the current multiple sequence alignment programs use a guide tree to produce the alignment and experiments showed that good guide trees can significantly improve the multiple alignment quality. Results We devise a new algorithm to simultaneously align multiple sequences and search for the phylogenetic tree that leads to the best alignment. We also implemented the algorithm as a C program package, which can handle both DNA and protein data and can take simple cost model as well as complex substitution matrices, such as PAM250 or BLOSUM62. The performance of the new method are compared with those from other popular multiple sequence alignment tools, including the widely used programs such as ClustalW and T-Coffee. Experimental results suggest that this method has good performance in terms of both phylogeny accuracy and alignment quality. Conclusion We present an algorithm to align multiple sequences and reconstruct the phylogenies that minimize the alignment score, which is based on an efficient algorithm to solve the median problems for three sequences. Our extensive experiments suggest that this method is very promising and can produce high quality phylogenies and alignments. PMID:19208110
Multilocus Species Trees Show the Recent Adaptive Radiation of the Mimetic Heliconius Butterflies
Kozak, Krzysztof M.; Wahlberg, Niklas; Neild, Andrew F. E.; Dasmahapatra, Kanchon K.; Mallet, James; Jiggins, Chris D.
2015-01-01
Müllerian mimicry among Neotropical Heliconiini butterflies is an excellent example of natural selection, associated with the diversification of a large continental-scale radiation. Some of the processes driving the evolution of mimicry rings are likely to generate incongruent phylogenetic signals across the assemblage, and thus pose a challenge for systematics. We use a data set of 22 mitochondrial and nuclear markers from 92% of species in the tribe, obtained by Sanger sequencing and de novo assembly of short read data, to re-examine the phylogeny of Heliconiini with both supermatrix and multispecies coalescent approaches, characterize the patterns of conflicting signal, and compare the performance of various methodological approaches to reflect the heterogeneity across the data. Despite the large extent of reticulate signal and strong conflict between markers, nearly identical topologies are consistently recovered by most of the analyses, although the supermatrix approach failed to reflect the underlying variation in the history of individual loci. However, the supermatrix represents a useful approximation where multiple rare species represented by short sequences can be incorporated easily. The first comprehensive, time-calibrated phylogeny of this group is used to test the hypotheses of a diversification rate increase driven by the dramatic environmental changes in the Neotropics over the past 23 myr, or changes caused by diversity-dependent effects on the rate of diversification. We find that the rate of diversification has increased on the branch leading to the presently most species-rich genus Heliconius, but the change occurred gradually and cannot be unequivocally attributed to a specific environmental driver. Our study provides comprehensive comparison of philosophically distinct species tree reconstruction methods and provides insights into the diversification of an important insect radiation in the most biodiverse region of the planet. PMID:25634098
Huang, Chih-Wei; Lee, Yen-Chen; Lin, Si-Min; Wu, Wen-Lung
2014-01-01
Abstract Aegista subchinensis (Möllendorff, 1884) is a widely distributed land snail species with morphological variation and endemic to Taiwan. Three genetic markers (partial sequence of the mitochondrial cytochrome c oxidase subunit I [COI], the 16S rDNA and the nuclear internal transcribed spacer 2 [ITS2]) were analysed to infer phylogenetic relationships and genetic divergence of closely related species of the genus Aegista, Aegista vermis (Reeve, 1852) and Aegista oculus (Pfeiffer, 1850). A new species from Aegista subchinensis has been recognized on the basis of phylogenetic and morphological evidences. The nominal new species, Aegista diversifamilia sp. n. is distinguished from Aegista subchinensis (Möllendorff, 1884) by its larger shell size, aperture and apex angle; wider umbilicus and flatter shell shape. The northernmost distribution of Aegista diversifamilia sp. n. is limited by the Lanyang River, which is presumed to mark the geographic barrier between Aegista diversifamilia sp. n. and Aegista subchinensis. PMID:25349506
Sandoval-Denis, Marcelo; Sutton, Deanna A.; Cano-Lira, José F.; Fothergill, Annette W.; Wiederhold, Nathan P.; Guarro, Josep
2014-01-01
A set of 73 isolates of the emerging fungus Trichoderma isolated from human and animal clinical specimens were characterized morphologically and molecularly using a multilocus sequence analysis that included the internal transcribed spacer (ITS) regions of the nuclear ribosomal DNA and fragments of the translation elongation factor 1 alpha (Tef1), endochitinase CHI18-5 (Chi18-5), and actin 1 (Act1) genes. The most frequent species was Trichoderma longibrachiatum (26%), followed by Trichoderma citrinoviride (18%), the Hypocrea lixii/Trichoderma harzianum species complex (15%), the newly described species Trichoderma bissettii (12%), and Trichoderma orientale (11%). The most common anatomical sites of isolation in human clinical specimens were the respiratory tract (40%), followed by deep tissue (30%) and superficial tissues (26%), while all the animal-associated isolates were obtained from superficial tissue samples. Susceptibilities of the isolates to eight antifungal drugs in vitro showed mostly high MICs, except for voriconazole and the echinocandins. PMID:24719448
Hinsinger, Damien Daniel; Basak, Jolly; Gaudeul, Myriam; Cruaud, Corinne; Bertolino, Paola; Frascaria-Lacoste, Nathalie; Bousquet, Jean
2013-01-01
The cosmopolitan genus Fraxinus, which comprises about 40 species of temperate trees and shrubs occupying various habitats in the Northern Hemisphere, represents a useful model to study speciation in long-lived angiosperms. We used nuclear external transcribed spacers (nETS), phantastica gene sequences, and two chloroplast loci (trnH-psbA and rpl32-trnL) in combination with previously published and newly obtained nITS sequences to produce a time-calibrated multi-locus phylogeny of the genus. We then inferred the biogeographic history and evolution of floral morphology. An early dispersal event could be inferred from North America to Asia during the Oligocene, leading to the diversification of the section Melioides sensus lato. Another intercontinental dispersal originating from the Eurasian section of Fraxinus could be dated from the Miocene and resulted in the speciation of F. nigra in North America. In addition, vicariance was inferred to account for the distribution of the other Old World species (sections Sciadanthus, Fraxinus and Ornus). Geographic speciation likely involving dispersal and vicariance could also be inferred from the phylogenetic grouping of geographically close taxa. Molecular dating suggested that the initial divergence of the taxonomical sections occurred during the middle and late Eocene and Oligocene periods, whereas diversification within sections occurred mostly during the late Oligocene and Miocene, which is consistent with the climate warming and accompanying large distributional changes observed during these periods. These various results underline the importance of dispersal and vicariance in promoting geographic speciation and diversification in Fraxinus. Similarities in life history, reproductive and demographic attributes as well as geographical distribution patterns suggest that many other temperate trees should exhibit similar speciation patterns. On the other hand, the observed parallel evolution and reversions in floral morphology would imply a major influence of environmental pressure. The phylogeny obtained and its biogeographical implications should facilitate future studies on the evolution of complex adaptive characters, such as habitat preference, and their possible roles in promoting divergent evolution in trees. PMID:24278282
Hinsinger, Damien Daniel; Basak, Jolly; Gaudeul, Myriam; Cruaud, Corinne; Bertolino, Paola; Frascaria-Lacoste, Nathalie; Bousquet, Jean
2013-01-01
The cosmopolitan genus Fraxinus, which comprises about 40 species of temperate trees and shrubs occupying various habitats in the Northern Hemisphere, represents a useful model to study speciation in long-lived angiosperms. We used nuclear external transcribed spacers (nETS), phantastica gene sequences, and two chloroplast loci (trnH-psbA and rpl32-trnL) in combination with previously published and newly obtained nITS sequences to produce a time-calibrated multi-locus phylogeny of the genus. We then inferred the biogeographic history and evolution of floral morphology. An early dispersal event could be inferred from North America to Asia during the Oligocene, leading to the diversification of the section Melioides sensus lato. Another intercontinental dispersal originating from the Eurasian section of Fraxinus could be dated from the Miocene and resulted in the speciation of F. nigra in North America. In addition, vicariance was inferred to account for the distribution of the other Old World species (sections Sciadanthus, Fraxinus and Ornus). Geographic speciation likely involving dispersal and vicariance could also be inferred from the phylogenetic grouping of geographically close taxa. Molecular dating suggested that the initial divergence of the taxonomical sections occurred during the middle and late Eocene and Oligocene periods, whereas diversification within sections occurred mostly during the late Oligocene and Miocene, which is consistent with the climate warming and accompanying large distributional changes observed during these periods. These various results underline the importance of dispersal and vicariance in promoting geographic speciation and diversification in Fraxinus. Similarities in life history, reproductive and demographic attributes as well as geographical distribution patterns suggest that many other temperate trees should exhibit similar speciation patterns. On the other hand, the observed parallel evolution and reversions in floral morphology would imply a major influence of environmental pressure. The phylogeny obtained and its biogeographical implications should facilitate future studies on the evolution of complex adaptive characters, such as habitat preference, and their possible roles in promoting divergent evolution in trees.
Phylogeny and temporal diversification of darters (Percidae: Etheostomatinae).
Near, Thomas J; Bossu, Christen M; Bradburd, Gideon S; Carlson, Rose L; Harrington, Richard C; Hollingsworth, Phillip R; Keck, Benjamin P; Etnier, David A
2011-10-01
Discussions aimed at resolution of the Tree of Life are most often focused on the interrelationships of major organismal lineages. In this study, we focus on the resolution of some of the most apical branches in the Tree of Life through exploration of the phylogenetic relationships of darters, a species-rich clade of North American freshwater fishes. With a near-complete taxon sampling of close to 250 species, we aim to investigate strategies for efficient multilocus data sampling and the estimation of divergence times using relaxed-clock methods when a clade lacks a fossil record. Our phylogenetic data set comprises a single mitochondrial DNA (mtDNA) gene and two nuclear genes sampled from 245 of the 248 darter species. This dense sampling allows us to determine if a modest amount of nuclear DNA sequence data can resolve relationships among closely related animal species. Darters lack a fossil record to provide age calibration priors in relaxed-clock analyses. Therefore, we use a near-complete species-sampled phylogeny of the perciform clade Centrarchidae, which has a rich fossil record, to assess two distinct strategies of external calibration in relaxed-clock divergence time estimates of darters: using ages inferred from the fossil record and molecular evolutionary rate estimates. Comparison of Bayesian phylogenies inferred from mtDNA and nuclear genes reveals that heterospecific mtDNA is present in approximately 12.5% of all darter species. We identify three patterns of mtDNA introgression in darters: proximal mtDNA transfer, which involves the transfer of mtDNA among extant and sympatric darter species, indeterminate introgression, which involves the transfer of mtDNA from a lineage that cannot be confidently identified because the introgressed haplotypes are not clearly referable to mtDNA haplotypes in any recognized species, and deep introgression, which is characterized by species diversification within a recipient clade subsequent to the transfer of heterospecific mtDNA. The results of our analyses indicate that DNA sequences sampled from single-copy nuclear genes can provide appreciable phylogenetic resolution for closely related animal species. A well-resolved near-complete species-sampled phylogeny of darters was estimated with Bayesian methods using a concatenated mtDNA and nuclear gene data set with all identified heterospecific mtDNA haplotypes treated as missing data. The relaxed-clock analyses resulted in very similar posterior age estimates across the three sampled genes and methods of calibration and therefore offer a viable strategy for estimating divergence times for clades that lack a fossil record. In addition, an informative rank-free clade-based classification of darters that preserves the rich history of nomenclature in the group and provides formal taxonomic communication of darter clades was constructed using the mtDNA and nuclear gene phylogeny. On the whole, the appeal of mtDNA for phylogeny inference among closely related animal species is diminished by the observations of extensive mtDNA introgression and by finding appreciable phylogenetic signal in a modest sampling of nuclear genes in our phylogenetic analyses of darters.
Ye, Wenwu; Wang, Yang; Shen, Danyu; Li, Delong; Pu, Tianhuizi; Jiang, Zide; Zhang, Zhengguang; Zheng, Xiaobo; Tyler, Brett M; Wang, Yuanchao
2016-07-01
On the basis of its downy mildew-like morphology, the litchi downy blight pathogen was previously named Peronophythora litchii. Recently, however, it was proposed to transfer this pathogen to Phytophthora clade 4. To better characterize this unusual oomycete species and important fruit pathogen, we obtained the genome sequence of Phytophthora litchii and compared it to those from other oomycete species. P. litchii has a small genome with tightly spaced genes. On the basis of a multilocus phylogenetic analysis, the placement of P. litchii in the genus Phytophthora is strongly supported. Effector proteins predicted included 245 RxLR, 30 necrosis-and-ethylene-inducing protein-like, and 14 crinkler proteins. The typical motifs, phylogenies, and activities of these effectors were typical for a Phytophthora species. However, like the genome features of the analyzed downy mildews, P. litchii exhibited a streamlined genome with a relatively small number of genes in both core and species-specific protein families. The low GC content and slight codon preferences of P. litchii sequences were similar to those of the analyzed downy mildews and a subset of Phytophthora species. Taken together, these observations suggest that P. litchii is a Phytophthora pathogen that is in the process of acquiring downy mildew-like genomic and morphological features. Thus P. litchii may provide a novel model for investigating morphological development and genomic adaptation in oomycete pathogens.
Delorme, Christine; Legravet, Nicolas; Jamet, Emmanuel; Hoarau, Caroline; Alexandre, Bolotin; El-Sharoud, Walid M; Darwish, Mohamed S; Renault, Pierre
2017-02-02
We analyzed 178 Streptococcus thermophilus strains isolated from diverse products, from around the world, over a 60-year period with a new multilocus sequence typing (MLST) scheme. This collection included isolates from two traditional cheese-making sites with different starter-use practices, in sampling campaigns carried out over a three years period. The nucleotide diversity of the S. thermophilus population was limited, but 116 sequence types (ST) were identified. Phylogenetic analysis of the concatenated sequences of the six housekeeping genes revealed the existence of groups confirmed by eBURST analysis. Deeper analyses performed on 25 strains by CRISPR and whole-genome analysis showed that phylogenies obtained by MLST and whole-genome analysis were in agreement but differed from that inferred by CRISPR analysis. Strains isolated from traditional products could cluster in specific groups indicating their origin, but also be mixed in groups containing industrial starter strains. In the traditional cheese-making sites, we found that S. thermophilus persisted on dairy equipment, but that occasionally added starter strains may become dominant. It underlined the impact of starter use that may reshape S. thermophilus populations including in traditional products. This new MLST scheme thus provides a framework for analyses of S. thermophilus populations and the management of its biodiversity. Copyright © 2016 Elsevier B.V. All rights reserved.
Nandi, Tannistha; Holden, Matthew T.G.; Didelot, Xavier; Mehershahi, Kurosh; Boddey, Justin A.; Beacham, Ifor; Peak, Ian; Harting, John; Baybayan, Primo; Guo, Yan; Wang, Susana; How, Lee Chee; Sim, Bernice; Essex-Lopresti, Angela; Sarkar-Tyson, Mitali; Nelson, Michelle; Smither, Sophie; Ong, Catherine; Aw, Lay Tin; Hoon, Chua Hui; Michell, Stephen; Studholme, David J.; Titball, Richard; Chen, Swaine L.; Parkhill, Julian
2015-01-01
Burkholderia pseudomallei (Bp) is the causative agent of the infectious disease melioidosis. To investigate population diversity, recombination, and horizontal gene transfer in closely related Bp isolates, we performed whole-genome sequencing (WGS) on 106 clinical, animal, and environmental strains from a restricted Asian locale. Whole-genome phylogenies resolved multiple genomic clades of Bp, largely congruent with multilocus sequence typing (MLST). We discovered widespread recombination in the Bp core genome, involving hundreds of regions associated with multiple haplotypes. Highly recombinant regions exhibited functional enrichments that may contribute to virulence. We observed clade-specific patterns of recombination and accessory gene exchange, and provide evidence that this is likely due to ongoing recombination between clade members. Reciprocally, interclade exchanges were rarely observed, suggesting mechanisms restricting gene flow between clades. Interrogation of accessory elements revealed that each clade harbored a distinct complement of restriction-modification (RM) systems, predicted to cause clade-specific patterns of DNA methylation. Using methylome sequencing, we confirmed that representative strains from separate clades indeed exhibit distinct methylation profiles. Finally, using an E. coli system, we demonstrate that Bp RM systems can inhibit uptake of non-self DNA. Our data suggest that RM systems borne on mobile elements, besides preventing foreign DNA invasion, may also contribute to limiting exchanges of genetic material between individuals of the same species. Genomic clades may thus represent functional units of genetic isolation in Bp, modulating intraspecies genetic diversity. PMID:25236617
Sharma, Prashant P; Santiago, Marc A; Kriebel, Ricardo; Lipps, Savana M; Buenavente, Perry A C; Diesmos, Arvin C; Janda, Milan; Boyer, Sarah L; Clouse, Ronald M; Wheeler, Ward C
2017-01-01
The taxonomy and systematics of the armored harvestmen (suborder Laniatores) are based on various sets of morphological characters pertaining to shape, armature, pedipalpal setation, and the number of articles of the walking leg tarsi. Few studies have tested the validity of these historical character systems in a comprehensive way, with reference to an independent data class, i.e., molecular sequence data. We examined as a test case the systematics of Podoctidae, a family distributed throughout the Indo-Pacific. We tested the validity of the three subfamilies of Podoctidae using a five-locus phylogeny, and examined the evolution of dorsal shape as a proxy for taxonomic utility, using parametric shape analysis. Here we show that two of the three subfamilies, Ibaloniinae and Podoctinae, are non-monophyletic, with the third subfamily, Erecananinae, recovered as non-monophyletic in a subset of analyses. Various genera were also recovered as non-monophyletic. As first steps toward revision of Podoctidae, the subfamilies Erecananinae Roewer, 1912 and Ibaloniinae Roewer, 1912 are synonymized with Podoctinae Roewer, 1912 new synonymies, thereby abolishing unsubstantiated subfamilial divisions within Podoctidae. We once again synonymize the genus Paralomanius Goodnight & Goodnight, 1948 with Lomanius Roewer, 1923 revalidated. We additionally show that eggs carried on the legs of male Podoctidae are not conspecific to the males, falsifying the hypothesis of paternal care in this group. Copyright © 2016 Elsevier Inc. All rights reserved.
Ota, Yuko; Yamanaka, Takashi; Murata, Hitoshi; Neda, Hitoshi; Ohta, Akira; Kawai, Masataka; Yamada, Akiyoshi; Konno, Miki; Tanaka, Chihiro
2012-01-01
Tricholoma matsutake (S. Ito & S. Imai) Singer and its allied species are referred to as matsutake worldwide and are the most economically important edible mushrooms in Japan. They are widely distributed in the northern hemisphere and established an ectomycorrhizal relationship with conifer and broadleaf trees. To clarify relationships among T. matsutake and its allies, and to delimit phylogenetic species, we analyzed multilocus datasets (ITS, megB1, tef, gpd) with samples that were correctly identified based on morphological characteristics. Phylogenetic analyses clearly identified four major groups: matsutake, T. bakamatsutake, T. fulvocastaneum and T. caligatum; the latter three species were outside the matsutake group. The haplotype analyses and median-joining haplotype network analyses showed that the matsutake group included four closely related but clearly distinct taxa (T. matsutake, T. anatolicum, Tricholoma sp. from Mexico and T. magnivelare) from different geographical regions; these were considered to be distinct phylogenetic species.
Lima, Luciana; Espinosa-Álvarez, Oneida; Ortiz, Paola A; Trejo-Varón, Javier A; Carranza, Julio C; Pinto, C Miguel; Serrano, Myrna G; Buck, Gregory A; Camargo, Erney P; Teixeira, Marta M G
2015-11-01
Trypanosoma cruzi is a complex of phenotypically and genetically diverse isolates distributed in six discrete typing units (DTUs) designated as TcI-TcVI. Five years ago, T. cruzi isolates from Brazilian bats showing unique patterns of traditional ribosomal and spliced leader PCRs not clustering into any of the six DTUs were designated as the Tcbat genotype. In the present study, phylogenies inferred using SSU rRNA (small subunit of ribosomal rRNA), gGAPDH (glycosomal glyceraldehyde 3-phosphate dehydrogenase) and Cytb (cytochrome b) genes strongly supported Tcbat as a monophyletic lineage prevalent in Brazil, Panama and Colombia. Providing strong support for Tcbat, sequences from 37 of 47 nuclear and 12 mitochondrial genes (retrieved from a draft genome of Tcbat) and reference strains of all DTUs available in databanks corroborated Tcbat as an independent DTU. Consistent with previous studies, multilocus analysis of most nuclear genes corroborated the evolution of T. cruzi from bat trypanosomes its divergence into two main phylogenetic lineages: the basal TcII; and the lineage clustering TcIV, the clade comprising TcIII and the sister groups TcI-Tcbat. Most likely, the common ancestor of Tcbat and TcI was a bat trypanosome. However, the results of the present analysis did not support Tcbat as the ancestor of all DTUs. Despite the insights provided by reports of TcIII, TcIV and TcII in bats, including Amazonian bats harbouring TcII, further studies are necessary to understand the roles played by bats in the diversification of all DTUs. We also demonstrated that in addition to value as molecular markers for DTU assignment, Cytb, ITS rDNA and the spliced leader (SL) polymorphic sequences suggest spatially structured populations of Tcbat. Phylogenetic and phylogeographical analyses, multiple molecular markers specific to Tcbat, and the degrees of sequence divergence between Tcbat and the accepted DTUs strongly support the definitive classification of Tcbat as a new DTU. Copyright © 2015 Elsevier B.V. All rights reserved.
Comparative Analysis of the Orphan CRISPR2 Locus in 242 Enterococcus faecalis Strains
Hullahalli, Karthik; Rodrigues, Marinelle; Schmidt, Brendan D.; Li, Xiang; Bhardwaj, Pooja; Palmer, Kelli L.
2015-01-01
Clustered, Regularly Interspaced Short Palindromic Repeats and their associated Cas proteins (CRISPR-Cas) provide prokaryotes with a mechanism for defense against mobile genetic elements (MGEs). A CRISPR locus is a molecular memory of MGE encounters. It contains an array of short sequences, called spacers, that generally have sequence identity to MGEs. Three different CRISPR loci have been identified among strains of the opportunistic pathogen Enterococcus faecalis. CRISPR1 and CRISPR3 are associated with the cas genes necessary for blocking MGEs, but these loci are present in only a subset of E. faecalis strains. The orphan CRISPR2 lacks cas genes and is ubiquitous in E. faecalis, although its spacer content varies from strain to strain. Because CRISPR2 is a variable locus occurring in all E. faecalis, comparative analysis of CRISPR2 sequences may provide information about the clonality of E. faecalis strains. We examined CRISPR2 sequences from 228 E. faecalis genomes in relationship to subspecies phylogenetic lineages (sequence types; STs) determined by multilocus sequence typing (MLST), and to a genome phylogeny generated for a representative 71 genomes. We found that specific CRISPR2 sequences are associated with specific STs and with specific branches on the genome tree. To explore possible applications of CRISPR2 analysis, we evaluated 14 E. faecalis bloodstream isolates using CRISPR2 analysis and MLST. CRISPR2 analysis identified two groups of clonal strains among the 14 isolates, an assessment that was confirmed by MLST. CRISPR2 analysis was also used to accurately predict the ST of a subset of isolates. We conclude that CRISPR2 analysis, while not a replacement for MLST, is an inexpensive method to assess clonality among E. faecalis isolates, and can be used in conjunction with MLST to identify recombination events occurring between STs. PMID:26398194
Kawasaki, Yuuki; Schuler, Hannes; Stauffer, Christian; Lakatos, Ferenc; Kajimura, Hisashi
2016-05-19
Haplodiploidy is a sex determination system in which fertilized diploid eggs develop into females and unfertilized haploid eggs develop into males. The evolutionary explanations for this phenomenon include the possibility that haplodiploidy can be reinforced by infection with endosymbiotic bacteria, such as Wolbachia. The subfamily Scolytinae contains species with haplodiploid and diploid sex determination systems. Thus, we studied the association with Wolbachia in 12 diploid and 11 haplodiploid scolytine beetles by analyzing wsp and multilocus sequence typing (MLST) of five loci in this endosymbiont. Wolbachia genotypes were compared with mitochondrial (COI) and nuclear (EF) genotypes in the scolytines. Eight of the 23 scolytine species were infected with Wolbachia, with haplodiploids at significantly higher rates than diploid species. Cloning and sequencing detected multiple infections with up to six Wolbachia strains in individual species. Phylogenetic analyses of wsp and five MLST genes revealed different Wolbachia strains in scolytines. Comparisons between the beetle and Wolbachia phylogenies revealed that closely related beetles were infected with genetically different Wolbachia strains. These results suggest the horizontal transmission of multiple Wolbachia strains between scolytines. We discuss these results in terms of the evolution of different sex determination systems in scolytine beetles. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Szabóová, Dana; Bielik, Peter; Poláková, Silvia; Šoltys, Katarína; Jatzová, Katarína; Szemes, Tomáš
2017-01-01
Abstract The yeast Saccharomyces are widely used to test ecological and evolutionary hypotheses. A large number of nuclear genomic DNA sequences are available, but mitochondrial genomic data are insufficient. We completed mitochondrial DNA (mtDNA) sequencing from Illumina MiSeq reads for all Saccharomyces species. All are circularly mapped molecules decreasing in size with phylogenetic distance from Saccharomyces cerevisiae but with similar gene content including regulatory and selfish elements like origins of replication, introns, free-standing open reading frames or GC clusters. Their most profound feature is species-specific alteration in gene order. The genetic code slightly differs from well-established yeast mitochondrial code as GUG is used rarely as the translation start and CGA and CGC code for arginine. The multilocus phylogeny, inferred from mtDNA, does not correlate with the trees derived from nuclear genes. mtDNA data demonstrate that Saccharomyces cariocanus should be assigned as a separate species and Saccharomyces bayanus CBS 380T should not be considered as a distinct species due to mtDNA nearly identical to Saccharomyces uvarum mtDNA. Apparently, comparison of mtDNAs should not be neglected in genomic studies as it is an important tool to understand the origin and evolutionary history of some yeast species. PMID:28992063
USDA-ARS?s Scientific Manuscript database
The strains TII7 and A5 formed an effective and ineffective symbiosis with Medicago truncatula Jemalong A17, respectively. Both were shown to have identical chromsomes with strains Rm1021 and RCR2011 using a Multilocus Sequence Typing method. The 2260 bp segments of DNA stretching from the 3’ end ...
Peeters, Charlotte; Meier-Kolthoff, Jan P.; Verheyde, Bart; De Brandt, Evie; Cooper, Vaughn S.; Vandamme, Peter
2016-01-01
Partial gyrB gene sequence analysis of 17 isolates from human and environmental sources revealed 13 clusters of strains and identified them as Burkholderia glathei clade (BGC) bacteria. The taxonomic status of these clusters was examined by whole-genome sequence analysis, determination of the G+C content, whole-cell fatty acid analysis and biochemical characterization. The whole-genome sequence-based phylogeny was assessed using the Genome Blast Distance Phylogeny (GBDP) method and an extended multilocus sequence analysis (MLSA) approach. The results demonstrated that these 17 BGC isolates represented 13 novel Burkholderia species that could be distinguished by both genotypic and phenotypic characteristics. BGC strains exhibited a broad metabolic versatility and developed beneficial, symbiotic, and pathogenic interactions with different hosts. Our data also confirmed that there is no phylogenetic subdivision in the genus Burkholderia that distinguishes beneficial from pathogenic strains. We therefore propose to formally classify the 13 novel BGC Burkholderia species as Burkholderia arvi sp. nov. (type strain LMG 29317T = CCUG 68412T), Burkholderia hypogeia sp. nov. (type strain LMG 29322T = CCUG 68407T), Burkholderia ptereochthonis sp. nov. (type strain LMG 29326T = CCUG 68403T), Burkholderia glebae sp. nov. (type strain LMG 29325T = CCUG 68404T), Burkholderia pedi sp. nov. (type strain LMG 29323T = CCUG 68406T), Burkholderia arationis sp. nov. (type strain LMG 29324T = CCUG 68405T), Burkholderia fortuita sp. nov. (type strain LMG 29320T = CCUG 68409T), Burkholderia temeraria sp. nov. (type strain LMG 29319T = CCUG 68410T), Burkholderia calidae sp. nov. (type strain LMG 29321T = CCUG 68408T), Burkholderia concitans sp. nov. (type strain LMG 29315T = CCUG 68414T), Burkholderia turbans sp. nov. (type strain LMG 29316T = CCUG 68413T), Burkholderia catudaia sp. nov. (type strain LMG 29318T = CCUG 68411T) and Burkholderia peredens sp. nov. (type strain LMG 29314T = CCUG 68415T). Furthermore, we present emended descriptions of the species Burkholderia sordidicola, Burkholderia zhejiangensis and Burkholderia grimmiae. The GenBank/EMBL/DDBJ accession numbers for the 16S rRNA and gyrB gene sequences determined in this study are LT158612-LT158624 and LT158625-LT158641, respectively. PMID:27375597
2013-01-01
Background The genus Uropsilus comprises a group of terrestrial, montane mammals endemic to the Hengduan and adjacent mountains. These animals are the most primitive living talpids. The taxonomy has been primarily based on cursory morphological comparisons and the evolutionary affinities are little known. To provide insight into the systematics of this group, we estimated the first multi-locus phylogeny and conducted species delimitation, including taxon sampling throughout their distribution range. Results We obtained two mitochondrial genes (~1, 985 bp) and eight nuclear genes (~4, 345 bp) from 56 specimens. Ten distinct evolutionary lineages were recovered from the three recognized species, eight of which were recognized as species/putative species. Five of these putative species were found to be masquerading as the gracile shrew mole. The divergence time estimation results indicated that climate change since the last Miocene and the uplift of the Himalayas may have resulted in the diversification and speciation of Uropsilus. Conclusions The cryptic diversity found in this study indicated that the number of species is strongly underestimated under the current taxonomy. Two synonyms of gracilis (atronates and nivatus) should be given full species status, and the taxonomic status of another three potential species should be evaluated using extensive taxon sampling, comprehensive morphological, and morphometric approaches. Consequently, the conservation status of Uropsilus spp. should also be re-evaluated, as most of the species/potential species have very limited distribution. PMID:24161152
Tavera, Jose; Acero P, Arturo; Wainwright, Peter C
2018-04-01
We present a phylogenetic analysis with divergence time estimates, and an ecomorphological assessment of the role of the benthic-to-pelagic axis of diversification in the history of haemulid fishes. Phylogenetic analyses were performed on 97 grunt species based on sequence data collected from seven loci. Divergence time estimation indicates that Haemulidae originated during the mid Eocene (54.7-42.3 Ma) but that the major lineages were formed during the mid-Oligocene 30-25 Ma. We propose a new classification that reflects the phylogenetic history of grunts. Overall the pattern of morphological and functional diversification in grunts appears to be strongly linked with feeding ecology. Feeding traits and the first principal component of body shape strongly separate species that feed in benthic and pelagic habitats. The benthic-to-pelagic axis has been the major axis of ecomorphological diversification in this important group of tropical shoreline fishes, with about 13 transitions between feeding habitats that have had major consequences for head and body morphology. Copyright © 2017 Elsevier Inc. All rights reserved.
Ahrenfeldt, Johanne; Skaarup, Carina; Hasman, Henrik; Pedersen, Anders Gorm; Aarestrup, Frank Møller; Lund, Ole
2017-01-05
Whole genome sequencing (WGS) is increasingly used in diagnostics and surveillance of infectious diseases. A major application for WGS is to use the data for identifying outbreak clusters, and there is therefore a need for methods that can accurately and efficiently infer phylogenies from sequencing reads. In the present study we describe a new dataset that we have created for the purpose of benchmarking such WGS-based methods for epidemiological data, and also present an analysis where we use the data to compare the performance of some current methods. Our aim was to create a benchmark data set that mimics sequencing data of the sort that might be collected during an outbreak of an infectious disease. This was achieved by letting an E. coli hypermutator strain grow in the lab for 8 consecutive days, each day splitting the culture in two while also collecting samples for sequencing. The result is a data set consisting of 101 whole genome sequences with known phylogenetic relationship. Among the sequenced samples 51 correspond to internal nodes in the phylogeny because they are ancestral, while the remaining 50 correspond to leaves. We also used the newly created data set to compare three different online available methods that infer phylogenies from whole-genome sequencing reads: NDtree, CSI Phylogeny and REALPHY. One complication when comparing the output of these methods with the known phylogeny is that phylogenetic methods typically build trees where all observed sequences are placed as leafs, even though some of them are in fact ancestral. We therefore devised a method for post processing the inferred trees by collapsing short branches (thus relocating some leafs to internal nodes), and also present two new measures of tree similarity that takes into account the identity of both internal and leaf nodes. Based on this analysis we find that, among the investigated methods, CSI Phylogeny had the best performance, correctly identifying 73% of all branches in the tree and 71% of all clades. We have made all data from this experiment (raw sequencing reads, consensus whole-genome sequences, as well as descriptions of the known phylogeny in a variety of formats) publicly available, with the hope that other groups may find this data useful for benchmarking and exploring the performance of epidemiological methods. All data is freely available at: https://cge.cbs.dtu.dk/services/evolution_data.php .
Labeda, David P
2016-03-01
Multi-locus sequence analysis has been demonstrated to be a useful tool for identification of Streptomyces species and was previously applied to phylogenetically differentiate the type strains of species pathogenic on potatoes (Solanum tuberosum L.). The ARS Culture Collection (NRRL) contains 43 strains identified as Streptomyces scabiei deposited at various times since the 1950s and these were subjected to multi-locus sequence analysis utilising partial sequences of the house-keeping genes atpD, gyrB, recA, rpoB and trpB. Phylogenetic analyses confirmed the identity of 17 of these strains as Streptomyces scabiei, 9 of the strains as the potato-pathogenic species Streptomyces europaeiscabiei and 6 strains as potentially new phytopathogenic species. Of the 16 other strains, 12 were identified as members of previously described non-pathogenic Streptomyces species while the remaining 4 strains may represent heretofore unrecognised non-pathogenic species. This study demonstrated the value of this technique for the relatively rapid, simple and sensitive molecular identification of Streptomyces strains held in culture collections.
spads 1.0: a toolbox to perform spatial analyses on DNA sequence data sets.
Dellicour, Simon; Mardulyn, Patrick
2014-05-01
SPADS 1.0 (for 'Spatial and Population Analysis of DNA Sequences') is a population genetic toolbox for characterizing genetic variability within and among populations from DNA sequences. In view of the drastic increase in genetic information available through sequencing methods, spads was specifically designed to deal with multilocus data sets of DNA sequences. It computes several summary statistics from populations or groups of populations, performs input file conversions for other population genetic programs and implements locus-by-locus and multilocus versions of two clustering algorithms to study the genetic structure of populations. The toolbox also includes two MATLAB and r functions, GDISPAL and GDIVPAL, to display differentiation and diversity patterns across landscapes. These functions aim to generate interpolating surfaces based on multilocus distance and diversity indices. In the case of multiple loci, such surfaces can represent a useful alternative to multiple pie charts maps traditionally used in phylogeography to represent the spatial distribution of genetic diversity. These coloured surfaces can also be used to compare different data sets or different diversity and/or distance measures estimated on the same data set. © 2013 John Wiley & Sons Ltd.
Sharma, Anshul; Kaur, Jasmine; Lee, Sulhee; Park, Young-Seo
2018-06-01
In the present study, 35 Leuconostoc mesenteroides strains isolated from vegetables and food products from South Korea were studied by multilocus sequence typing (MLST) of seven housekeeping genes (atpA, groEL, gyrB, pheS, pyrG, rpoA, and uvrC). The fragment sizes of the seven amplified housekeeping genes ranged in length from 366 to 1414 bp. Sequence analysis indicated 27 different sequence types (STs) with 25 of them being represented by a single strain indicating high genetic diversity, whereas the remaining 2 were characterized by five strains each. In total, 220 polymorphic nucleotide sites were detected among seven housekeeping genes. The phylogenetic analysis based on the STs of the seven loci indicated that the 35 strains belonged to two major groups, A (28 strains) and B (7 strains). Split decomposition analysis showed that intraspecies recombination played a role in generating diversity among strains. The minimum spanning tree showed that the evolution of the STs was not correlated with food source. This study signifies that the multilocus sequence typing is a valuable tool to access the genetic diversity among L. mesenteroides strains from South Korea and can be used further to monitor the evolutionary changes.
Phylogenetic relationships of Malassezia species based on multilocus sequence analysis.
Castellá, Gemma; Coutinho, Selene Dall' Acqua; Cabañes, F Javier
2014-01-01
Members of the genus Malassezia are lipophilic basidiomycetous yeasts, which are part of the normal cutaneous microbiota of humans and other warm-blooded animals. Currently, this genus consists of 14 species that have been characterized by phenetic and molecular methods. Although several molecular methods have been used to identify and/or differentiate Malassezia species, the sequencing of the rRNA genes and the chitin synthase-2 gene (CHS2) are the most widely employed. There is little information about the β-tubulin gene in the genus Malassezia, a gene has been used for the analysis of complex species groups. The aim of the present study was to sequence a fragment of the β-tubulin gene of Malassezia species and analyze their phylogenetic relationship using a multilocus sequence approach based on two rRNA genes (ITS including 5.8S rRNA and D1/D2 region of 26S rRNA) together with two protein encoding genes (CHS2 and β-tubulin). The phylogenetic study of the partial β-tubulin gene sequences indicated that this molecular marker can be used to assess diversity and identify new species. The multilocus sequence analysis of the four loci provides robust support to delineate species at the terminal nodes and could help to estimate divergence times for the origin and diversification of Malassezia species.
2011-01-01
Background The avian family Cettiidae, including the genera Cettia, Urosphena, Tesia, Abroscopus and Tickellia and Orthotomus cucullatus, has recently been proposed based on analysis of a small number of loci and species. The close relationship of most of these taxa was unexpected, and called for a comprehensive study based on multiple loci and dense taxon sampling. In the present study, we infer the relationships of all except one of the species in this family using one mitochondrial and three nuclear loci. We use traditional gene tree methods (Bayesian inference, maximum likelihood bootstrapping, parsimony bootstrapping), as well as a recently developed Bayesian species tree approach (*BEAST) that accounts for lineage sorting processes that might produce discordance between gene trees. We also analyse mitochondrial DNA for a larger sample, comprising multiple individuals and a large number of subspecies of polytypic species. Results There are many topological incongruences among the single-locus trees, although none of these is strongly supported. The multi-locus tree inferred using concatenated sequences and the species tree agree well with each other, and are overall well resolved and well supported by the data. The main discrepancy between these trees concerns the most basal split. Both methods infer the genus Cettia to be highly non-monophyletic, as it is scattered across the entire family tree. Deep intraspecific divergences are revealed, and one or two species and one subspecies are inferred to be non-monophyletic (differences between methods). Conclusions The molecular phylogeny presented here is strongly inconsistent with the traditional, morphology-based classification. The remarkably high degree of non-monophyly in the genus Cettia is likely to be one of the most extraordinary examples of misconceived relationships in an avian genus. The phylogeny suggests instances of parallel evolution, as well as highly unequal rates of morphological divergence in different lineages. This complex morphological evolution apparently misled earlier taxonomists. These results underscore the well-known but still often neglected problem of basing classifications on overall morphological similarity. Based on the molecular data, a revised taxonomy is proposed. Although the traditional and species tree methods inferred much the same tree in the present study, the assumption by species tree methods that all species are monophyletic is a limitation in these methods, as some currently recognized species might have more complex histories. PMID:22142197
Saslis-Lagoudakis, C Haris; Klitgaard, Bente B; Forest, Félix; Francis, Louise; Savolainen, Vincent; Williamson, Elizabeth M; Hawkins, Julie A
2011-01-01
The study of traditional knowledge of medicinal plants has led to discoveries that have helped combat diseases and improve healthcare. However, the development of quantitative measures that can assist our quest for new medicinal plants has not greatly advanced in recent years. Phylogenetic tools have entered many scientific fields in the last two decades to provide explanatory power, but have been overlooked in ethnomedicinal studies. Several studies show that medicinal properties are not randomly distributed in plant phylogenies, suggesting that phylogeny shapes ethnobotanical use. Nevertheless, empirical studies that explicitly combine ethnobotanical and phylogenetic information are scarce. In this study, we borrowed tools from community ecology phylogenetics to quantify significance of phylogenetic signal in medicinal properties in plants and identify nodes on phylogenies with high bioscreening potential. To do this, we produced an ethnomedicinal review from extensive literature research and a multi-locus phylogenetic hypothesis for the pantropical genus Pterocarpus (Leguminosae: Papilionoideae). We demonstrate that species used to treat a certain conditions, such as malaria, are significantly phylogenetically clumped and we highlight nodes in the phylogeny that are significantly overabundant in species used to treat certain conditions. These cross-cultural patterns in ethnomedicinal usage in Pterocarpus are interpreted in the light of phylogenetic relationships. This study provides techniques that enable the application of phylogenies in bioscreening, but also sheds light on the processes that shape cross-cultural ethnomedicinal patterns. This community phylogenetic approach demonstrates that similar ethnobotanical uses can arise in parallel in different areas where related plants are available. With a vast amount of ethnomedicinal and phylogenetic information available, we predict that this field, after further refinement of the techniques, will expand into similar research areas, such as pest management or the search for bioactive plant-based compounds.
Sahl, Jason W; Johnson, J Kristie; Harris, Anthony D; Phillippy, Adam M; Hsiao, William W; Thom, Kerri A; Rasko, David A
2011-06-04
Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Pulsed-field gel electrophoresis, multi-locus sequence typing and antibiotic resistance profiles demonstrated genotypic and phenotypic variation. Each isolate was sequenced to high-quality draft status, which allowed for comparative genomic analyses with existing A. baumannii genomes. A high resolution, whole genome alignment method detailed the phylogenetic relationships of sequenced A. baumannii and found no correlation between phylogeny and body site of isolation. This method identified genomic regions unique to both those isolates found on the surface of the skin or in wounds, termed colonization isolates, and those identified from body fluids, termed invasive isolates; these regions may play a role in the pathogenesis and spread of this important pathogen. A PCR-based screen of 74 A. baumanii isolates demonstrated that these unique genes are not exclusive to either phenotype or isolation source; however, a conserved genomic region exclusive to all sequenced A. baumannii was identified and verified. The results of the comparative genome analysis and PCR assay show that A. baumannii is a diverse and genomically variable pathogen that appears to have the potential to cause a range of human disease regardless of the isolation source.
Draft genome sequence of non-shiga toxin-producing Escherichia coli O157 NCCP15738.
Kwon, Taesoo; Kim, Jung-Beom; Bak, Young-Seok; Yu, Young-Bin; Kwon, Ki Sung; Kim, Won; Cho, Seung-Hak
2016-01-01
The non-shiga toxin-producing Escherichia coli (non-STEC) O157 is a pathogenic strain that cause diarrhea but does not cause hemolytic-uremic syndrome, or hemorrhagic colitis. Here, we present the 5-Mb draft genome sequence of non-STEC O157 NCCP15738, which was isolated from the feces of a Korean patient with diarrhea, and describe its features and the structural basis for its genome evolution. A total of 565-Mbp paired-end reads were generated using the Illumina-HiSeq 2000 platform. The reads were assembled into 135 scaffolds throughout the de novo assembly. The assembled genome size of NCCP15738 was 5,005,278 bp with an N50 value of 142,450 bp and 50.65 % G+C content. Using Rapid Annotation using Subsystem Technology analysis, we predicted 4780 ORFs and 31 RNA genes. The evolutionary tree was inferred from multiple sequence alignment of 45 E. coli species. The most closely related neighbor of NCCP15738 indicated by whole-genome phylogeny was E. coli UMNK88, but that indicated by multilocus sequence analysis was E. coli DH1(ME8569). A comparison between the NCCP15738 genome and those of reference strains, E. coli K-12 substr. MG1655 and EHEC O157:H7 EDL933 by bioinformatics analyses revealed unique genes in NCCP15738 associated with lysis protein S, two-component signal transduction system, conjugation, the flagellum, nucleotide-binding proteins, and metal-ion binding proteins. Notably, NCCP15738 has a dual flagella system like that in Vibrio parahaemolyticus, Aeromonas spp., and Rhodospirillum centenum. The draft genome sequence and the results of bioinformatics analysis of NCCP15738 provide the basis for understanding the genomic evolution of this strain.
Reck-Kortmann, Maikel; Silva-Arias, Gustavo Adolfo; Segatto, Ana Lúcia Anversa; Mäder, Geraldo; Bonatto, Sandro Luis; de Freitas, Loreta Brandão
2014-12-01
The phylogeny of Petunia species has been difficult to resolve, primarily due to the recent diversification of the genus. Several studies have included molecular data in phylogenetic reconstructions of this genus, but all of them have failed to include all taxa and/or analyzed few genetic markers. In the present study, we employed the most inclusive genetic and taxonomic datasets for the genus, aiming to reconstruct the evolutionary history of Petunia based on molecular phylogeny, biogeographic distribution, and character evolution. We included all 20 Petunia morphological species or subspecies in these analyses. Based on nine nuclear and five plastid DNA markers, our phylogenetic analysis reinforces the monophyly of the genus Petunia and supports the hypothesis that the basal divergence is more related to the differentiation of corolla tube length, whereas the geographic distribution of species is more related to divergences within these main clades. Ancestral area reconstructions suggest the Pampas region as the area of origin and earliest divergence in Petunia. The state reconstructions suggest that the ancestor of Petunia might have had a short corolla tube and a bee pollination floral syndrome. Copyright © 2014 Elsevier Inc. All rights reserved.
Ko, Kwan Soo; Oh, Won Sup; Peck, Kyong Ran; Lee, Jang Ho; Lee, Nam Yong; Song, Jae-Hoon
2005-07-01
Non-typeable isolates of Streptococcus pneumoniae collected from Asian countries were characterized by optochin susceptibility test, bile solubility test, multilocus sequence typing of housekeeping genes, amplification of virulence-related genes, 16S rDNA-RsaI digestion, and 16S rDNA sequencing. Six of 54 non-typeable pneumococcal isolates showed divergence of gene sequences of recP and xpt from typical pneumococcal strains. Of these six atypical pneumococcal strains, two showed different results in optochin susceptibility or bile solubility test from typical pneumococcal strains. All six isolates showed high sequence dissimilarities of multilocus sequence typing, 16S rDNA sequences, and lytA sequences from typical S. pneumoniae strains. Data from this study suggest that classic tests such as optochin susceptibility and bile solubility tests may lead to incorrect identification of S. pneumoniae. These atypical strains may belong to different bacterial species from S. pneumoniae.
Siqueira, J. P. Z.; Sutton, D. A.; García, D.; Wiederhold, N.; Peterson, S. W.; Guarro, J.
2017-01-01
ABSTRACT A multilocus phylogenetic study was carried out to assess species identity of a set of 34 clinical isolates from Aspergillus section Circumdati from the United States and to determine their in vitro antifungal susceptibility against eight antifungal drugs. The genetic markers used were the internal transcribed spacer (ITS) region, and fragments of the beta-tubulin (BenA), calmodulin (CaM), and RNA polymerase II second largest subunit (RPB2) genes. The drugs tested were amphotericin B, itraconazole, posaconazole, voriconazole, anidulafungin, caspofungin, micafungin, and terbinafine. The most common species sampled was A. westerdijkiae (29.4%), followed by a novel species, which was described here as A. pseudosclerotiorum (23.5%). Other species identified were A. sclerotiorum (17.6%), A. ochraceus (8.8%), A. subramanianii (8.8%), and A. insulicola and A. ochraceopetaliformis, with two isolates (5.9%) of each. The drugs that showed the most potent activity were caspofungin, micafungin, and terbinafine, while amphotericin B showed the least activity. PMID:28053212
Siqueira, J P Z; Sutton, D A; Gené, J; García, D; Wiederhold, N; Peterson, S W; Guarro, J
2017-03-01
A multilocus phylogenetic study was carried out to assess species identity of a set of 34 clinical isolates from Aspergillus section Circumdati from the United States and to determine their in vitro antifungal susceptibility against eight antifungal drugs. The genetic markers used were the internal transcribed spacer (ITS) region, and fragments of the beta-tubulin ( BenA ), calmodulin ( CaM ), and RNA polymerase II second largest subunit ( RPB2 ) genes. The drugs tested were amphotericin B, itraconazole, posaconazole, voriconazole, anidulafungin, caspofungin, micafungin, and terbinafine. The most common species sampled was A. westerdijkiae (29.4%), followed by a novel species, which was described here as A. pseudosclerotiorum (23.5%). Other species identified were A. sclerotiorum (17.6%), A. ochraceus (8.8%), A. subramanianii (8.8%), and A. insulicola and A. ochraceopetaliformis , with two isolates (5.9%) of each. The drugs that showed the most potent activity were caspofungin, micafungin, and terbinafine, while amphotericin B showed the least activity. Copyright © 2017 American Society for Microbiology.
Haendiges, Julie; Jones, Jessica; Myers, Robert A.; Mitchell, Clifford S.; Butler, Erin
2016-01-01
ABSTRACT In the summer of 2010, Vibrio parahaemolyticus caused an outbreak in Maryland linked to the consumption of oysters. Strains isolated from both stool and oyster samples were indistinguishable by pulsed-field gel electrophoresis (PFGE). However, the oysters contained other potentially pathogenic V. parahaemolyticus strains exhibiting different PFGE patterns. In order to assess the identity, genetic makeup, relatedness, and potential pathogenicity of the V. parahaemolyticus strains, we sequenced 11 such strains (2 clinical strains and 9 oyster strains). We analyzed these genomes by in silico multilocus sequence typing (MLST) and determined their phylogeny using a whole-genome MLST (wgMLST) analysis. Our in silico MLST analysis identified six different sequence types (STs) (ST8, ST676, ST810, ST811, ST34, and ST768), with both of the clinical and four of the oyster strains being identified as belonging to ST8. Using wgMLST, we showed that the ST8 strains from clinical and oyster samples were nearly indistinguishable and belonged to the same outbreak, confirming that local oysters were the source of the infections. The remaining oyster strains were genetically diverse, differing in >3,000 loci from the Maryland ST8 strains. eBURST analysis comparing these strains with strains of other STs available at the V. parahaemolyticus MLST website showed that the Maryland ST8 strains belonged to a clonal complex endemic to Asia. This indicates that the ST8 isolates from clinical and oyster sources were likely not endemic to Maryland. Finally, this study demonstrates the utility of whole-genome sequencing (WGS) and associated analyses for source-tracking investigations. IMPORTANCE Vibrio parahaemolyticus is an important foodborne pathogen and the leading cause of bacterial infections in the United States associated with the consumption of seafood. In the summer of 2010, Vibrio parahaemolyticus caused an outbreak in Maryland linked to oyster consumption. Strains isolated from stool and oyster samples were indistinguishable by pulsed-field gel electrophoresis (PFGE). The oysters also contained other potentially pathogenic V. parahaemolyticus strains with different PFGE patterns. Since their identity, genetic makeup, relatedness, and potential pathogenicity were unknown, their genomes were determined by using next-generation sequencing. Whole-genome sequencing (WGS) analysis by whole-genome multilocus sequence typing (wgMLST) allowed (i) identification of clinical and oyster strains with matching PFGE profiles as belonging to ST8, (ii) determination of oyster strain diversity, and (iii) identification of the clinical strains as belonging to a clonal complex (CC) described only in Asia. Finally, WGS and associated analyses demonstrated their utility for trace-back investigations. PMID:26994080
Haendiges, Julie; Jones, Jessica; Myers, Robert A; Mitchell, Clifford S; Butler, Erin; Toro, Magaly; Gonzalez-Escalona, Narjol
2016-06-01
In the summer of 2010, Vibrio parahaemolyticus caused an outbreak in Maryland linked to the consumption of oysters. Strains isolated from both stool and oyster samples were indistinguishable by pulsed-field gel electrophoresis (PFGE). However, the oysters contained other potentially pathogenic V. parahaemolyticus strains exhibiting different PFGE patterns. In order to assess the identity, genetic makeup, relatedness, and potential pathogenicity of the V. parahaemolyticus strains, we sequenced 11 such strains (2 clinical strains and 9 oyster strains). We analyzed these genomes by in silico multilocus sequence typing (MLST) and determined their phylogeny using a whole-genome MLST (wgMLST) analysis. Our in silico MLST analysis identified six different sequence types (STs) (ST8, ST676, ST810, ST811, ST34, and ST768), with both of the clinical and four of the oyster strains being identified as belonging to ST8. Using wgMLST, we showed that the ST8 strains from clinical and oyster samples were nearly indistinguishable and belonged to the same outbreak, confirming that local oysters were the source of the infections. The remaining oyster strains were genetically diverse, differing in >3,000 loci from the Maryland ST8 strains. eBURST analysis comparing these strains with strains of other STs available at the V. parahaemolyticus MLST website showed that the Maryland ST8 strains belonged to a clonal complex endemic to Asia. This indicates that the ST8 isolates from clinical and oyster sources were likely not endemic to Maryland. Finally, this study demonstrates the utility of whole-genome sequencing (WGS) and associated analyses for source-tracking investigations. Vibrio parahaemolyticus is an important foodborne pathogen and the leading cause of bacterial infections in the United States associated with the consumption of seafood. In the summer of 2010, Vibrio parahaemolyticus caused an outbreak in Maryland linked to oyster consumption. Strains isolated from stool and oyster samples were indistinguishable by pulsed-field gel electrophoresis (PFGE). The oysters also contained other potentially pathogenic V. parahaemolyticus strains with different PFGE patterns. Since their identity, genetic makeup, relatedness, and potential pathogenicity were unknown, their genomes were determined by using next-generation sequencing. Whole-genome sequencing (WGS) analysis by whole-genome multilocus sequence typing (wgMLST) allowed (i) identification of clinical and oyster strains with matching PFGE profiles as belonging to ST8, (ii) determination of oyster strain diversity, and (iii) identification of the clinical strains as belonging to a clonal complex (CC) described only in Asia. Finally, WGS and associated analyses demonstrated their utility for trace-back investigations. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Antonov, V A; Altukhova, V V; Savchenko, S S; Zamaraev, V S; Iliukhin, V I; Alekseev, V V
2007-01-01
Burkholderia mallei is highly pathogenic microorganism for both humans and animals. In this work, the possibility of the use of the genotyping method for differentiation between strains of B. mallei was studied. A collection of 14 isolates of B. mallei was characterized using randomly amplified polymorphic DNA (RAPD) and multilocus sequence typing (MLST). RAPD was the best method used for detecting strain differences of B. mallei. It was suggested that this method would be an increasingly useful molecular epidemiological tool.
Xu, Duo; Jaber, Yousef; Pavlidis, Pavlos; Gokcumen, Omer
2017-09-26
Constructing alignments and phylogenies for a given locus from large genome sequencing studies with relevant outgroups allow novel evolutionary and anthropological insights. However, no user-friendly tool has been developed to integrate thousands of recently available and anthropologically relevant genome sequences to construct complete sequence alignments and phylogenies. Here, we provide VCFtoTree, a user friendly tool with a graphical user interface that directly accesses online databases to download, parse and analyze genome variation data for regions of interest. Our pipeline combines popular sequence datasets and tree building algorithms with custom data parsing to generate accurate alignments and phylogenies using all the individuals from the 1000 Genomes Project, Neanderthal and Denisovan genomes, as well as reference genomes of Chimpanzee and Rhesus Macaque. It can also be applied to other phased human genomes, as well as genomes from other species. The output of our pipeline includes an alignment in FASTA format and a tree file in newick format. VCFtoTree fulfills the increasing demand for constructing alignments and phylogenies for a given loci from thousands of available genomes. Our software provides a user friendly interface for a wider audience without prerequisite knowledge in programming. VCFtoTree can be accessed from https://github.com/duoduoo/VCFtoTree_3.0.0 .
de Gier, Camilla; Kirkham, Lea-Ann S.
2015-01-01
Nonhemolytic variants of Haemophilus haemolyticus are difficult to differentiate from Haemophilus influenzae despite a wide difference in pathogenic potential. A previous investigation characterized a challenging set of 60 clinical strains using multiple PCRs for marker genes and described strains that could not be unequivocally identified as either species. We have analyzed the same set of strains by multilocus sequence analysis (MLSA) and near-full-length 16S rRNA gene sequencing. MLSA unambiguously allocated all study strains to either of the two species, while identification by 16S rRNA sequence was inconclusive for three strains. Notably, the two methods yielded conflicting identifications for two strains. Most of the “fuzzy species” strains were identified as H. influenzae that had undergone complete deletion of the fucose operon. Such strains, which are untypeable by the H. influenzae multilocus sequence type (MLST) scheme, have sporadically been reported and predominantly belong to a single branch of H. influenzae MLSA phylogenetic group II. We also found evidence of interspecies recombination between H. influenzae and H. haemolyticus within the 16S rRNA genes. Establishing an accurate method for rapid and inexpensive identification of H. influenzae is important for disease surveillance and treatment. PMID:26378279
Cholley, Pascal; Stojanov, Milos; Hocquet, Didier; Thouverez, Michelle; Bertrand, Xavier; Blanc, Dominique S
2015-08-01
Reliable molecular typing methods are necessary to investigate the epidemiology of bacterial pathogens. Reference methods such as multilocus sequence typing (MLST) and pulsed-field gel electrophoresis (PFGE) are costly and time consuming. Here, we compared our newly developed double-locus sequence typing (DLST) method for Pseudomonas aeruginosa to MLST and PFGE on a collection of 281 isolates. DLST was as discriminatory as MLST and was able to recognize "high-risk" epidemic clones. Both methods were highly congruent. Not surprisingly, a higher discriminatory power was observed with PFGE. In conclusion, being a simple method (single-strand sequencing of only 2 loci), DLST is valuable as a first-line typing tool for epidemiological investigations of P. aeruginosa. Coupled to a more discriminant method like PFGE or whole genome sequencing, it might represent an efficient typing strategy to investigate or prevent outbreaks. Copyright © 2015 Elsevier Inc. All rights reserved.
Tóth, Annamária; Hausknecht, Anton; Krisai-Greilhuber, Irmgard; Papp, Tamás; Vágvölgyi, Csaba; Nagy, László G.
2013-01-01
Reconciling traditional classifications, morphology, and the phylogenetic relationships of brown-spored agaric mushrooms has proven difficult in many groups, due to extensive convergence in morphological features. Here, we address the monophyly of the Bolbitiaceae, a family with over 700 described species and examine the higher-level relationships within the family using a newly constructed multilocus dataset (ITS, nrLSU rDNA and EF1-alpha). We tested whether the fast-evolving Internal Transcribed Spacer (ITS) sequences can be accurately aligned across the family, by comparing the outcome of two iterative alignment refining approaches (an automated and a manual) and various indel-treatment strategies. We used PRANK to align sequences in both cases. Our results suggest that – although PRANK successfully evades overmatching of gapped sites, referred previously to as alignment overmatching – it infers an unrealistically high number of indel events with natively generated guide-trees. This 'alignment undermatching' could be avoided by using more rigorous (e.g. ML) guide trees. The trees inferred in this study support the monophyly of the core Bolbitiaceae, with the exclusion of Panaeolus, Agrocybe, and some of the genera formerly placed in the family. Bolbitius and Conocybe were found monophyletic, however, Pholiotina and Galerella require redefinition. The phylogeny revealed that stipe coverage type is a poor predictor of phylogenetic relationships, indicating the need for a revision of the intrageneric relationships within Conocybe. PMID:23418526
Wendel, Andreas F; Meyer, Sebastian; Deenen, René; Köhrer, Karl; Kolbe-Busch, Susanne; Pfeffer, Klaus; Willmann, Matthias; Kaasch, Achim J; MacKenzie, Colin R
2018-05-11
Enterobacter cloacae complex is a common cause of hospital outbreaks. A retrospective and prospective molecular analysis of carbapenem-resistant clinical isolates in a tertiary care center demonstrated an outbreak of a German-imipenemase-1 (GIM-1) metallo-beta-lactamase-producing Enterobacter hormaechei ssp. steigerwaltii affecting 23 patients between 2009 and 2016. Thirty-three isolates were sequence type 89 by conventional multilocus sequence typing (MLST) and displayed a maximum difference of 49 out of 3,643 targets in the ad-hoc core-genome MLST (cgMLST) scheme (SeqSphere+ software; Ridom, Münster, Germany). The relatedness of all isolates was confirmed by further maximum-likelihood phylogeny. One clonal complex of highly related isolates (≤15 allele difference in cgMLST) contained 17 patients, but epidemiological data only suggested five transmission events. The bla GIM-1 -gene was embedded in a class-1-integron (In770) and the Tn21-subgroup transposon Tn6216 (KC511628) on a 25-kb plasmid. Environmental screening detected one colonized sink trap in a service room. The outbreak was self-limited as no further bla GIM-1 -positive E. hormaechei has been isolated since 2016. Routine molecular screening of carbapenem-nonsusceptible gram-negative isolates detected a long-term, low-frequency outbreak of a GIM-1-producing E. hormaechei ssp. steigerwaltii clone. This highlights the necessity of molecular surveillance.
Development of Mycoplasma synoviae (MS) core genome multilocus sequence typing (cgMLST) scheme.
Ghanem, Mostafa; El-Gazzar, Mohamed
2018-05-01
Mycoplasma synoviae (MS) is a poultry pathogen with reported increased prevalence and virulence in recent years. MS strain identification is essential for prevention, control efforts and epidemiological outbreak investigations. Multiple multilocus based sequence typing schemes have been developed for MS, yet the resolution of these schemes could be limited for outbreak investigation. The cost of whole genome sequencing became close to that of sequencing the seven MLST targets; however, there is no standardized method for typing MS strains based on whole genome sequences. In this paper, we propose a core genome multilocus sequence typing (cgMLST) scheme as a standardized and reproducible method for typing MS based whole genome sequences. A diverse set of 25 MS whole genome sequences were used to identify 302 core genome genes as cgMLST targets (35.5% of MS genome) and 44 whole genome sequences of MS isolates from six countries in four continents were used for typing applying this scheme. cgMLST based phylogenetic trees displayed a high degree of agreement with core genome SNP based analysis and available epidemiological information. cgMLST allowed evaluation of two conventional MLST schemes of MS. The high discriminatory power of cgMLST allowed differentiation between samples of the same conventional MLST type. cgMLST represents a standardized, accurate, highly discriminatory, and reproducible method for differentiation between MS isolates. Like conventional MLST, it provides stable and expandable nomenclature, allowing for comparing and sharing the typing results between different laboratories worldwide. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Michael DeGiorgio; John Syring; Andrew J. Eckert; Aaron Liston; Richard Cronn; David B. Neale; Noah A. Rosenberg
2014-01-01
Background: As it becomes increasingly possible to obtain DNA sequences of orthologous genes from diverse sets of taxa, species trees are frequently being inferred from multilocus data. However, the behavior of many methods for performing this inference has remained largely unexplored. Some methods have been proven to be consistent given certain evolutionary models,...
Xiong, H; Campelo, D; Pollack, R J; Raoult, D; Shao, R; Alem, M; Ali, J; Bilcha, K; Barker, S C
2014-08-01
The Illumina Hiseq platform was used to sequence the entire mitochondrial coding-regions of 20 body lice, Pediculus humanus Linnaeus, and head lice, P. capitis De Geer (Phthiraptera: Pediculidae), from eight towns and cities in five countries: Ethiopia, France, China, Australia and the U.S.A. These data (∼310 kb) were used to see how much more informative entire mitochondrial coding-region sequences were than partial mitochondrial coding-region sequences, and thus to guide the design of future studies of the phylogeny, origin, evolution and taxonomy of body lice and head lice. Phylogenies were compared from entire coding-region sequences (∼15.4 kb), entire cox1 (∼1.5 kb), partial cox1 (∼700 bp) and partial cytb (∼600 bp) sequences. On the one hand, phylogenies from entire mitochondrial coding-region sequences (∼15.4 kb) were much more informative than phylogenies from entire cox1 sequences (∼1.5 kb) and partial gene sequences (∼600 to ∼700 bp). For example, 19 branches had > 95% bootstrap support in our maximum likelihood tree from the entire mitochondrial coding-regions (∼15.4 kb) whereas the tree from 700 bp cox1 had only two branches with bootstrap support > 95%. Yet, by contrast, partial cytb (∼600 bp) and partial cox1 (∼486 bp) sequences were sufficient to genotype lice to Clade A, B or C. The sequences of the mitochondrial genomes of the P. humanus, P. capitis and P. schaeffi Fahrenholz studied are in NCBI GenBank under the accession numbers KC660761-800, KC685631-6330, KC241882-97, EU219988-95, HM241895-8 and JX080388-407. © 2014 The Royal Entomological Society.
Bergin, Sarah M; Periaswamy, Balamurugan; Barkham, Timothy; Chua, Hong Choon; Mok, Yee Ming; Fung, Daniel Shuen Sheng; Su, Alex Hsin Chuan; Lee, Yen Ling; Chua, Ming Lai Ivan; Ng, Poh Yong; Soon, Wei Jia Wendy; Chu, Collins Wenhan; Tan, Siyun Lucinda; Meehan, Mary; Ang, Brenda Sze Peng; Leo, Yee Sin; Holden, Matthew T G; De, Partha; Hsu, Li Yang; Chen, Swaine L; de Sessions, Paola Florez; Marimuthu, Kalisvar
2018-05-09
OBJECTIVEWe report the utility of whole-genome sequencing (WGS) conducted in a clinically relevant time frame (ie, sufficient for guiding management decision), in managing a Streptococcus pyogenes outbreak, and present a comparison of its performance with emm typing.SETTINGA 2,000-bed tertiary-care psychiatric hospital.METHODSActive surveillance was conducted to identify new cases of S. pyogenes. WGS guided targeted epidemiological investigations, and infection control measures were implemented. Single-nucleotide polymorphism (SNP)-based genome phylogeny, emm typing, and multilocus sequence typing (MLST) were performed. We compared the ability of WGS and emm typing to correctly identify person-to-person transmission and to guide the management of the outbreak.RESULTSThe study included 204 patients and 152 staff. We identified 35 patients and 2 staff members with S. pyogenes. WGS revealed polyclonal S. pyogenes infections with 3 genetically distinct phylogenetic clusters (C1-C3). Cluster C1 isolates were all emm type 4, sequence type 915 and had pairwise SNP differences of 0-5, which suggested recent person-to-person transmissions. Epidemiological investigation revealed that cluster C1 was mediated by dermal colonization and transmission of S. pyogenes in a male residential ward. Clusters C2 and C3 were genomically diverse, with pairwise SNP differences of 21-45 and 26-58, and emm 11 and mostly emm120, respectively. Clusters C2 and C3, which may have been considered person-to-person transmissions by emm typing, were shown by WGS to be unlikely by integrating pairwise SNP differences with epidemiology.CONCLUSIONSWGS had higher resolution than emm typing in identifying clusters with recent and ongoing person-to-person transmissions, which allowed implementation of targeted intervention to control the outbreak.Infect Control Hosp Epidemiol 2018;1-9.
Yap, Kien-Pong; Ho, Wing S; Gan, Han M; Chai, Lay C; Thong, Kwai L
2016-01-01
Typhoid fever, caused by Salmonella enterica serovar Typhi, remains an important public health burden in Southeast Asia and other endemic countries. Various genotyping methods have been applied to study the genetic variations of this human-restricted pathogen. Multilocus sequence typing (MLST) is one of the widely accepted methods, and recently, there is a growing interest in the re-application of MLST in the post-genomic era. In this study, we provide the global MLST distribution of S. Typhi utilizing both publicly available 1,826 S. Typhi genome sequences in addition to performing conventional MLST on S. Typhi strains isolated from various endemic regions spanning over a century. Our global MLST analysis confirms the predominance of two sequence types (ST1 and ST2) co-existing in the endemic regions. Interestingly, S. Typhi strains with ST8 are currently confined within the African continent. Comparative genomic analyses of ST8 and other rare STs with genomes of ST1/ST2 revealed unique mutations in important virulence genes such as flhB, sipC, and tviD that may explain the variations that differentiate between seemingly successful (widespread) and unsuccessful (poor dissemination) S. Typhi populations. Large scale whole-genome phylogeny demonstrated evidence of phylogeographical structuring and showed that ST8 may have diverged from the earlier ancestral population of ST1 and ST2, which later lost some of its fitness advantages, leading to poor worldwide dissemination. In response to the unprecedented increase in genomic data, this study demonstrates and highlights the utility of large-scale genome-based MLST as a quick and effective approach to narrow the scope of in-depth comparative genomic analysis and consequently provide new insights into the fine scale of pathogen evolution and population structure.
Mendes, Joana; Harris, D James; Carranza, Salvador; Salvi, Daniele
2016-07-01
Estimating the phylogeny of lacertid lizards, and particularly the tribe Lacertini has been challenging, possibly due to the fast radiation of this group resulting in a hard polytomy. However this is still an open question, as concatenated data primarily from mitochondrial markers have been used so far whereas in a recent phylogeny based on a compilation of these data within a squamate supermatrix the basal polytomy seems to be resolved. In this study, we estimate phylogenetic relationships between all Lacertini genera using for the first time DNA sequences from five fast evolving nuclear genes (acm4, mc1r, pdc, βfib and reln) and two mitochondrial genes (nd4 and 12S). We generated a total of 529 sequences from 88 species and used Maximum Likelihood and Bayesian Inference methods based on concatenated multilocus dataset as well as a coalescent-based species tree approach with the aim of (i) shedding light on the basal relationships of Lacertini (ii) assessing the monophyly of genera which were previously questioned, and (iii) discussing differences between estimates from this and previous studies based on different markers, and phylogenetic methods. Results uncovered (i) a new phylogenetic clade formed by the monotypic genera Archaeolacerta, Zootoca, Teira and Scelarcis; and (ii) support for the monophyly of the Algyroides clade, with two sister species pairs represented by western (A. marchi and A. fitzingeri) and eastern (A. nigropunctatus and A. moreoticus) lineages. In both cases the members of these groups show peculiar morphology and very different geographical distributions, suggesting that they are relictual groups that were once diverse and widespread. They probably originated about 11-13 million years ago during early events of speciation in the tribe, and the split between their members is estimated to be only slightly older. This scenario may explain why mitochondrial markers (possibly saturated at higher divergence levels) or slower nuclear markers used in previous studies (likely lacking enough phylogenetic signal) failed to recover these relationships. Finally, the phylogenetic position of most remaining genera was unresolved, corroborating the hypothesis of a hard polytomy in the Lacertini phylogeny due to a fast radiation. This is in agreement with all previous studies but in sharp contrast with a recent squamate megaphylogeny. We show that the supermatrix approach may provide high support for incorrect nodes that are not supported either by original sequence data or by new data from this study. This finding suggests caution when using megaphylogenies to integrate inter-generic relationships in comparative ecological and evolutionary studies. Copyright © 2016 Elsevier Inc. All rights reserved.
Reconstructing metastatic seeding patterns of human cancers
Reiter, Johannes G.; Makohon-Moore, Alvin P.; Gerold, Jeffrey M.; Bozic, Ivana; Chatterjee, Krishnendu; Iacobuzio-Donahue, Christine A.; Vogelstein, Bert; Nowak, Martin A.
2017-01-01
Reconstructing the evolutionary history of metastases is critical for understanding their basic biological principles and has profound clinical implications. Genome-wide sequencing data has enabled modern phylogenomic methods to accurately dissect subclones and their phylogenies from noisy and impure bulk tumour samples at unprecedented depth. However, existing methods are not designed to infer metastatic seeding patterns. Here we develop a tool, called Treeomics, to reconstruct the phylogeny of metastases and map subclones to their anatomic locations. Treeomics infers comprehensive seeding patterns for pancreatic, ovarian, and prostate cancers. Moreover, Treeomics correctly disambiguates true seeding patterns from sequencing artifacts; 7% of variants were misclassified by conventional statistical methods. These artifacts can skew phylogenies by creating illusory tumour heterogeneity among distinct samples. In silico benchmarking on simulated tumour phylogenies across a wide range of sample purities (15–95%) and sequencing depths (25-800 × ) demonstrates the accuracy of Treeomics compared with existing methods. PMID:28139641
Inquiry-Based Learning of Molecular Phylogenetics
ERIC Educational Resources Information Center
Campo, Daniel; Garcia-Vazquez, Eva
2008-01-01
Reconstructing phylogenies from nucleotide sequences is a challenge for students because it strongly depends on evolutionary models and computer tools that are frequently updated. We present here an inquiry-based course aimed at learning how to trace a phylogeny based on sequences existing in public databases. Computer tools are freely available…
Estimation of relative effectiveness of phylogenetic programs by machine learning.
Krivozubov, Mikhail; Goebels, Florian; Spirin, Sergei
2014-04-01
Reconstruction of phylogeny of a protein family from a sequence alignment can produce results of different quality. Our goal is to predict the quality of phylogeny reconstruction basing on features that can be extracted from the input alignment. We used Fitch-Margoliash (FM) method of phylogeny reconstruction and random forest as a predictor. For training and testing the predictor, alignments of orthologous series (OS) were used, for which the result of phylogeny reconstruction can be evaluated by comparison with trees of corresponding organisms. Our results show that the quality of phylogeny reconstruction can be predicted with more than 80% precision. Also, we tried to predict which phylogeny reconstruction method, FM or UPGMA, is better for a particular alignment. With the used set of features, among alignments for which the obtained predictor predicts a better performance of UPGMA, 56% really give a better result with UPGMA. Taking into account that in our testing set only for 34% alignments UPGMA performs better, this result shows a principal possibility to predict the better phylogeny reconstruction method basing on features of a sequence alignment.
Rapid diversification and dispersal during periods of global warming by plethodontid salamanders
Vieites, David R.; Min, Mi-Sook; Wake, David B.
2007-01-01
A phylogeny and timescale derived from analyses of multilocus nuclear DNA sequences for Holarctic genera of plethodontid salamanders reveal them to be an old radiation whose common ancestor diverged from sister taxa in the late Jurassic and underwent rapid diversification during the late Cretaceous. A North American origin of plethodontids was followed by a continental-wide diversification, not necessarily centered only in the Appalachian region. The colonization of Eurasia by plethodontids most likely occurred once, by dispersal during the late Cretaceous. Subsequent diversification in Asia led to the origin of Hydromantes and Karsenia, with the former then dispersing both to Europe and back to North America. Salamanders underwent rapid episodes of diversification and dispersal that coincided with major global warming events during the late Cretaceous and again during the Paleocene–Eocene thermal optimum. The major clades of plethodontids were established during these episodes, contemporaneously with similar phenomena in angiosperms, arthropods, birds, and mammals. Periods of global warming may have promoted diversification and both inter- and transcontinental dispersal in northern hemisphere salamanders by making available terrain that shortened dispersal routes and offered new opportunities for adaptive and vicariant evolution. PMID:18077422
An experimental phylogeny to benchmark ancestral sequence reconstruction
Randall, Ryan N.; Radford, Caelan E.; Roof, Kelsey A.; Natarajan, Divya K.; Gaucher, Eric A.
2016-01-01
Ancestral sequence reconstruction (ASR) is a still-burgeoning method that has revealed many key mechanisms of molecular evolution. One criticism of the approach is an inability to validate its algorithms within a biological context as opposed to a computer simulation. Here we build an experimental phylogeny using the gene of a single red fluorescent protein to address this criticism. The evolved phylogeny consists of 19 operational taxonomic units (leaves) and 17 ancestral bifurcations (nodes) that display a wide variety of fluorescent phenotypes. The 19 leaves then serve as ‘modern' sequences that we subject to ASR analyses using various algorithms and to benchmark against the known ancestral genotypes and ancestral phenotypes. We confirm computer simulations that show all algorithms infer ancient sequences with high accuracy, yet we also reveal wide variation in the phenotypes encoded by incorrectly inferred sequences. Specifically, Bayesian methods incorporating rate variation significantly outperform the maximum parsimony criterion in phenotypic accuracy. Subsampling of extant sequences had minor effect on the inference of ancestral sequences. PMID:27628687
Molecular characterization of Giardia psittaci by multilocus sequence analysis.
Abe, Niichiro; Makino, Ikuko; Kojima, Atsushi
2012-12-01
Multilocus sequence analyses targeting small subunit ribosomal DNA (SSU rDNA), elongation factor 1 alpha (ef1α), glutamate dehydrogenase (gdh), and beta giardin (β-giardin) were performed on Giardia psittaci isolates from three Budgerigars (Melopsittacus undulates) and four Barred parakeets (Bolborhynchus lineola) kept in individual households or imported from overseas. Nucleotide differences and phylogenetic analyses at four loci indicate the distinction of G. psittaci from the other known Giardia species: Giardia muris, Giardia microti, Giardia ardeae, and Giardia duodenalis assemblages. Furthermore, G. psittaci was related more closely to G. duodenalis than to the other known Giardia species, except for G. microti. Conflicting signals regarded as "double peaks" were found at the same nucleotide positions of the ef1α in all isolates. However, the sequences of the other three loci, including gdh and β-giardin, which are known to be highly variable, from all isolates were also mutually identical at every locus. They showed no double peaks. These results suggest that double peaks found in the ef1α sequences are caused not by mixed infection with genetically different G. psittaci isolates but by allelic sequence heterogeneity (ASH), which is observed in diplomonad lineages including G. duodenalis. No sequence difference was found in any G. psittaci isolates at the gdh and β-giardin, suggesting that G. psittaci is indeed not more diverse genetically than other Giardia species. This report is the first to provide evidence related to the genetic characteristics of G. psittaci obtained using multilocus sequence analysis. Copyright © 2012 Elsevier B.V. All rights reserved.
Saslis-Lagoudakis, C. Haris; Klitgaard, Bente B.; Forest, Félix; Francis, Louise; Savolainen, Vincent; Williamson, Elizabeth M.; Hawkins, Julie A.
2011-01-01
Background The study of traditional knowledge of medicinal plants has led to discoveries that have helped combat diseases and improve healthcare. However, the development of quantitative measures that can assist our quest for new medicinal plants has not greatly advanced in recent years. Phylogenetic tools have entered many scientific fields in the last two decades to provide explanatory power, but have been overlooked in ethnomedicinal studies. Several studies show that medicinal properties are not randomly distributed in plant phylogenies, suggesting that phylogeny shapes ethnobotanical use. Nevertheless, empirical studies that explicitly combine ethnobotanical and phylogenetic information are scarce. Methodology/Principal Findings In this study, we borrowed tools from community ecology phylogenetics to quantify significance of phylogenetic signal in medicinal properties in plants and identify nodes on phylogenies with high bioscreening potential. To do this, we produced an ethnomedicinal review from extensive literature research and a multi-locus phylogenetic hypothesis for the pantropical genus Pterocarpus (Leguminosae: Papilionoideae). We demonstrate that species used to treat a certain conditions, such as malaria, are significantly phylogenetically clumped and we highlight nodes in the phylogeny that are significantly overabundant in species used to treat certain conditions. These cross-cultural patterns in ethnomedicinal usage in Pterocarpus are interpreted in the light of phylogenetic relationships. Conclusions/Significance This study provides techniques that enable the application of phylogenies in bioscreening, but also sheds light on the processes that shape cross-cultural ethnomedicinal patterns. This community phylogenetic approach demonstrates that similar ethnobotanical uses can arise in parallel in different areas where related plants are available. With a vast amount of ethnomedicinal and phylogenetic information available, we predict that this field, after further refinement of the techniques, will expand into similar research areas, such as pest management or the search for bioactive plant-based compounds. PMID:21789247
Gardner, Shea N; Wagner, Mark C
2005-01-01
Background Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation. Results A method and SPR Opt (SNP and PCR-RFLP Optimization) software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP) and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP) analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed) are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes. Conclusion This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As more sequence data becomes available for multiple strains and isolates of a species, automated, computational approaches such as those described here will be essential to make sense of large amounts of information, and to guide and optimize efforts in the laboratory. The software and source code for SPR Opt is publicly available and free for non-profit use at . PMID:15904493
Callahan, Melissa S; McPeek, Mark A
2016-01-01
Reconstructing evolutionary patterns of species and populations provides a framework for asking questions about the impacts of climate change. Here we use a multilocus dataset to estimate gene trees under maximum likelihood and Bayesian models to obtain a robust estimate of relationships for a genus of North American damselflies, Enallagma. Using a relaxed molecular clock, we estimate the divergence times for this group. Furthermore, to account for the fact that gene tree analyses can overestimate ages of population divergences, we use a multi-population coalescent model to gain a more accurate estimate of divergence times. We also infer diversification rates using a method that allows for variation in diversification rate through time and among lineages. Our results reveal a complex evolutionary history of Enallagma, in which divergence events both predate and occur during Pleistocene climate fluctuations. There is also evidence of diversification rate heterogeneity across the tree. These divergence time estimates provide a foundation for addressing the relative significance of historical climatic events in the diversification of this genus. Copyright © 2015 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
The ARS Microbial Genome Sequence Database (http://199.133.98.43), a web-based database server, was established utilizing the BIGSdb (Bacterial Isolate Genomics Sequence Database) software package, developed at Oxford University, as a tool to manage multi-locus sequence data for the family Streptomy...
Tanabe, Akifumi S
2011-09-01
Proportional and separate models able to apply different combination of substitution rate matrix (SRM) and among-site rate variation model (ASRVM) to each locus are frequently used in phylogenetic studies of multilocus data. A proportional model assumes that branch lengths are proportional among partitions and a separate model assumes that each partition has an independent set of branch lengths. However, the selection from among nonpartitioned (i.e., a common combination of models is applied to all-loci concatenated sequences), proportional and separate models is usually based on the researcher's preference rather than on any information criteria. This study describes two programs, 'Kakusan4' (for DNA sequences) and 'Aminosan' (for amino-acid sequences), which allow the selection of evolutionary models based on several types of information criteria. The programs can handle both multilocus and single-locus data, in addition to providing an easy-to-use wizard interface and a noninteractive command line interface. In the case of multilocus data, SRMs and ASRVMs are compared at each locus and at all-loci concatenated sequences, after which nonpartitioned, proportional and separate models are compared based on information criteria. The programs also provide model configuration files for mrbayes, paup*, phyml, raxml and Treefinder to support further phylogenetic analysis using a selected model. When likelihoods are optimized by Treefinder, the best-fit models were found to differ depending on the data set. Furthermore, differences in the information criteria among nonpartitioned, proportional and separate models were much larger than those among the nonpartitioned models. These findings suggest that selecting from nonpartitioned, proportional and separate models results in a better phylogenetic tree. Kakusan4 and Aminosan are available at http://www.fifthdimension.jp/. They are licensed under gnugpl Ver.2, and are able to run on Windows, MacOS X and Linux. © 2011 Blackwell Publishing Ltd.
Leaché, Adam D.; Chavez, Andreas S.; Jones, Leonard N.; Grummer, Jared A.; Gottscho, Andrew D.; Linkem, Charles W.
2015-01-01
Sequence capture and restriction site associated DNA sequencing (RADseq) are popular methods for obtaining large numbers of loci for phylogenetic analysis. These methods are typically used to collect data at different evolutionary timescales; sequence capture is primarily used for obtaining conserved loci, whereas RADseq is designed for discovering single nucleotide polymorphisms (SNPs) suitable for population genetic or phylogeographic analyses. Phylogenetic questions that span both “recent” and “deep” timescales could benefit from either type of data, but studies that directly compare the two approaches are lacking. We compared phylogenies estimated from sequence capture and double digest RADseq (ddRADseq) data for North American phrynosomatid lizards, a species-rich and diverse group containing nine genera that began diversifying approximately 55 Ma. Sequence capture resulted in 584 loci that provided a consistent and strong phylogeny using concatenation and species tree inference. However, the phylogeny estimated from the ddRADseq data was sensitive to the bioinformatics steps used for determining homology, detecting paralogs, and filtering missing data. The topological conflicts among the SNP trees were not restricted to any particular timescale, but instead were associated with short internal branches. Species tree analysis of the largest SNP assembly, which also included the most missing data, supported a topology that matched the sequence capture tree. This preferred phylogeny provides strong support for the paraphyly of the earless lizard genera Holbrookia and Cophosaurus, suggesting that the earless morphology either evolved twice or evolved once and was subsequently lost in Callisaurus. PMID:25663487
Sun, Zhihong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Yu, Jie; Bilige, Menghe; Zhang, Heping; Chen, Yongfu
2015-05-01
Lactobacillus helveticus is an economically important lactic acid bacterium used in industrial dairy fermentation. In the present study, the population structure of 245 isolates of L. helveticus from different naturally fermented dairy products in China and Mongolia were investigated using an multilocus sequence typing scheme with 11 housekeeping genes. A total of 108 sequence types were detected, which formed 8 clonal complexes and 27 singletons. Results from Structure, SplitsTree, and ClonalFrame software analyses demonstrated the presence of 3 subpopulations in the L. helveticus isolates used in our study, namely koumiss, kurut-tarag, and panmictic lineages. Most L. helveticus isolates from particular ecological origins had specific population structures. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Kotetishvili, Mamuka; Stine, O. Colin; Chen, Yuansha; Kreger, Arnold; Sulakvelidze, Alexander; Sozhamannan, Shanmuga; Morris, Jr., J. Glenn
2003-01-01
Twenty-two Vibrio cholerae isolates, including some from “epidemic” (O1 and O139) and “nonepidemic” serogroups, were characterized by pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) by using three housekeeping genes, gyrB, pgm, and recA; sequence data were also obtained for the virulence-associated genes tcpA, ctxA, and ctxB. Even with the small number of loci used, MLST had better discriminatory ability than did PFGE. On MLST analysis, there was clear clustering of epidemic serogroups; much greater diversity was seen among tcpA- and ctxAB-positive V. cholerae strains from other, nonepidemic serogroups, with a number of tcpA and ctxAB alleles identified. PMID:12734277
Sanz, Yolanda
2017-01-01
Abstract The miniaturized and portable DNA sequencer MinION™ has demonstrated great potential in different analyses such as genome-wide sequencing, pathogen outbreak detection and surveillance, human genome variability, and microbial diversity. In this study, we tested the ability of the MinION™ platform to perform long amplicon sequencing in order to design new approaches to study microbial diversity using a multi-locus approach. After compiling a robust database by parsing and extracting the rrn bacterial region from more than 67000 complete or draft bacterial genomes, we demonstrated that the data obtained during sequencing of the long amplicon in the MinION™ device using R9 and R9.4 chemistries were sufficient to study 2 mock microbial communities in a multiplex manner and to almost completely reconstruct the microbial diversity contained in the HM782D and D6305 mock communities. Although nanopore-based sequencing produces reads with lower per-base accuracy compared with other platforms, we presented a novel approach consisting of multi-locus and long amplicon sequencing using the MinION™ MkIb DNA sequencer and R9 and R9.4 chemistries that help to overcome the main disadvantage of this portable sequencing platform. Furthermore, the nanopore sequencing library, constructed with the last releases of pore chemistry (R9.4) and sequencing kit (SQK-LSK108), permitted the retrieval of the higher level of 1D read accuracy sufficient to characterize the microbial species present in each mock community analysed. Improvements in nanopore chemistry, such as minimizing base-calling errors and new library protocols able to produce rapid 1D libraries, will provide more reliable information in the near future. Such data will be useful for more comprehensive and faster specific detection of microbial species and strains in complex ecosystems. PMID:28605506
USDA-ARS?s Scientific Manuscript database
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence d...
Swain, Timothy D
2018-01-01
The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA. Copyright © 2017 Elsevier Inc. All rights reserved.
GASP: Gapped Ancestral Sequence Prediction for proteins
Edwards, Richard J; Shields, Denis C
2004-01-01
Background The prediction of ancestral protein sequences from multiple sequence alignments is useful for many bioinformatics analyses. Predicting ancestral sequences is not a simple procedure and relies on accurate alignments and phylogenies. Several algorithms exist based on Maximum Parsimony or Maximum Likelihood methods but many current implementations are unable to process residues with gaps, which may represent insertion/deletion (indel) events or sequence fragments. Results Here we present a new algorithm, GASP (Gapped Ancestral Sequence Prediction), for predicting ancestral sequences from phylogenetic trees and the corresponding multiple sequence alignments. Alignments may be of any size and contain gaps. GASP first assigns the positions of gaps in the phylogeny before using a likelihood-based approach centred on amino acid substitution matrices to assign ancestral amino acids. Important outgroup information is used by first working down from the tips of the tree to the root, using descendant data only to assign probabilities, and then working back up from the root to the tips using descendant and outgroup data to make predictions. GASP was tested on a number of simulated datasets based on real phylogenies. Prediction accuracy for ungapped data was similar to three alternative algorithms tested, with GASP performing better in some cases and worse in others. Adding simple insertions and deletions to the simulated data did not have a detrimental effect on GASP accuracy. Conclusions GASP (Gapped Ancestral Sequence Prediction) will predict ancestral sequences from multiple protein alignments of any size. Although not as accurate in all cases as some of the more sophisticated maximum likelihood approaches, it can process a wide range of input phylogenies and will predict ancestral sequences for gapped and ungapped residues alike. PMID:15350199
Knowles, Lacey L; Klimov, Pavel B
2011-11-01
With the increased availability of multilocus sequence data, the lack of concordance of gene trees estimated for independent loci has focused attention on both the biological processes producing the discord and the methodologies used to estimate phylogenetic relationships. What has emerged is a suite of new analytical tools for phylogenetic inference--species tree approaches. In contrast to traditional phylogenetic methods that are stymied by the idiosyncrasies of gene trees, approaches for estimating species trees explicitly take into account the cause of discord among loci and, in the process, provides a direct estimate of phylogenetic history (i.e. the history of species divergence, not divergence of specific loci). We illustrate the utility of species tree estimates with an analysis of a diverse group of feather mites, the pinnatus species group (genus Proctophyllodes). Discord among four sequenced nuclear loci is consistent with theoretical expectations, given the short time separating speciation events (as evident by short internodes relative to terminal branch lengths in the trees). Nevertheless, many of the relationships are well resolved in a Bayesian estimate of the species tree; the analysis also highlights ambiguous aspects of the phylogeny that require additional loci. The broad utility of species tree approaches is discussed, and specifically, their application to groups with high speciation rates--a history of diversification with particular prevalence in host/parasite systems where species interactions can drive rapid diversification.
Spooner, David M; Ruess, Holly; Iorizzo, Massimo; Senalik, Douglas; Simon, Philipp
2017-02-01
We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results with prior phylogenetic results using plastid and nuclear DNA sequences. We used Illumina sequencing to obtain full plastid sequences of 37 accessions of 20 Daucus taxa and outgroups, analyzed the data with phylogenetic methods, and examined evidence for mitochondrial DNA transfer to the plastid ( Dc MP). Our phylogenetic trees of the entire data set were highly resolved, with 100% bootstrap support for most of the external and many of the internal clades, except for the clade of D. carota and its most closely related species D. syrticus . Subsets of the data, including regions traditionally used as phylogenetically informative regions, provide various degrees of soft congruence with the entire data set. There are areas of hard incongruence, however, with phylogenies using nuclear data. We extended knowledge of a mitochondrial to plastid DNA insertion sequence previously named Dc MP and identified the first instance in flowering plants of a sequence of potential nuclear genome origin inserted into the plastid genome. There is a relationship of inverted repeat junction classes and repeat DNA to phylogeny, but no such relationship with nonsynonymous mutations. Our data have allowed us to (1) produce a well-resolved plastid phylogeny of Daucus , (2) evaluate subsets of the entire plastid data for phylogeny, (3) examine evidence for plastid and nuclear DNA phylogenetic incongruence, and (4) examine mitochondrial and nuclear DNA insertion into the plastid. © 2017 Spooner et al. Published by the Botanical Society of America. This work is licensed under a Creative Commons public domain license (CC0 1.0).
KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation.
Wang, Dapeng; Xu, Jiayue; Yu, Jun
2015-09-16
The K-mer approach, treating genomic sequences as simple characters and counting the relative abundance of each string upon a fixed K, has been extensively applied to phylogeny inference for genome assembly, annotation, and comparison. To meet increasing demands for comparing large genome sequences and to promote the use of the K-mer approach, we develop a versatile database, KGCAK ( http://kgcak.big.ac.cn/KGCAK/ ), containing ~8,000 genomes that include genome sequences of diverse life forms (viruses, prokaryotes, protists, animals, and plants) and cellular organelles of eukaryotic lineages. It builds phylogeny based on genomic elements in an alignment-free fashion and provides in-depth data processing enabling users to compare the complexity of genome sequences based on K-mer distribution. We hope that KGCAK becomes a powerful tool for exploring relationship within and among groups of species in a tree of life based on genomic data.
Pneumocystis jirovecii multilocus gene sequencing: findings and implications.
Matos, Olga; Esteves, Francisco
2010-08-01
Pneumocystis jirovecii pneumonia (PcP) remains a major cause of respiratory illness among immunocompromised patients, especially patients infected with HIV, but it has also been isolated from immunocompetent persons. This article discusses the application of multilocus genotyping analysis to the study of the genetic diversity of P. jirovecii and its epidemiological and clinical parameters, and the important concepts achieved to date with these approaches. The multilocus typing studies performed until now have shown that there is an important genetic diversity of stable and ubiquitous P. jirovecii genotypes; infection with P. jirovecii is not necessarily clonal, recombination between some P. jirovecii multilocus genotypes has been suggested. P. jirovecii-specific multilocus genotypes can be associated with severity of PcP. Patients infected with P. jirovecii, regardless of the form of infection they present with, are part of a common human reservoir for future infections. The CYB, DHFR, DHPS, mtLSU rRNA, SOD and the ITS loci are suitable genetic targets to be used in further epidemiological studies focused on the identification and characterization of P. jirovecii haplotypes correlated with drug resistance and PcP outcome.
Correcting for sequencing error in maximum likelihood phylogeny inference.
Kuhner, Mary K; McGill, James
2014-11-04
Accurate phylogenies are critical to taxonomy as well as studies of speciation processes and other evolutionary patterns. Accurate branch lengths in phylogenies are critical for dating and rate measurements. Such accuracy may be jeopardized by unacknowledged sequencing error. We use simulated data to test a correction for DNA sequencing error in maximum likelihood phylogeny inference. Over a wide range of data polymorphism and true error rate, we found that correcting for sequencing error improves recovery of the branch lengths, even if the assumed error rate is up to twice the true error rate. Low error rates have little effect on recovery of the topology. When error is high, correction improves topological inference; however, when error is extremely high, using an assumed error rate greater than the true error rate leads to poor recovery of both topology and branch lengths. The error correction approach tested here was proposed in 2004 but has not been widely used, perhaps because researchers do not want to commit to an estimate of the error rate. This study shows that correction with an approximate error rate is generally preferable to ignoring the issue. Copyright © 2014 Kuhner and McGill.
Genome-Scale Phylogeny of the Alphavirus Genus Suggests a Marine Origin
Palacios, G.; Tesh, R. B.; Savji, N.; Guzman, H.; Sherman, M.; Weaver, S. C.; Lipkin, W. I.
2012-01-01
The genus Alphavirus comprises a diverse group of viruses, including some that cause severe disease. Using full-length sequences of all known alphaviruses, we produced a robust and comprehensive phylogeny of the Alphavirus genus, presenting a more complete evolutionary history of these viruses compared to previous studies based on partial sequences. Our phylogeny suggests the origin of the alphaviruses occurred in the southern oceans and spread equally through the Old and New World. Since lice appear to be involved in aquatic alphavirus transmission, it is possible that we are missing a louse-borne branch of the alphaviruses. Complete genome sequencing of all members of the genus also revealed conserved residues forming the structural basis of the E1 and E2 protein dimers. PMID:22190718
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.
Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T
2016-02-24
Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic methods for larger effect sizes. The Silva/UNITE-based ghost tree presented here can be easily integrated into existing fungal analysis pipelines to enhance the resolution of fungal community differences and improve understanding of these communities in built environments. The ghost-tree software package can also be used to develop phylogenetic trees for other marker gene sets that afford different taxonomic resolution, or for bridging genome trees with amplicon trees. ghost-tree is pip-installable. All source code, documentation, and test code are available under the BSD license at https://github.com/JTFouquier/ghost-tree .
USDA-ARS?s Scientific Manuscript database
The ARS Culture Collection (NRRL) currently contains 7569 strains within the family Streptomycetaceae but 4368 of them have not been characterized to the species level. A gene sequence database using the Bacterial Isolate Genomic Sequence Database package (BIGSdb) (Jolley & Maiden, 2010) is availabl...
Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis
USDA-ARS?s Scientific Manuscript database
In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T formed a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these ot...
Gottscho, Andrew D.; Wood, Dustin A.; Vandergast, Amy; Lemos Espinal, Julio A.; Gatesy, John; Reeder, Tod
2017-01-01
Multi-locus nuclear DNA data were used to delimit species of fringe-toed lizards of theUma notata complex, which are specialized for living in wind-blown sand habitats in the deserts of southwestern North America, and to infer whether Quaternary glacial cycles or Tertiary geological events were important in shaping the historical biogeography of this group. We analyzed ten nuclear loci collected using Sanger sequencing and genome-wide sequence and single-nucleotide polymorphism (SNP) data collected using restriction-associated DNA (RAD) sequencing. A combination of species discovery methods (concatenated phylogenies, parametric and non-parametric clustering algorithms) and species validation approaches (coalescent-based species tree/isolation-with-migration models) were used to delimit species, infer phylogenetic relationships, and to estimate effective population sizes, migration rates, and speciation times. Uma notata, U. inornata, U. cowlesi, and an undescribed species from Mohawk Dunes, Arizona (U. sp.) were supported as distinct in the concatenated analyses and by clustering algorithms, and all operational taxonomic units were decisively supported as distinct species by ranking hierarchical nested speciation models with Bayes factors based on coalescent-based species tree methods. However, significant unidirectional gene flow (2NM >1) from U. cowlesi and U. notata into U. rufopunctata was detected under the isolation-with-migration model. Therefore, we conservatively delimit four species-level lineages within this complex (U. inornata, U. notata, U. cowlesi, and U. sp.), treating U. rufopunctata as a hybrid population (U. notata x cowlesi). Both concatenated and coalescent-based estimates of speciation times support the hypotheses that speciation within the complex occurred during the late Pleistocene, and that the geological evolution of the Colorado River delta during this period was an important process shaping the observed phylogeographic patterns.
McRobb, Evan; Sarovich, Derek S; Price, Erin P; Kaestli, Mirjam; Mayo, Mark; Keim, Paul; Currie, Bart J
2015-04-01
Melioidosis, a disease of public health importance in Southeast Asia and northern Australia, is caused by the Gram-negative soil bacillus Burkholderia pseudomallei. Melioidosis is typically acquired through environmental exposure, and case clusters are rare, even in regions where the disease is endemic. B. pseudomallei is classed as a tier 1 select agent by the Centers for Disease Control and Prevention; from a biodefense perspective, source attribution is vital in an outbreak scenario to rule out a deliberate release. Two cases of melioidosis within a 3-month period at a residence in rural northern Australia prompted an investigation to determine the source of exposure. B. pseudomallei isolates from the property's groundwater supply matched the multilocus sequence type of the clinical isolates. Whole-genome sequencing confirmed the water supply as the probable source of infection in both cases, with the clinical isolates differing from the likely infecting environmental strain by just one single nucleotide polymorphism (SNP) each. For the first time, we report a phylogenetic analysis of genomewide insertion/deletion (indel) data, an approach conventionally viewed as problematic due to high mutation rates and homoplasy. Our whole-genome indel analysis was concordant with the SNP phylogeny, and these two combined data sets provided greater resolution and a better fit with our epidemiological chronology of events. Collectively, this investigation represents a highly accurate account of source attribution in a melioidosis outbreak and gives further insight into a frequently overlooked reservoir of B. pseudomallei. Our methods and findings have important implications for outbreak source tracing of this bacterium and other highly recombinogenic pathogens. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Unravelling the Genetic Diversity among Cassava Bemisia tabaci Whiteflies Using NextRAD Sequencing.
Wosula, Everlyne N; Chen, Wenbo; Fei, Zhangjun; Legg, James P
2017-11-01
Bemisia tabaci threatens production of cassava in Africa through vectoring viruses that cause cassava mosaic disease (CMD) and cassava brown streak disease (CBSD). B. tabaci sampled from cassava in eight countries in Africa were genotyped using NextRAD sequencing, and their phylogeny and population genetics were investigated using the resultant single nucleotide polymorphism (SNP) markers. SNP marker data and short sequences of mitochondrial DNA cytochrome oxidase I (mtCOI) obtained from the same insect were compared. Eight genetically distinct groups were identified based on mtCOI, whereas phylogenetic analysis using SNPs identified six major groups, which were further confirmed by PCA and multidimensional analyses. STRUCTURE analysis identified four ancestral B. tabaci populations that have contributed alleles to the six SNP-based groups. Significant gene flows were detected between several of the six SNP-based groups. Evidence of gene flow was strongest for SNP-based groups occurring in central Africa. Comparison of the mtCOI and SNP identities of sampled insects provided a strong indication that hybrid populations are emerging in parts of Africa recently affected by the severe CMD pandemic. This study reveals that mtCOI is not an effective marker at distinguishing cassava-colonizing B. tabaci haplogroups, and that more robust SNP-based multilocus markers should be developed. Significant gene flows between populations could lead to the emergence of haplogroups that might alter the dynamics of cassava virus spread and disease severity in Africa. Continuous monitoring of genetic compositions of whitefly populations should be an essential component in efforts to combat cassava viruses in Africa. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
McRobb, Evan; Kaestli, Mirjam; Mayo, Mark; Keim, Paul
2015-01-01
Melioidosis, a disease of public health importance in Southeast Asia and northern Australia, is caused by the Gram-negative soil bacillus Burkholderia pseudomallei. Melioidosis is typically acquired through environmental exposure, and case clusters are rare, even in regions where the disease is endemic. B. pseudomallei is classed as a tier 1 select agent by the Centers for Disease Control and Prevention; from a biodefense perspective, source attribution is vital in an outbreak scenario to rule out a deliberate release. Two cases of melioidosis within a 3-month period at a residence in rural northern Australia prompted an investigation to determine the source of exposure. B. pseudomallei isolates from the property's groundwater supply matched the multilocus sequence type of the clinical isolates. Whole-genome sequencing confirmed the water supply as the probable source of infection in both cases, with the clinical isolates differing from the likely infecting environmental strain by just one single nucleotide polymorphism (SNP) each. For the first time, we report a phylogenetic analysis of genomewide insertion/deletion (indel) data, an approach conventionally viewed as problematic due to high mutation rates and homoplasy. Our whole-genome indel analysis was concordant with the SNP phylogeny, and these two combined data sets provided greater resolution and a better fit with our epidemiological chronology of events. Collectively, this investigation represents a highly accurate account of source attribution in a melioidosis outbreak and gives further insight into a frequently overlooked reservoir of B. pseudomallei. Our methods and findings have important implications for outbreak source tracing of this bacterium and other highly recombinogenic pathogens. PMID:25631791
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahn, Anne-Catherine; Meier-Kolthoff, Jan P.; Overmars, Lex
Thioalkalivibrio is a genus of obligate chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria. Their habitat are soda lakes which are dual extreme environments with a pH range from 9.5 to 11 and salt concentrations up to saturation. More than 100 strains of this genus have been isolated from various soda lakes all over the world, but only ten species have been effectively described yet. Therefore, the assignment of the remaining strains to either existing or novel species is important and will further elucidate their genomic diversity as well as give a better general understanding of this genus. Recently, the genomes of 76 Thioalkalivibriomore » strains were sequenced. On these, we applied different methods including (i) 16S rRNA gene sequence analysis, (ii) Multilocus Sequence Analysis (MLSA) based on eight housekeeping genes, (iii) Average Nucleotide Identity based on BLAST (ANI b) and MUMmer (ANI m ), (iv) Tetranucleotide frequency correlation coefficients (TETRA), (v) digital DNA:DNA hybridization (dDDH) as well as (vi) nucleotide- and amino acid-based Genome BLAST Distance Phylogeny (GBDP) analyses. We detected a high genomic diversity by revealing 15 new "genomic" species and 16 new "genomic" subspecies in addition to the ten already described species. Phylogenetic and phylogenomic analyses showed that the genus is not monophyletic, because four strains were clearly separated from the other Thioalkalivibrio by type strains from other genera. Therefore, it is recommended to classify the latter group as a novel genus. The biogeographic distribution of Thioalkalivibrio suggested that the different "genomic" species can be classified as candidate disjunct or candidate endemic species. This study is a detailed genome-based classification and identification of members within the genus Thioalkalivibrio. However, future phenotypical and chemotaxonomical studies will be needed for a full species description of this genus.« less
Genomic diversity within the haloalkaliphilic genus Thioalkalivibrio
Ahn, Anne-Catherine; Meier-Kolthoff, Jan P.; Overmars, Lex; ...
2017-03-10
Thioalkalivibrio is a genus of obligate chemolithoautotrophic haloalkaliphilic sulfur-oxidizing bacteria. Their habitat are soda lakes which are dual extreme environments with a pH range from 9.5 to 11 and salt concentrations up to saturation. More than 100 strains of this genus have been isolated from various soda lakes all over the world, but only ten species have been effectively described yet. Therefore, the assignment of the remaining strains to either existing or novel species is important and will further elucidate their genomic diversity as well as give a better general understanding of this genus. Recently, the genomes of 76 Thioalkalivibriomore » strains were sequenced. On these, we applied different methods including (i) 16S rRNA gene sequence analysis, (ii) Multilocus Sequence Analysis (MLSA) based on eight housekeeping genes, (iii) Average Nucleotide Identity based on BLAST (ANI b) and MUMmer (ANI m ), (iv) Tetranucleotide frequency correlation coefficients (TETRA), (v) digital DNA:DNA hybridization (dDDH) as well as (vi) nucleotide- and amino acid-based Genome BLAST Distance Phylogeny (GBDP) analyses. We detected a high genomic diversity by revealing 15 new "genomic" species and 16 new "genomic" subspecies in addition to the ten already described species. Phylogenetic and phylogenomic analyses showed that the genus is not monophyletic, because four strains were clearly separated from the other Thioalkalivibrio by type strains from other genera. Therefore, it is recommended to classify the latter group as a novel genus. The biogeographic distribution of Thioalkalivibrio suggested that the different "genomic" species can be classified as candidate disjunct or candidate endemic species. This study is a detailed genome-based classification and identification of members within the genus Thioalkalivibrio. However, future phenotypical and chemotaxonomical studies will be needed for a full species description of this genus.« less
Recapitulating phylogenies using k-mers: from trees to networks.
Bernard, Guillaume; Ragan, Mark A; Chan, Cheong Xin
2016-01-01
Ernst Haeckel based his landmark Tree of Life on the supposed ontogenic recapitulation of phylogeny, i.e. that successive embryonic stages during the development of an organism re-trace the morphological forms of its ancestors over the course of evolution. Much of this idea has since been discredited. Today, phylogenies are often based on families of molecular sequences. The standard approach starts with a multiple sequence alignment, in which the sequences are arranged relative to each other in a way that maximises a measure of similarity position-by-position along their entire length. A tree (or sometimes a network) is then inferred. Rigorous multiple sequence alignment is computationally demanding, and evolutionary processes that shape the genomes of many microbes (bacteria, archaea and some morphologically simple eukaryotes) can add further complications. In particular, recombination, genome rearrangement and lateral genetic transfer undermine the assumptions that underlie multiple sequence alignment, and imply that a tree-like structure may be too simplistic. Here, using genome sequences of 143 bacterial and archaeal genomes, we construct a network of phylogenetic relatedness based on the number of shared k -mers (subsequences at fixed length k ). Our findings suggest that the network captures not only key aspects of microbial genome evolution as inferred from a tree, but also features that are not treelike. The method is highly scalable, allowing for investigation of genome evolution across a large number of genomes. Instead of using specific regions or sequences from genome sequences, or indeed Haeckel's idea of ontogeny, we argue that genome phylogenies can be inferred using k -mers from whole-genome sequences. Representing these networks dynamically allows biological questions of interest to be formulated and addressed quickly and in a visually intuitive manner.
Schröder, Christiane; Bleidorn, Christoph; Hartmann, Stefanie; Tiedemann, Ralph
2009-12-15
Investigating the dog genome we found 178965 introns with a moderate length of 200-1000 bp. A screening of these sequences against 23 different repeat libraries to find insertions of short interspersed elements (SINEs) detected 45276 SINEs. Virtually all of these SINEs (98%) belong to the tRNA-derived Can-SINE family. Can-SINEs arose about 55 million years ago before Carnivora split into two basal groups, the Caniformia (dog-like carnivores) and the Feliformia (cat-like carnivores). Genome comparisons of dog and cat recovered 506 putatively informative SINE loci for caniformian phylogeny. In this study we show how to use such genome information of model organisms to research the phylogeny of related non-model species of interest. Investigating a dataset including representatives of all major caniformian lineages, we analysed 24 randomly chosen loci for 22 taxa. All loci were amplifiable and revealed 17 parsimony-informative SINE insertions. The screening for informative SINE insertions yields a large amount of sequence information, in particular of introns, which contain reliable phylogenetic information as well. A phylogenetic analysis of intron- and SINE sequence data provided a statistically robust phylogeny which is congruent with the absence/presence pattern of our SINE markers. This phylogeny strongly supports a sistergroup relationship of Musteloidea and Pinnipedia. Within Pinnipedia, we see strong support from bootstrapping and the presence of a SINE insertion for a sistergroup relationship of the walrus with the Otariidae.
den Bakker, Henk C; Manuel, Clyde S; Fortes, Esther D; Wiedmann, Martin; Nightingale, Kendra K
2013-09-01
Twenty Listeria-like isolates were obtained from environmental samples collected on a cattle ranch in northern Colorado; all of these isolates were found to share an identical partial sigB sequence, suggesting close relatedness. The isolates were similar to members of the genus Listeria in that they were Gram-stain-positive, short rods, oxidase-negative and catalase-positive; the isolates were similar to Listeria fleischmannii because they were non-motile at 25 °C. 16S rRNA gene sequencing for representative isolates and whole genome sequencing for one isolate was performed. The genome of the type strain of Listeria fleischmannii (strain LU2006-1(T)) was also sequenced. The draft genomes were very similar in size and the average MUMmer nucleotide identity across 91% of the genomes was 95.16%. Genome sequence data were used to design primers for a six-gene multi-locus sequence analysis (MLSA) scheme. Phylogenies based on (i) the near-complete 16S rRNA gene, (ii) 31 core genes and (iii) six housekeeping genes illustrated the close relationship of these Listeria-like isolates to Listeria fleischmannii LU2006-1(T). Sufficient genetic divergence of the Listeria-like isolates from the type strain of Listeria fleischmannii and differing phenotypic characteristics warrant these isolates to be classified as members of a distinct infraspecific taxon, for which the name Listeria fleischmannii subsp. coloradonensis subsp. nov. is proposed. The type strain is TTU M1-001(T) ( =BAA-2414(T) =DSM 25391(T)). The isolates of Listeria fleischmannii subsp. coloradonensis subsp. nov. differ from the nominate subspecies by the inability to utilize melezitose, turanose and sucrose, and the ability to utilize inositol. The results also demonstrate the utility of whole genome sequencing to facilitate identification of novel taxa within a well-described genus. The genomes of both subspecies of Listeria fleischmannii contained putative enhancin genes; the Listeria fleischmannii subsp. coloradonensis subsp. nov. genome also encoded a putative mosquitocidal toxin. The presence of these genes suggests possible adaptation to an insect host, and further studies are needed to probe niche adaptation of Listeria fleischmannii.
Thirugnanasambantham, Krishnaraj; Saravanan, Subramanian; Karikalan, Kulandaivelu; Bharanidharan, Rajaraman; Lalitha, Perumal; Ilango, S; HairulIslam, Villianur Ibrahim
2015-10-01
Momordica charantia (bitter gourd, bitter melon) is a monoecious Cucurbitaceae with anti-oxidant, anti-microbial, anti-viral and anti-diabetic potential. Molecular studies on this economically valuable plant are very essential to understand its phylogeny and evolution. MicroRNAs (miRNAs) are conserved, small, non-coding RNA with ability to regulate gene expression by bind the 3' UTR region of target mRNA and are evolved at different rates in different plant species. In this study we have utilized homology based computational approach and identified 27 mature miRNAs for the first time from this bio-medically important plant. The phylogenetic tree developed from binary data derived from the data on presence/absence of the identified miRNAs were noticed to be uncertain and biased. Most of the identified miRNAs were highly conserved among the plant species and sequence based phylogeny analysis of miRNAs resolved the above difficulties in phylogeny approach using miRNA. Predicted gene targets of the identified miRNAs revealed their importance in regulation of plant developmental process. Reported miRNAs held sequence conservation in mature miRNAs and the detailed phylogeny analysis of pre-miRNA sequences revealed genus specific segregation of clusters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Phylogeny, divergence time and historical biogeography of Laetiporus (Basidiomycota, Polyporales).
Song, Jie; Cui, Bao-Kai
2017-04-20
The aim of this study was to characterize the molecular relationship, origin and historical biogeography of the species in important brown rot fungal genus Laetiporus from East Asia, Europe, Pan-America, Hawaii and South Africa. We used six genetic markers to estimate a genus-level phylogeny including (1) the internal transcribed spacer (ITS), (2) nuclear large subunit rDNA (nrLSU), (3) nuclear small subunit rDNA (nrSSU), (4) translation elongation factor 1-α (EF-1α), (5) DNA-directed RNA polymerase II subunit 2 (RPB2), and (6) mitochondrial small subunit rDNA (mtSSU). Results of multi-locus phylogenetic analyses show clade support for at least seventeen species-level lineages including two new Laetiporus in China. Molecular dating using BEAST estimated the present crown group diverged approximately 20.16 million years ago (Mya) in the early Miocene. Biogeographic analyses using RASP indicated that Laetiporus most likely originated in temperate zones with East Asia and North America having the highest probability (48%) of being the ancestral area. Four intercontinental dispersal routes and a possible concealed dispersal route were established for the first time.
A “Shallow Phylogeny” of Shallow Barnacles (Chthamalus)
Wares, John P.; Pankey, M. Sabrina; Pitombo, Fabio; Daglio, Liza Gómez; Achituv, Yair
2009-01-01
Background We present a multi-locus phylogenetic analysis of the shallow water (high intertidal) barnacle genus Chthamalus, focusing on member species in the western hemisphere. Understanding the phylogeny of this group improves interpretation of classical ecological work on competition, distributional changes associated with climate change, and the morphological evolution of complex cirripede phenotypes. Methodology and Findings We use traditional and Bayesian phylogenetic and ‘deep coalescent’ approaches to identify a phylogeny that supports the monophyly of the mostly American ‘fissus group’ of Chthamalus, but that also supports a need for taxonomic revision of Chthamalus and Microeuraphia. Two deep phylogeographic breaks were also found within the range of two tropical American taxa (C. angustitergum and C. southwardorum) as well. Conclusions Our data, which include two novel gene regions for phylogenetic analysis of cirripedes, suggest that much more evaluation of the morphological evolutionary history and taxonomy of Chthamalid barnacles is necessary. These data and associated analyses also indicate that the radiation of species in the late Pliocene and Pleistocene was very rapid, and may provide new insights toward speciation via transient allopatry or ecological barriers. PMID:19440543
Ritz, C M; Reiker, J; Charles, G; Hoxey, P; Hunt, D; Lowry, M; Stuppy, W; Taylor, N
2012-11-01
The cacti of tribe Tephrocacteae (Cactaceae-Opuntioideae) are adapted to diverse climatic conditions over a wide area of the southern Andes and adjacent lowlands. They exhibit a range of life forms from geophytes and cushion-plants to dwarf shrubs, shrubs or small trees. To confirm or challenge previous morphology-based classifications and molecular phylogenies, we sampled DNA sequences from the chloroplast trnK/matK region and the nuclear low copy gene phyC and compared the resulting phylogenies with previous data gathered from nuclear ribosomal DNA sequences. The here presented chloroplast and nuclear low copy gene phylogenies were mutually congruent and broadly coincident with the classification based on gross morphology and seed micro-morphology and anatomy. Reconstruction of hypothetical ancestral character states suggested that geophytes and cushion-forming species probably evolved several times from dwarf shrubby precursors. We also traced an increase of embryo size at the expense of the nucellus-derived storage tissue during the evolution of the Tephrocacteae, which is thought to be an evolutionary advantage because nutrients are then more rapidly accessible for the germinating embryo. In contrast to these highly concordant phylogenies, nuclear ribosomal DNA data sampled by a previous study yielded conflicting phylogenetic signals. Secondary structure predictions of ribosomal transcribed spacers suggested that this phylogeny is strongly influenced by the inclusion of paralogous sequence probably arisen by genome duplication during the evolution of this plant group. Copyright © 2012 Elsevier Inc. All rights reserved.
Brun, Sophie; Madrid, Hugo; Gerrits Van Den Ende, Bert; Andersen, Birgitte; Marinach-Patrice, Carine; Mazier, Dominique; De Hoog, G Sybren
2013-01-01
The genus Alternaria includes numerous phytopathogenic species, many of which are economically relevant. Traditionally, identification has been based on morphology, but is often hampered by the tendency of some strains to become sterile in culture and by the existence of species-complexes of morphologically similar taxa. This study aimed to assess if strains of four closely-related plant pathogens, i.e., accurately Alternaria dauci (ten strains), Alternaria porri (six), Alternaria solani (ten), and Alternaria tomatophila (ten) could be identified using multilocus phylogenetic analysis and Matrix-Assisted Laser Desorption Ionisation Time of Flight (MALDI-TOF) profiling of proteins. Phylogenetic analyses were performed on three loci, i.e., the internal transcribed spacer (ITS) region of rRNA, and the glyceraldehyde-3-phosphate dehydrogenase (gpd) and Alternaria major antigen (Alt a 1) genes. Phylogenetic trees based on ITS sequences did not differentiate strains of A. solani, A. tomatophila, and A. porri, but these three species formed a clade separate from strains of A. dauci. The resolution improved in trees based on gpd and Alt a 1, which distinguished strains of the four species as separate clades. However, none provided significant bootstrap support for all four species, which could only be achieved when results for the three loci were combined. MALDI-TOF-based dendrograms showed three major clusters. The first comprised all A. dauci strains, the second included five strains of A. porri and one of A. solani, and the third included all strains of A. tomatophila, as well as all but one strain of A. solani, and one strain of A. porri. Thus, this study shows the usefulness of MALDI-TOF mass spectrometry as a promising tool for identification of these four species of Alternaria which are closely-related plant pathogens. Copyright © 2012 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
Cavalier-Smith, Thomas
2015-04-01
Contradictory and confusing results can arise if sequenced 'monoprotist' samples really contain DNA of very different species. Eukaryote-wide phylogenetic analyses using five genes from the amoeboflagellate culture ATCC 50646 previously implied it was an undescribed percolozoan related to percolatean flagellates (Stephanopogon, Percolomonas). Contrastingly, three phylogenetic analyses of 18S rRNA alone, did not place it within Percolozoa, but as an isolated deep-branching excavate. I resolve that contradiction by sequence phylogenies for all five genes individually, using up to 652 taxa. Its 18S rRNA sequence (GQ377652) is near-identical to one from stained-glass windows, somewhat more distant from one from cooling-tower water, all three related to terrestrial actinocephalid gregarines Hoplorhynchus and Pyxinia. All four protein-gene sequences (Hsp90; α-tubulin; β-tubulin; actin) are from an amoeboflagellate heterolobosean percolozoan, not especially deeply branching. Contrary to previous conclusions from trees combining protein and rRNA sequences or rDNA trees including Eozoa only, this culture does not represent a major novel deep-branching eukaryote lineage distinct from Heterolobosea, and thus lacks special significance for deep eukaryote phylogeny, though the rDNA sequence is important for gregarine phylogeny. α-Tubulin trees for over 250 eukaryotes refute earlier suggestions of lateral gene transfer within eukaryotes, being largely congruent with morphology and other gene trees. Copyright © 2015. Published by Elsevier GmbH.
Killgore, George; Thompson, Angela; Johnson, Stuart; Brazier, Jon; Kuijper, Ed; Pepin, Jacques; Frost, Eric H; Savelkoul, Paul; Nicholson, Brad; van den Berg, Renate J; Kato, Haru; Sambol, Susan P; Zukowski, Walter; Woods, Christopher; Limbago, Brandi; Gerding, Dale N; McDonald, L Clifford
2008-02-01
Using 42 isolates contributed by laboratories in Canada, The Netherlands, the United Kingdom, and the United States, we compared the results of analyses done with seven Clostridium difficile typing techniques: multilocus variable-number tandem-repeat analysis (MLVA), amplified fragment length polymorphism (AFLP), surface layer protein A gene sequence typing (slpAST), PCR-ribotyping, restriction endonuclease analysis (REA), multilocus sequence typing (MLST), and pulsed-field gel electrophoresis (PFGE). We assessed the discriminating ability and typeability of each technique as well as the agreement among techniques in grouping isolates by allele profile A (AP-A) through AP-F, which are defined by toxinotype, the presence of the binary toxin gene, and deletion in the tcdC gene. We found that all isolates were typeable by all techniques and that discrimination index scores for the techniques tested ranged from 0.964 to 0.631 in the following order: MLVA, REA, PFGE, slpAST, PCR-ribotyping, MLST, and AFLP. All the techniques were able to distinguish the current epidemic strain of C. difficile (BI/027/NAP1) from other strains. All of the techniques showed multiple types for AP-A (toxinotype 0, binary toxin negative, and no tcdC gene deletion). REA, slpAST, MLST, and PCR-ribotyping all included AP-B (toxinotype III, binary toxin positive, and an 18-bp deletion in tcdC) in a single group that excluded other APs. PFGE, AFLP, and MLVA grouped two, one, and two different non-AP-B isolates, respectively, with their AP-B isolates. All techniques appear to be capable of detecting outbreak strains, but only REA and MLVA showed sufficient discrimination to distinguish strains from different outbreaks.
Gorgé, Olivier; Lopez, Stéphanie; Hilaire, Valérie; Lisanti, Olivier; Ramisse, Vincent; Vergnaud, Gilles
2008-01-01
The Shigella genus has historically been separated into four species, based on biochemical assays. The classification within each species relies on serotyping. Recently, genome sequencing and DNA assays, in particular the multilocus sequence typing (MLST) approach, greatly improved the current knowledge of the origin and phylogenetic evolution of Shigella spp. The Shigella and Escherichia genera are now considered to belong to a unique genomospecies. Multilocus variable-number tandem-repeat (VNTR) analysis (MLVA) provides valuable polymorphic markers for genotyping and performing phylogenetic analyses of highly homogeneous bacterial pathogens. Here, we assess the capability of MLVA for Shigella typing. Thirty-two potentially polymorphic VNTRs were selected by analyzing in silico five Shigella genomic sequences and subsequently evaluated. Eventually, a panel of 15 VNTRs was selected (i.e., MLVA15 analysis). MLVA15 analysis of 78 strains or genome sequences of Shigella spp. and 11 strains or genome sequences of Escherichia coli distinguished 83 genotypes. Shigella population cluster analysis gave consistent results compared to MLST. MLVA15 analysis showed capabilities for E. coli typing, providing classification among pathogenic and nonpathogenic E. coli strains included in the study. The resulting data can be queried on our genotyping webpage (http://mlva.u-psud.fr). The MLVA15 assay is rapid, highly discriminatory, and reproducible for Shigella and Escherichia strains, suggesting that it could significantly contribute to epidemiological trace-back analysis of Shigella infections and pathogenic Escherichia outbreaks. Typing was performed on strains obtained mostly from collections. Further studies should include strains of much more diverse origins, including all pathogenic E. coli types. PMID:18216214
USDA-ARS?s Scientific Manuscript database
The predominantly holarctic bee genus Osmia is species-rich and behaviorally diverse. A robust phylogeny of this genus is important for understanding the evolution of the immense variety of morphological and behavioral traits exhibited by this group. We infer a phylogeny of Osmia using DNA sequenc...
Chavda, Kalyan D; Chen, Liang; Fouts, Derrick E; Sutton, Granger; Brinkac, Lauren; Jenkins, Stephen G; Bonomo, Robert A; Adams, Mark D; Kreiswirth, Barry N
2016-12-13
Knowledge regarding the genomic structure of Enterobacter spp., the second most prevalent carbapenemase-producing Enterobacteriaceae, remains limited. Here we sequenced 97 clinical Enterobacter species isolates that were both carbapenem susceptible and resistant from various geographic regions to decipher the molecular origins of carbapenem resistance and to understand the changing phylogeny of these emerging and drug-resistant pathogens. Of the carbapenem-resistant isolates, 30 possessed bla KPC-2 , 40 had bla KPC-3 , 2 had bla KPC-4 , and 2 had bla NDM-1 Twenty-three isolates were carbapenem susceptible. Six genomes were sequenced to completion, and their sizes ranged from 4.6 to 5.1 Mbp. Phylogenomic analysis placed 96 of these genomes, 351 additional Enterobacter genomes downloaded from NCBI GenBank, and six newly sequenced type strains into 19 phylogenomic groups-18 groups (A to R) in the Enterobacter cloacae complex and Enterobacter aerogenes Diverse mechanisms underlying the molecular evolutionary trajectory of these drug-resistant Enterobacter spp. were revealed, including the acquisition of an antibiotic resistance plasmid, followed by clonal spread, horizontal transfer of bla KPC -harboring plasmids between different phylogenomic groups, and repeated transposition of the bla KPC gene among different plasmid backbones. Group A, which comprises multilocus sequence type 171 (ST171), was the most commonly identified (23% of isolates). Genomic analysis showed that ST171 isolates evolved from a common ancestor and formed two different major clusters; each acquiring unique bla KPC -harboring plasmids, followed by clonal expansion. The data presented here represent the first comprehensive study of phylogenomic interrogation and the relationship between antibiotic resistance and plasmid discrimination among carbapenem-resistant Enterobacter spp., demonstrating the genetic diversity and complexity of the molecular mechanisms driving antibiotic resistance in this genus. Enterobacter spp., especially carbapenemase-producing Enterobacter spp., have emerged as a clinically significant cause of nosocomial infections. However, only limited information is available on the distribution of carbapenem resistance across this genus. Augmenting this problem is an erroneous identification of Enterobacter strains because of ambiguous typing methods and imprecise taxonomy. In this study, we used a whole-genome-based comparative phylogenetic approach to (i) revisit and redefine the genus Enterobacter and (ii) unravel the emergence and evolution of the Klebsiella pneumoniae carbapenemase-harboring Enterobacter spp. Using genomic analysis of 447 sequenced strains, we developed an improved understanding of the species designations within this complex genus and identified the diverse mechanisms driving the molecular evolution of carbapenem resistance. The findings in this study provide a solid genomic framework that will serve as an important resource in the future development of molecular diagnostics and in supporting drug discovery programs. Copyright © 2016 Chavda et al.
[Standard algorithm of molecular typing of Yersinia pestis strains].
Eroshenko, G A; Odinokov, G N; Kukleva, L M; Pavlova, A I; Krasnov, Ia M; Shavina, N Iu; Guseva, N P; Vinogradova, N A; Kutyrev, V V
2012-01-01
Development of the standard algorithm of molecular typing of Yersinia pestis that ensures establishing of subspecies, biovar and focus membership of the studied isolate. Determination of the characteristic strain genotypes of plague infectious agent of main and nonmain subspecies from various natural foci of plague of the Russian Federation and the near abroad. Genotyping of 192 natural Y. pestis strains of main and nonmain subspecies was performed by using PCR methods, multilocus sequencing and multilocus analysis of variable tandem repeat number. A standard algorithm of molecular typing of plague infectious agent including several stages of Yersinia pestis differentiation by membership: in main and nonmain subspecies, various biovars of the main subspecies, specific subspecies; natural foci and geographic territories was developed. The algorithm is based on 3 typing methods--PCR, multilocus sequence typing and multilocus analysis of variable tandem repeat number using standard DNA targets--life support genes (terC, ilvN, inv, glpD, napA, rhaS and araC) and 7 loci of variable tandem repeats (ms01, ms04, ms06, ms07, ms46, ms62, ms70). The effectiveness of the developed algorithm is shown on the large number of natural Y. pestis strains. Characteristic sequence types of Y. pestis strains of various subspecies and biovars as well as MLVA7 genotypes of strains from natural foci of plague of the Russian Federation and the near abroad were established. The application of the developed algorithm will increase the effectiveness of epidemiologic monitoring of plague infectious agent, and analysis of epidemics and outbreaks of plague with establishing the source of origin of the strain and routes of introduction of the infection.
New Vibrio species associated to molluscan microbiota: a review
Romalde, Jesús L.; Dieguez, Ana L.; Lasa, Aide; Balboa, Sabela
2014-01-01
The genus Vibrio consists of more than 100 species grouped in 14 clades that are widely distributed in aquatic environments such as estuarine, coastal waters, and sediments. A large number of species of this genus are associated with marine organisms like fish, molluscs and crustaceans, in commensal or pathogenic relations. In the last decade, more than 50 new species have been described in the genus Vibrio, due to the introduction of new molecular techniques in bacterial taxonomy, such as multilocus sequence analysis or fluorescent amplified fragment length polymorphism. On the other hand, the increasing number of environmental studies has contributed to improve the knowledge about the family Vibrionaceae and its phylogeny. Vibrio crassostreae, V. breoganii, V. celticus are some of the new Vibrio species described as forming part of the molluscan microbiota. Some of them have been associated with mortalities of different molluscan species, seriously affecting their culture and causing high losses in hatcheries as well as in natural beds. For other species, ecological importance has been demonstrated being highly abundant in different marine habitats and geographical regions. The present work provides an updated overview of the recently characterized Vibrio species isolated from molluscs. In addition, their pathogenic potential and/or environmental importance is discussed. PMID:24427157
Song, Bao-Hua; Windsor, Aaron J.; Schmid, Karl J.; Ramos-Onsins, Sebastian; Schranz, M. Eric; Heidel, Andrew J.; Mitchell-Olds, Thomas
2009-01-01
Information about polymorphism, population structure, and linkage disequilibrium (LD) is crucial for association studies of complex trait variation. However, most genomewide studies have focused on model systems, with very few analyses of undisturbed natural populations. Here, we sequenced 86 mapped nuclear loci for a sample of 46 genotypes of Boechera stricta and two individuals of B. holboellii, both wild relatives of Arabidopsis. Isolation by distance was significant across the species range of B. stricta, and three geographic groups were identified by structure analysis, principal coordinates analysis, and distance-based phylogeny analyses. The allele frequency spectrum indicated a genomewide deviation from an equilibrium neutral model, with silent nucleotide diversity averaging 0.004. LD decayed rapidly, declining to background levels in ∼10 kb or less. For tightly linked SNPs separated by <1 kb, LD was dependent on the reference population. LD was lower in the specieswide sample than within populations, suggesting that low levels of LD found in inbreeding species such as B. stricta, Arabidopsis thaliana, and barley may result from broad geographic sampling that spans heterogeneous genetic groups. Finally, analyses also showed that inbreeding B. stricta and A. thaliana have ∼45% higher recombination per kilobase than outcrossing A. lyrata. PMID:19104077
USDA-ARS?s Scientific Manuscript database
Phylogenetic analyses of species of Streptomyces based on 16S rRNA gene sequences resulted in a statistically well-supported clade (100% bootstrap value) containing 8 species having very similar gross morphology. These species, including Streptomyces bambergiensis, Streptomyces chlorus, Streptomyces...
[Multilocus Sequence Typing analysis of human Campylobacter coli in Granada (Spain)].
Carrillo-Ávila, J A; Sorlózano-Puerto, A; Pérez-Ruiz, M; Gutiérrez-Fernández, J
2016-12-01
Different subtypes of Campylobacter spp. have been associated with diarrhoea and a Multilocus Sequence Typing (MLST) method has been performed for subtyping. In the present work, MLST was used to analyse the genetic diversity of eight strains of Campylobacter coli. Nineteen genetic markers were amplified for MLST analysis: AnsB, DmsA, ggt, Cj1585c, CJJ81176-1367/1371, Tlp7, cj1321-cj1326, fucP, cj0178, cj0755/cfrA, ceuE, pldA, cstII, cstIII. After comparing the obtained sequences with the Campylobacter MLST database, the allele numbers, sequence types (STs) and clonal complexes (CCs) were assigned. The 8 C. coli isolates yielded 4 different STs belonging to 2 CCs. Seven isolates belong to ST-828 clonal complex and only one isolate belong to ST-21. Two samples came from the same patient, but were isolated in two different periods of time. MLST can be useful for taxonomic characterization of C. coli isolates.
Charpentier, Elena; Garnaud, Cécile; Wintenberger, Claire; Bailly, Sébastien; Murat, Jean-Benjamin; Rendu, John; Pavese, Patricia; Drouet, Thibault; Augier, Caroline; Malvezzi, Paolo; Thiébaut-Bertrand, Anne; Mallaret, Marie-Reine; Epaulard, Olivier; Cornet, Muriel; Larrat, Sylvie; Maubon, Danièle
2017-08-01
Pneumocystis jirovecii is a major threat for immunocompromised patients, and clusters of pneumocystis pneumonia (PCP) have been increasingly described in transplant units during the past decade. Exploring an outbreak transmission network requires complementary spatiotemporal and strain-typing approaches. We analyzed a PCP outbreak and demonstrated the added value of next-generation sequencing (NGS) for the multilocus sequence typing (MLST) study of P. jirovecii strains. Thirty-two PCP patients were included. Among the 12 solid organ transplant patients, 5 shared a major and unique genotype that was also found as a minor strain in a sixth patient. A transmission map analysis strengthened the suspicion of nosocomial acquisition of this strain for the 6 patients. NGS-MLST enables accurate determination of subpopulation, which allowed excluding other patients from the transmission network. NGS-MLST genotyping approach was essential to deciphering this outbreak. This innovative approach brings new insights for future epidemiologic studies on this uncultivable opportunistic fungus.
Charpentier, Elena; Garnaud, Cécile; Wintenberger, Claire; Bailly, Sébastien; Murat, Jean-Benjamin; Rendu, John; Pavese, Patricia; Drouet, Thibault; Augier, Caroline; Malvezzi, Paolo; Thiébaut-Bertrand, Anne; Mallaret, Marie-Reine; Epaulard, Olivier; Cornet, Muriel; Larrat, Sylvie
2017-01-01
Pneumocystis jirovecii is a major threat for immunocompromised patients, and clusters of pneumocystis pneumonia (PCP) have been increasingly described in transplant units during the past decade. Exploring an outbreak transmission network requires complementary spatiotemporal and strain-typing approaches. We analyzed a PCP outbreak and demonstrated the added value of next-generation sequencing (NGS) for the multilocus sequence typing (MLST) study of P. jirovecii strains. Thirty-two PCP patients were included. Among the 12 solid organ transplant patients, 5 shared a major and unique genotype that was also found as a minor strain in a sixth patient. A transmission map analysis strengthened the suspicion of nosocomial acquisition of this strain for the 6 patients. NGS-MLST enables accurate determination of subpopulation, which allowed excluding other patients from the transmission network. NGS-MLST genotyping approach was essential to deciphering this outbreak. This innovative approach brings new insights for future epidemiologic studies on this uncultivable opportunistic fungus. PMID:28726611
Wang, Tao; Li, Hua; Wang, Hua; Su, Jing
2015-04-16
The present study established a typing method with NotI-based pulsed-field gel electrophoresis (PFGE) and stress response gene schemed multilocus sequence typing (MLST) for 55 Oenococcus oeni strains isolated from six individual regions in China and two model strains PSU-1 (CP000411) and ATCC BAA-1163 (AAUV00000000). Seven stress response genes, cfa, clpL, clpP, ctsR, mleA, mleP and omrA, were selected for MLST testing, and positive selective pressure was detected for these genes. Furthermore, both methods separated the strains into two clusters. The PFGE clusters are correlated with the region, whereas the sequence types (STs) formed by the MLST confirm the two clusters identified by PFGE. In addition, the population structure was a mixture of evolutionary pathways, and the strains exhibited both clonal and panmictic characteristics. Copyright © 2015 Elsevier B.V. All rights reserved.
Bouchez, Valérie; Guglielmini, Julien; Dazas, Mélody; Landier, Annie; Toubiana, Julie; Guillot, Sophie; Criscuolo, Alexis; Brisse, Sylvain
2018-06-01
Bordetella pertussis causes whooping cough, a highly contagious respiratory disease that is reemerging in many world regions. The spread of antigen-deficient strains may threaten acellular vaccine efficacy. Dynamics of strain transmission are poorly defined because of shortcomings in current strain genotyping methods. Our objective was to develop a whole-genome genotyping strategy with sufficient resolution for local epidemiologic questions and sufficient reproducibility to enable international comparisons of clinical isolates. We defined a core genome multilocus sequence typing scheme comprising 2,038 loci and demonstrated its congruence with whole-genome single-nucleotide polymorphism variation. Most cases of intrafamilial groups of isolates or of multiple isolates recovered from the same patient were distinguished from temporally and geographically cocirculating isolates. However, epidemiologically unrelated isolates were sometimes nearly undistinguishable. We set up a publicly accessible core genome multilocus sequence typing database to enable global comparisons of B. pertussis isolates, opening the way for internationally coordinated surveillance.
Eight new Arthrinium species from China
Wang, Mei; Tan, Xiao-Ming; Liu, Fang; Cai, Lei
2018-01-01
Abstract The genus Arthrinium includes important plant pathogens, endophytes and saprobes with a wide host range and geographic distribution. In this paper, 74 Arthrinium strains isolated from various substrates such as bamboo leaves, tea plants, soil and air from karst caves in China were examined using a multi-locus phylogeny based on a combined dataset of ITS rDNA, TEF1 and TUB2, in conjunction with morphological characters, host association and ecological distribution. Eight new species were described based on their distinct phylogenetic relationships and morphological characters. Our results indicated a high species diversity of Arthrinium with wide host ranges, amongst which, Poaceae and Cyperaceae were the major host plant families of Arthrinium species. PMID:29755262
Eight new Arthrinium species from China.
Wang, Mei; Tan, Xiao-Ming; Liu, Fang; Cai, Lei
2018-01-01
The genus Arthrinium includes important plant pathogens, endophytes and saprobes with a wide host range and geographic distribution. In this paper, 74 Arthrinium strains isolated from various substrates such as bamboo leaves, tea plants, soil and air from karst caves in China were examined using a multi-locus phylogeny based on a combined dataset of ITS rDNA, TEF1 and TUB2, in conjunction with morphological characters, host association and ecological distribution. Eight new species were described based on their distinct phylogenetic relationships and morphological characters. Our results indicated a high species diversity of Arthrinium with wide host ranges, amongst which, Poaceae and Cyperaceae were the major host plant families of Arthrinium species.
2013-01-01
aquatic plants and subsequent ecological consequences. The authors of this technical note have linked avian vacuolar myelinopathy (AVM), a disease...additional cyanobacteria sequences to determine designations for probe development, to advance understanding of the species’ phylogeny , and to lay...groundwork for its formal description. Phylogeny data confirm that the species is in section V, order Stigonematales. Phylogeny also infers that the
Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita
2017-06-01
The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
Phylogenetic relationships in the family Streptomycetaceae using multi-locus sequence analysis
USDA-ARS?s Scientific Manuscript database
The family Streptomycetaceae, notably species in the genus Streptomyces, have long been the subject of investigation due to their well-known ability to produce secondary metabolites. The emergence of drug resistant pathogens and the relative ease of producing genome sequences has renewed the importa...
`Inter-Arrival Time' Inspired Algorithm and its Application in Clustering and Molecular Phylogeny
NASA Astrophysics Data System (ADS)
Kolekar, Pandurang S.; Kale, Mohan M.; Kulkarni-Kale, Urmila
2010-10-01
Bioinformatics, being multidisciplinary field, involves applications of various methods from allied areas of Science for data mining using computational approaches. Clustering and molecular phylogeny is one of the key areas in Bioinformatics, which help in study of classification and evolution of organisms. Molecular phylogeny algorithms can be divided into distance based and character based methods. But most of these methods are dependent on pre-alignment of sequences and become computationally intensive with increase in size of data and hence demand alternative efficient approaches. `Inter arrival time distribution' (IATD) is a popular concept in the theory of stochastic system modeling but its potential in molecular data analysis has not been fully explored. The present study reports application of IATD in Bioinformatics for clustering and molecular phylogeny. The proposed method provides IATDs of nucleotides in genomic sequences. The distance function based on statistical parameters of IATDs is proposed and distance matrix thus obtained is used for the purpose of clustering and molecular phylogeny. The method is applied on a dataset of 3' non-coding region sequences (NCR) of Dengue virus type 3 (DENV-3), subtype III, reported in 2008. The phylogram thus obtained revealed the geographical distribution of DENV-3 isolates. Sri Lankan DENV-3 isolates were further observed to be clustered in two sub-clades corresponding to pre and post Dengue hemorrhagic fever emergence groups. These results are consistent with those reported earlier, which are obtained using pre-aligned sequence data as an input. These findings encourage applications of the IATD based method in molecular phylogenetic analysis in particular and data mining in general.
Fast alignment-free sequence comparison using spaced-word frequencies.
Leimeister, Chris-Andre; Boden, Marcus; Horwege, Sebastian; Lindner, Sebastian; Morgenstern, Burkhard
2014-07-15
Alignment-free methods for sequence comparison are increasingly used for genome analysis and phylogeny reconstruction; they circumvent various difficulties of traditional alignment-based approaches. In particular, alignment-free methods are much faster than pairwise or multiple alignments. They are, however, less accurate than methods based on sequence alignment. Most alignment-free approaches work by comparing the word composition of sequences. A well-known problem with these methods is that neighbouring word matches are far from independent. To reduce the statistical dependency between adjacent word matches, we propose to use 'spaced words', defined by patterns of 'match' and 'don't care' positions, for alignment-free sequence comparison. We describe a fast implementation of this approach using recursive hashing and bit operations, and we show that further improvements can be achieved by using multiple patterns instead of single patterns. To evaluate our approach, we use spaced-word frequencies as a basis for fast phylogeny reconstruction. Using real-world and simulated sequence data, we demonstrate that our multiple-pattern approach produces better phylogenies than approaches relying on contiguous words. Our program is freely available at http://spaced.gobics.de/. © The Author 2014. Published by Oxford University Press.
Kelley, Scott T; Cassirer, E Frances; Weiser, Glen C; Safaee, Shirin
2007-01-01
Wild and domestic animal populations are known to be sources and reservoirs of emerging diseases. There is also a growing recognition that horizontal genetic transfer (HGT) plays an important role in bacterial pathogenesis. We used molecular phylogenetic methods to assess diversity and cross-transmission rates of Pasteurellaceae bacteria in populations of bighorn sheep, Dall's sheep, domestic sheep and domestic goats. Members of the Pasteurellaceae cause an array of deadly illnesses including bacterial pneumonia known as "pasteurellosis", a particularly devastating disease for bighorn sheep. A phylogenetic analysis of a combined dataset of two RNA genes (16S ribosomal RNA and RNAse P RNA) revealed remarkable evolutionary diversity among Pasteurella trehalosi and Mannheimia (Pasteurella) haemolytica bacteria isolated from sheep and goats. Several phylotypes appeared to associate with particular host species, though we found numerous instances of apparent cross-transmission among species and populations. Statistical analyses revealed that host species, geographic locale and biovariant classification, but not virulence, correlated strongly with Pasteurellaceae phylogeny. Sheep host species correlated with P. trehalosi isolates phylogeny (PTP test; P=0.002), but not with the phylogeny of M. haemolytica isolates, suggesting that P. trehalosi bacteria may be more host specific. With regards to populations within species, we also discovered a strong correlation between geographic locale and isolate phylogeny in the Rocky Mountain bighorn sheep (PTP test; P=0.001). We also investigated the potential for HGT of the leukotoxin A (lktA) gene, which produces a toxin that plays an integral role in causing disease. Comparative analysis of the combined RNA gene phylogeny and the lktA phylogenies revealed considerable incongruence between the phylogenies, suggestive of HGT. Furthermore, we found identical lktA alleles in unrelated bacterial species, some of which had been isolated from sheep in distantly removed populations. For example, lktA sequences from P. trehalosi isolated from remote Alaskan Dall's sheep were 100% identical over a 900-nucleotide stretch to sequences determined from M. haemolytica isolated from domestic sheep in the UK. This extremely high degree of sequence similarity of lktA sequences among distinct bacterial species suggests that HGT has played a role in the evolution of lktA in wild hosts.
The multilocus sequence typing network: mlst.net.
Aanensen, David M; Spratt, Brian G
2005-07-01
The unambiguous characterization of strains of a pathogen is crucial for addressing questions relating to its epidemiology, population and evolutionary biology. Multilocus sequence typing (MLST), which defines strains from the sequences at seven house-keeping loci, has become the method of choice for molecular typing of many bacterial and fungal pathogens (and non-pathogens), and MLST schemes and strain databases are available for a growing number of prokaryotic and eukaryotic organisms. Sequence data are ideal for strain characterization as they are unambiguous, meaning strains can readily be compared between laboratories via the Internet. Laboratories undertaking MLST can quickly progress from sequencing the seven gene fragments to characterizing their strains and relating them to those submitted by others and to the population as a whole. We provide the gateway to a number of MLST schemes, each of which contain a set of tools for the initial characterization of strains, and methods for relating query strains to other strains of the species, including clustering based on differences in allelic profiles, phylogenetic trees based on concatenated sequences, and a recently developed method (eBURST) for identifying clonal complexes within a species and displaying the overall structure of the population. This network of MLST websites is available at http://www.mlst.net.
Msaddak, Abdelhakim; Rejili, Mokhtar; Durán, David; Rey, Luis; Imperial, Juan; Palacios, Jose Manuel; Ruiz-Argüeso, Tomas; Mars, Mohamed
2017-06-01
The genetic diversity of bacterial populations nodulating Lupinus luteus (yellow lupine) in Northern Tunisia was examined. Phylogenetic analyses of 43 isolates based on recA and gyrB partial sequences grouped them in three clusters, two of which belong to genus Bradyrhizobium (41 isolates) and one, remarkably, to Microvirga (2 isolates), a genus never previously described as microsymbiont of this lupine species. Representatives of the three clusters were analysed in-depth by multilocus sequence analysis of five housekeeping genes (rrs, recA, glnII, gyrB and dnaK). Surprisingly, the Bradyrhizobium cluster with the two isolates LluI4 and LluTb2 may constitute a new species defined by a separate position between Bradyrhizobium manausense and B. denitrificans. A nodC-based phylogeny identified only two groups: one formed by Bradyrhizobium strains included in the symbiovar genistearum and the other by the Microvirga strains. Symbiotic behaviour of representative isolates was tested, and among the seven legumes inoculated only a difference was observed i.e. the Bradyrhizobium strains nodulated Ornithopus compressus unlike the two strains of Microvirga. On the basis of these data, we conclude that L. luteus root nodule symbionts in Northern Tunisia are mostly strains within the B. canariense/B. lupini lineages, and the remaining strains belong to two groups not previously identified as L. luteus endosymbionts: one corresponding to a new clade of Bradyrhizobium and the other to the genus Microvirga. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
USDA-ARS?s Scientific Manuscript database
Fifty-eight fusaria isolated from 52 Italian patients between 2004 and 2007 were subject to multilocus DNA sequence typing to characterize the spectrum of species and circulating sequence types (STs) associated with dermatological infections, especially onychomycoses and paronychia, and other fusari...
Pathogenic Leptospira Species in Insectivorous Bats, China, 2015.
Han, Hui-Ju; Wen, Hong-Ling; Liu, Jian-Wei; Qin, Xiang-Rong; Zhao, Min; Wang, Li-Jun; Luo, Li-Mei; Zhou, Chuan-Min; Zhu, Ye-Lei; Qi, Rui; Li, Wen-Qian; Yu, Hao; Yu, Xue-Jie
2018-06-01
PCR amplification of the rrs2 gene indicated that 50% (62/124) of insectivorous bats from eastern China were infected with Leptospira borgpetersenii, L. kirschneri, and several potentially new Leptospira species. Multilocus sequence typing defined 3 novel sequence types in L. kirschneri, suggesting that bats are major carriers of Leptospira.
Kress, W John; Erickson, David L; Swenson, Nathan G; Thompson, Jill; Uriarte, Maria; Zimmerman, Jess K
2010-11-09
Species number, functional traits, and phylogenetic history all contribute to characterizing the biological diversity in plant communities. The phylogenetic component of diversity has been particularly difficult to quantify in species-rich tropical tree assemblages. The compilation of previously published (and often incomplete) data on evolutionary relationships of species into a composite phylogeny of the taxa in a forest, through such programs as Phylomatic, has proven useful in building community phylogenies although often of limited resolution. Recently, DNA barcodes have been used to construct a robust community phylogeny for nearly 300 tree species in a forest dynamics plot in Panama using a supermatrix method. In that study sequence data from three barcode loci were used to generate a well-resolved species-level phylogeny. Here we expand upon this earlier investigation and present results on the use of a phylogenetic constraint tree to generate a community phylogeny for a diverse, tropical forest dynamics plot in Puerto Rico. This enhanced method of phylogenetic reconstruction insures the congruence of the barcode phylogeny with broadly accepted hypotheses on the phylogeny of flowering plants (i.e., APG III) regardless of the number and taxonomic breadth of the taxa sampled. We also compare maximum parsimony versus maximum likelihood estimates of community phylogenetic relationships as well as evaluate the effectiveness of one- versus two- versus three-gene barcodes in resolving community evolutionary history. As first demonstrated in the Panamanian forest dynamics plot, the results for the Puerto Rican plot illustrate that highly resolved phylogenies derived from DNA barcode sequence data combined with a constraint tree based on APG III are particularly useful in comparative analysis of phylogenetic diversity and will enhance research on the interface between community ecology and evolution.
Yu, Li; Li, Yi-Wei; Ryder, Oliver A; Zhang, Ya-Ping
2007-10-24
Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt) gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other difficult phylogenetic issues. Although the whole mitochondrial DNA sequence based phylogeny is robust, it remains in conflict with phylogenetic relationships suggested by analysis of limited nuclear-encoded data, a situation that will require gathering more nuclear DNA sequence information.
Yu, Li; Li, Yi-Wei; Ryder, Oliver A; Zhang, Ya-Ping
2007-01-01
Background Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt) gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. Results This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Conclusion Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other difficult phylogenetic issues. Although the whole mitochondrial DNA sequence based phylogeny is robust, it remains in conflict with phylogenetic relationships suggested by analysis of limited nuclear-encoded data, a situation that will require gathering more nuclear DNA sequence information. PMID:17956639
Delamuta, Jakeline Renata Marçon; Ribeiro, Renan Augusto; Ormeño-Orrillo, Ernesto; Parma, Marcia Maria; Melo, Itamar Soares; Martínez-Romero, Esperanza; Hungria, Mariangela
2015-12-01
Biological nitrogen fixation is a key process for agricultural production and environmental sustainability, but there are comparatively few studies of symbionts of tropical pasture legumes, as well as few described species of the genus Bradyrhizobium, although it is the predominant rhizobial genus in the tropics. A detailed polyphasic study was conducted with two strains of the genus Bradyrhizobium used in commercial inoculants for tropical pastures in Brazil, CNPSo 1112T, isolated from perennial soybean (Neonotonia wightii), and CNPSo 2833T, from desmodium (Desmodium heterocarpon). Based on 16S-rRNA gene phylogeny, both strains were grouped in the Bradyrhizobium elkanii superclade, but were not clearly clustered with any known species. Multilocus sequence analysis of three (glnII, gyrB and recA) and five (plus atpD and dnaK) housekeeping genes confirmed that the strains are positioned in two distinct clades. Comparison with intergenic transcribed spacer sequences of type strains of described species of the genus Bradyrhizobium showed similarity lower than 93.1 %, and differences were confirmed by BOX-PCR analysis. Nucleotide identity of three housekeeping genes with type strains of described species ranged from 88.1 to 96.2 %. Average nucleotide identity of genome sequences showed values below the threshold for distinct species of the genus Bradyrhizobium ( < 90.6 %), and the value between the two strains was also below this threshold (91.2 %). Analysis of nifH and nodC gene sequences positioned the two strains in a clade distinct from other species of the genus Bradyrhizobium. Morphophysiological, genotypic and genomic data supported the description of two novel species in the genus Bradyrhizobium, Bradyrhizobium tropiciagri sp. nov. (type strain CNPSo 1112T = SMS 303T = BR 1009T = SEMIA 6148T = LMG 28867T) and Bradyrhizobium embrapense sp. nov. (type strain CNPSo 2833T = CIAT 2372T = BR 2212T = SEMIA 6208T = U674T = LMG 2987).
Meats, Emma; Feil, Edward J.; Stringer, Suzanna; Cody, Alison J.; Goldstein, Richard; Kroll, J. Simon; Popovic, Tanja; Spratt, Brian G.
2003-01-01
A multilocus sequence typing (MLST) scheme has been developed for the unambiguous characterization of encapsulated and noncapsulated Haemophilus influenzae isolates. The sequences of internal fragments of seven housekeeping genes were determined for 131 isolates, comprising a diverse set of 104 serotype a, b, c, d, e, and f isolates and 27 noncapsulated isolates. Many of the encapsulated isolates had previously been characterized by multilocus enzyme electrophoresis (MLEE), and the validity of the MLST scheme was established by the very similar clustering of isolates obtained by these methods. Isolates of serotypes c, d, e, and f formed monophyletic groups on a dendrogram constructed from the differences in the allelic profiles of the isolates, whereas there were highly divergent lineages of both serotype a and b isolates. Noncapsulated isolates were distinct from encapsulated isolates and, with one exception, were within two highly divergent clusters. The relationships between the major lineages of encapsulated H. influenzae inferred from MLEE data could not be discerned on a dendrogram constructed from differences in the allelic profiles, but were apparent on a tree reconstructed from the concatenated nucleotide sequences. Recombination has not therefore completely eliminated phylogenetic signal, and in support of this, for encapsulated isolates, there was significant congruence between many of the trees reconstructed from the sequences of the seven individual loci. Congruence was less apparent for noncapsulated isolates, suggesting that the impact of recombination is greater among noncapsulated than encapsulated isolates. The H. influenzae MLST scheme is available at www.mlst.net, it allows any isolate to be compared with those in the MLST database, and (for encapsulated isolates) it assigns isolates to their phylogenetic lineage, via the Internet. PMID:12682154
Roisin, S; Gaudin, C; De Mendonça, R; Bellon, J; Van Vaerenbergh, K; De Bruyne, K; Byl, B; Pouseele, H; Denis, O; Supply, P
2016-06-01
We used a two-step whole genome sequencing analysis for resolving two concurrent outbreaks in two neonatal services in Belgium, caused by exfoliative toxin A-encoding-gene-positive (eta+) methicillin-susceptible Staphylococcus aureus with an otherwise sporadic spa-type t209 (ST-109). Outbreak A involved 19 neonates and one healthcare worker in a Brussels hospital from May 2011 to October 2013. After a first episode interrupted by decolonization procedures applied over 7 months, the outbreak resumed concomitantly with the onset of outbreak B in a hospital in Asse, comprising 11 neonates and one healthcare worker from mid-2012 to January 2013. Pan-genome multilocus sequence typing, defined on the basis of 42 core and accessory reference genomes, and single-nucleotide polymorphisms mapped on an outbreak-specific de novo assembly were used to compare 28 available outbreak isolates and 19 eta+/spa-type t209 isolates identified by routine or nationwide surveillance. Pan-genome multilocus sequence typing showed that the outbreaks were caused by independent clones not closely related to any of the surveillance isolates. Isolates from only ten cases with overlapping stays in outbreak A, including four pairs of twins, showed no or only a single nucleotide polymorphism variation, indicating limited sequential transmission. Detection of larger genomic variation, even from the start of the outbreak, pointed to sporadic seeding from a pre-existing exogenous source, which persisted throughout the whole course of outbreak A. Whole genome sequencing analysis can provide unique fine-tuned insights into transmission pathways of complex outbreaks even at their inception, which, with timely use, could valuably guide efforts for early source identification. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.
Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences
Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi
2015-01-01
Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera. PMID:25704094
Tellapragada, Chaitanya; Kamthan, Aayushi; Shaw, Tushar; Ke, Vandana; Kumar, Subodh; Bhat, Vinod; Mukhopadhyay, Chiranjay
2016-01-01
There is a slow but steady rise in the case detection rates of melioidosis from various parts of the Indian sub-continent in the past two decades. However, the epidemiology of the disease in India and the surrounding South Asian countries remains far from well elucidated. Multi-locus sequence typing (MLST) is a useful epidemiological tool to study the genetic relatedness of bacterial isolates both with-in and across the countries. With this background, we studied the molecular epidemiology of 32 Burkholderia pseudomallei isolates (31 clinical and 1 soil isolate) obtained during 2006-2015 from various parts of south India using multi-locus sequencing typing and analysis. Of the 32 isolates included in the analysis, 30 (93.7%) had novel allelic profiles that were not reported previously. Sequence type (ST) 1368 (n = 15, 46.8%) with allelic profile (1, 4, 6, 4, 1, 1, 3) was the most common genotype observed. We did not observe a genotypic association of STs with geographical location, type of infection and year of isolation in the present study. Measure of genetic differentiation (FST) between Indian and the rest of world isolates was 0.14413. Occurrence of the same ST across three adjacent states of south India suggest the dispersion of B.pseudomallei across the south western coastal part of India with limited geographical clustering. However, majority of the STs reported from the present study remained as "outliers" on the eBURST "Population snapshot", suggesting the genetic diversity of Indian isolates from the Australasian and Southeast Asian isolates.
Molecular taxonomy and phylogeny
USDA-ARS?s Scientific Manuscript database
The cyst nematodes comprise a group of sedentary endoparasitic nematodes that impact a wide range of crops in both tropical and temperate regions of the world. This chapter updates the taxonomy and phylogeny of this group and describes the nuclear protein coding, ribosomal, and mitochondrial sequenc...
Aveskamp, M.M.; de Gruyter, J.; Woudenberg, J.H.C.; Verkley, G.J.M.; Crous, P.W.
2010-01-01
Fungal taxonomists routinely encounter problems when dealing with asexual fungal species due to poly- and paraphyletic generic phylogenies, and unclear species boundaries. These problems are aptly illustrated in the genus Phoma. This phytopathologically significant fungal genus is currently subdivided into nine sections which are mainly based on a single or just a few morphological characters. However, this subdivision is ambiguous as several of the section-specific characters can occur within a single species. In addition, many teleomorph genera have been linked to Phoma, three of which are recognised here. In this study it is attempted to delineate generic boundaries, and to come to a generic circumscription which is more correct from an evolutionary point of view by means of multilocus sequence typing. Therefore, multiple analyses were conducted utilising sequences obtained from 28S nrDNA (Large Subunit - LSU), 18S nrDNA (Small Subunit - SSU), the Internal Transcribed Spacer regions 1 & 2 and 5.8S nrDNA (ITS), and part of the β-tubulin (TUB) gene region. A total of 324 strains were included in the analyses of which most belonged to Phoma taxa, whilst 54 to related pleosporalean fungi. In total, 206 taxa were investigated, of which 159 are known to have affinities to Phoma. The phylogenetic analysis revealed that the current Boeremaean subdivision is incorrect from an evolutionary point of view, revealing the genus to be highly polyphyletic. Phoma species are retrieved in six distinct clades within the Pleosporales, and appear to reside in different families. The majority of the species, however, including the generic type, clustered in a recently established family, Didymellaceae. In the second part of this study, the phylogenetic variation of the species and varieties in this clade was further assessed. Next to the genus Didymella, which is considered to be the sole teleomorph of Phoma s. str., we also retrieved taxa belonging to the teleomorph genera Leptosphaerulina and Macroventuria in this clade. Based on the sequence data obtained, the Didymellaceae segregate into at least 18 distinct clusters, of which many can be associated with several specific taxonomic characters. Four of these clusters were defined well enough by means of phylogeny and morphology, so that the associated taxa could be transferred to separate genera. Aditionally, this study addresses the taxonomic description of eight species and two varieties that are novel to science, and the recombination of 61 additional taxa. PMID:20502538
Smith, Adam R.; Proffitt, Melissa R.; Ho, Winnie W.; Mullaney, Claire B.; Maldonado-Ocampo, Javier A.; Lovejoy, Nathan R.; Alves-Gomes, José A.; Smith, G. Troy
2018-01-01
The electric communication signals of weakly electric ghost knifefishes (Gymnotiformes: Apteronotidae) provide a valuable model system for understanding the evolution and physiology of behavior. Apteronotids produce continuous wave-type electric organ discharges (EODs) that are used for electrolocation and communication. The frequency and waveform of EODs, as well as the structure of transient EOD modulations (chirps), vary substantially across species. Understanding how these signals have evolved, however, has been hampered by the lack of a well-supported phylogeny for this family. We constructed a molecular phylogeny for the Apteronotidae by using sequence data from three genes (cytochrome c oxidase subunit 1, recombination activating gene 2, and cytochrome oxidase B) in 32 species representing 13 apteronotid genera. This phylogeny and an extensive database of apteronotid signals allowed us to examine signal evolution by using ancestral state reconstruction (ASR) and phylogenetic generalized least squares (PGLS) models. Our molecular phylogeny largely agrees with another recent sequence-based phylogeny and identified five robust apteronotid clades: (i) Sternarchorhamphus + Orthosternarchus, (ii) Adontosternarchus, (iii) Apteronotus + Parapteronotus, (iv) Sternarchorhynchus, and (v) a large clade including Porotergus, ‘Apteronotus’, Compsaraia, Sternarchogiton, Sternarchella, and Magosternarchus. We analyzed novel chirp recordings from two apteronotid species (Orthosternarchus tamandua and Sternarchorhynchus mormyrus), and combined data from these species with that from previously recorded species in our phylogenetic analyses. Some signal parameters in O. tamandua were plesiomorphic (e.g., low frequency EODs and chirps with little frequency modulation that nevertheless interrupt the EOD), suggesting that ultra-high frequency EODs and ‘‘big” chirps evolved after apteronotids diverged from other gymnotiforms. In contrast to previous studies, our PGLS analyses using the new phylogeny indicated the presence of phylogenetic signals in the relationships between some EOD and chirp parameters. The ASR demonstrated that most EOD and chirp parameters are evolutionarily labile and have often diversified even among closely related species. PMID:27769924
Smith, Adam R; Proffitt, Melissa R; Ho, Winnie W; Mullaney, Claire B; Maldonado-Ocampo, Javier A; Lovejoy, Nathan R; Alves-Gomes, José A; Smith, G Troy
2016-10-01
The electric communication signals of weakly electric ghost knifefishes (Gymnotiformes: Apteronotidae) provide a valuable model system for understanding the evolution and physiology of behavior. Apteronotids produce continuous wave-type electric organ discharges (EODs) that are used for electrolocation and communication. The frequency and waveform of EODs, as well as the structure of transient EOD modulations (chirps), vary substantially across species. Understanding how these signals have evolved, however, has been hampered by the lack of a well-supported phylogeny for this family. We constructed a molecular phylogeny for the Apteronotidae by using sequence data from three genes (cytochrome c oxidase subunit 1, recombination activating gene 2, and cytochrome oxidase B) in 32 species representing 13 apteronotid genera. This phylogeny and an extensive database of apteronotid signals allowed us to examine signal evolution by using ancestral state reconstruction (ASR) and phylogenetic generalized least squares (PGLS) models. Our molecular phylogeny largely agrees with another recent sequence-based phylogeny and identified five robust apteronotid clades: (i) Sternarchorhamphus+Orthosternarchus, (ii) Adontosternarchus, (iii) Apteronotus+Parapteronotus, (iv) Sternarchorhynchus, and (v) a large clade including Porotergus, 'Apteronotus', Compsaraia, Sternarchogiton, Sternarchella, and Magosternarchus. We analyzed novel chirp recordings from two apteronotid species (Orthosternarchus tamandua and Sternarchorhynchus mormyrus), and combined data from these species with that from previously recorded species in our phylogenetic analyses. Some signal parameters in O. tamandua were plesiomorphic (e.g., low frequency EODs and chirps with little frequency modulation that nevertheless interrupt the EOD), suggesting that ultra-high frequency EODs and "big" chirps evolved after apteronotids diverged from other gymnotiforms. In contrast to previous studies, our PGLS analyses using the new phylogeny indicated the presence of phylogenetic signals in the relationships between some EOD and chirp parameters. The ASR demonstrated that most EOD and chirp parameters are evolutionarily labile and have often diversified even among closely related species. Published by Elsevier Ltd.
Couto, Natacha; Chlebowicz, Monika A; Raangs, Erwin C; Friedrich, Alex W; Rossen, John W
2018-04-05
The emergence of nosocomial infections by multidrug-resistant Staphylococcus haemolyticus isolates has been reported in several European countries. Here, we report the first two complete genome sequences of S. haemolyticus sequence type 25 (ST25) isolates 83131A and 83131B. Both isolates were isolated from the same clinical sample and were first identified through shotgun metagenomics. Copyright © 2018 Couto et al.
A Framework Phylogeny of the American Oak Clade Based on Sequenced RAD Data
Hipp, Andrew L.; Eaton, Deren A. R.; Cavender-Bares, Jeannine; Fitzek, Elisabeth; Nipper, Rick; Manos, Paul S.
2014-01-01
Previous phylogenetic studies in oaks (Quercus, Fagaceae) have failed to resolve the backbone topology of the genus with strong support. Here, we utilize next-generation sequencing of restriction-site associated DNA (RAD-Seq) to resolve a framework phylogeny of a predominantly American clade of oaks whose crown age is estimated at 23–33 million years old. Using a recently developed analytical pipeline for RAD-Seq phylogenetics, we created a concatenated matrix of 1.40 E06 aligned nucleotides, constituting 27,727 sequence clusters. RAD-Seq data were readily combined across runs, with no difference in phylogenetic placement between technical replicates, which overlapped by only 43–64% in locus coverage. 17% (4,715) of the loci we analyzed could be mapped with high confidence to one or more expressed sequence tags in NCBI Genbank. A concatenated matrix of the loci that BLAST to at least one EST sequence provides approximately half as many variable or parsimony-informative characters as equal-sized datasets from the non-EST loci. The EST-associated matrix is more complete (fewer missing loci) and has slightly lower homoplasy than non-EST subsampled matrices of the same size, but there is no difference in phylogenetic support or relative attribution of base substitutions to internal versus terminal branches of the phylogeny. We introduce a partitioned RAD visualization method (implemented in the R package RADami; http://cran.r-project.org/web/packages/RADami) to investigate the possibility that suboptimal topologies supported by large numbers of loci—due, for example, to reticulate evolution or lineage sorting—are masked by the globally optimal tree. We find no evidence for strongly-supported alternative topologies in our study, suggesting that the phylogeny we recover is a robust estimate of large-scale phylogenetic patterns in the American oak clade. Our study is one of the first to demonstrate the utility of RAD-Seq data for inferring phylogeny in a 23–33 million year-old clade. PMID:24705617
Yao, Lu; Li, Hongjie; Martin, Robert D; Moreau, Corrie S; Malhi, Ripan S
2017-11-01
The biogeographical history of Southeast Asia is complicated due to the continuous emergences and disappearances of land bridges throughout the Pleistocene. Here, we use long-tailed macaques (Macaca fascicularis), which are widely distributed throughout the mainland and islands of Southeast Asia, asa model for better understanding the biogeographical patterns of diversification in this geographically complex region. A reliable intraspecific phylogeny including individuals from localities on oceanic islands, continental islands, and the mainland is needed to trace relatedness along with the pattern and timing of colonization in this region. We used high-throughput sequencing techniques to sequence mitochondrial genomes (mitogenomes) from 95 Southeast Asian M. fascicularis specimens housed at natural history museums around the world. To achieve a comprehensive picture, we more than tripled the mitogenome sample size for M. fascicularis from previous studies, and for the first time included documented samples from the Philippines and several small Indonesian islands. Confirming the result from a previous, recent intraspecific phylogeny for M. fascicularis, the newly reconstructed phylogeny of 135 specimens divides the samples into two major clades: Clade A includes haplotypes from the mainland and some from northern Sumatra, while Clade B includes all insular haplotypes along with lineages from southern Sumatra. This study resolves a previous disparity by revealing a disjunction in the origin of Sumatran macaques, with separate lineages originating within the two major clades, suggesting that at least two major migrations to Sumatra occurred. However, our dated phylogeny reveals that the two major clades split ∼1.88Ma, which is earlier than in previously published phylogenies. Our new data reveal that most Philippine macaque lineages diverged from the Borneo stock within the last ∼0.06-0.43Ma. Finally, our study provides insight into successful sequencing of DNA across museums and shotgun sequencing of DNA specimens asa method to sequence the mitogenome. Copyright © 2017 Elsevier Inc. All rights reserved.
Uncommonly isolated clinical Pseudomonas: identification and phylogenetic assignation.
Mulet, M; Gomila, M; Ramírez, A; Cardew, S; Moore, E R B; Lalucat, J; García-Valdés, E
2017-02-01
Fifty-two Pseudomonas strains that were difficult to identify at the species level in the phenotypic routine characterizations employed by clinical microbiology laboratories were selected for genotypic-based analysis. Species level identifications were done initially by partial sequencing of the DNA dependent RNA polymerase sub-unit D gene (rpoD). Two other gene sequences, for the small sub-unit ribosonal RNA (16S rRNA) and for DNA gyrase sub-unit B (gyrB) were added in a multilocus sequence analysis (MLSA) study to confirm the species identifications. These sequences were analyzed with a collection of reference sequences from the type strains of 161 Pseudomonas species within an in-house multi-locus sequence analysis database. Whole-cell matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) analyses of these strains complemented the DNA sequenced-based phylogenetic analyses and were observed to be in accordance with the results of the sequence data. Twenty-three out of 52 strains were assigned to 12 recognized species not commonly detected in clinical specimens and 29 (56 %) were considered representatives of at least ten putative new species. Most strains were distributed within the P. fluorescens and P. aeruginosa lineages. The value of rpoD sequences in species-level identifications for Pseudomonas is emphasized. The correct species identifications of clinical strains is essential for establishing the intrinsic antibiotic resistance patterns and improved treatment plans.
USDA-ARS?s Scientific Manuscript database
A wild badger (Meles meles) with a severe nodular dermatitis was presented for post mortem examination. Numerous cutaneous granulomas with superficial ulceration were present especially on head, dorsum, and forearms were found at necropsy. Histopathological examination of the skin revealed a severe ...
Diversity of the Cronobacter Genus as Revealed by Multilocus Sequence Typing
Joseph, S.; Sonbol, H.; Hariri, S.; Desai, P.; McClelland, M.
2012-01-01
Cronobacter (previously known as Enterobacter sakazakii) is a diverse bacterial genus consisting of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. universalis, C. muytjensii, C. dublinensis, and C. condimenti. In this study, we have used a multilocus sequence typing (MLST) approach employing the alleles of 7 genes (atpD, fusA, glnS, gltB, gyrB, infB, and ppsA; total length, 3,036 bp) to investigate the phylogenetic relationship of 325 Cronobacter species isolates. Strains were chosen on the basis of their species, geographic and temporal distribution, source, and clinical outcome. The earliest strain was isolated from milk powder in 1950, and the earliest clinical strain was isolated in 1953. The existence of seven species was supported by MLST. Intraspecific variation ranged from low diversity in C. sakazakii to extensive diversity within some species, such as C. muytjensii and C. dublinensis, including evidence of gene conversion between species. The predominant species from clinical sources was found to be C. sakazakii. C. sakazakii sequence type 4 (ST4) was the predominant sequence type of cerebral spinal fluid isolates from cases of meningitis. PMID:22785185
de la Estrella, Manuel; Forest, Félix; Klitgård, Bente; Lewis, Gwilym P; Mackinder, Barbara A; de Queiroz, Luciano P; Wieringa, Jan J; Bruneau, Anne
2018-05-02
Detarioideae (81 genera, c. 760 species) is one of the six Leguminosae subfamilies recently reinstated by the Legume Phylogeny Working Group. This subfamily displays high morphological variability and is one of the early branching clades in the evolution of legumes. Using previously published and newly generated sequences from four loci (matK-trnK, rpL16, trnG-trnG2G and ITS), we develop a new densely sampled phylogeny to assess generic relationships and tribal delimitations within Detarioideae. The ITS phylogenetic trees are poorly resolved, but the plastid data recover several strongly supported clades, which also are supported in a concatenated plastid + ITS sequence analysis. We propose a new phylogeny-based tribal classification for Detarioideae that includes six tribes: re-circumscribed Detarieae and Amherstieae, and the four new tribes Afzelieae, Barnebydendreae, Saraceae and Schotieae. An identification key and descriptions for each of the tribes are also provided.
Kang, Hahk-Soo
2017-02-01
Genomics-based methods are now commonplace in natural products research. A phylogeny-guided mining approach provides a means to quickly screen a large number of microbial genomes or metagenomes in search of new biosynthetic gene clusters of interest. In this approach, biosynthetic genes serve as molecular markers, and phylogenetic trees built with known and unknown marker gene sequences are used to quickly prioritize biosynthetic gene clusters for their metabolites characterization. An increase in the use of this approach has been observed for the last couple of years along with the emergence of low cost sequencing technologies. The aim of this review is to discuss the basic concept of a phylogeny-guided mining approach, and also to provide examples in which this approach was successfully applied to discover new natural products from microbial genomes and metagenomes. I believe that the phylogeny-guided mining approach will continue to play an important role in genomics-based natural products research.
Barony, Gustavo M; Tavares, Guilherme C; Pereira, Felipe L; Carvalho, Alex F; Dorella, Fernanda A; Leal, Carlos A G; Figueiredo, Henrique C P
2017-10-19
Streptococcus agalactiae is a major pathogen and a hindrance on tilapia farming worldwide. The aims of this work were to analyze the genomic evolution of Brazilian strains of S. agalactiae and to establish spatial and temporal relations between strains isolated from different outbreaks of streptococcosis. A total of 39 strains were obtained from outbreaks and their whole genomes were sequenced and annotated for comparative analysis of multilocus sequence typing, genomic similarity and whole genome multilocus sequence typing (wgMLST). The Brazilian strains presented two sequence types, including a newly described ST, and a non-typeable lineage. The use of wgMLST could differentiate each strain in a single clone and was used to establish temporal and geographical correlations among strains. Bayesian phylogenomic analysis suggests that the studied Brazilian population was co-introduced in the country with their host, approximately 60 years ago. Brazilian strains of S. agalactiae were shown to be heterogeneous in their genome sequences and were distributed in different regions of the country according to their genotype, which allowed the use of wgMLST analysis to track each outbreak event individually.
Hall, Miquette; Chattaway, Marie A.; Reuter, Sandra; Savin, Cyril; Strauch, Eckhard; Carniel, Elisabeth; Connor, Thomas; Van Damme, Inge; Rajakaruna, Lakshani; Rajendram, Dunstan; Jenkins, Claire; Thomson, Nicholas R.
2014-01-01
The genus Yersinia is a large and diverse bacterial genus consisting of human-pathogenic species, a fish-pathogenic species, and a large number of environmental species. Recently, the phylogenetic and population structure of the entire genus was elucidated through the genome sequence data of 241 strains encompassing every known species in the genus. Here we report the mining of this enormous data set to create a multilocus sequence typing-based scheme that can identify Yersinia strains to the species level to a level of resolution equal to that for whole-genome sequencing. Our assay is designed to be able to accurately subtype the important human-pathogenic species Yersinia enterocolitica to whole-genome resolution levels. We also report the validation of the scheme on 386 strains from reference laboratory collections across Europe. We propose that the scheme is an important molecular typing system to allow accurate and reproducible identification of Yersinia isolates to the species level, a process often inconsistent in nonspecialist laboratories. Additionally, our assay is the most phylogenetically informative typing scheme available for Y. enterocolitica. PMID:25339391
Bastien, Olivier; Ortet, Philippe; Roy, Sylvaine; Maréchal, Eric
2005-03-10
Popular methods to reconstruct molecular phylogenies are based on multiple sequence alignments, in which addition or removal of data may change the resulting tree topology. We have sought a representation of homologous proteins that would conserve the information of pair-wise sequence alignments, respect probabilistic properties of Z-scores (Monte Carlo methods applied to pair-wise comparisons) and be the basis for a novel method of consistent and stable phylogenetic reconstruction. We have built up a spatial representation of protein sequences using concepts from particle physics (configuration space) and respecting a frame of constraints deduced from pair-wise alignment score properties in information theory. The obtained configuration space of homologous proteins (CSHP) allows the representation of real and shuffled sequences, and thereupon an expression of the TULIP theorem for Z-score probabilities. Based on the CSHP, we propose a phylogeny reconstruction using Z-scores. Deduced trees, called TULIP trees, are consistent with multiple-alignment based trees. Furthermore, the TULIP tree reconstruction method provides a solution for some previously reported incongruent results, such as the apicomplexan enolase phylogeny. The CSHP is a unified model that conserves mutual information between proteins in the way physical models conserve energy. Applications include the reconstruction of evolutionary consistent and robust trees, the topology of which is based on a spatial representation that is not reordered after addition or removal of sequences. The CSHP and its assigned phylogenetic topology, provide a powerful and easily updated representation for massive pair-wise genome comparisons based on Z-score computations.
Pyvolve: A Flexible Python Module for Simulating Sequences along Phylogenies.
Spielman, Stephanie J; Wilke, Claus O
2015-01-01
We introduce Pyvolve, a flexible Python module for simulating genetic data along a phylogeny using continuous-time Markov models of sequence evolution. Easily incorporated into Python bioinformatics pipelines, Pyvolve can simulate sequences according to most standard models of nucleotide, amino-acid, and codon sequence evolution. All model parameters are fully customizable. Users can additionally specify custom evolutionary models, with custom rate matrices and/or states to evolve. This flexibility makes Pyvolve a convenient framework not only for simulating sequences under a wide variety of conditions, but also for developing and testing new evolutionary models. Pyvolve is an open-source project under a FreeBSD license, and it is available for download, along with a detailed user-manual and example scripts, from http://github.com/sjspielman/pyvolve.
Li, Zhirong; Liu, Xiaolei; Zhao, Jianhong; Xu, Kaiyue; Tian, Tiantian; Yang, Jing; Qiang, Cuixin; Shi, Dongyan; Wei, Honglian; Sun, Suju; Cui, Qingqing; Li, Ruxin; Niu, Yanan; Huang, Bixing
2018-04-01
Clostridium difficile is the causative pathogen for antibiotic-related nosocomial diarrhea. For epidemiological study and identification of virulent clones, a new binary typing method was developed for C. difficile in this study. The usefulness of this newly developed optimized 10-loci binary typing method was compared with two widely used methods ribotyping and multilocus sequence typing (MLST) in 189 C. difficile samples. The binary typing, ribotyping and MLST typed the samples into 53 binary types (BTs), 26 ribotypes (RTs), and 33 MLST sequence types (STs), respectively. The typing ability of the binary method was better than that of either ribotyping or MLST expressed in Simpson Index (SI) at 0.937, 0.892 and 0.859, respectively. The ease of testing, portability and cost-effectiveness of the new binary typing would make it a useful typing alternative for outbreak investigations within healthcare facilities and epidemiological research. Copyright © 2018 Elsevier B.V. All rights reserved.
Multilocus sequence type profiles of Bacillus cereus isolates from infant formula in China.
Yang, Yong; Yu, Xiaofeng; Zhan, Li; Chen, Jiancai; Zhang, Yunyi; Zhang, Junyan; Chen, Honghu; Zhang, Zheng; Zhang, Yanjun; Lu, Yiyu; Mei, Lingling
2017-04-01
Bacillus cereus sensu stricto is an opportunistic foodborne pathogen. The multilocus sequence type (MLST) of 74 B. cereus isolated from 513 non-random infant formula in China was analyzed. Of 64 sequence types (STs) detected, 50 STs and 6 alleles were newly found in PubMLST database. All isolates except for one singleton (ST-1049), were classified into 7 clonal complexes (CC) by BURST (n-4), in which CC1 with core ancestral clone ST-26 was the largest group including 86% isolates, and CC2, 3, 9, 10 and 13 were first reported in China. MLST profiles of the isolates from 8 infant formula brands were compared. It was found the brands might be potentially tracked by the variety of STs, such as ST-1049 of singleton and ST-1062 of isolate from goat milk source, though they could not be easily tracked just by clonal complex types of the isolates. Copyright © 2016 Elsevier Ltd. All rights reserved.
Multilocus Sequence Types of Campylobacter jejuni Isolates from Different Sources in Eastern China.
Zhang, Gong; Zhang, Xiaoyan; Hu, Yuanqing; Jiao, Xin-An; Huang, Jinlin
2015-09-01
Campylobacter jejuni is a major food-borne pathogen that causes human gastroenteritis in many developed countries. In our study, we applied multilocus sequence typing (MLST) technology to 167 C. jejuni isolates from diverse sources in Eastern China to examine their genetic diversity. MLST defined 94 sequence types (STs) belonging to 18 clonal complexes (CCs). Forty-five STs from 60 isolates (36%) and 22 alleles have not been previously documented in an international database. One hundred and two isolates, accounting for 61.1% of all isolates, belonged to eight clonal complexes. The eight major CCs were also the most common complexes from different sources. The most common ST type of isolates from human and food was ST-353. The dominant ST type in chicken and foods was ST-354. Among 21 STs that contained two or more different sources isolates, 15 STs contained human isolates and isolates from other sources, suggesting that potentially pathogenic strains are not restricted to specific lineages.
Pérez-Escobar, Oscar Alejandro; Balbuena, Juan Antonio; Gottschling, Marc
2016-01-01
Phylogenetic relationships inferred from multilocus organellar and nuclear DNA data are often difficult to resolve because of evolutionary conflicts among gene trees. However, conflicting or "outlier" associations (i.e., linked pairs of "operational terminal units" in two phylogenies) among these data sets often provide valuable information on evolutionary processes such as chloroplast capture following hybridization, incomplete lineage sorting, and horizontal gene transfer. Statistical tools that to date have been used in cophylogenetic studies only also have the potential to test for the degree of topological congruence between organellar and nuclear data sets and reliably detect outlier associations. Two distance-based methods, namely ParaFit and Procrustean Approach to Cophylogeny (PACo), were used in conjunction to detect those outliers contributing to conflicting phylogenies independently derived from chloroplast and nuclear sequence data. We explored their efficiency of retrieving outlier associations, and the impact of input data (unit branch length and additive trees) between data sets, by using several simulation approaches. To test their performance using real data sets, we additionally inferred the phylogenetic relationships within Neotropical Catasetinae (Epidendroideae, Orchidaceae), which is a suitable group to investigate phylogenetic incongruence because of hybridization processes between some of its constituent species. A comparison between trees derived from chloroplast and nuclear sequence data reflected strong, well-supported incongruence within Catasetum, Cycnoches, and Mormodes. As a result, outliers among chloroplast and nuclear data sets, and in experimental simulations, were successfully detected by PACo when using patristic distance matrices obtained from phylograms, but not from unit branch length trees. The performance of ParaFit was overall inferior compared to PACo, using either phylograms or unit branch lengths as input data. Because workflows for applying cophylogenetic analyses are not standardized yet, we provide a pipeline for executing PACo and ParaFit as well as displaying outlier associations in plots and trees by using the software R. The pipeline renders a method to identify outliers with high reliability and to assess the combinability of the independently derived data sets by means of statistical analyses. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Relationship of Individual and Group Change: Ontogeny and Phylogeny in Biology.
ERIC Educational Resources Information Center
Gould, Steven Jay
1984-01-01
Considers the issue of parallels between ontogeny and phylogeny from an historical perspective. Discusses such parallels in relationship to two ontogenetic principles concerning recapitulation and sequence of stages. Differentiates between Piaget's use of the idea of recapitulation and Haeckel's biogenetic law. (Author/RH)
Multilocus sequence typing of total-genome-sequenced bacteria.
Larsen, Mette V; Cosentino, Salvatore; Rasmussen, Simon; Friis, Carsten; Hasman, Henrik; Marvig, Rasmus Lykke; Jelsbak, Lars; Sicheritz-Pontén, Thomas; Ussery, David W; Aarestrup, Frank M; Lund, Ole
2012-04-01
Accurate strain identification is essential for anyone working with bacteria. For many species, multilocus sequence typing (MLST) is considered the "gold standard" of typing, but it is traditionally performed in an expensive and time-consuming manner. As the costs of whole-genome sequencing (WGS) continue to decline, it becomes increasingly available to scientists and routine diagnostic laboratories. Currently, the cost is below that of traditional MLST. The new challenges will be how to extract the relevant information from the large amount of data so as to allow for comparison over time and between laboratories. Ideally, this information should also allow for comparison to historical data. We developed a Web-based method for MLST of 66 bacterial species based on WGS data. As input, the method uses short sequence reads from four sequencing platforms or preassembled genomes. Updates from the MLST databases are downloaded monthly, and the best-matching MLST alleles of the specified MLST scheme are found using a BLAST-based ranking method. The sequence type is then determined by the combination of alleles identified. The method was tested on preassembled genomes from 336 isolates covering 56 MLST schemes, on short sequence reads from 387 isolates covering 10 schemes, and on a small test set of short sequence reads from 29 isolates for which the sequence type had been determined by traditional methods. The method presented here enables investigators to determine the sequence types of their isolates on the basis of WGS data. This method is publicly available at www.cbs.dtu.dk/services/MLST.
Kelly, S; Wickstead, B; Gull, K
2011-04-07
We have developed a machine-learning approach to identify 3537 discrete orthologue protein sequence groups distributed across all available archaeal genomes. We show that treating these orthologue groups as binary detection/non-detection data is sufficient to capture the majority of archaeal phylogeny. We subsequently use the sequence data from these groups to infer a method and substitution-model-independent phylogeny. By holding this phylogeny constrained and interrogating the intersection of this large dataset with both the Eukarya and the Bacteria using Bayesian and maximum-likelihood approaches, we propose and provide evidence for a methanogenic origin of the Archaea. By the same criteria, we also provide evidence in support of an origin for Eukarya either within or as sisters to the Thaumarchaea.
USDA-ARS?s Scientific Manuscript database
In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T forms a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these oth...
Rapid Multi-Locus Sequence Typing Using Microfluidic Biochips
2010-05-12
Sequence Types. The evolutionary history of all the B. cereus MLST concatenated Sequence Types (545 taxa, 2,394 nucleotide positions) was inferred using...the Neighbor-Joining method [28]. The bootstrap consensus tree inferred from 100 replicates was taken to represent the evolutionary history of the... Chlamydia (manuscript in preparation) and performed pilot studies on Staphylococcus aureus and Streptoccus pneumoniae (Data S4 and Text S2). Another potential
Prolonged and mixed non-O157 Escherichia coli infection in an Australian household.
Staples, M; Graham, R M A; Doyle, C J; Smith, H V; Jennison, A V
2012-05-01
An Australian family was identified through a Public Health follow up on a Shiga-toxigenic Escherichia coli (STEC) positive bloody diarrhoea case, with three of the four family members experiencing either symptomatic or asymptomatic STEC shedding. Bacterial isolates were submitted to stx sequence sub-typing, multi-locus variable number tandem repeat analysis (MLVA), multi-locus sequence typing (MLST) and binary typing. The analysis revealed that there were multiple strains of STEC being shed by the family members, with similar virulence gene profiles and the same serogroup but differing in their MLVA and MLST profiles. This study illustrates the potentially complicated nature of non-O157 STEC infections and the importance of molecular epidemiology in understanding disease clusters. © 2012 QUEENSLAND HEALTH. Clinical Microbiology and Infection © 2012 European Society of Clinical Microbiology and Infectious Diseases.
Phylogeny of anaerobic fungi (phylum Neocallimastigomycota), with contributions from yak in China.
Wang, Xuewei; Liu, Xingzhong; Groenewald, Johannes Z
2017-01-01
The phylum Neocallimastigomycota contains eight genera (about 20 species) of strictly anaerobic fungi. The evolutionary relationships of these genera are uncertain due to insufficient sequence data to infer their phylogenies. Based on morphology and molecular phylogeny, thirteen isolates obtained from yak faeces and rumen digesta in China were assigned to Neocallimastix frontalis (nine isolates), Orpinomyces joyonii (two isolates) and Caecomyces sp. (two isolates), respectively. The phylogenetic relationships of the eight genera were evaluated using complete ITS and partial LSU sequences, compared to the ITS1 region which has been widely used in this phylum in the past. Five monophyletic lineages corresponding to six of the eight genera were statistically supported. Isolates of Caecomyces and Cyllamyces were present in a single lineage and could not be separated properly. Members of Neocallimastigomycota with uniflagellate zoospores represented by Piromyces were polyphyletic. The Piromyces-like genus Oontomyces was consistently closely related to the traditional Anaeromyces, and separated the latter genus into two clades. The phylogenetic position of the Piromyces-like genus Buwchfawromyces remained unresolved. Orpinomyces and Neocallimastix, sharing polyflagellate zoospores, were supported as sister genera in the LSU phylogeny. Apparently ITS, specifically ITS1 alone, is not a good marker to resolve the generic affinities of the studied fungi. The LSU sequences are easier to align and appear to work well to resolve generic relationships. This study provides a comparative phylogenetic revision of Neocallimastigomycota isolates known from culture and sequence data.
Phylogeny and biogeography of North-American wild rice (Zizania L.Poaceae)
USDA-ARS?s Scientific Manuscript database
The wild-rice genus Zizania includes four species disjunctly distributed in eastern Asia and North America, with three species (Z. aquatica, Z. palustris, and Z. texana) in North America and one (Z. latifolia) in eastern Asia. The phylogeny and biogeography of Zizania were explored using sequences o...
Phylogeny of Cirsium spp. in North America: host specificity does not follow phylogeny
USDA-ARS?s Scientific Manuscript database
Weedy invasive Cirsium spp. are widespread in temperate regions of North America and some of their biological control agents have attacked native Cirsium spp. A phylogenetic tree was developed from DNA sequences for the internal transcribed spacer and external transcribed spacer regions from native ...
USDA-ARS?s Scientific Manuscript database
The increase in the consumption of fresh produce in the United States has correlated with a rise in the number of reported foodborne illnesses. To identify potential risk factors associated with post-harvest practices, the present study employed multilocus sequence typing (MLST) for the genotypic c...
USDA-ARS?s Scientific Manuscript database
Previous phylogenetic analyses of species of Streptomyces based on 16S rRNA gene sequences resulted in a statistically well-supported clade (100% bootstrap value) containing 8 species that exhibited very similar gross morphology in producing open looped (Retinaculum-Apertum) to spiral (Spira) chains...
Hughes, L A; Wigley, P; Bennett, M; Chantrey, J; Williams, N
2010-10-01
Recent studies have suggested that Salmonella Typhimurium strains associated with mortality in UK garden birds are significantly different from strains that cause disease in humans and livestock and that wild bird strains may be host adapted. However, without further genomic characterization of these strains, it is not possible to determine whether they are host adapted. The aim of this study was to characterize a representative sample of Salm. Typhimurium strains detected in wild garden birds using multi-locus sequence typing (MLST)to investigate evolutionary relationships between them. Multi-locus sequence typing was performed on nine Salm. Typhimurium strains isolated from wild garden birds. Two sequence types were identified, the most common of which was ST568. Examination of the public Salmonella enterica MLST database revealed that only three other ST568 isolates had been cultured from a human in Scotland. Two further isolates of Salm. Typhimurium were determined to be ST19. Results of MLST analysis suggest that there is a predominant strain of Salm. Typhimurium circulating among garden bird populations in the United Kingdom, which is rarely detected in other species, supporting the hypothesis that this strain is host adapted. Host-pathogen evolution is often assumed to lead to pathogens becoming less virulent to avoid the death of their host; however, infection with ST568 led to high mortality rates among the wild birds examined, which were all found dead at wild bird-feeding stations. We hypothesize that by attracting unnaturally high densities of birds, wild bird-feeding stations may facilitate the transmission of ST568 between wild birds, therefore reducing the evolutionary cost of this pathogen killing its host, resulting in a host-adapted strain with increased virulence.
Dall'Agnol, Rebeca Fuzinatto; Bournaud, Caroline; de Faria, Sérgio Miana; Béna, Gilles; Moulin, Lionel; Hungria, Mariangela
2017-04-01
Some species of the genus Paraburkholderia that are able to nodulate and fix nitrogen in symbiosis with legumes are called β-rhizobia and represent a group of ecological and biotechnological importance. We used Mimosa pudica and Phaseolus vulgaris to trap 427 rhizobial isolates from rhizospheric soil of Mimoseae trees in the Brazilian Atlantic Forest. Eighty-four representative strains were selected according to the 16S rRNA haplotypes and taxonomically characterized using a concatenated 16S rRNA-recA phylogeny. Most strains were assembled in the genus Paraburkholderia, including Paraburkholderia sabiae and Pa. nodosa. Mesorhizobium (α-rhizobia) and Cupriavidus (β-rhizobia) were also isolated, but in smaller proportions. Multilocus sequence analysis and BOX-PCR analyses indicated that six clusters of Paraburkholderia represent potential new species. In the phylogenetic analysis of the nodC gene, the majority of the strains were positioned in the same groups as in the 16S rRNA-recA tree, indicative of stability and vertical inheritance, but we also identified horizontal transfer of nodC in Pa. sabiae. All α- and β-rhizobial species were trapped by both legumes, although preferences of the host plants for specific rhizobial species have been observed. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Avise, John C
2008-08-12
The field of molecular genetics has many roles in biodiversity assessment and conservation. I summarize three of those standard roles and propose logical extensions of each. First, many biologists suppose that a comprehensive picture of the Tree of Life will soon emerge from multilocus DNA sequence data interpreted in concert with fossils and other evidence. If nonreticulate trees are indeed valid metaphors for life's history, then a well dated global phylogeny will offer an opportunity to erect a universally standardized scheme of biological classification. If life's history proves to be somewhat reticulate, a web-like phylogenetic pattern should become evident and will offer opportunities to reevaluate the fundamental nature of evolutionary processes. Second, extensive networks of wildlife sanctuaries offer some hope for shepherding appreciable biodiversity through the ongoing extinction crisis, and molecular genetics can assist in park design by helping to identify key species, historically important biotic areas, and biodiversity hotspots. An opportunity centers on the concept of Pleistocene Parks that could protect "legacy biotas" in much the same way that traditional national parks preserve special geological features and historical landmarks honor legacy events in human affairs. Third, genetic perspectives have become an integral part of many focused conservation efforts by unveiling ecological, behavioral, or evolutionary phenomena relevant to population management. They also can open opportunities to educate the public about the many intellectual gifts and aesthetic marvels of the natural world.
Three ambitious (and rather unorthodox) assignments for the field of biodiversity genetics
Avise, John C.
2008-01-01
The field of molecular genetics has many roles in biodiversity assessment and conservation. I summarize three of those standard roles and propose logical extensions of each. First, many biologists suppose that a comprehensive picture of the Tree of Life will soon emerge from multilocus DNA sequence data interpreted in concert with fossils and other evidence. If nonreticulate trees are indeed valid metaphors for life's history, then a well dated global phylogeny will offer an opportunity to erect a universally standardized scheme of biological classification. If life's history proves to be somewhat reticulate, a web-like phylogenetic pattern should become evident and will offer opportunities to reevaluate the fundamental nature of evolutionary processes. Second, extensive networks of wildlife sanctuaries offer some hope for shepherding appreciable biodiversity through the ongoing extinction crisis, and molecular genetics can assist in park design by helping to identify key species, historically important biotic areas, and biodiversity hotspots. An opportunity centers on the concept of Pleistocene Parks that could protect “legacy biotas” in much the same way that traditional national parks preserve special geological features and historical landmarks honor legacy events in human affairs. Third, genetic perspectives have become an integral part of many focused conservation efforts by unveiling ecological, behavioral, or evolutionary phenomena relevant to population management. They also can open opportunities to educate the public about the many intellectual gifts and aesthetic marvels of the natural world. PMID:18695224
Stephan, Roger; Grim, Christopher J; Gopinath, Gopal R; Mammel, Mark K; Sathyamoorthy, Venugopal; Trach, Larisa H; Chase, Hannah R; Fanning, Séamus; Tall, Ben D
2014-10-01
Recently, a taxonomical re-evaluation of the genus Enterobacter, based on multi-locus sequence typing (MLST) analysis, has led to the proposal that the species Enterobacter pulveris, Enterobacter helveticus and Enterobacter turicensis should be reclassified as novel species of the genus Cronobacter. In the present work, new genome-scale analyses, including average nucleotide identity, genome-scale phylogeny and k-mer analysis, coupled with previously reported DNA-DNA hybridization values and biochemical characterization strongly indicate that these three species of the genus Enterobacter are not members of the genus Cronobacter, nor do they belong to the re-evaluated genus Enterobacter. Furthermore, data from this polyphasic study indicated that all three species constitute two new genera. We propose reclassifying Enterobacter pulveris and Enterobacter helveticus in the genus Franconibacter gen. nov. as Franconibacter pulveris comb. nov. (type strain 601/05(T) = LMG 24057(T) = DSM 19144(T)) and Franconibacter helveticus comb. nov. (type strain 513/05(T) = LMG 23732(T) = DSM 18396(T)), respectively, and Enterobacter turicensis in the genus Siccibacter gen. nov. as Siccibacter turicensis comb. nov. (type strain 508/05(T) = LMG 23730(T) = DSM 18397(T)).
Grim, Christopher J.; Gopinath, Gopal R.; Mammel, Mark K.; Sathyamoorthy, Venugopal; Trach, Larisa H.; Chase, Hannah R.; Fanning, Séamus; Tall, Ben D.
2014-01-01
Recently, a taxonomical re-evaluation of the genus Enterobacter, based on multi-locus sequence typing (MLST) analysis, has led to the proposal that the species Enterobacter pulveris, Enterobacter helveticus and Enterobacter turicensis should be reclassified as novel species of the genus Cronobacter. In the present work, new genome-scale analyses, including average nucleotide identity, genome-scale phylogeny and k-mer analysis, coupled with previously reported DNA–DNA hybridization values and biochemical characterization strongly indicate that these three species of the genus Enterobacter are not members of the genus Cronobacter, nor do they belong to the re-evaluated genus Enterobacter. Furthermore, data from this polyphasic study indicated that all three species constitute two new genera. We propose reclassifying Enterobacter pulveris and Enterobacter helveticus in the genus Franconibacter gen. nov. as Franconibacter pulveris comb. nov. (type strain 601/05T = LMG 24057T = DSM 19144T) and Franconibacter helveticus comb. nov. (type strain 513/05T = LMG 23732T = DSM 18396T), respectively, and Enterobacter turicensis in the genus Siccibacter gen. nov. as Siccibacter turicensis comb. nov. (type strain 508/05T = LMG 23730T = DSM 18397T). PMID:25028159
Genomewide Function Conservation and Phylogeny in the Herpesviridae
Albà, M. Mar; Das, Rhiju; Orengo, Christine A.; Kellam, Paul
2001-01-01
The Herpesviridae are a large group of well-characterized double-stranded DNA viruses for which many complete genome sequences have been determined. We have extracted protein sequences from all predicted open reading frames of 19 herpesvirus genomes. Sequence comparison and protein sequence clustering methods have been used to construct herpesvirus protein homologous families. This resulted in 1692 proteins being clustered into 243 multiprotein families and 196 singleton proteins. Predicted functions were assigned to each homologous family based on genome annotation and published data and each family classified into seven broad functional groups. Phylogenetic profiles were constructed for each herpesvirus from the homologous protein families and used to determine conserved functions and genomewide phylogenetic trees. These trees agreed with molecular-sequence-derived trees and allowed greater insight into the phylogeny of ungulate and murine gammaherpesviruses. PMID:11156614
Phylogeny of mycoplasmalike organisms (phytoplasmas): a basis for their classification.
Gundersen, D E; Lee, I M; Rehner, S A; Davis, R E; Kingsbury, D T
1994-01-01
A global phylogenetic analysis using parsimony of 16S rRNA gene sequences from 46 mollicutes, 19 mycoplasmalike organisms (MLOs) (new trivial name, phytoplasmas), and several related bacteria placed the MLOs definitively among the members of the class Mollicutes and revealed that MLOs form a large discrete monophyletic clade, paraphyletic to the Acholeplasma species, within the Anaeroplasma clade. Within the MLO clade resolved in the global mollicutes phylogeny and a comprehensive MLO phylogeny derived by parsimony analyses of 16S rRNA gene sequences from 30 diverse MLOs representative of nearly all known distinct MLO groups, five major phylogenetic groups with a total of 11 distinct subclades (monophyletic groups or taxa) could be recognized. These MLO subclades (roman numerals) and designated type strains were as follows: i, Maryland aster yellows AY1; ii, apple proliferation AP-A; iii, peanut witches'-broom PnWB; iv, Canada peach X CX; v, rice yellow dwarf RYD; vi, pigeon pea witches'-broom PPWB; vii, palm lethal yellowing LY; viii, ash yellows AshY; ix, clover proliferation CP; x, elm yellows EY; and xi, loofah witches'-broom LfWB. The designations of subclades and their phylogenetic positions within the MLO clade were supported by a congruent phylogeny derived by parsimony analyses of ribosomal protein L22 gene sequences from most representative MLOs. On the basis of the phylogenies inferred in the present study, we propose that MLOs should be represented taxonomically at the minimal level of genus and that each phylogenetically distinct MLO subclade identified should represent at least a distinct species under this new genus. Images PMID:8071198
Huang, Jie; Chen, Zigui; Song, Weibo; Berger, Helmut
2014-01-01
Classifications of the Urostyloidea were mainly based on morphology and morphogenesis. Since molecular phylogeny largely focused on limited sampling using mostly the one-gene information, the incongruence between morphological data and gene sequences have risen. In this work, the three-gene data (SSU-rDNA, ITS1-5.8S-ITS2 and LSU-rDNA) comprising 12 genera in the “core urostyloids” are sequenced, and the phylogenies based on these different markers are compared using maximum-likelihood and Bayesian algorithms and tested by unconstrained and constrained analyses. The molecular phylogeny supports the following conclusions: (1) the monophyly of the core group of Urostyloidea is well supported while the whole Urostyloidea is not monophyletic; (2) Thigmokeronopsis and Apokeronopsis are clearly separated from the pseudokeronopsids in analyses of all three gene markers, supporting their exclusion from the Pseudokeronopsidae and the inclusion in the Urostylidae; (3) Diaxonella and Apobakuella should be assigned to the Urostylidae; (4) Bergeriella, Monocoronella and Neourostylopsis flavicana share a most recent common ancestor; (5) all molecular trees support the transfer of Metaurostylopsis flavicana to the recently proposed genus Neourostylopsis; (6) all molecular phylogenies fail to separate the morphologically well-defined genera Uroleptopsis and Pseudokeronopsis; and (7) Arcuseries gen. nov. containing three distinctly deviating Anteholosticha species is established. PMID:24140978
Host-Nonspecific Iron Acquisition Systems and Virulence in the Zoonotic Serovar of Vibrio vulnificus
Pajuelo, David; Lee, Chung-Te; Roig, Francisco J.; Lemos, Manuel L.; Hor, Lien-I
2014-01-01
The zoonotic serovar of Vibrio vulnificus (known as biotype 2 serovar E) is the etiological agent of human and fish vibriosis. The aim of the present work was to discover the role of the vulnibactin- and hemin-dependent iron acquisition systems in the pathogenicity of this zoonotic serovar under the hypothesis that both are host-nonspecific virulence factors. To this end, we selected three genes for three outer membrane receptors (vuuA, a receptor for ferric vulnibactin, and hupA and hutR, two hemin receptors), obtained single and multiple mutants as well as complemented strains, and tested them in a series of in vitro and in vivo assays, using eels and mice as animal models. The overall results confirm that hupA and vuuA, but not hutR, are host-nonspecific virulence genes and suggest that a third undescribed host-specific plasmid-encoded system could also be used by the zoonotic serovar in fish. hupA and vuuA were expressed in the internal organs of the animals in the first 24 h of infection, suggesting that they may be needed to achieve the population size required to trigger fatal septicemia. vuuA and hupA were sequenced in strains representative of the genetic diversity of this species, and their phylogenies were reconstructed by multilocus sequence analysis of selected housekeeping and virulence genes as a reference. Given the overall results, we suggest that both genes might form part of the core genes essential not only for disease development but also for the survival of this species in its natural reservoir, the aquatic environment. PMID:24478087
Rutschmann, Sereina; Detering, Harald; Simon, Sabrina; Funk, David H; Gattolliat, Jean-Luc; Hughes, Samantha J; Raposeiro, Pedro M; DeSalle, Rob; Sartori, Michel; Monaghan, Michael T
2017-02-01
The study of processes driving diversification requires a fully sampled and well resolved phylogeny, although a lack of phylogenetic markers remains a limitation for many non-model groups. Multilocus approaches to the study of recent diversification provide a powerful means to study the evolutionary process, but their application remains restricted because multiple unlinked loci with suitable variation for phylogenetic or coalescent analysis are not available for most non-model taxa. Here we identify novel, putative single-copy nuclear DNA (nDNA) phylogenetic markers to study the colonization and diversification of an aquatic insect species complex, Cloeon dipterum L. 1761 (Ephemeroptera: Baetidae), in Macaronesia. Whole-genome sequencing data from one member of the species complex were used to identify 59 nDNA loci (32,213 base pairs), followed by Sanger sequencing of 29 individuals sampled from 13 islands of three Macaronesian archipelagos. Multispecies coalescent analyses established six putative species. Three island species formed a monophyletic clade, with one species occurring on the Azores, Europe and North America. Ancestral state reconstruction indicated at least two colonization events from the mainland (to the Canaries, respectively Azores) and one within the archipelago (between Madeira and the Canaries). Random subsets of the 59 loci showed a positive linear relationship between number of loci and node support. In contrast, node support in the multispecies coalescent tree was negatively correlated with mean number of phylogenetically informative sites per locus, suggesting a complex relationship between tree resolution and marker variability. Our approach highlights the value of combining genomics, coalescent-based phylogeography, species delimitation, and phylogenetic reconstruction to resolve recent diversification events in an archipelago species complex. Copyright © 2016 Elsevier Inc. All rights reserved.
Multiplex Touchdown PCR for Rapid Typing of the Opportunistic Pathogen Propionibacterium acnes
Barnard, Emma; Nagy, István; Hunyadkürti, Judit; Patrick, Sheila
2015-01-01
The opportunistic human pathogen Propionibacterium acnes is composed of a number of distinct phylogroups, designated types IA1, IA2, IB, IC, II, and III, which vary in their production of putative virulence factors, their inflammatory potential, and their biochemical, aggregative, and morphological characteristics. Although multilocus sequence typing (MLST) currently represents the gold standard for unambiguous phylogroup classification and individual strain identification, it is a labor-intensive and time-consuming technique. As a consequence, we developed a multiplex touchdown PCR assay that in a single reaction can confirm the species identity and phylogeny of an isolate based on its pattern of reaction with six primer sets that target the 16S rRNA gene (all isolates), ATPase (types IA1, IA2, and IC), sodA (types IA2 and IB), atpD (type II), and recA (type III) housekeeping genes, as well as a Fic family toxin gene (type IC). When applied to 312 P. acnes isolates previously characterized by MLST and representing types IA1 (n = 145), IA2 (n = 20), IB (n = 65), IC (n = 7), II (n = 45), and III (n = 30), the multiplex displayed 100% sensitivity and 100% specificity for detecting isolates within each targeted phylogroup. No cross-reactivity with isolates from other bacterial species was observed. This multiplex assay will provide researchers with a rapid, high-throughput, and technically undemanding typing method for epidemiological and phylogenetic investigations. It will facilitate studies investigating the association of lineages with various infections and clinical conditions, and it will serve as a prescreening tool to maximize the number of genetically diverse isolates selected for downstream higher-resolution sequence-based analyses. PMID:25631794
Kelly, S.; Wickstead, B.; Gull, K.
2011-01-01
We have developed a machine-learning approach to identify 3537 discrete orthologue protein sequence groups distributed across all available archaeal genomes. We show that treating these orthologue groups as binary detection/non-detection data is sufficient to capture the majority of archaeal phylogeny. We subsequently use the sequence data from these groups to infer a method and substitution-model-independent phylogeny. By holding this phylogeny constrained and interrogating the intersection of this large dataset with both the Eukarya and the Bacteria using Bayesian and maximum-likelihood approaches, we propose and provide evidence for a methanogenic origin of the Archaea. By the same criteria, we also provide evidence in support of an origin for Eukarya either within or as sisters to the Thaumarchaea. PMID:20880885
USDA-ARS?s Scientific Manuscript database
Premise of the study: Prunus L. phylogeny has extensively studied using cpDNA sequences. CpDNA has a slow rate of evolution which is beneficial to determine species relationships at a deeper level. However, a limitation of the chloroplast based phylogenies is its transfer by interspecific hybridizat...
Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar
2013-06-01
In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.
Two Atypical Cases of Kingella kingae Invasive Infection with Concomitant Human Rhinovirus Infection
Basmaci, Romain; Ilharreborde, Brice; Doit, Catherine; Presedo, Ana; Lorrot, Mathie; Alison, Marianne; Mazda, Keyvan; Bidet, Philippe
2013-01-01
We describe two atypical cases of Kingella kingae infection in children diagnosed by PCR, one case involving a soft tissue abscess and one case a femoral Brodie abscess. Both patients had concomitant human rhinovirus infection. K. kingae strains, isolated from an oropharyngeal swab, were characterized by multilocus sequence typing and rtxA sequencing. PMID:23784119
Chaloner, Gemma L.; Harrison, Timothy G.; Coyne, Karen P.; Aanensen, David M.; Birtles, Richard J.
2011-01-01
Bartonella henselae is one of the most common zoonotic agents acquired from companion animals (cats) in industrialized countries. Nonetheless, although the prevalence of infections in cats is high, the number of human cases reported is relatively low. One hypothesis for this discrepancy is that B. henselae strains vary in their zoonotic potential. To test this hypothesis, we employed structured sampling to explore the population structure of B. henselae in the United Kingdom and to determine the distribution of strains associated with zoonotic disease within this structure. A total of 118 B. henselae strains were delineated into 12 sequence types (STs) using multilocus sequence typing. We observed that most (85%) of the zoonosis-associated strains belonged to only three genotypes, i.e., ST2, ST5, and ST8. Conversely, most (74%) of the feline isolates belonged to ST4, ST6, and ST7. The difference in host association of ST2, ST5, and ST8 (zoonosis associated) and ST6 (feline) was statistically significant (P < 0.05), indicating that a few, uncommon STs were responsible for the majority of symptomatic human infections. PMID:21471345
Nunney, L; Elfekih, S; Stouthamer, R
2012-05-01
Microbial identification methods have evolved rapidly over the last few decades. One such method is multilocus sequence typing (MLST). MLST is a powerful tool for understanding the evolutionary dynamics of pathogens and to gain insight into their genetic diversity. We illustrate the importance of accurate typing by reporting on three problems that have arisen in the study of a single bacterial species, the plant pathogen Xylella fastidiosa. Two of these were particularly serious since they concerned contamination of important research material that has had detrimental consequences for Xylella research: the contamination of DNA used in the sequencing of an X. fastidiosa genome (Ann-1) with DNA from another X. fastidiosa strain, and the unrecognized mislabeling of a strain (Temecula1) distributed from a culture collection (ATCC). We advocate the routine use of MLST to define strains maintained in culture collections and emphasize the importance of confirming the purity of DNA submitted for sequencing. We also present a third example that illustrates the value of MLST in guiding the choice of taxonomic types. Beyond these situations, there is a strong case for MLST whenever an isolate is used experimentally, especially where genotypic differences are suspected to influence the outcome.
Pinho, Marcos D; Erol, Erdal; Ribeiro-Gonçalves, Bruno; Mendes, Catarina I; Carriço, João A; Matos, Sandra C; Preziuso, Silvia; Luebke-Becker, Antina; Wieler, Lothar H; Melo-Cristino, Jose; Ramirez, Mario
2016-08-17
The pathogenic role of beta-hemolytic Streptococcus dysgalactiae in the equine host is increasingly recognized. A collection of 108 Lancefield group C (n = 96) or L (n = 12) horse isolates recovered in the United States and in three European countries presented multilocus sequence typing (MLST) alleles, sequence types and emm types (only 56% of the isolates could be emm typed) that were, with few exceptions, distinct from those previously found in human Streptococcus dysgalactiae subsp. equisimilis. Characterization of a subset of horse isolates by multilocus sequence analysis (MLSA) and 16S rRNA gene sequence showed that most equine isolates could also be differentiated from S. dysgalactiae strains from other animal species, supporting the existence of a horse specific genomovar. Draft genome information confirms the distinctiveness of the horse genomovar and indicates the presence of potentially horse-specific virulence factors. While this genomovar represents most of the isolates recovered from horses, a smaller MLST and MLSA defined sub-population seems to be able to cause infections in horses, other animals and humans, indicating that transmission between hosts of strains belonging to this group may occur.
Li, Xiang; Tambong, James; Yuan, Kat Xiaoli; Chen, Wen; Xu, Huimin; Lévesque, C André; De Boer, Solke H
2018-01-01
Although the genus Clavibacter was originally proposed to accommodate all phytopathogenic coryneform bacteria containing B2γ diaminobutyrate in the peptidoglycan, reclassification of all but one species into other genera has resulted in the current monospecific status of the genus. The single species in the genus, Clavibacter michiganensis, has multiple subspecies, which are all highly host-specific plant pathogens. Whole genome analysis based on average nucleotide identity and digital DNA-DNA hybridization as well as multi-locus sequence analysis (MLSA) of seven housekeeping genes support raising each of the C. michiganensis subspecies to species status. On the basis of whole genome and MLSA data, we propose the establishment of two new species and three new combinations: Clavibacter capsici sp. nov., comb. nov. and Clavibacter tessellarius sp. nov., comb. nov., and Clavibacter insidiosus comb. nov., Clavibacter nebraskensis comb. nov. and Clavibacter sepedonicus comb. nov.
Li, Xiang; Tambong, James; Yuan, Kat (Xiaoli); Chen, Wen; Xu, Huimin; Lévesque, C. André; De Boer, Solke H.
2018-01-01
Although the genus Clavibacter was originally proposed to accommodate all phytopathogenic coryneform bacteria containing B2γ diaminobutyrate in the peptidoglycan, reclassification of all but one species into other genera has resulted in the current monospecific status of the genus. The single species in the genus, Clavibacter michiganensis, has multiple subspecies, which are all highly host-specific plant pathogens. Whole genome analysis based on average nucleotide identity and digital DNA–DNA hybridization as well as multi-locus sequence analysis (MLSA) of seven housekeeping genes support raising each of the C. michiganensis subspecies to species status. On the basis of whole genome and MLSA data, we propose the establishment of two new species and three new combinations: Clavibacter capsici sp. nov., comb. nov. and Clavibacter tessellarius sp. nov., comb. nov., and Clavibacter insidiosus comb. nov., Clavibacter nebraskensis comb. nov. and Clavibacter sepedonicus comb. nov. PMID:29160202
Vite-Garín, Tania; Estrada-Bárcenas, Daniel Alfonso; Cifuentes, Joaquín; Taylor, Maria Lucia
2014-01-01
Advances in the classification of the human pathogen Histoplasma capsulatum (H. capsulatum) (ascomycete) are sustained by the results of several genetic analyses that support the high diversity of this dimorphic fungus. The present mini-review highlights the great genetic plasticity of H. capsulatum. Important records with different molecular tools, mainly single- or multi-locus sequence analyses developed with this fungus, are discussed. Recent phylogenetic data with a multi-locus sequence analysis using 5 polymorphic loci support a new clade and/or phylogenetic species of H. capsulatum for the Americas, which was associated with fungal isolates obtained from the migratory bat Tadarida brasiliensis. This manuscript is part of the series of works presented at the "V International Workshop: Molecular genetic approaches to the study of human pathogenic fungi" (Oaxaca, Mexico, 2012). Copyright © 2013 Revista Iberoamericana de Micología. Published by Elsevier Espana. All rights reserved.
Sun, Mingjun; Jing, Zhigang; Di, Dongdong; Yan, Hao; Zhang, Zhicheng; Xu, Quangang; Zhang, Xiyue; Wang, Xun; Ni, Bo; Sun, Xiangxiang; Yan, Chengxu; Yang, Zhen; Tian, Lili; Li, Jinping; Fan, Weixing
2017-01-01
Brucellosis is a worldwide zoonotic disease caused by Brucella spp. In China, brucellosis is recognized as a reemerging disease mainly caused by Brucella melitensis specie. To better understand the currently endemic B. melitensis strains in China, three Brucella genotyping methods were applied to 110 B. melitensis strains obtained in past several years. By MLVA genotyping, five MLVA-8 genotypes were identified, among which genotypes 42 (1-5-3-13-2-2-3-2) was recognized as the predominant genotype, while genotype 63 (1-5-3-13-2-3-3-2) and a novel genotype of 1-5-3-13-2-4-3-2 were second frequently observed. MLVA-16 discerned a total of 57 MLVA-16 genotypes among these Brucella strains, with 41 genotypes being firstly detected and the other 16 genotypes being previously reported. By BruMLSA21 typing, six sequence types (STs) were identified, among them ST8 is the most frequently seen in China while the other five STs were firstly detected and designated as ST137, ST138, ST139, ST140, and ST141 by international multilocus sequence typing database. Whole-genome sequence (WGS)-single-nucleotide polymorphism (SNP)-based typing and phylogenetic analysis resolved Chinese B. melitensis strains into five clusters, reflecting the existence of multiple lineages among these Chinese B. melitensis strains. In phylogeny, Chinese lineages are more closely related to strains collected from East Mediterranean and Middle East countries, such as Turkey, Kuwait, and Iraq. In the next few years, MLVA typing will certainly remain an important epidemiological tool for Brucella infection analysis, as it displays a high discriminatory ability and achieves result largely in agreement with WGS-SNP-based typing. However, WGS-SNP-based typing is found to be the most powerful and reliable method in discerning Brucella strains and will be popular used in the future.
Pervasiveness of UVC254-resistant Geobacillus strains in extreme environments.
Carlson, Courtney; Singh, Nitin K; Bibra, Mohit; Sani, Rajesh K; Venkateswaran, Kasthuri
2018-02-01
We have characterized a broad collection of extremophilic bacterial isolates from a deep subsurface mine, compost dumping sites, and several hot spring ecosystems. Spore-forming strains isolated from these environments comprised both obligate thermophiles/thermotolerant species (growing at > 55 °C; 240 strains) and mesophiles (growing at 15 to 40 °C; 12 strains). An overwhelming abundance of Geobacillus (81.3%) and Bacillus (18.3%) species was observed among the tested isolates. 16S rRNA sequence analysis documented the presence of 24 species among these isolates, but the 16S rRNA gene was shown to possess insufficient resolution to reliably discern Geobacillus phylogeny. gyrB-based phylogenetic analyses of nine strains revealed the presence of six known Geobacillus and one novel species. Multilocus sequence typing analyses based on seven different housekeeping genes deduced from whole genome sequencing of nine strains revealed the presence of three novel Geobacillus species. The vegetative cells of 41 Geobacillus strains were exposed to UVC 254 , and most (34 strains) survived 120 J/m 2 , while seven strains survived 300 J/m 2 , and cells of only one Geobacillus strain isolated from a compost facility survived 600 J/m 2 . Additionally, the UVC 254 inactivation kinetics of spores from four Geobacillus strains isolated from three distinct geographical regions were evaluated and compared to that of a spacecraft assembly facility (SAF) clean room Geobacillus strain. The purified spores of the thermophilic SAF strain exhibited resistance to 2000 J/m 2 , whereas spores of two environmental Geobacillus strains showed resistance to 1000 J/m 2 . This study is the first to investigate UV resistance of environmental, obligately thermophilic Geobacillus strains, and also lays the foundation for advanced understanding of necessary sterilization protocols practiced in food, medical, pharmaceutical, and aerospace industries.
Cangi, Nídia; Gordon, Jonathan L; Bournez, Laure; Pinarello, Valérie; Aprelon, Rosalie; Huber, Karine; Lefrançois, Thierry; Neves, Luís; Meyer, Damien F; Vachiéry, Nathalie
2016-01-01
The disease, Heartwater, caused by the Anaplasmataceae E. ruminantium , represents a major problem for tropical livestock and wild ruminants. Up to now, no effective vaccine has been available due to a limited cross protection of vaccinal strains on field strains and a high genetic diversity of Ehrlichia ruminantium within geographical locations. To address this issue, we inferred the genetic diversity and population structure of 194 E. ruminantium isolates circulating worldwide using Multilocus Sequence Typing based on lipA, lipB, secY, sodB , and sucA genes . Phylogenetic trees and networks were generated using BEAST and SplitsTree, respectively, and recombination between the different genetic groups was tested using the PHI test for recombination. Our study reveals the repeated occurrence of recombination between E. ruminantium strains, suggesting that it may occur frequently in the genome and has likely played an important role in the maintenance of genetic diversity and the evolution of E. ruminantium . Despite the unclear phylogeny and phylogeography, E. ruminantium isolates are clustered into two main groups: Group 1 (West Africa) and a Group 2 (worldwide) which is represented by West, East, and Southern Africa, Indian Ocean, and Caribbean strains. Some sequence types are common between West Africa and Caribbean and between Southern Africa and Indian Ocean strains. These common sequence types highlight two main introduction events due to the movement of cattle: from West Africa to Caribbean and from Southern Africa to the Indian Ocean islands. Due to the long branch lengths between Group 1 and Group 2, and the propensity for recombination between these groups, it seems that the West African clusters of Subgroup 2 arrived there more recently than the original divergence of the two groups, possibly with the original waves of domesticated ruminants that spread across the African continent several thousand years ago.
Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms.
Zhang, Wei; Qi, Weihong; Albert, Thomas J; Motiwala, Alifiya S; Alland, David; Hyytia-Trees, Eija K; Ribot, Efrain M; Fields, Patricia I; Whittam, Thomas S; Swaminathan, Bala
2006-06-01
Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7x10(-9) per site per year), we estimate that the most recent common ancestor of the contemporary beta-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens.
Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms
Zhang, Wei; Qi, Weihong; Albert, Thomas J.; Motiwala, Alifiya S.; Alland, David; Hyytia-Trees, Eija K.; Ribot, Efrain M.; Fields, Patricia I.; Whittam, Thomas S.; Swaminathan, Bala
2006-01-01
Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7 × 10−9 per site per year), we estimate that the most recent common ancestor of the contemporary β-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens. PMID:16606700
Jones, Christopher M; Stres, Blaz; Rosenquist, Magnus; Hallin, Sara
2008-09-01
Denitrification is a facultative respiratory pathway in which nitrite (NO2(-)), nitric oxide (NO), and nitrous oxide (N2O) are successively reduced to nitrogen gas (N(2)), effectively closing the nitrogen cycle. The ability to denitrify is widely dispersed among prokaryotes, and this polyphyletic distribution has raised the possibility of horizontal gene transfer (HGT) having a substantial role in the evolution of denitrification. Comparisons of 16S rRNA and denitrification gene phylogenies in recent studies support this possibility; however, these results remain speculative as they are based on visual comparisons of phylogenies from partial sequences. We reanalyzed publicly available nirS, nirK, norB, and nosZ partial sequences using Bayesian and maximum likelihood phylogenetic inference. Concomitant analysis of denitrification genes with 16S rRNA sequences from the same organisms showed substantial differences between the trees, which were supported by examining the posterior probability of monophyletic constraints at different taxonomic levels. Although these differences suggest HGT of denitrification genes, the presence of structural variants for nirK, norB, and nosZ makes it difficult to determine HGT from other evolutionary events. Additional analysis using phylogenetic networks and likelihood ratio tests of phylogenies based on full-length sequences retrieved from genomes also revealed significant differences in tree topologies among denitrification and 16S rRNA gene phylogenies, with the exception of the nosZ gene phylogeny within the data set of the nirK-harboring genomes. However, inspection of codon usage and G + C content plots from complete genomes gave no evidence for recent HGT. Instead, the close proximity of denitrification gene copies in the genomes of several denitrifying bacteria suggests duplication. Although HGT cannot be ruled out as a factor in the evolution of denitrification genes, our analysis suggests that other phenomena, such gene duplication/divergence and lineage sorting, may have differently influenced the evolution of each denitrification gene.
Streptococcus mutans clonal variation revealed by multilocus sequence typing.
Nakano, Kazuhiko; Lapirattanakul, Jinthana; Nomura, Ryota; Nemoto, Hirotoshi; Alaluusua, Satu; Grönroos, Lisa; Vaara, Martti; Hamada, Shigeyuki; Ooshima, Takashi; Nakagawa, Ichiro
2007-08-01
Streptococcus mutans is the major pathogen of dental caries, a biofilm-dependent infectious disease, and occasionally causes infective endocarditis. S. mutans strains have been classified into four serotypes (c, e, f, and k). However, little is known about the S. mutans population, including the clonal relationships among strains of S. mutans, in relation to the particular clones that cause systemic diseases. To address this issue, we have developed a multilocus sequence typing (MLST) scheme for S. mutans. Eight housekeeping gene fragments were sequenced from each of 102 S. mutans isolates collected from the four serotypes in Japan and Finland. Between 14 and 23 alleles per locus were identified, allowing us theoretically to distinguish more than 1.2 x 10(10) sequence types. We identified 92 sequence types in these 102 isolates, indicating that S. mutans contains a diverse population. Whereas serotype c strains were widely distributed in the dendrogram, serotype e, f, and k strains were differentiated into clonal complexes. Therefore, we conclude that the ancestral strain of S. mutans was serotype c. No geographic specificity was identified. However, the distribution of the collagen-binding protein gene (cnm) and direct evidence of mother-to-child transmission were clearly evident. In conclusion, the superior discriminatory capacity of this MLST scheme for S. mutans may have important practical implications.
Bouvet, Philippe; Ferraris, Laurent; Dauphin, Brunhilde; Popoff, Michel-Robert; Butel, Marie Jose
2014-01-01
In 2002, an outbreak of necrotizing enterocolitis in a Canadian neonatal intensive care unit was associated with a proposed novel species of Clostridium, “Clostridium neonatale.” To date, there are no data about the isolation, identification, or clinical significance of this species. Additionally, C. neonatale has not been formally classified as a new species, rendering its identification challenging. Indeed, the C. neonatale 16S rRNA gene sequence shows high similarity to another Clostridium species involved in neonatal necrotizing enterocolitis, Clostridium butyricum. By performing a polyphasic study combining phylogenetic analysis (16S rRNA gene sequencing and multilocus sequence analysis) and phenotypic characterization with mass spectrometry, we demonstrated that C. neonatale is a new species within the Clostridium genus sensu stricto, for which we propose the name Clostridium neonatale sp. nov. Now that the status of C. neonatale has been clarified, matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) can be used for better differential identification of C. neonatale and C. butyricum clinical isolates. This is necessary to precisely define the role and clinical significance of C. neonatale, a species that may have been misidentified and underrepresented during previous neonatal necrotizing enterocolitis studies. PMID:25232167
Rubin, D A; Dores, R M
1995-06-01
In order to obtain a more resolute phylogeny of teleosts based on growth hormone (GH) sequences, phylogenetic analyses were performed in which deletions (gaps), which appear to be order specific, were upheld to maintain GH's structural information. Sequences were analyzed at 194 amino acid positions. In addition, the two closest genealogically related groups to the teleosts, Amia calva and Acipenser guldenstadti, were used as outgroups. Modified sequence alignments were also analyzed to determine clade stability. Analyses indicated, in the most parsimonious cladogram, that molecular and morphological relationships for the orders of fishes are congruent. With GH molecular sequence data it was possible to resolve all clades at the familial level. Analyses of the primary sequence data indicate that: (a) the halecomorphean and chondrostean GH sequences are the appropriate outgroups for generating the most parsimonious cladogram for teleosts; (b) proper alignment of teleost GH sequence by the inclusion of gaps is necessary for resolution of the Percomorpha; and (c) removal of sequence information by deleting improperly aligned sequence decreases the phylogenetic signal obtained.
Pestalotiopsis and allied genera from Camellia, with description of 11 new species from China.
Liu, Fang; Hou, Lingwei; Raza, Mubashar; Cai, Lei
2017-04-13
A total of 124 Pestalotiopsis-like isolates associated with symptomatic and asymptomatic tissues of Camellia sinensis and other Camellia spp. from eight provinces in China were investigated. Based on single- and multi-locus (ITS, TEF, TUB2) phylogenies, as well as morphological characters, host associations and geographical distributions, they were classified into at least 19 species in three genera, i.e. Neopestalotiopsis, Pestalotiopsis and Pseudopestalotiopsis. Eight novel species in Pestalotiopsis and three novel species in Pseudopestalotiopsis were described. Our data suggested that the currently widely used loci in Pestalotiopsis-like genera do not consistently provide stable and sufficient resolution tree topologies, especially for Neopestalotiopsis. Moreover, the number, branch pattern and length of the conidial basal appendages were revealed to be phylogenetically informative characters in Pestalotiopsis.
Phylogeny of economically important insect pests that infesting several crops species in Malaysia
NASA Astrophysics Data System (ADS)
Ghazali, Siti Zafirah; Zain, Badrul Munir Md.; Yaakop, Salmah
2014-09-01
This paper reported molecular data on insect pests of commercial crops in Peninsular Malaysia. Fifteen insect pests (Metisa plana, Calliteara horsefeldii, Cotesia vestalis, Bactrocera papayae, Bactrocera carambolae, Bactrocera latifrons, Conopomorpha cramella, Sesamia inferens, Chilo polychrysa, Rhynchophorus vulneratus, and Rhynchophorus ferrugineus) of nine crops were sampled (oil palm, coconut, paddy, cocoa, starfruit, angled loofah, guava, chili and mustard) and also four species that belong to the fern's pest (Herpetogramma platycapna) and storage and rice pests (Tribolium castaneum, Oryzaephilus surinamensis and Cadra cautella). The presented phylogeny summarized the initial phylogenetic hypothesis, which concerning by implementation of the economically important insect pests. In this paper, phylogenetic relationships among 39 individuals of 15 species that belonging to three orders under 12 genera were inferred from DNA sequences of mitochondrial marker, cytochrome oxidase subunit I (COI) and nuclear marker, ribosomal DNA 28S D2 region. The phylogenies resulted from the phylogenetic analyses of both genes are relatively similar, but differ in the sequence of evolution. Interestingly, this most recent molecular data of COI sequences data by using Bayesian Inference analysis resulted a more-resolved phylogeny that corroborated with traditional hypotheses of holometabolan relationships based on traditional hypotheses of holometabolan relationships and most of recently molecular study compared to 28S sequences. This finding provides the information on relationships of pests species, which infested several crops in Malaysia and also estimation on Holometabola's order relationships. The identification of the larval stages of insect pests could be done accurately, without waiting the emergence of adults and supported by the phylogenetic tree.
Concordance and discordance of sequence survey methods for molecular epidemiology
Hasan, Nur A.; Cebula, Thomas A.; Colwell, Rita R.; Robison, Richard A.; Johnson, W. Evan; Crandall, Keith A.
2015-01-01
The post-genomic era is characterized by the direct acquisition and analysis of genomic data with many applications, including the enhancement of the understanding of microbial epidemiology and pathology. However, there are a number of molecular approaches to survey pathogen diversity, and the impact of these different approaches on parameter estimation and inference are not entirely clear. We sequenced whole genomes of bacterial pathogens, Burkholderia pseudomallei, Yersinia pestis, and Brucella spp. (60 new genomes), and combined them with 55 genomes from GenBank to address how different molecular survey approaches (whole genomes, SNPs, and MLST) impact downstream inferences on molecular evolutionary parameters, evolutionary relationships, and trait character associations. We selected isolates for sequencing to represent temporal, geographic origin, and host range variability. We found that substitution rate estimates vary widely among approaches, and that SNP and genomic datasets yielded different but strongly supported phylogenies. MLST yielded poorly supported phylogenies, especially in our low diversity dataset, i.e., Y. pestis. Trait associations showed that B. pseudomallei and Y. pestis phylogenies are significantly associated with geography, irrespective of the molecular survey approach used, while Brucella spp. phylogeny appears to be strongly associated with geography and host origin. We contrast inferences made among monomorphic (clonal) and non-monomorphic bacteria, and between intra- and inter-specific datasets. We also discuss our results in light of underlying assumptions of different approaches. PMID:25737810
Ribosomal RNA: a key to phylogeny
NASA Technical Reports Server (NTRS)
Olsen, G. J.; Woese, C. R.
1993-01-01
As molecular phylogeny increasingly shapes our understanding of organismal relationships, no molecule has been applied to more questions than have ribosomal RNAs. We review this role of the rRNAs and some of the insights that have been gained from them. We also offer some of the practical considerations in extracting the phylogenetic information from the sequences. Finally, we stress the importance of comparing results from multiple molecules, both as a method for testing the overall reliability of the organismal phylogeny and as a method for more broadly exploring the history of the genome.
Ba, Hengxing; Yang, Fuhe; Xing, Xiumei; Li, Chunyi
2015-06-01
To further refine the classification and phylogeny of sika deer subspecies, the well-annotated sequences of the complete mitochondrial DNA (mtDNA) control region of 13 sika deer subspecies from GenBank were downloaded, aligned and analyzed in this study. By reconstructing the phylogenetic tree with an extended sample set, the results revealed a split between Northern and Southern Mainland Asia/Taiwan lineages, and moreover, two subspecies, C.n.mantchuricus and C.n.hortulorum, were existed in Northern Mainland Asia. Unexpectedly, Dybowskii's sika deer that was thought to originate from Northern Mainland Asia joins the Southern Mainland Asia/Taiwan lineage. The genetic divergences were ranged from 2.1% to 4.7% between Dybowskii's sika deer and all the other established subspecies at the mtDNA sequence level, which suggests that the maternal lineage of uncertain sika subspecies in Europe had been maintained until today. This study also provides a better understanding for the classification, phylogeny and phylogeographic history of sika deer subspecies.
Chen, Zhi-Teng; Zhao, Meng-Yuan; Xu, Cheng; Du, Yu-Zhou
2018-05-01
The infraorder Systellognatha is the most species-rich clade in the insect order Plecoptera and includes six families in two superfamilies: Pteronarcyoidea (Pteronarcyidae, Peltoperlidae, and Styloperlidae) and Perloidea (Perlidae, Perlodidae, and Chloroperlidae). To resolve the debatable phylogeny of Systellognatha, we carried out the first mitochondrial phylogenetic analysis covering all the six families, including three newly sequenced mitogenomes from two families (Perlodidae and Peltoperlidae) and 15 published mitogenomes. The three newly reported mitogenomes share conserved mitogenomic features with other sequenced stoneflies. For phylogenetic analyses, we assembled five datasets with two inference methods to assess their influence on topology and nodal support within Systellognatha. The results indicated that inclusion of the third codon positions of PCGs, exclusion of rRNA genes, the use of nucleotide datasets and Bayesian inference could improve the phylogenetic reconstruction of Systellognatha. The monophyly of Perloidea was supported in the mitochondrial phylogeny, but Pteronarcyoidea was recovered as paraphyletic and remained controversial. In this mitochondrial phylogenetic study, the relationships within Systellognatha were recovered as (((Perlidae + (Perlodidae + Chloroperlidae)) + (Pteronarcyidae + Styloperlidae)) + Peltoperlidae). Copyright © 2018 Elsevier B.V. All rights reserved.
Molecular phylogeny of choanoflagellates, the sister group to Metazoa
Carr, M.; Leadbeater, B. S. C.; Hassan, R.; Nelson, M.; Baldauf, S. L.
2008-01-01
Choanoflagellates are single-celled aquatic flagellates with a unique morphology consisting of a cell with a single flagellum surrounded by a “collar” of microvilli. They have long interested evolutionary biologists because of their striking resemblance to the collared cells (choanocytes) of sponges. Molecular phylogeny has confirmed a close relationship between choanoflagellates and Metazoa, and the first choanoflagellate genome sequence has recently been published. However, molecular phylogenetic studies within choanoflagellates are still extremely limited. Thus, little is known about choanoflagellate evolution or the exact nature of the relationship between choanoflagellates and Metazoa. We have sequenced four genes from a broad sampling of the morphological diversity of choanoflagellates including most species currently available in culture. Phylogenetic analyses of these sequences, alone and in combination, reject much of the traditional taxonomy of the group. The molecular data also strongly support choanoflagellate monophyly rejecting proposals that Metazoa were derived from a true choanoflagellate ancestor. Mapping of a complementary matrix of morphological and ecological traits onto the phylogeny allows a reinterpretation of choanoflagellate character evolution and predicts the nature of their last common ancestor. PMID:18922774
Dan, Tong; Liu, Wenjun; Sun, Zhihong; Lv, Qiang; Xu, Haiyan; Song, Yuqin; Zhang, Heping
2014-06-09
Economically, Leuconostoc lactis is one of the most important species in the genus Leuconostoc. It plays an important role in the food industry including the production of dextrans and bacteriocins. Currently, traditional molecular typing approaches for characterisation of this species at the isolate level are either unavailable or are not sufficiently reliable for practical use. Multilocus sequence typing (MLST) is a robust and reliable method for characterising bacterial and fungal species at the molecular level. In this study, a novel MLST protocol was developed for 50 L. lactis isolates from Mongolia and China. Sequences from eight targeted genes (groEL, carB, recA, pheS, murC, pyrG, rpoB and uvrC) were obtained. Sequence analysis indicated 20 different sequence types (STs), with 13 of them being represented by a single isolate. Phylogenetic analysis based on the sequences of eight MLST loci indicated that the isolates belonged to two major groups, A (34 isolates) and B (16 isolates). Linkage disequilibrium analyses indicated that recombination occurred at a low frequency in L. lactis, indicating a clonal population structure. Split-decomposition analysis indicated that intraspecies recombination played a role in generating genotypic diversity amongst isolates. Our results indicated that MLST is a valuable tool for typing L. lactis isolates that can be used for further monitoring of evolutionary changes and population genetics.
Otero, Verónica; Rodríguez-Calleja, José-María; Otero, Andrés; García-López, María-Luisa
2013-01-01
A collection of 81 isolates of enteropathogenic Escherichia coli (EPEC) was obtained from samples of bulk tank sheep milk (62 isolates), ovine feces (4 isolates), sheep farm environment (water, 4 isolates; air, 1 isolate), and human stool samples (9 isolates). The strains were considered atypical EPEC organisms, carrying the eae gene without harboring the pEAF plasmid. Multilocus sequence typing (MLST) was carried out with seven housekeeping genes and 19 sequence types (ST) were detected, with none of them having been previously reported for atypical EPEC. The most frequent ST included 41 strains isolated from milk and human stool samples. Genetic typing by pulsed-field gel electrophoresis (PFGE) resulted in 57 patterns which grouped in 24 clusters. Comparison of strains isolated from the different samples showed phylogenetic relationships between milk and human isolates and also between milk and water isolates. The results obtained show a possible risk for humans due to the presence of atypical EPEC in ewes' milk and suggest a transmission route for this emerging pathogen through contaminated water. PMID:23872571
Bernhardt, A; Sedlacek, L; Wagner, S; Schwarz, C; Würstl, B; Tintelnot, K
2013-12-01
Scedosporium and Pseudallescheria species are the second most common lung-colonising fungi in cystic fibrosis (CF) patients. For epidemiological reasons it is important to trace sources of infection, routes of transmission and to determine whether these fungi are transient or permanent colonisers of the respiratory tract. Molecular typing methods like multilocus sequence typing (MLST) help provide this data. Clinical isolates of the P. boydii complex (including S. apiospermum and P. boydii) from CF patients in different regions of Germany were studied using MLST. Five gene loci, ACT, CAL, RPB2, BT2 and SOD2, were analysed. The S. apiospermum isolates from 34 patients were assigned to 32 sequence types (STs), and the P. boydii isolates from 14 patients to 8 STs. The results revealed that patients can be colonised by individual strains for years. The MLST scheme developed for S. apiospermum and P. boydii is a highly effective tool for epidemiologic studies worldwide. The MLST data are accessible at http://mlst.mycologylab.org/. Copyright © 2013 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.
Klein, Günter
2011-07-01
Bacillus cereus var. toyoi strain NCIMB 40112 (Toyocerin), a probiotic authorized in the European Union as feed additive for swine, bovines, poultry, and rabbits, was characterized by DNA fingerprinting applying pulsed-field gel electrophoresis and multilocus sequence typing and was compared with reference strains (of clinical and environmental origins). The probiotic strain was clearly characterized by pulsed-field gel electrophoresis using the restriction enzymes Apa I and Sma I resulting in unique DNA patterns. The comparison to the clinical reference strain B. cereus DSM 4312 was done with the same restriction enzymes, and again a clear differentiation of the two strains was possible by the resulting DNA patterns. The use of the restriction enzymes Apa I and Sma I is recommended for further studies. Furthermore, multilocus sequence typing analysis revealed a sequence type (ST 111) that was different from all known STs of B. cereus strains from food poisoning incidents. Thus, a strain characterization and differentiation from food poisoning strains for the probiotic strain was possible. Copyright ©, International Association for Food Protection
Marco, Jorge D; Bhutto, Abdul M; Soomro, Farooq R; Baloch, Javed H; Barroso, Paola A; Kato, Hirotomo; Uezato, Hiroshi; Katakura, Ken; Korenaga, Masataka; Nonaka, Shigeo; Hashiguchi, Yoshihisa
2006-08-01
Seventeen Leishmania stocks isolated from cutaneous lesions of Pakistani patients were studied by multilocus enzyme electrophoresis and by polymerase chain reaction amplification and sequencing of the cytochrome b (Cyt b) gene. Eleven stocks that expressed nine zymodemes were assigned to L. (Leishmania) major. All of them were isolated from patients in the lowlands of Larkana district and Sibi city in Sindh and Balochistan provinces, respectively. The remaining six, distributed in two zymodemes (five and one), isolated from the highland of Quetta city, Balochistan, were identified as L. (L.) tropica. The same result at species level was obtained by the Cyt b sequencing for all the stocks examined. No clear-cut association between the clinical features (wet or dry type lesions) and the Leishmania species involved was found. Leishmania (L.) major was highly polymorphic compared with L. (L.) tropica. This difference may be explained by the fact that humans may act as a sole reservoir of L. (L.) tropica in anthroponotic cycles; however, many wild mammals can be reservoirs of L. (L.) major in zoonotic cycles.
Housworth, E A; Martins, E P
2001-01-01
Statistical randomization tests in evolutionary biology often require a set of random, computer-generated trees. For example, earlier studies have shown how large numbers of computer-generated trees can be used to conduct phylogenetic comparative analyses even when the phylogeny is uncertain or unknown. These methods were limited, however, in that (in the absence of molecular sequence or other data) they allowed users to assume that no phylogenetic information was available or that all possible trees were known. Intermediate situations where only a taxonomy or other limited phylogenetic information (e.g., polytomies) are available are technically more difficult. The current study describes a procedure for generating random samples of phylogenies while incorporating limited phylogenetic information (e.g., four taxa belong together in a subclade). The procedure can be used to conduct comparative analyses when the phylogeny is only partially resolved or can be used in other randomization tests in which large numbers of possible phylogenies are needed.
Veterinary Fusarioses within the United States
USDA-ARS?s Scientific Manuscript database
Multilocus DNA sequence data was used to retrospectively assess the genetic diversity and evolutionary relationships of 67 Fusarium strains from veterinary sources, most of which were from the United States. Molecular phylogenetic analyses revealed that the strains comprised 23 phylogenetically dist...
Phylogeny of the owlet-nightjars (Aves: Aegothelidae) based on mitochondrial DNA sequence
Dumbacher, J.P.; Pratt, T.K.; Fleischer, R.C.
2003-01-01
The avian family Aegothelidae (Owlet-nightjars) comprises nine extant species and one extinct species, all of which are currently classified in a single genus, Aegotheles. Owlet-nightjars are secretive nocturnal birds of the South Pacific. They are relatively poorly studied and some species are known from only a few specimens. Furthermore, their confusing morphological variation has made it difficult to cluster existing specimens unambiguously into hierarchical taxonomic units. Here we sample all extant owlet-nightjar species and all but three currently recognized subspecies. We use DNA extracted primarily from museum specimens to obtain mitochondrial gene sequences and construct a molecular phylogeny. Our phylogeny suggests that most species are reciprocally monophyletic, however A. albertisi appears paraphyletic. Our data also suggest splitting A. bennettii into two species and splitting A. insignis and A. tatei as suggested in another recent paper. ?? 2003 Elsevier Science (USA). All rights reserved.
Wymant, Chris; Colijn, Caroline; Danaviah, Siva; Essex, Max; Frost, Simon; Gall, Astrid; Gaseitsiwe, Simani; Grabowski, Mary K.; Gray, Ronald; Guindon, Stephane; von Haeseler, Arndt; Kaleebu, Pontiano; Kendall, Michelle; Kozlov, Alexey; Manasa, Justen; Minh, Bui Quang; Moyo, Sikhulile; Novitsky, Vlad; Nsubuga, Rebecca; Pillay, Sureshnee; Quinn, Thomas C.; Serwadda, David; Ssemwanga, Deogratius; Stamatakis, Alexandros; Trifinopoulos, Jana; Wawer, Maria; Brown, Andy Leigh; de Oliveira, Tulio; Kellam, Paul; Pillay, Deenan; Fraser, Christophe
2017-01-01
Abstract To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the “Phylogenetics and Networks for Generalised HIV Epidemics in Africa” consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n = 2,833; MRC/UVRI Uganda, n = 701; Mochudi Prevention Project, n = 359; Africa Health Research Institute Resistance Cohort, n = 92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3′ end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences (NGS) has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected. PMID:28540766
Ratmann, Oliver; Wymant, Chris; Colijn, Caroline; Danaviah, Siva; Essex, M; Frost, Simon D W; Gall, Astrid; Gaiseitsiwe, Simani; Grabowski, Mary; Gray, Ronald; Guindon, Stephane; von Haeseler, Arndt; Kaleebu, Pontiano; Kendall, Michelle; Kozlov, Alexey; Manasa, Justen; Minh, Bui Quang; Moyo, Sikhulile; Novitsky, Vladimir; Nsubuga, Rebecca; Pillay, Sureshnee; Quinn, Thomas C; Serwadda, David; Ssemwanga, Deogratius; Stamatakis, Alexandros; Trifinopoulos, Jana; Wawer, Maria; Leigh Brown, Andrew; de Oliveira, Tulio; Kellam, Paul; Pillay, Deenan; Fraser, Christophe
2017-05-25
To characterize HIV-1 transmission dynamics in regions where the burden of HIV-1 is greatest, the 'Phylogenetics and Networks for Generalised HIV Epidemics in Africa' consortium (PANGEA-HIV) is sequencing full-genome viral isolates from across sub-Saharan Africa. We report the first 3,985 PANGEA-HIV consensus sequences from four cohort sites (Rakai Community Cohort Study, n=2,833; MRC/UVRI Uganda, n=701; Mochudi Prevention Project, n=359; Africa Health Research Institute Resistance Cohort, n=92). Next-generation sequencing success rates varied: more than 80% of the viral genome from the gag to the nef genes could be determined for all sequences from South Africa, 75% of sequences from Mochudi, 60% of sequences from MRC/UVRI Uganda, and 22% of sequences from Rakai. Partial sequencing failure was primarily associated with low viral load, increased for amplicons closer to the 3' end of the genome, was not associated with subtype diversity except HIV-1 subtype D, and remained significantly associated with sampling location after controlling for other factors. We assessed the impact of the missing data patterns in PANGEA-HIV sequences on phylogeny reconstruction in simulations. We found a threshold in terms of taxon sampling below which the patchy distribution of missing characters in next-generation sequences has an excess negative impact on the accuracy of HIV-1 phylogeny reconstruction, which is attributable to tree reconstruction artifacts that accumulate when branches in viral trees are long. The large number of PANGEA-HIV sequences provides unprecedented opportunities for evaluating HIV-1 transmission dynamics across sub-Saharan Africa and identifying prevention opportunities. Molecular epidemiological analyses of these data must proceed cautiously because sequence sampling remains below the identified threshold and a considerable negative impact of missing characters on phylogeny reconstruction is expected.
Phylogenetic estimates of diversification rate are affected by molecular rate variation.
Duchêne, D A; Hua, X; Bromham, L
2017-10-01
Molecular phylogenies are increasingly being used to investigate the patterns and mechanisms of macroevolution. In particular, node heights in a phylogeny can be used to detect changes in rates of diversification over time. Such analyses rest on the assumption that node heights in a phylogeny represent the timing of diversification events, which in turn rests on the assumption that evolutionary time can be accurately predicted from DNA sequence divergence. But there are many influences on the rate of molecular evolution, which might also influence node heights in molecular phylogenies, and thus affect estimates of diversification rate. In particular, a growing number of studies have revealed an association between the net diversification rate estimated from phylogenies and the rate of molecular evolution. Such an association might, by influencing the relative position of node heights, systematically bias estimates of diversification time. We simulated the evolution of DNA sequences under several scenarios where rates of diversification and molecular evolution vary through time, including models where diversification and molecular evolutionary rates are linked. We show that commonly used methods, including metric-based, likelihood and Bayesian approaches, can have a low power to identify changes in diversification rate when molecular substitution rates vary. Furthermore, the association between the rates of speciation and molecular evolution rate can cause the signature of a slowdown or speedup in speciation rates to be lost or misidentified. These results suggest that the multiple sources of variation in molecular evolutionary rates need to be considered when inferring macroevolutionary processes from phylogenies. © 2017 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2017 European Society For Evolutionary Biology.
Kropáčková, Lucie; Těšický, Martin; Albrecht, Tomáš; Kubovčiak, Jan; Čížková, Dagmar; Tomášek, Oldřich; Martin, Jean-François; Bobek, Lukáš; Králová, Tereza; Procházka, Petr; Kreisinger, Jakub
2017-10-01
Vertebrate gut microbiota (GM) is comprised of a taxonomically diverse consortium of symbiotic and commensal microorganisms that have a pronounced effect on host physiology, immune system function and health status. Despite much research on interactions between hosts and their GM, the factors affecting inter- and intraspecific GM variation in wild populations are still poorly known. We analysed data on faecal microbiota composition in 51 passerine species (319 individuals) using Illumina MiSeq sequencing of bacterial 16S rRNA (V3-V4 variable region). Despite pronounced interindividual variation, GM composition exhibited significant differences at the interspecific level, accounting for approximately 20%-30% of total GM variation. We also observed a significant correlation between GM composition divergence and host's phylogenetic divergence, with strength of correlation higher than that of GM vs. ecological or life history traits and geographic variation. The effect of host's phylogeny on GM composition was significant, even after statistical control for these confounding factors. Hence, our data do not support codiversification of GM and passerine phylogeny solely as a by-product of their ecological divergence. Furthermore, our findings do not support that GM vs. host's phylogeny codiversification is driven primarily through trans-generational GM transfer as the GM vs. phylogeny correlation does not increase with higher sequence similarity used when delimiting operational taxonomic units. Instead, we hypothesize that the GM vs. phylogeny correlation may arise as a consequence of interspecific divergence of genes that directly or indirectly modulate composition of GM. © 2017 John Wiley & Sons Ltd.
Testing the new animal phylogeny: a phylum level molecular analysis of the animal kingdom.
Bourlat, Sarah J; Nielsen, Claus; Economou, Andrew D; Telford, Maximilian J
2008-10-01
The new animal phylogeny inferred from ribosomal genes some years ago has prompted a number of radical rearrangements of the traditional, morphology based metazoan tree. The two main bilaterian clades, Deuterostomia and Protostomia, find strong support, but the protostomes consist of two sister groups, Ecdysozoa and Lophotrochozoa, not seen in morphology based trees. Although widely accepted, not all recent molecular phylogenetic analyses have supported the tripartite structure of the new animal phylogeny. Furthermore, even if the small ribosomal subunit (SSU) based phylogeny is correct, there is a frustrating lack of resolution of relationships between the phyla that make up the three clades of this tree. To address this issue, we have assembled a dataset including a large number of aligned sequence positions as well as a broad sampling of metazoan phyla. Our dataset consists of sequence data from ribosomal and mitochondrial genes combined with new data from protein coding genes (5139 amino acid and 3524 nucleotide positions in total) from 37 representative taxa sampled across the Metazoa. Our data show strong support for the basic structure of the new animal phylogeny as well as for the Mandibulata including Myriapoda. We also provide some resolution within the Lophotrochozoa, where we confirm support for a monophyletic clade of Echiura, Sipuncula and Annelida and surprising evidence of a close relationship between Brachiopoda and Nemertea.
Huang, Jie; Chen, Zigui; Song, Weibo; Berger, Helmut
2014-01-01
Classifications of the Urostyloidea were mainly based on morphology and morphogenesis. Since molecular phylogeny largely focused on limited sampling using mostly the one-gene information, the incongruence between morphological data and gene sequences have risen. In this work, the three-gene data (SSU-rDNA, ITS1-5.8S-ITS2 and LSU-rDNA) comprising 12 genera in the "core urostyloids" are sequenced, and the phylogenies based on these different markers are compared using maximum-likelihood and Bayesian algorithms and tested by unconstrained and constrained analyses. The molecular phylogeny supports the following conclusions: (1) the monophyly of the core group of Urostyloidea is well supported while the whole Urostyloidea is not monophyletic; (2) Thigmokeronopsis and Apokeronopsis are clearly separated from the pseudokeronopsids in analyses of all three gene markers, supporting their exclusion from the Pseudokeronopsidae and the inclusion in the Urostylidae; (3) Diaxonella and Apobakuella should be assigned to the Urostylidae; (4) Bergeriella, Monocoronella and Neourostylopsis flavicana share a most recent common ancestor; (5) all molecular trees support the transfer of Metaurostylopsis flavicana to the recently proposed genus Neourostylopsis; (6) all molecular phylogenies fail to separate the morphologically well-defined genera Uroleptopsis and Pseudokeronopsis; and (7) Arcuseries gen. nov. containing three distinctly deviating Anteholosticha species is established. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Schirtzinger, Erin E.; Matsumoto, Tania; Eberhard, Jessica R.; Graves, Gary R.; Sanchez, Juan J.; Capelli, Sara; Müller, Heinrich; Scharpegge, Julia; Chambers, Geoffrey K.; Fleischer, Robert C.
2008-01-01
The question of when modern birds (Neornithes) first diversified has generated much debate among avian systematists. Fossil evidence generally supports a Tertiary diversification, whereas estimates based on molecular dating favor an earlier diversification in the Cretaceous period. In this study, we used an alternate approach, the inference of historical biogeographic patterns, to test the hypothesis that the initial radiation of the Order Psittaciformes (the parrots and cockatoos) originated on the Gondwana supercontinent during the Cretaceous. We utilized broad taxonomic sampling (representatives of 69 of the 82 extant genera and 8 outgroup taxa) and multilocus molecular character sampling (3,941 bp from mitochondrial DNA (mtDNA) genes cytochrome oxidase I and NADH dehydrogenase 2 and nuclear introns of rhodopsin intron 1, tropomyosin alpha-subunit intron 5, and transforming growth factor ß-2) to generate phylogenetic hypotheses for the Psittaciformes. Analyses of the combined character partitions using maximum parsimony, maximum likelihood, and Bayesian criteria produced well-resolved and topologically similar trees in which the New Zealand taxa Strigops and Nestor (Psittacidae) were sister to all other psittaciforms and the cockatoo clade (Cacatuidae) was sister to a clade containing all remaining parrots (Psittacidae). Within this large clade of Psittacidae, some traditionally recognized tribes and subfamilies were monophyletic (e.g., Arini, Psittacini, and Loriinae), whereas several others were polyphyletic (e.g., Cyclopsittacini, Platycercini, Psittaculini, and Psittacinae). Ancestral area reconstructions using our Bayesian phylogenetic hypothesis and current distributions of genera supported the hypothesis of an Australasian origin for the Psittaciformes. Separate analyses of the timing of parrot diversification constructed with both Bayesian relaxed-clock and penalized likelihood approaches showed better agreement between geologic and diversification events in the chronograms based on a Cretaceous dating of the basal split within parrots than the chronograms based on a Tertiary dating of this split, although these data are more equivocal. Taken together, our results support a Cretaceous origin of Psittaciformes in Gondwana after the separation of Africa and the India/Madagascar block with subsequent diversification through both vicariance and dispersal. These well-resolved molecular phylogenies will be of value for comparative studies of behavior, ecology, and life history in parrots. PMID:18653733
Kahlke, Tim; Goesmann, Alexander; Hjerde, Erik; Willassen, Nils Peder; Haugen, Peik
2012-05-10
The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.
High-Resolution Melting Analysis for Rapid Detection of Sequence Type 131 Escherichia coli.
Harrison, Lucas B; Hanson, Nancy D
2017-06-01
Escherichia coli isolates belonging to the sequence type 131 (ST131) clonal complex have been associated with the global distribution of fluoroquinolone and β-lactam resistance. Whole-genome sequencing and multilocus sequence typing identify sequence type but are expensive when evaluating large numbers of samples. This study was designed to develop a cost-effective screening tool using high-resolution melting (HRM) analysis to differentiate ST131 from non-ST131 E. coli in large sample populations in the absence of sequence analysis. The method was optimized using DNA from 12 E. coli isolates. Singleplex PCR was performed using 10 ng of DNA, Type-it HRM buffer, and multilocus sequence typing primers and was followed by multiplex PCR. The amplicon sizes ranged from 630 to 737 bp. Melt temperature peaks were determined by performing HRM analysis at 0.1°C resolution from 50 to 95°C on a Rotor-Gene Q 5-plex HRM system. Derivative melt curves were compared between sequence types and analyzed by principal component analysis. A blinded study of 191 E. coli isolates of ST131 and unknown sequence types validated this methodology. This methodology returned 99.2% specificity (124 true negatives and 1 false positive) and 100% sensitivity (66 true positives and 0 false negatives). This HRM methodology distinguishes ST131 from non-ST131 E. coli without sequence analysis. The analysis can be accomplished in about 3 h in any laboratory with an HRM-capable instrument and principal component analysis software. Therefore, this assay is a fast and cost-effective alternative to sequencing-based ST131 identification. Copyright © 2017 Harrison and Hanson.
Md. Iqbal Hosen; Tai-Hui Li; D. Jean Lodge; Alan Rockefeller
2016-01-01
This is the first internal transcribed spacer (ITS) phylogeny of the enigmatic genus Cantharocybe and includes ITS sequences from two out of the three holotype collections. Two species are reported from the Americas and only a single species from Asia. Additionally, a collection of Cantharocybe virosa collected from tropical...
Entomopathogen ID: A multi-locus sequence alignment resource for entomopathogenic fungi
USDA-ARS?s Scientific Manuscript database
The ability to correctly identify entomopathogenic fungi is an important step in developing biopesticides and effectively communicating research results. Over the years, identifying entomopathogenic fungi has evolved from a system based on diagnostic morphological and physiological characters to mol...
Multilocus sequence typing reveals a novel subspeciation of Lactobacillus delbrueckii.
Tanigawa, Kana; Watanabe, Koichi
2011-03-01
Currently, the species Lactobacillus delbrueckii is divided into four subspecies, L. delbrueckii subsp. delbrueckii, L. delbrueckii subsp. bulgaricus, L. delbrueckii subsp. indicus and L. delbrueckii subsp. lactis. These classifications were based mainly on phenotypic identification methods and few studies have used genotypic identification methods. As a result, these subspecies have not yet been reliably delineated. In this study, the four subspecies of L. delbrueckii were discriminated by phenotype and by genotypic identification [amplified-fragment length polymorphism (AFLP) and multilocus sequence typing (MLST)] methods. The MLST method developed here was based on the analysis of seven housekeeping genes (fusA, gyrB, hsp60, ileS, pyrG, recA and recG). The MLST method had good discriminatory ability: the 41 strains of L. delbrueckii examined were divided into 34 sequence types, with 29 sequence types represented by only a single strain. The sequence types were divided into eight groups. These groups could be discriminated as representing different subspecies. The results of the AFLP and MLST analyses were consistent. The type strain of L. delbrueckii subsp. delbrueckii, YIT 0080(T), was clearly discriminated from the other strains currently classified as members of this subspecies, which were located close to strains of L. delbrueckii subsp. lactis. The MLST scheme developed in this study should be a useful tool for the identification of strains of L. delbrueckii to the subspecies level.
Multilocus sequence typing scheme for the Mycobacterium abscessus complex.
Macheras, Edouard; Konjek, Julie; Roux, Anne-Laure; Thiberge, Jean-Michel; Bastian, Sylvaine; Leão, Sylvia Cardoso; Palaci, Moises; Sivadon-Tardy, Valérie; Gutierrez, Cristina; Richter, Elvira; Rüsch-Gerdes, Sabine; Pfyffer, Gaby E; Bodmer, Thomas; Jarlier, Vincent; Cambau, Emmanuelle; Brisse, Sylvain; Caro, Valérie; Rastogi, Nalin; Gaillard, Jean-Louis; Heym, Beate
2014-01-01
We developed a multilocus sequence typing (MLST) scheme for Mycobacterium abscessus sensu lato, based on the partial sequencing of seven housekeeping genes: argH, cya, glpK, gnd, murC, pta and purH. This scheme was used to characterize a collection of 227 isolates recovered between 1994 and 2010 in France, Germany, Switzerland and Brazil. We identified 100 different sequence types (STs), which were distributed into three groups on the tree obtained by concatenating the sequences of the seven housekeeping gene fragments (3576bp): the M. abscessus sensu stricto group (44 STs), the "M. massiliense" group (31 STs) and the "M. bolletii" group (25 STs). SplitTree analysis showed a degree of intergroup lateral transfers. There was also evidence of lateral transfer events involving rpoB. The most prevalent STs in our collection were ST1 (CC5; 20 isolates) and ST23 (CC3; 31 isolates). Both STs were found in Europe and Brazil, and the latter was implicated in a large post-surgical procedure outbreak in Brazil. Respiratory isolates from patients with cystic fibrosis belonged to a large variety of STs; however, ST2 was predominant in this group of patients. Our MLST scheme, publicly available at www.pasteur.fr/mlst, offers investigators a valuable typing tool for M. abscessus sensu lato in future epidemiological studies throughout the world. Copyright © 2013 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
Dan, Tong; Liu, Wenjun; Song, Yuqin; Xu, Haiyan; Menghe, Bilige; Zhang, Heping; Sun, Zhihong
2015-05-20
Lactobacillus fermentum is economically important in the production and preservation of fermented foods. A repeatable and discriminative typing method was devised to characterize L. fermentum at the molecular level. The multilocus sequence typing (MLST) scheme developed was based on analysis of the internal sequence of 11 housekeeping gene fragments (clpX, dnaA, dnaK, groEL, murC, murE, pepX, pyrG, recA, rpoB, and uvrC). MLST analysis of 203 isolates of L. fermentum from Mongolia and seven provinces/ autonomous regions in China identified 57 sequence types (ST), 27 of which were represented by only a single isolate, indicating high genetic diversity. Phylogenetic analyses based on the sequence of the 11 housekeeping gene fragments indicated that the L. fermentum isolates analyzed belonged to two major groups. A standardized index of association (I A (S)) indicated a weak clonal population structure in L. fermentum. Split decomposition analysis indicated that recombination played an important role in generating the genetic diversity observed in L. fermentum. The results from the minimum spanning tree strongly suggested that evolution of L. fermentum STs was not correlated with geography or food-type. The MLST scheme developed will be valuable for further studies on the evolution and population structure of L. fermentum isolates used in food products.
Solving the problem of comparing whole bacterial genomes across different sequencing platforms.
Kaas, Rolf S; Leekitcharoenphon, Pimlapas; Aarestrup, Frank M; Lund, Ole
2014-01-01
Whole genome sequencing (WGS) shows great potential for real-time monitoring and identification of infectious disease outbreaks. However, rapid and reliable comparison of data generated in multiple laboratories and using multiple technologies is essential. So far studies have focused on using one technology because each technology has a systematic bias making integration of data generated from different platforms difficult. We developed two different procedures for identifying variable sites and inferring phylogenies in WGS data across multiple platforms. The methods were evaluated on three bacterial data sets and sequenced on three different platforms (Illumina, 454, Ion Torrent). We show that the methods are able to overcome the systematic biases caused by the sequencers and infer the expected phylogenies. It is concluded that the cause of the success of these new procedures is due to a validation of all informative sites that are included in the analysis. The procedures are available as web tools.
Redberg, G.L.; Hibbett, D.S.; Ammirati, J.F.; Rodriguez, R.J.
2003-01-01
The genetic diversity and phylogeny of Bridgeoporus nobilissimus have been analyzed. DNA was extracted from spores collected from individual fruiting bodies representing six geographically distinct populations in Oregon and Washington. Spore samples collected contained low levels of bacteria, yeast and a filamentous fungal species. Using taxon-specific PCR primers, it was possible to discriminate among rDNA from bacteria, yeast, a filamentous associate and B. nobilissimus. Nuclear rDNA internal transcribed spacer (ITS) region sequences of B. nobilissimus were compared among individuals representing six populations and were found to have less than 2% variation. These sequences also were used to design dual and nested PCR primers for B. nobilissimus-specific amplification. Mitochondrial small-subunit rDNA sequences were used in a phylogenetic analysis that placed B. nobilissimus in the hymenochaetoid clade, where it was associated with Oxyporus and Schizopora.
First isolation of Actinobacillus genomospecies 2 in Japan.
Murakami, Miyuki; Shimonishi, Yoshimasa; Hobo, Seiji; Niwa, Hidekazu; Ito, Hiroya
2016-05-03
We describe here the first isolation of Actinobacillus genomospecies 2 in Japan. The isolate was found in a septicemic foal and characterized by phenotypic and genetic analyses, with the latter consisting of 16S rDNA nucleotide sequence analysis plus multilocus sequence analysis using three housekeeping genes, recN, rpoA and thdF, that have been proposed for use as a genomic tool in place of DNA-DNA hybridization.
Clonality and serotypes of Streptococcus mutans among children by multilocus sequence typing
Momeni, Stephanie S.; Whiddon, Jennifer; Cheon, Kyounga; Moser, Stephen A.; Childers, Noel K.
2015-01-01
Studies using multilocus sequence typing (MLST) have demonstrated that Streptococcus mutans isolates are genetically diverse. Our laboratory previously demonstrated clonality of S. mutans using MLST but could not discount the possibility of sampling bias. In this study, the clonality of randomly selected S. mutans plaque isolates from African American children was examined using MLST. Serotype and presence of collagen-binding proteins (CBP) cnm/cbm were also assessed. One hundred S. mutans isolates were randomly selected for MLST analysis. Sequence analysis was performed and phylogenetic trees were generated using START2 and MEGA. Thirty-four sequence types (ST) were identified of which 27 were unique to this population. Seventy-five percent of the isolates clustered into 16 clonal groups. Serotypes observed were c (n=84), e (n=3), and k (n=11). The prevalence of S. mutans isolates serotype k was notably high at 17.5%. All isolates were cnm/cbm negative. The clonality of S. mutans demonstrated in this study illustrates the importance of localized populations studies and are consistent with transmission. The prevalence of serotype k, a recently proposed systemic pathogen, observed in this study is higher than reported in most populations and is the first report of S. mutans serotype k in a US population. PMID:26443288
Platonov, A E; Mironov, K O; Iatsyshina, S B; Koroleva, I S; Platonova, O V; Gushchin, A E; Shipulin, G A
2003-01-01
Haemophilius influenzae, type b (Hib) bacteria, were genotyped by multilocus sequence typing (MLST) using 5 loci (adk, fucK, mdh, pgi, recA). 42 Moscow Hib strains (including 38 isolates form cerebrospinal fluid of children, who had purulent meningitis in 1999-2001, and 4 strains isolated from healthy carriers of Hib), as well as 2 strains from Yekaterinburg were studied. In MLST a strain is characterized, by alleles and their combinations (an allele profile) referred to also as sequence-type (ST). 9 Sts were identified within the Russian Hib bacteria: ST-1 was found in 25 strains (57%), ST-12 was found in 8 strains (18%), ST-11 was found in 4 strains (9%) and ST-15 was found in 2 strains (4.5%); all other STs strains (13, 14, 16, 17, 51) were found in isolated cases (2.3%). A comparison of allelic profiles and of nucleotide sequences showed that 93% of Russian isolates, i.e. strain with ST-1, 11, 12, 13, 15 and 17, belong to one and the same clonal complex. 2 isolates from Norway and Sweden from among 7 foreign Hib strains studied up to now can be described as belonging to the same clonal complex; 5 Hib strains were different from the Russian ones.
Development of Multilocus Sequence Typing (MLST) for Mycoplasma synoviae.
El-Gazzar, Mohamed; Ghanem, Mostafa; McDonald, Kristina; Ferguson-Noel, Naola; Raviv, Ziv; Slemons, Richard D
2017-03-01
Mycoplasma synoviae (MS) is a poultry pathogen that has had an increasing incidence and economic impact over the past few years. Strain identification is necessary for outbreak investigation, infection source identification, and facilitating prevention and control as well as eradication efforts. Currently, a segment of the variable lipoprotein hemagglutinin A (vlhA) gene (420 bp) is the only target that is used for MS strain identification. A major limitation of this assay is that colonality of typed samples can only be inferred if their vlhA sequences are identical; however, if their sequences are different, the degree of relatedness is uncertain. In this study we propose a multilocus sequence typing (MLST) assay to further refine MS strain identification. After initial screening of 24 housekeeping genes as potential targets, seven genes were selected for the MLST assay. An internal segment (450-711 bp) from each of the seven genes was successfully amplified and sequenced from 58 different MS strains and field isolates (n = 30) or positive clinical samples (n = 28). The collective sequence of all seven gene segments (3960 bp total) was used for MS sequence typing. The 58 tested MS samples were typed into 30 different sequence types using the MLST assay and, coincidentally, all the samples were typed into 30 sequence types using the vlhA assay. However, the phylogenetic tree generated using the MLST data was more congruent to the epidemiologic information than was the tree generated by the vlhA assay. We suggest that the newly developed MLST assay and the vlhA assay could be used in tandem for MS typing. The MLST assay will be a valuable and more reliable tool for MS sequence typing, providing better understanding of the epidemiology of MS infection. This in turn will aid disease prevention, control, and eradication efforts.
USDA-ARS?s Scientific Manuscript database
We explored the phylogenetic utility of entire plastid DNA sequences in Daucus and compared the results to prior phylogenetic results using plastid, nuclear, and mitochondrial DNA sequences. We obtained, using Illumina sequencing, full plastid sequences of 37 accessions of 20 Daucus taxa and outgrou...
Coipan, E Claudia; Jahfari, Setareh; Fonville, Manoj; Oei, G Anneke; Spanjaard, Lodewijk; Takumi, Katsuhisa; Hovius, Joppe W R; Sprong, Hein
2016-08-01
In this study we used typing based on the eight multilocus sequence typing scheme housekeeping genes (MLST) and 5S-23S rDNA intergenic spacer (IGS) to explore the population structure of Borrelia burgdorferi sensu lato isolates from patients with Lyme borreliosis (LB) and to test the association between the B. burgdorferi s.l. sequence types (ST) and the clinical manifestations they cause in humans. Isolates of B. burgdorferi from 183 LB cases across Europe, with distinct clinical manifestations, and 257 Ixodes ricinus lysates from The Netherlands, were analyzed for this study alone. For completeness, we incorporated in our analysis also 335 European B. burgdorferi s.l. MLST profiles retrieved from literature. Borrelia afzelii and Borrelia bavariensis were associated with human cases of LB while Borrelia garinii, Borrelia lusitaniae and Borrelia valaisiana were associated with questing I. ricinus ticks. B. afzelii was associated with acrodermatitis chronica atrophicans, while B. garinii and B. bavariensis were associated with neuroborreliosis. The samples in our study belonged to 251 different STs, of which 94 are newly described, adding to the overall picture of the genetic diversity of Borrelia genospecies. The fraction of STs that were isolated from human samples was significantly higher for the genospecies that are known to be maintained in enzootic cycles by mammals (B. afzelii, B. bavariensis, and Borrelia spielmanii) than for genospecies that are maintained by birds (B. garinii and B. valaisiana) or lizards (B. lusitaniae). We found six multilocus sequence types that were significantly associated to clinical manifestations in humans and five IGS haplotypes that were associated with the human LB cases. While IGS could perform just as well as the housekeeping genes in the MLST scheme for predicting the infectivity of B. burgdorferi s.l., the advantage of MLST is that it can also capture the differential invasiveness of the various STs. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Phylogenetic Analysis of Enterohemorrhagic Escherichia coli O157, Germany, 1987–2008
Jenke, Christian; Harmsen, Dag; Weniger, Thomas; Rothgänger, Jörg; Hyytiä-Trees, Eija; Bielaszewska, Martina; Karch, Helge
2010-01-01
Multilocus variable number tandem repeat analysis (MLVA) is a subtyping technique for characterizing human pathogenic bacteria such as enterohemorrhagic Escherichia coli (EHEC) O157. We determined the phylogeny of 202 epidemiologically unrelated EHEC O157:H7/H– clinical isolates through 8 MLVA loci obtained in Germany during 1987–2008. Biodiversity in the loci ranged from 0.66 to 0.90. Four of 8 loci showed null alleles and a frequency <44.1%. These loci were distributed among 48.5% of all strains. Overall, 141 MLVA profiles were identified. Phylogenetic analysis assigned 67.3% of the strains to 19 MLVA clusters. Specific MLVA profiles with an evolutionary persistence were identified, particularly within sorbitol-fermenting EHEC O157:H–.These pathogens belonged to the same MLVA cluster. Our findings indicate successful persistence of this clone. PMID:20350374
Phylogenetic analysis of enterohemorrhagic Escherichia coli O157, Germany, 1987-2008.
Jenke, Christian; Harmsen, Dag; Weniger, Thomas; Rothganger, Jorg; Hyytia-Trees, Eija; Bielaszewska, Martina; Karch, Helge; Mellmann, Alexander
2010-04-01
Multilocus variable number tandem repeat analysis (MLVA) is a subtyping technique for characterizing human pathogenic bacteria such as enterohemorrhagic Escherichia coli (EHEC) O157. We determined the phylogeny of 202 epidemiologically unrelated EHEC O157:H7/H- clinical isolates through 8 MLVA loci obtained in Germany during 1987-2008. Biodiversity in the loci ranged from 0.66 to 0.90. Four of 8 loci showed null alleles and a frequency < or =44.1%. These loci were distributed among 48.5% of all strains. Overall, 141 MLVA profiles were identified. Phylogenetic analysis assigned 67.3% of the strains to 19 MLVA clusters. Specific MLVA profiles with an evolutionary persistence were identified, particularly within sorbitol-fermenting EHEC O157:H-.These pathogens belonged to the same MLVA cluster. Our findings indicate successful persistence of this clone.
Glynou, Kyriaki; Ali, Tahir; Kia, Sevda Haghi; Thines, Marco; Maciá-Vicente, Jose G
2017-09-01
Studying community structure and dynamics of plant-associated fungi is the basis for unravelling their interactions with hosts and ecosystem functions. A recent sampling revealed that only a few fungal groups, as defined by internal transcribed spacer region (ITS) sequence similarity, dominate culturable root endophytic communities of nonmycorrhizal Microthlaspi spp. plants across Europe. Strains of these fungi display a broad phenotypic and functional diversity, which suggests a genetic variability masked by ITS clustering into operational taxonomic units (OTUs). The aims of this study were to identify how genetic similarity patterns of these fungi change across environments and to evaluate their ability to disperse and adapt to ecological conditions. A first ITS-based haplotype analysis of ten widespread OTUs mostly showed a low to moderate genotypic differentiation, with the exception of a group identified as Cadophora sp. that was highly diverse. A multilocus phylogeny based on additional genetic loci (partial translation elongation factor 1α, beta-tubulin and actin) and amplified fragment length polymorphism profiling of 185 strains representative of the five dominant OTUs revealed a weak association of genetic differences with geography and environmental conditions, including bioclimatic and soil factors. Our findings suggest that dominant culturable root endophytic fungi have efficient dispersal capabilities, and that their distribution is little affected by environmental filtering. Other processes, such as inter- and intraspecific biotic interactions, may be more important for the local assembly of their communities. © 2017 John Wiley & Sons Ltd.
Blanchard, Adam M; Jolley, Keith A; Maiden, Martin C J; Coffey, Tracey J; Maboni, Grazieli; Staley, Ceri E; Bollard, Nicola J; Warry, Andrew; Emes, Richard D; Davies, Peers L; Tötemeyer, Sabine
2018-01-01
Dichelobacter nodosus ( D. nodosus ) is the causative pathogen of ovine footrot, a disease that has a significant welfare and financial impact on the global sheep industry. Previous studies into the phylogenetics of D. nodosus have focused on Australia and Scandinavia, meaning the current diversity in the United Kingdom (U.K.) population and its relationship globally, is poorly understood. Numerous epidemiological methods are available for bacterial typing; however, few account for whole genome diversity or provide the opportunity for future application of new computational techniques. Multilocus sequence typing (MLST) measures nucleotide variations within several loci with slow accumulation of variation to enable the designation of allele numbers to determine a sequence type. The usage of whole genome sequence data enables the application of MLST, but also core and whole genome MLST for higher levels of strain discrimination with a negligible increase in experimental cost. An MLST database was developed alongside a seven loci scheme using publically available whole genome data from the sequence read archive. Sequence type designation and strain discrimination was compared to previously published data to ensure reproducibility. Multiple D. nodosus isolates from U.K. farms were directly compared to populations from other countries. The U.K. isolates define new clades within the global population of D. nodosus and predominantly consist of serogroups A, B and H, however serogroups C, D, E, and I were also found. The scheme is publically available at https://pubmlst.org/dnodosus/.
2014-01-01
Background Economically, Leuconostoc lactis is one of the most important species in the genus Leuconostoc. It plays an important role in the food industry including the production of dextrans and bacteriocins. Currently, traditional molecular typing approaches for characterisation of this species at the isolate level are either unavailable or are not sufficiently reliable for practical use. Multilocus sequence typing (MLST) is a robust and reliable method for characterising bacterial and fungal species at the molecular level. In this study, a novel MLST protocol was developed for 50 L. lactis isolates from Mongolia and China. Results Sequences from eight targeted genes (groEL, carB, recA, pheS, murC, pyrG, rpoB and uvrC) were obtained. Sequence analysis indicated 20 different sequence types (STs), with 13 of them being represented by a single isolate. Phylogenetic analysis based on the sequences of eight MLST loci indicated that the isolates belonged to two major groups, A (34 isolates) and B (16 isolates). Linkage disequilibrium analyses indicated that recombination occurred at a low frequency in L. lactis, indicating a clonal population structure. Split-decomposition analysis indicated that intraspecies recombination played a role in generating genotypic diversity amongst isolates. Conclusions Our results indicated that MLST is a valuable tool for typing L. lactis isolates that can be used for further monitoring of evolutionary changes and population genetics. PMID:24912963
A Molecular Phylogeny of Hemiptera Inferred from Mitochondrial Genome Sequences
Song, Nan; Liang, Ai-Ping; Bu, Cui-Ping
2012-01-01
Classically, Hemiptera is comprised of two suborders: Homoptera and Heteroptera. Homoptera includes Cicadomorpha, Fulgoromorpha and Sternorrhyncha. However, according to previous molecular phylogenetic studies based on 18S rDNA, Fulgoromorpha has a closer relationship to Heteroptera than to other hemipterans, leaving Homoptera as paraphyletic. Therefore, the position of Fulgoromorpha is important for studying phylogenetic structure of Hemiptera. We inferred the evolutionary affiliations of twenty-five superfamilies of Hemiptera using mitochondrial protein-coding genes and rRNAs. We sequenced three mitogenomes, from Pyrops candelaria, Lycorma delicatula and Ricania marginalis, representing two additional families in Fulgoromorpha. Pyrops and Lycorma are representatives of an additional major family Fulgoridae in Fulgoromorpha, whereas Ricania is a second representative of the highly derived clade Ricaniidae. The organization and size of these mitogenomes are similar to those of the sequenced fulgoroid species. Our consensus phylogeny of Hemiptera largely supported the relationships (((Fulgoromorpha,Sternorrhyncha),Cicadomorpha),Heteroptera), and thus supported the classic phylogeny of Hemiptera. Selection of optimal evolutionary models (exclusion and inclusion of two rRNA genes or of third codon positions of protein-coding genes) demonstrated that rapidly evolving and saturated sites should be removed from the analyses. PMID:23144967
Lee, I M; Bartoszyk, I M; Gundersen-Rindal, D E; Davis, R E
1997-07-01
A phylogenetic analysis by parsimony of 16S rRNA gene sequences (16S rDNA) revealed that species and subspecies of Clavibacter and Rathayibacter form a discrete monophyletic clade, paraphyletic to Corynebacterium species. Within the Clavibacter-Rathayibacter clade, four major phylogenetic groups (subclades) with a total of 10 distinct taxa were recognized: (I) species C. michiganensis; (II) species C. xyli; (III) species R. iranicus and R. tritici; and (IV) species R. rathayi. The first three groups form a monophyletic cluster, paraphyletic to R. rathayi. On the basis of the phylogeny inferred, reclassification of members of Clavibacter-Rathayibacter group is proposed. A system for classification of taxa in Clavibacter and Rathayibacter was developed based on restriction fragment length polymorphism (RFLP) analysis of the PCR-amplified 16S rDNA sequences. The groups delineated on the basis of RFLP patterns of 16S rDNA coincided well with the subclades delineated on the basis of phylogeny. In contrast to previous classification systems, which are based primarily on phenotypic properties and are laborious, the RFLP analyses allow for rapid differentiation among species and subspecies in the two genera.
Ekaphan Kraichak; Sittiporn Parnmen; Robert Lücking; Eimy Rivas Plata; Andre Aptroot; Marcela E.S. Caceres; Damien Ertz; Armin Mangold; Joel A. Mercado-Diaz; Khwanruan Papong; Dries Van der Broeck; Gothamie Weerakoon; H. Thorsten Lumbsch; NO-VALUE
2014-01-01
We present an updated 3-locus molecular phylogeny of tribe Ocellularieae, the second largest tribe within subfamily Graphidoideae in the Graphidaceae. Adding 165 newly generated sequences from the mitochondrial small subunit rDNA (mtSSU), the nuclear large subunit rDNA (nuLSU), and the second largest subunit of the DNA-directed RNA polymerase II (RPB2), we currently...
mec-associated dru typing in the epidemiological analysis of ST239 MRSA in Malaysia.
Ghaznavi-Rad, E; Goering, R V; Nor Shamsudin, M; Weng, P L; Sekawi, Z; Tavakol, M; van Belkum, A; Neela, V
2011-11-01
The usefulness of mec-associated dru typing in the epidemiological analysis of methicillin-resistant Staphylococcus aureus (MRSA) isolated in Malaysia was investigated and compared with pulsed-field gel electrophoresis (PFGE), multilocus sequence typing (MLST), and spa and SCCmec typing. The isolates studied included all MRSA types in Malaysia. Multilocus sequence type ST188 and ST1 isolates were highly clonal by all typing methods. However, the dru typing of ST239 isolates produced the clearest discrimination between SCCmec IIIa and III isolates, yielding more subtypes than any other method. Evaluation of the discriminatory power for each method identified dru typing and PFGE as the most discriminatory, with Simpson's index of diversity (SID) values over 89%, including an isolate which was non-typeable by spa, but dru-typed as dt13j. The discriminatory ability of dru typing, especially with closely related MRSA ST239 strains (e.g., Brazilian and Hungarian), underscores its utility as a tool for the epidemiological investigation of MRSA.
Suchan, Tomasz; Espíndola, Anahí; Rutschmann, Sereina; Emerson, Brent C; Gori, Kevin; Dessimoz, Christophe; Arrigo, Nils; Ronikier, Michał; Alvarez, Nadir
2017-09-01
Determining phylogenetic relationships among recently diverged species has long been a challenge in evolutionary biology. Cytoplasmic DNA markers, which have been widely used, notably in the context of molecular barcoding, have not always proved successful in resolving such phylogenies. However, with the advent of next-generation-sequencing technologies and associated techniques of reduced genome representation, phylogenies of closely related species have been resolved at a much higher detail in the last couple of years. Here we examine the potential and limitations of one of such techniques-Restriction-site Associated DNA (RAD) sequencing, a method that produces thousands of (mostly) anonymous nuclear markers, in disentangling the phylogeny of the fly genus Chiastocheta (Diptera: Anthomyiidae). In Europe, this genus encompasses seven species of seed predators, which have been widely studied in the context of their ecological and evolutionary interactions with the plant Trollius europaeus (Ranunculaceae). So far, phylogenetic analyses using mitochondrial markers failed to resolve monophyly of most of the species from this recently diversified genus, suggesting that their taxonomy may need a revision. However, relying on a single, non-recombining marker and ignoring potential incongruences between mitochondrial and nuclear loci may provide an incomplete account of the lineage history. In this study, we applied both classical Sanger sequencing of three mtDNA regions and RAD-sequencing, for reconstructing the phylogeny of the genus. Contrasting with results based on mitochondrial markers, RAD-sequencing analyses retrieved the monophyly of all seven species, in agreement with the morphological species assignment. We found robust nuclear-based species assignment of individual samples, and low levels of estimated contemporary gene flow among them. However, despite recovering species' monophyly, interspecific relationships varied depending on the set of RAD loci considered, producing contradictory topologies. Moreover, coalescence-based phylogenetic analyses revealed low supports for most of the interspecific relationships. Our results indicate that despite the higher performance of RAD-sequencing in terms of species trees resolution compared to cytoplasmic markers, reconstructing inter-specific relationships among recently-diverged lineages may lie beyond the possibilities offered by large sets of RAD-sequencing markers in cases of strong gene tree incongruence. Copyright © 2017 Elsevier Inc. All rights reserved.
Enterohemorrhagic Escherichia coli as Causes of Hemolytic Uremic Syndrome in the Czech Republic
Marejková, Monika; Bláhová, Květa; Janda, Jan; Fruth, Angelika; Petráš, Petr
2013-01-01
Background Enterohemorrhagic Escherichia coli (EHEC) cause diarrhea-associated hemolytic uremic syndrome (D+ HUS) worldwide, but no systematic study of EHEC as the causative agents of HUS was performed in the Czech Republic. We analyzed stools of all patients with D+ HUS in the Czech Republic between 1998 and 2012 for evidence of EHEC infection. We determined virulence profiles, phenotypes, antimicrobial susceptibilities and phylogeny of the EHEC isolates. Methodology/Principal Findings Virulence loci were identified using PCR, phenotypes and antimicrobial susceptibilities were determined using standard procedures, and phylogeny was assessed using multilocus sequence typing. During the 15-year period, EHEC were isolated from stools of 39 (69.4%) of 56 patients. The strains belonged to serotypes [fliC types] O157:H7/NM[fliC H7] (50% of which were sorbitol-fermenting; SF), O26:H11/NM[fliC H11], O55:NM[fliC H7], O111:NM[fliC H8], O145:H28[fliC H28], O172:NM[fliC H25], and Orough:NM[fliC H25]. O26:H11/NM[fliC H11] was the most common serotype associated with HUS (41% isolates). Five stx genotypes were identified, the most frequent being stx 2a (71.1% isolates). Most strains contained EHEC-hlyA encoding EHEC hemolysin, and a subset (all SF O157:NM and one O157:H7) harbored cdt-V encoding cytolethal distending toxin. espPα encoding serine protease EspPα was found in EHEC O157:H7, O26:H11/NM, and O145:H28, whereas O172:NM and Orough:NM strains contained espPγ. All isolates contained eae encoding adhesin intimin, which belonged to subtypes β (O26), γ (O55, O145, O157), γ2/θ (O111), and ε (O172, Orough). Loci encoding other adhesins (efa1, lpfA O26, lpfA O157OI-141, lpfA O157OI-154, iha) were usually associated with particular serotypes. Phylogenetic analysis demonstrated nine sequence types (STs) which correlated with serotypes. Of these, two STs (ST660 and ST1595) were not found in HUS-associated EHEC before. Conclusions/Significance EHEC strains, including O157:H7 and non-O157:H7, are frequent causes of D+ HUS in the Czech Republic. Identification of unusual EHEC serotypes/STs causing HUS calls for establishment of an European collection of HUS-associated EHEC, enabling to study properties and evolution of these important pathogens. PMID:24040117
Larmuseau, Maarten H D; Van Geystelen, Anneleen; Kayser, Manfred; van Oven, Mannis; Decorte, Ronny
2015-03-01
Currently, several different Y-chromosomal phylogenies and haplogroup nomenclatures are presented in scientific literature and at conferences demonstrating the present diversity in Y-chromosomal phylogenetic trees and Y-SNP sets used within forensic and anthropological research. This situation can be ascribed to the exponential growth of the number of Y-SNPs discovered due to mostly next-generation sequencing (NGS) studies. As Y-SNPs and their respective phylogenetic positions are important in forensics, such as for male lineage characterization and paternal bio-geographic ancestry inference, there is a need for forensic geneticists to know how to deal with these newly identified Y-SNPs and phylogenies, especially since these phylogenies are often created with other aims than to carry out forensic genetic research. Therefore, we give here an overview of four categories of currently used Y-chromosomal phylogenies and the associated Y-SNP sets in scientific research in the current NGS era. We compare these categories based on the construction method, their advantages and disadvantages, the disciplines wherein the phylogenetic tree can be used, and their specific relevance for forensic geneticists. Based on this overview, it is clear that an up-to-date reduced tree with a consensus Y-SNP set and a stable nomenclature will be the most appropriate reference resource for forensic research. Initiatives to reach such an international consensus are therefore highly recommended. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Dowie, Nicholas J; Grubisha, Lisa C; Burton, Brent A; Klooster, Matthew R; Miller, Steven L
2017-01-01
Rhizopogon species are ecologically significant ectomycorrhizal fungi in conifer ecosystems. The importance of this system merits the development and utilization of a more robust set of molecular markers specifically designed to evaluate their evolutionary ecology. Anonymous nuclear loci (ANL) were developed for R. subgenus Amylopogon. Members of this subgenus occur throughout the United States and are exclusive fungal symbionts associated with Pterospora andromedea, a threatened mycoheterotrophic plant endemic to disjunct eastern and western regions of North America. Candidate ANL were developed from 454 shotgun pyrosequencing and assessed for positive amplification across targeted species, sequencing success, and recovery of phylogenetically informative sites. Ten ANL were successfully developed and were subsequently used to sequence representative taxa, herbaria holotype and paratype specimens in R. subgenus Amylopogon. Phylogenetic reconstructions were performed on individual and concatenated data sets by Bayesian inference and maximum likelihood methods. Phylogenetic analyses of these 10 ANL were compared with a phylogeny traditionally constructed using the universal fungal barcode nuc rDNA ITS1-5.8S-ITS2 region (ITS). The resulting ANL phylogeny was consistent with most of the species designations delineated by ITS. However, the ANL phylogeny provided much greater phylogenetic resolution, yielding new evidence for cryptic species within previously defined species of R. subgenus Amylopogon. Additionally, the rooted ANL phylogeny provided an alternate topology to the ITS phylogeny, which inferred a novel set of evolutionary relationships not identified in prior phylogenetic studies.
Large-scale phylogenomic analysis resolves a backbone phylogeny in ferns.
Shen, Hui; Jin, Dongmei; Shu, Jiang-Ping; Zhou, Xi-Le; Lei, Ming; Wei, Ran; Shang, Hui; Wei, Hong-Jin; Zhang, Rui; Liu, Li; Gu, Yu-Feng; Zhang, Xian-Chun; Yan, Yue-Hong
2018-02-01
Ferns, originated about 360 million years ago, are the sister group of seed plants. Despite the remarkable progress in our understanding of fern phylogeny, with conflicting molecular evidence and different morphological interpretations, relationships among major fern lineages remain controversial. With the aim to obtain a robust fern phylogeny, we carried out a large-scale phylogenomic analysis using high-quality transcriptome sequencing data, which covered 69 fern species from 38 families and 11 orders. Both coalescent-based and concatenation-based methods were applied to both nucleotide and amino acid sequences in species tree estimation. The resulting topologies are largely congruent with each other, except for the placement of Angiopteris fokiensis, Cheiropleuria bicuspis, Diplaziopsis brunoniana, Matteuccia struthiopteris, Elaphoglossum mcclurei, and Tectaria subpedata. Our result confirmed that Equisetales is sister to the rest of ferns, and Dennstaedtiaceae is sister to eupolypods. Moreover, our result strongly supported some relationships different from the current view of fern phylogeny, including that Marattiaceae may be sister to the monophyletic clade of Psilotaceae and Ophioglossaceae; that Gleicheniaceae and Hymenophyllaceae form a monophyletic clade sister to Dipteridaceae; and that Aspleniaceae is sister to the rest of the groups in eupolypods II. These results were interpreted with morphological traits, especially sporangia characters, and a new evolutionary route of sporangial annulus in ferns was suggested. This backbone phylogeny in ferns sets a foundation for further studies in biology and evolution in ferns, and therefore in plants. © The Authors 2017. Published by Oxford University Press.
Large-scale phylogenomic analysis resolves a backbone phylogeny in ferns
Shen, Hui; Jin, Dongmei; Shu, Jiang-Ping; Zhou, Xi-Le; Lei, Ming; Wei, Ran; Shang, Hui; Wei, Hong-Jin; Zhang, Rui; Liu, Li; Gu, Yu-Feng; Zhang, Xian-Chun; Yan, Yue-Hong
2018-01-01
Abstract Background Ferns, originated about 360 million years ago, are the sister group of seed plants. Despite the remarkable progress in our understanding of fern phylogeny, with conflicting molecular evidence and different morphological interpretations, relationships among major fern lineages remain controversial. Results With the aim to obtain a robust fern phylogeny, we carried out a large-scale phylogenomic analysis using high-quality transcriptome sequencing data, which covered 69 fern species from 38 families and 11 orders. Both coalescent-based and concatenation-based methods were applied to both nucleotide and amino acid sequences in species tree estimation. The resulting topologies are largely congruent with each other, except for the placement of Angiopteris fokiensis, Cheiropleuria bicuspis, Diplaziopsis brunoniana, Matteuccia struthiopteris, Elaphoglossum mcclurei, and Tectaria subpedata. Conclusions Our result confirmed that Equisetales is sister to the rest of ferns, and Dennstaedtiaceae is sister to eupolypods. Moreover, our result strongly supported some relationships different from the current view of fern phylogeny, including that Marattiaceae may be sister to the monophyletic clade of Psilotaceae and Ophioglossaceae; that Gleicheniaceae and Hymenophyllaceae form a monophyletic clade sister to Dipteridaceae; and that Aspleniaceae is sister to the rest of the groups in eupolypods II. These results were interpreted with morphological traits, especially sporangia characters, and a new evolutionary route of sporangial annulus in ferns was suggested. This backbone phylogeny in ferns sets a foundation for further studies in biology and evolution in ferns, and therefore in plants. PMID:29186447
A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis
Fitzpatrick, David A; Logue, Mary E; Stajich, Jason E; Butler, Geraldine
2006-01-01
Background To date, most fungal phylogenies have been derived from single gene comparisons, or from concatenated alignments of a small number of genes. The increase in fungal genome sequencing presents an opportunity to reconstruct evolutionary events using entire genomes. As a tool for future comparative, phylogenomic and phylogenetic studies, we used both supertrees and concatenated alignments to infer relationships between 42 species of fungi for which complete genome sequences are available. Results A dataset of 345,829 genes was extracted from 42 publicly available fungal genomes. Supertree methods were employed to derive phylogenies from 4,805 single gene families. We found that the average consensus supertree method may suffer from long-branch attraction artifacts, while matrix representation with parsimony (MRP) appears to be immune from these. A genome phylogeny was also reconstructed from a concatenated alignment of 153 universally distributed orthologs. Our MRP supertree and concatenated phylogeny are highly congruent. Within the Ascomycota, the sub-phyla Pezizomycotina and Saccharomycotina were resolved. Both phylogenies infer that the Leotiomycetes are the closest sister group to the Sordariomycetes. There is some ambiguity regarding the placement of Stagonospora nodurum, the sole member of the class Dothideomycetes present in the dataset. Within the Saccharomycotina, a monophyletic clade containing organisms that translate CTG as serine instead of leucine is evident. There is also strong support for two groups within the CTG clade, one containing the fully sexual species Candida lusitaniae, Candida guilliermondii and Debaryomyces hansenii, and the second group containing Candida albicans, Candida dubliniensis, Candida tropicalis, Candida parapsilosis and Lodderomyces elongisporus. The second major clade within the Saccharomycotina contains species whose genomes have undergone a whole genome duplication (WGD), and their close relatives. We could not confidently resolve whether Candida glabrata or Saccharomyces castellii lies at the base of the WGD clade. Conclusion We have constructed robust phylogenies for fungi based on whole genome analysis. Overall, our phylogenies provide strong support for the classification of phyla, sub-phyla, classes and orders. We have resolved the relationship of the classes Leotiomyctes and Sordariomycetes, and have identified two classes within the CTG clade of the Saccharomycotina that may correlate with sexual status. PMID:17121679
Allnutt, T R; Roper, K; Henry, C
2008-01-23
A genetic marker system based on the S1 Short Interspersed Elements (SINEs) in the important commercial crop, oilseed rape ( Brassica napus L.) has been developed. SINEs provided a successful multilocus, dominant marker system that was capable of clearly delineating winter- and spring-type crop varieties. Sixteen of 20 varieties tested showed unique profiles from the 17 polymorphic SINE markers generated. The 3' or 5' flank region of nine SINE markers were cloned, and DNA was sequenced. In addition, one putative pre-transposition SINE allele was cloned and sequenced. Two SINE flanking sequences were used to design real-time PCR assays. These quantitative SINE assays were applied to study the genetic structure of eight fields of oilseed rape crops. Studied fields were more genetically diverse than expected for the chosen loci (mean H T = 0.23). The spatial distribution of SINE marker frequencies was highly structured in some fields, suggesting locations of volunteer impurities within the crop. In one case, the assay identified a mislabeling of the crop variety. SINE markers were a useful tool for crop genetics, phylogenetics, variety identification, and purity analysis. The use and further application of quantitative, real-time PCR markers are discussed.
Wang, Liyan; Ma, Lina; Liu, Yongan; Gao, Pengcheng; Li, Youquan; Li, Xuerui; Liu, Yongsheng
2016-10-01
Haemophilus parasuis is the etiological agent of Glässers disease, which causes high morbidity and mortality in swine herds. Although H. parasuis strains can be classified into 15 serovars with the Kielstein-Rapp-Gabrielson serotyping scheme, a large number of isolates cannot be classified and have been designated 'nontypeable' strains. In this study, multilocus sequence typing (MLST) of H. parasuis was used to analyze 48 H. parasuis field strains isolated in China and two strains from Australia. Twenty-six new alleles and 29 new sequence types (STs) were detected, enriching the H. parasuis MLST databases. A BURST analysis indicated that H. parasuis lacks stable population structure and is highly heterogeneous, and that there is no association between STs and geographic area. When an UPGMA dendrogram was constructed, two major clades, clade A and clade B, were defined. Animal experiments, in which guinea pigs were challenged intraperitoneally with the bacterial isolates, supported the hypothesis that the H. parasuis STs in clade A are generally avirulent or weakly virulent, whereas the STs in clade B tend to be virulent. Copyright © 2016 Elsevier B.V. All rights reserved.
Johnson, Jennifer K.; Arduino, Sonia M.; Stine, O. Colin; Johnson, Judith A.; Harris, Anthony D.
2007-01-01
For hospital epidemiologists, determining a system of typing that is discriminatory is essential for measuring the effectiveness of infection control measures. In situations in which the incidence of resistant Pseudomonas aeruginosa is increasing, the ability to discern whether it is due to patient-to-patient transmission versus an increase in patient endogenous strains is often made on the basis of molecular typing. The present study compared the discriminatory abilities of pulsed-field gel electrophoresis (PFGE) and multilocus sequence typing (MLST) for 90 P. aeruginosa isolates obtained from cultures of perirectal surveillance swabs from patients in an intensive care unit. PFGE identified 85 distinct types and 76 distinct groups when similarity cutoffs of 100% and 87%, respectively, were used. By comparison, MLST identified 60 sequence types that could be clustered into 11 clonal complexes and 32 singletons. By using the Simpson index of diversity (D), PFGE had a greater discriminatory ability than MLST for P. aeruginosa isolates (D values, 0.999 versus 0.975, respectively). Thus, while MLST was better for detecting genetic relatedness, we determined that PFGE was more discriminatory than MLST for determining genetic differences in P. aeruginosa. PMID:17881548
Genotyping of Indian antigenic, vaccine, and field Brucella spp. using multilocus sequence typing.
Shome, Rajeswari; Krithiga, Natesan; Shankaranarayana, Padmashree B; Jegadesan, Sankarasubramanian; Udayakumar S, Vishnu; Shome, Bibek Ranjan; Saikia, Girin Kumar; Sharma, Narendra Kumar; Chauhan, Harshad; Chandel, Bharat Singh; Jeyaprakash, Rajendhran; Rahman, Habibur
2016-03-31
Brucellosis is one of the most important zoonotic diseases that affects multiple livestock species and causes great economic losses. The highly conserved genomes of Brucella, with > 90% homology among species, makes it important to study the genetic diversity circulating in the country. A total of 26 Brucella spp. (4 reference strains and 22 field isolates) and 1 B. melitensis draft genome sequence from India (B. melitensis Bm IND1) were included for sequence typing. The field isolates were identified by biochemical tests and confirmed by both conventional and quantitative polymerase chain reaction (qPCR) targeting bcsp 31Brucella genus-specific marker. Brucella speciation and biotyping was done by Bruce ladder, probe qPCR, and AMOS PCRs, respectively, and genotyping was done by multilocus sequence typing (MLST). The MLST typing of 27 Brucella spp. revealed five distinct sequence types (STs); the B. abortus S99 reference strain and 21 B. abortus field isolates belonged to ST1. On the other hand, the vaccine strain B. abortus S19 was genotyped as ST5. Similarly, B. melitensis 16M reference strain and one B. melitensis field isolate were grouped into ST7. Another B. melitensis field isolate belonged to ST8 (draft genome sequence from India), and only B. suis 1330 reference strain was found to be ST14. The sequences revealed genetic similarity of the Indian strains to the global reference and field strains. The study highlights the usefulness of MLST for typing of field isolates and validation of reference strains used for diagnosis and vaccination against brucellosis.
Teaching the process of molecular phylogeny and systematics: a multi-part inquiry-based exercise.
Lents, Nathan H; Cifuentes, Oscar E; Carpi, Anthony
2010-01-01
Three approaches to molecular phylogenetics are demonstrated to biology students as they explore molecular data from Homo sapiens and four related primates. By analyzing DNA sequences, protein sequences, and chromosomal maps, students are repeatedly challenged to develop hypotheses regarding the ancestry of the five species. Although these exercises were designed to supplement and enhance classroom instruction on phylogeny, cladistics, and systematics in the context of a postsecondary majors-level introductory biology course, the activities themselves require very little prior student exposure to these topics. Thus, they are well suited for students in a wide range of educational levels, including a biology class at the secondary level. In implementing this exercise, we have observed measurable gains, both in student comprehension of molecular phylogeny and in their acceptance of modern evolutionary theory. By engaging students in modern phylogenetic activities, these students better understood how biologists are currently using molecular data to develop a more complete picture of the shared ancestry of all living things.
Rapid Detection & Identification of Bacillus Species using MALDI-TOF/TOF and Biomarker Database
2006-06-01
rRNA sequence analysis. Multilocus enzyme electrophoresis ( MEE ) and comparative DNA sequence analysis suggest that they may represent a single species...adaptation of the MEE method [63] but with greater discrimination [64]. All of these new PCR-based subtyping methods are certainly superior and more...Demirev, P.A., Lin, J.S., Pineda , F.J., and Fenselau, C. (2001). Bioinformatics and mass spectrometry for microorganism identification: proteome-wide
First isolation of Actinobacillus genomospecies 2 in Japan
MURAKAMI, Miyuki; SHIMONISHI, Yoshimasa; HOBO, Seiji; NIWA, Hidekazu; ITO, Hiroya
2015-01-01
We describe here the first isolation of Actinobacillus genomospecies 2 in Japan. The isolate was found in a septicemic foal and characterized by phenotypic and genetic analyses, with the latter consisting of 16S rDNA nucleotide sequence analysis plus multilocus sequence analysis using three housekeeping genes, recN, rpoA and thdF, that have been proposed for use as a genomic tool in place of DNA-DNA hybridization. PMID:26668165
Mel-36 – preliminary description of a new morel species
USDA-ARS?s Scientific Manuscript database
A pilot survey of true morels (Morchella) of Newfoundland and Labrador (NL), employing phylogenetic analyses of multilocus DNA sequence data, resulted in the discovery of a novel species that is currently only known from NL and New Brunswick. This unnamed species was informally designated Morchella ...
Trichoderma asperellum reconsidered: two cryptic species
USDA-ARS?s Scientific Manuscript database
Analysis of a world-wide collection of strains of Trichoderma asperellum using multilocus genealogies of four genomic regions (tef1, rbp2, act, ITS1, 2, 5.8s), sequence polymorphism-derived (SPD) markers, matrix-assisted laser desorption/ionisation–time of flight mass spectrometry (MALDI-TOF MS) of ...
Shajitha, P P; Dhanesh, N R; Ebin, P J; Laly, Joseph; Aneesha, Devassy; Reshma, John; Augustine, Jomy; Linu, Mathew
2016-12-01
Only a few Impatiens spp. from South India (one of the five centers of diversity for Impatiens species) were included in the published datum of molecular phylogeny of the family Balsaminaceae. The present investigation is a novel attempt to reveal the phylogenetic association of Impatiens species of South India, by placing them in the global phylogeny of Impatiens based on a combined analysis of two chloroplast genes. Thirty species of genus Impatiens were collected from different locations of South India. Total genomic DNA was extracted from fresh plant leaf, and polymerase chain reaction was carried out using atpB-rbcL and trnL-F intergenic spacer-specific forward and reverse primers. Thirteen sequences of Impatiens species from three centers of diversity were obtained from GenBank for reconstructing the evolutionary relationships within the genus Impatiens. Bayesian inference analysis was carried out in MrBayes v.3.2.2. This analysis supported Southeast Asia as the ancestral place of origin of extant Impatiens species. Molecular phylogeny of South Indian Impatiens spp. based on combined chloroplast sequences showed the same association as that of morphological taxonomy. Sections Scapigerae, Tomentosae, Sub-Umbellatae, and Racemosae showed Southeast Asian relationship, while sections Annuae and Microsepalae showed African affinity.
Generalized Buneman Pruning for Inferring the Most Parsimonious Multi-state Phylogeny
NASA Astrophysics Data System (ADS)
Misra, Navodit; Blelloch, Guy; Ravi, R.; Schwartz, Russell
Accurate reconstruction of phylogenies remains a key challenge in evolutionary biology. Most biologically plausible formulations of the problem are formally NP-hard, with no known efficient solution. The standard in practice are fast heuristic methods that are empirically known to work very well in general, but can yield results arbitrarily far from optimal. Practical exact methods, which yield exponential worst-case running times but generally much better times in practice, provide an important alternative. We report progress in this direction by introducing a provably optimal method for the weighted multi-state maximum parsimony phylogeny problem. The method is based on generalizing the notion of the Buneman graph, a construction key to efficient exact methods for binary sequences, so as to apply to sequences with arbitrary finite numbers of states with arbitrary state transition weights. We implement an integer linear programming (ILP) method for the multi-state problem using this generalized Buneman graph and demonstrate that the resulting method is able to solve data sets that are intractable by prior exact methods in run times comparable with popular heuristics. Our work provides the first method for provably optimal maximum parsimony phylogeny inference that is practical for multi-state data sets of more than a few characters.
Kumar, S; Gadagkar, S R
2000-12-01
The neighbor-joining (NJ) method is widely used in reconstructing large phylogenies because of its computational speed and the high accuracy in phylogenetic inference as revealed in computer simulation studies. However, most computer simulation studies have quantified the overall performance of the NJ method in terms of the percentage of branches inferred correctly or the percentage of replications in which the correct tree is recovered. We have examined other aspects of its performance, such as the relative efficiency in correctly reconstructing shallow (close to the external branches of the tree) and deep branches in large phylogenies; the contribution of zero-length branches to topological errors in the inferred trees; and the influence of increasing the tree size (number of sequences), evolutionary rate, and sequence length on the efficiency of the NJ method. Results show that the correct reconstruction of deep branches is no more difficult than that of shallower branches. The presence of zero-length branches in realized trees contributes significantly to the overall error observed in the NJ tree, especially in large phylogenies or slowly evolving genes. Furthermore, the tree size does not influence the efficiency of NJ in reconstructing shallow and deep branches in our simulation study, in which the evolutionary process is assumed to be homogeneous in all lineages.
Variance to mean ratio, R(t), for poisson processes on phylogenetic trees.
Goldman, N
1994-09-01
The ratio of expected variance to mean, R(t), of numbers of DNA base substitutions for contemporary sequences related by a "star" phylogeny is widely seen as a measure of the adherence of the sequences' evolution to a Poisson process with a molecular clock, as predicted by the "neutral theory" of molecular evolution under certain conditions. A number of estimators of R(t) have been proposed, all predicted to have mean 1 and distributions based on the chi 2. Various genes have previously been analyzed and found to have values of R(t) far in excess of 1, calling into question important aspects of the neutral theory. In this paper, I use Monte Carlo simulation to show that the previously suggested means and distributions of estimators of R(t) are highly inaccurate. The analysis is applied to star phylogenies and to general phylogenetic trees, and well-known gene sequences are reanalyzed. For star phylogenies the results show that Kimura's estimators ("The Neutral Theory of Molecular Evolution," Cambridge Univ. Press, Cambridge, 1983) are unsatisfactory for statistical testing of R(t), but confirm the accuracy of Bulmer's correction factor (Genetics 123: 615-619, 1989). For all three nonstar phylogenies studied, attained values of all three estimators of R(t), although larger than 1, are within their true confidence limits under simple Poisson process models. This shows that lineage effects can be responsible for high estimates of R(t), restoring some limited confidence in the molecular clock and showing that the distinction between lineage and molecular clock effects is vital.(ABSTRACT TRUNCATED AT 250 WORDS)
Karim, Md Robiul; Wang, Rongjun; Yu, Fuchang; Li, Tongyi; Dong, Haiju; Li, Dezhong; Zhang, Longxian; Li, Junqiang; Jian, Fuchun; Zhang, Sumei; Rume, Farzana Islam; Ning, Changshen; Xiao, Lihua
2015-03-01
Only a few studies based on single locus characterization have been conducted on the molecular epidemiology of Giardia duodenalis in nonhuman primates (NHPs). The present study was conducted to examine the occurrence and genotype identity of G. duodenalis in NHPs based on multi-locus analysis of the small-subunit ribosomal RNA (SSU rRNA), triose phosphate isomerase (tpi), glutamate dehydrogenase (gdh), and beta-giardin (bg) genes. Fecal specimens were collected from 496 animals of 36 NHP species kept in seven zoos in China and screened for G. duodenalis by tpi-based PCR. G. duodenalis was detected in 92 (18.6%) specimens from 18 NHP species, belonging to assemblage A (n=4) and B (n=88). In positive NHP species, the infection rates ranged from 4.8% to 100%. In tpi sequence analysis, the assemblage A included subtypes A1, A2 and one novel subtype. Multi-locus analysis of the tpi, gdh, and bg genes detected 11 (8 known and 3 new), 6 (3 known and 3 new) and 9 (2 known and 7 new) subtypes in 88, 47 and 35 isolates in assemblage B, respectively. Thirty-two assemblage B isolates with data at all three loci yielded 15 multi-locus genotypes (MLGs), including 2 known and 13 new MLGs. Phylogenetic analysis of concatenated sequences of assemblage B showed that MLGs found here were genetically different from those of humans, NHPs, rabbit and guinea pig in Italy and Sweden. It further indicated that assemblage B isolates in ring-tailed lemurs and squirrel monkeys might be genetically different from those in other NHPs. These data suggest that NHPs are mainly infected with G. duodenalis assemblage B and there might be geographical segregation and host-adaptation in assemblage B in NHPs. Copyright © 2014 Elsevier B.V. All rights reserved.
Wirshing, Herman H; Baker, Andrew C
2014-08-01
Molecular phylogenies of scleractinian corals often fail to agree with traditional phylogenies derived from morphological characters. These discrepancies are generally attributed to non-homologous or morphologically plastic characters used in taxonomic descriptions. Consequently, morphological convergence of coral skeletons among phylogenetically unrelated groups is considered to be the major evolutionary process confounding molecular and morphological hypotheses. A strategy that may help identify cases of convergence and/or diversification in coral morphology is to compare phylogenies of existing "neutral" genetic markers used to estimate genealogic phylogenetic history with phylogenies generated from non-neutral genes involved in calcification (biomineralization). We tested the hypothesis that differences among calcification gene phylogenies with respect to the "neutral" trees may represent convergent or divergent functional strategies among calcification gene proteins that may correlate to aspects of coral skeletal morphology. Partial sequences of two nuclear genes previously determined to be involved in the calcification process in corals, "Cnidaria-III" membrane-bound/secreted α-carbonic anhydrase (CIII-MBSα-CA) and bone morphogenic protein (BMP) 2/4, were PCR-amplified, cloned and sequenced from 31 scleractinian coral species in 26 genera and 9 families. For comparison, "neutral" gene phylogenies were generated from sequences from two protein-coding "non-calcification" genes, one nuclear (β-tubulin) and one mitochondrial (cytochrome b), from the same individuals. Cloned CIII-MBSα-CA sequences were found to be non-neutral, and phylogenetic analyses revealed CIII-MBSα-CAs to exhibit a complex evolutionary history with clones distributed between at least 2 putative gene copies. However, for several coral taxa only one gene copy was recovered. With CIII-MBSα-CA, several recovered clades grouped taxa that differed from the "non-calcification" loci. In some cases, these taxa shared aspects of their skeletal morphology (i.e., convergence or diversification relative to the "non-calcification" loci), but in other cases they did not. For example, the "non-calcification" loci recovered Atlantic and Pacific mussids as separate evolutionary lineages, whereas with CIII-MBSα-CA, clones of two species of Atlantic mussids (Isophyllia sinuosa and Mycetophyllia sp.) and two species of Pacific mussids (Acanthastrea echinata and Lobophyllia hemprichii) were united in a distinct clade (except for one individual of Mycetophyllia). However, this clade also contained other taxa which were not unambiguously correlated with morphological features. BMP2/4 also contained clones that likely represent different gene copies. However, many of the sequences showed no significant deviation from neutrality, and reconstructed phylogenies were similar to the "non-calcification" tree topologies with a few exceptions. Although individual calcification genes are unlikely to precisely explain the diverse morphological features exhibited by scleractinian corals, this study demonstrates an approach for identifying cases where morphological taxonomy may have been misled by convergent and/or divergent molecular evolutionary processes in corals. Studies such as this may help illuminate our understanding of the likely complex evolution of genes involved in the calcification process, and enhance our knowledge of the natural history and biodiversity within this central ecological group. Published by Elsevier Inc.
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine
2008-07-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.
Lanier, Hayley C; Knowles, L Lacey
2015-02-01
Coalescent-based methods for species-tree estimation are becoming a dominant approach for reconstructing species histories from multi-locus data, with most of the studies examining these methodologies focused on recently diverged species. However, deeper phylogenies, such as the datasets that comprise many Tree of Life (ToL) studies, also exhibit gene-tree discordance. This discord may also arise from the stochastic sorting of gene lineages during the speciation process (i.e., reflecting the random coalescence of gene lineages in ancestral populations). It remains unknown whether guidelines regarding methodologies and numbers of loci established by simulation studies at shallow tree depths translate into accurate species relationships for deeper phylogenetic histories. We address this knowledge gap and specifically identify the challenges and limitations of species-tree methods that account for coalescent variance for deeper phylogenies. Using simulated data with characteristics informed by empirical studies, we evaluate both the accuracy of estimated species trees and the characteristics associated with recalcitrant nodes, with a specific focus on whether coalescent variance is generally responsible for the lack of resolution. By determining the proportion of coalescent genealogies that support a particular node, we demonstrate that (1) species-tree methods account for coalescent variance at deep nodes and (2) mutational variance - not gene-tree discord arising from the coalescent - posed the primary challenge for accurate reconstruction across the tree. For example, many nodes were accurately resolved despite predicted discord from the random coalescence of gene lineages and nodes with poor support were distributed across a range of depths (i.e., they were not restricted to a particular recent divergences). Given their broad taxonomic scope and large sampling of taxa, deep level phylogenies pose several potential methodological complications including difficulties with MCMC convergence and estimation of requisite population genetic parameters for coalescent-based approaches. Despite these difficulties, the findings generally support the utility of species-tree analyses for the estimation of species relationships throughout the ToL. We discuss strategies for successful application of species-tree approaches to deep phylogenies. Copyright © 2014 Elsevier Inc. All rights reserved.
Breinholt, Jesse W; Earl, Chandra; Lemmon, Alan R; Lemmon, Emily Moriarty; Xiao, Lei; Kawahara, Akito Y
2018-01-01
The advent of next-generation sequencing technology has allowed for thecollection of large portions of the genome for phylogenetic analysis. Hybrid enrichment and transcriptomics are two techniques that leverage next-generation sequencing and have shown much promise. However, methods for processing hybrid enrichment data are still limited. We developed a pipeline for anchored hybrid enrichment (AHE) read assembly, orthology determination, contamination screening, and data processing for sequences flanking the target "probe" region. We apply this approach to study the phylogeny of butterflies and moths (Lepidoptera), a megadiverse group of more than 157,000 described species with poorly understood deep-level phylogenetic relationships. We introduce a new, 855 locus AHE kit for Lepidoptera phylogenetics and compare resulting trees to those from transcriptomes. The enrichment kit was designed from existing genomes, transcriptomes, and expressed sequence tags and was used to capture sequence data from 54 species from 23 lepidopteran families. Phylogenies estimated from AHE data were largely congruent with trees generated from transcriptomes, with strong support for relationships at all but the deepest taxonomic levels. We combine AHE and transcriptomic data to generate a new Lepidoptera phylogeny, representing 76 exemplar species in 42 families. The tree provides robust support for many relationships, including those among the seven butterfly families. The addition of AHE data to an existing transcriptomic dataset lowers node support along the Lepidoptera backbone, but firmly places taxa with AHE data on the phylogeny. Combining taxa sequenced for AHE with existing transcriptomes and genomes resulted in a tree with strong support for (Calliduloidea $+$ Gelechioidea $+$ Thyridoidea) $+$ (Papilionoidea $+$ Pyraloidea $+$ Macroheterocera). To examine the efficacy of AHE at a shallow taxonomic level, phylogenetic analyses were also conducted on a sister group representing a more recent divergence, the Saturniidae and Sphingidae. These analyses utilized sequences from the probe region and data flanking it, nearly doubled the size of the dataset; resulting trees supported new phylogenetics relationships, especially within the Saturniidae and Sphingidae (e.g., Hemarina derived in the latter). We hope that our data processing pipeline, hybrid enrichment gene set, and approach of combining AHE data with transcriptomes will be useful for the broader systematics community. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Multi-locus phylogenetic analysis reveals the pattern and tempo of bony fish evolution
Broughton, Richard E.; Betancur-R., Ricardo; Li, Chenhong; Arratia, Gloria; Ortí, Guillermo
2013-01-01
Over half of all vertebrates are “fishes”, which exhibit enormous diversity in morphology, physiology, behavior, reproductive biology, and ecology. Investigation of fundamental areas of vertebrate biology depend critically on a robust phylogeny of fishes, yet evolutionary relationships among the major actinopterygian and sarcopterygian lineages have not been conclusively resolved. Although a consensus phylogeny of teleosts has been emerging recently, it has been based on analyses of various subsets of actinopterygian taxa, but not on a full sample of all bony fishes. Here we conducted a comprehensive phylogenetic study on a broad taxonomic sample of 61 actinopterygian and sarcopterygian lineages (with a chondrichthyan outgroup) using a molecular data set of 21 independent loci. These data yielded a resolved phylogenetic hypothesis for extant Osteichthyes, including 1) reciprocally monophyletic Sarcopterygii and Actinopterygii, as currently understood, with polypteriforms as the first diverging lineage within Actinopterygii; 2) a monophyletic group containing gars and bowfin (= Holostei) as sister group to teleosts; and 3) the earliest diverging lineage among teleosts being Elopomorpha, rather than Osteoglossomorpha. Relaxed-clock dating analysis employing a set of 24 newly applied fossil calibrations reveals divergence times that are more consistent with paleontological estimates than previous studies. Establishing a new phylogenetic pattern with accurate divergence dates for bony fishes illustrates several areas where the fossil record is incomplete and provides critical new insights on diversification of this important vertebrate group. PMID:23788273
Villarreal A, Juan Carlos; Crandall-Stotler, Barbara J; Hart, Michelle L; Long, David G; Forrest, Laura L
2016-03-01
We present a complete generic-level phylogeny of the complex thalloid liverworts, a lineage that includes the model system Marchantia polymorpha. The complex thalloids are remarkable for their slow rate of molecular evolution and for being the only extant plant lineage to differentiate gas exchange tissues in the gametophyte generation. We estimated the divergence times and analyzed the evolutionary trends of morphological traits, including air chambers, rhizoids and specialized reproductive structures. A multilocus dataset was analyzed using maximum likelihood and Bayesian approaches. Relative rates were estimated using local clocks. Our phylogeny cements the early branching in complex thalloids. Marchantia is supported in one of the earliest divergent lineages. The rate of evolution in organellar loci is slower than for other liverwort lineages, except for two annual lineages. Most genera diverged in the Cretaceous. Marchantia polymorpha diversified in the Late Miocene, giving a minimum age estimate for the evolution of its sex chromosomes. The complex thalloid ancestor, excluding Blasiales, is reconstructed as a plant with a carpocephalum, with filament-less air chambers opening via compound pores, and without pegged rhizoids. Our comprehensive study of the group provides a temporal framework for the analysis of the evolution of critical traits essential for plants during land colonization. © 2015 Royal Botanic Garden Edinburgh. New Phytologist © 2015 New Phytologist Trust.
Kundu, S; Jones, C G; Prys-Jones, R P; Groombridge, J J
2012-01-01
Parrots are among the most recognisable and widely distributed of all bird groups occupying major parts of the tropics. The evolution of the genera that are found in and around the Indian Ocean region is particularly interesting as they show a high degree of heterogeneity in distribution and levels of speciation. Here we present a molecular phylogenetic analysis of Indian Ocean parrots, identifying the possible geological and geographical factors that influenced their evolution. We hypothesise that the Indian Ocean islands acted as stepping stones in the radiation of the Old-World parrots, and that sea-level changes may have been an important determinant of current distributions and differences in speciation. A multi-locus phylogeny showing the evolutionary relationships among genera highlights the interesting position of the monotypic Psittrichas, which shares a common ancestor with the geographically distant Coracopsis. An extensive species-level molecular phylogeny indicates a complex pattern of radiation including evidence for colonisation of Africa, Asia and the Indian Ocean islands from Australasia via multiple routes, and of island populations 'seeding' continents. Moreover, comparison of estimated divergence dates and sea-level changes points to the latter as a factor in parrot speciation. This is the first study to include the extinct parrot taxa, Mascarinus mascarinus and Psittacula wardi which, respectively, appear closely related to Coracopsis nigra and Psittacula eupatria. Copyright © 2011 Elsevier Inc. All rights reserved.
Ibarra-Cerdeña, Carlos N; Zaldívar-Riverón, Alejandro; Peterson, A Townsend; Sánchez-Cordero, Víctor; Ramsey, Janine M
2014-10-01
The niche conservatism hypothesis states that related species diverge in niche characteristics at lower rates than expected, given their lineage divergence. Here we analyze whether niche conservatism is a common pattern among vector species (Hemiptera: Reduviidae: Triatominae) of Trypanosoma cruzi that inhabit North and Central America, a highly heterogeneous landmass in terms of environmental gradients. Mitochondrial and nuclear loci were used in a multi-locus phylogenetic framework to reconstruct phylogenetic relationships among species and estimate time of divergence of selected clades to draw biogeographic inferences. Then, we estimated similarity between the ecological niche of sister species and tested the niche conservatism hypothesis using our best estimate of phylogeny. Triatoma is not monophyletic. A primary clade with all North and Central American (NCA) triatomine species from the genera Triatoma, Dipetalogaster, and Panstrongylus, was consistently recovered. Nearctic species within the NCA clade (T. p. protracta, T. r. rubida) diverged during the Pliocene, whereas the Neotropical species (T. phyllosoma, T. longipennis, T. dimidiata complex) are estimated to have diverged more recently, during the Pleistocene. The hypothesis of niche conservatism could not be rejected for any of six sister species pairs. Niche similarity between sister species best fits a retention model. While this framework is used here to infer niche evolution, it has a direct impact on spatial vector dynamics driven by human population movements, expansion of transportation networks and climate change scenarios.
USDA-ARS?s Scientific Manuscript database
Cryptococcus flavescens strain OH182.9_3C (3C) previously displayed significant biological control activity against Fusarium head blight, a globally important disease of wheat; however, the diversity within C. flavescens has not been previously characterized. Multilocus sequence typing was performed...
USDA-ARS?s Scientific Manuscript database
Bacterial spot of tomato (BST) is a major constraint to tomato production in Ethiopia and many other countries leading to significant crop losses. In the present study, using pathogenicity tests, sensitivity to copper and streptomycin, and multilocus sequence analysis, a diverse group of Xanthomonas...
USDA-ARS?s Scientific Manuscript database
Vibrio parahaemolyticus is a gram-negative bacterium that inhabits coastal and marine environments. Thermostable direct hemolysin (tdh), tdh-related hemolysin (trh) and the type III secretion system are considered the potential virulent factors of pathogenic V. parahaemolyticus. The frequency of str...
USDA-ARS?s Scientific Manuscript database
Strains from a collection of 3,639 diverse Bacillus thuringiensis isolates were classified based on phenotypic profiles resulting from six biochemical tests, including production of amylase (T), lecithinase (L), urease (U), acid from sucrose (S) and salicin (A), and the hydrolysis of esculin (E). St...
USDA-ARS?s Scientific Manuscript database
The objective of this study was to compare subtypes of Campylobacter jejuni and coli detected on three discreet selective Campylobacter plating media to determine if different media select for different subtypes. Fifty ceca and fifty carcasses (n=100, representing 50 flocks) were collected from the...
USDA-ARS?s Scientific Manuscript database
Mycoplasma bovis is a primary agent of mastitis, pneumonia and arthritis in cattle and is the bacterium isolated most frequently from the polymicrobial syndrome known as bovine respiratory disease complex (BRDC). Recently, M. bovis has emerged as a significant health problem in bison, causing necro...
A multilocus sequence typing method and curated database for Mycoplasma bovis
USDA-ARS?s Scientific Manuscript database
Mycoplasma bovis is a primary agent of mastitis, pneumonia and arthritis in cattle and is the bacterium isolated most frequently from the polymicrobial syndrome known as bovine respiratory disease complex (BRDC). Recently, M. bovis has emerged as a significant problem in bison, causing necrotic pha...
Campylobacter multi-locus sequence typing subtypes detected on chicken livers available at retail.
USDA-ARS?s Scientific Manuscript database
Foodborne campylobacteriosis has been traced to undercooked chicken liver. It is not known what prevalence of Campylobacter to expect on fresh chicken livers available at retail. The objectives of this study were to measure prevalence of Campylobacter associated with chicken livers at retail and d...
Population sub-structuring among Trypanosoma evansi stocks.
Njiru, Z K; Constantine, C C
2007-10-01
To investigate the population genetic structure of Trypanosoma evansi from domesticated animals, we have analysed 112 stocks from camels, buffaloes, cattle and horses using the tandemly repeated coding sequence (MORF2) and minisatellite markers 292 and cysteine-rich acidic integral membrane protein (CRAM). We recorded a total of six alleles at the MORF2 locus, seven at 292 and 12 at the CRAM loci. Nei's genetic distance showed reduced allelic diversity between buffaloes and cattle stocks (1.2) as compared to the diversity between camels and buffaloes (3.75) and camels and cattle stock (1.69). The mean index of association (IA=0.92) significantly deviated from zero, and the average number of multilocus genotypes (G/N ratio) was 0.21. Twenty-four multilocus genotypes were defined from the combination of alleles at the three loci. The Kenyan sub-populations showed Fst=0.28 and analysis of molecular variance showed significant divergence (22.7%) between the Laikipia, Kulal and Galana regions. The regional and host distribution of multi-locus genotypes significant population differentiation and high Nei's genetic distances suggest existence of genetic sub-structuring within T. evansi stocks while the few multi-locus genotypes and deviation of association index from zero indicate the lack of recombination. In conclusion, this study reveals that some genetic sub-structuring does occur within T. evansi, which has a clonal population structure.
Fonseca, Luiz Henrique M; Lohmann, Lúcia G
2018-06-01
Combining high-throughput sequencing data with amplicon sequences allows the reconstruction of robust phylogenies based on comprehensive sampling of characters and taxa. Here, we combine Next Generation Sequencing (NGS) and Sanger sequencing data to infer the phylogeny of the "Adenocalymma-Neojobertia" clade (Bignonieae, Bignoniaceae), a diverse lineage of Neotropical plants, using Maximum Likelihood and Bayesian approaches. We used NGS to obtain complete or nearly-complete plastomes of members of this clade, leading to a final dataset with 54 individuals, representing 44 members of ingroup and 10 outgroups. In addition, we obtained Sanger sequences of two plastid markers (ndhF and rpl32-trnL) for 44 individuals (43 ingroup and 1 outgroup) and the nuclear PepC for 64 individuals (63 ingroup and 1 outgroup). Our final dataset includes 87 individuals of members of the "Adenocalymma-Neojobertia" clade, representing 66 species (ca. 90% of the diversity), plus 11 outgroups. Plastid and nuclear datasets recovered congruent topologies and were combined. The combined analysis recovered a monophyletic "Adenocalymma-Neojobertia" clade and a paraphyletic Adenocalymma that also contained a monophyletic Neojobertia plus Pleonotoma albiflora. Relationships are strongly supported in all analyses, with most lineages within the "Adenocalymma-Neojobertia" clade receiving maximum posterior probabilities. Ancestral character state reconstructions using Bayesian approaches identified six morphological synapomorphies of clades namely, prophyll type, petiole and petiolule articulation, tendril ramification, inflorescence ramification, calyx shape, and fruit wings. Other characters such as habit, calyx cupular trichomes, corolla color, and corolla shape evolved multiple times. These characters are putatively related with the clade diversification and can be further explored in diversification studies. Copyright © 2018 Elsevier Inc. All rights reserved.
Homology and phylogeny and their automated inference
NASA Astrophysics Data System (ADS)
Fuellen, Georg
2008-06-01
The analysis of the ever-increasing amount of biological and biomedical data can be pushed forward by comparing the data within and among species. For example, an integrative analysis of data from the genome sequencing projects for various species traces the evolution of the genomes and identifies conserved and innovative parts. Here, I review the foundations and advantages of this “historical” approach and evaluate recent attempts at automating such analyses. Biological data is comparable if a common origin exists (homology), as is the case for members of a gene family originating via duplication of an ancestral gene. If the family has relatives in other species, we can assume that the ancestral gene was present in the ancestral species from which all the other species evolved. In particular, describing the relationships among the duplicated biological sequences found in the various species is often possible by a phylogeny, which is more informative than homology statements. Detecting and elaborating on common origins may answer how certain biological sequences developed, and predict what sequences are in a particular species and what their function is. Such knowledge transfer from sequences in one species to the homologous sequences of the other is based on the principle of ‘my closest relative looks and behaves like I do’, often referred to as ‘guilt by association’. To enable knowledge transfer on a large scale, several automated ‘phylogenomics pipelines’ have been developed in recent years, and seven of these will be described and compared. Overall, the examples in this review demonstrate that homology and phylogeny analyses, done on a large (and automated) scale, can give insights into function in biology and biomedicine.
Imhoff, Johannes F.; Rahn, Tanja; Künzel, Sven; Neulinger, Sven C.
2018-01-01
Two different photosystems for performing bacteriochlorophyll-mediated photosynthetic energy conversion are employed in different bacterial phyla. Those bacteria employing a photosystem II type of photosynthetic apparatus include the phototrophic purple bacteria (Proteobacteria), Gemmatimonas and Chloroflexus with their photosynthetic relatives. The proteins of the photosynthetic reaction center PufL and PufM are essential components and are common to all bacteria with a type-II photosynthetic apparatus, including the anaerobic as well as the aerobic phototrophic Proteobacteria. Therefore, PufL and PufM proteins and their genes are perfect tools to evaluate the phylogeny of the photosynthetic apparatus and to study the diversity of the bacteria employing this photosystem in nature. Almost complete pufLM gene sequences and the derived protein sequences from 152 type strains and 45 additional strains of phototrophic Proteobacteria employing photosystem II were compared. The results give interesting and comprehensive insights into the phylogeny of the photosynthetic apparatus and clearly define Chromatiales, Rhodobacterales, Sphingomonadales as major groups distinct from other Alphaproteobacteria, from Betaproteobacteria and from Caulobacterales (Brevundimonas subvibrioides). A special relationship exists between the PufLM sequences of those bacteria employing bacteriochlorophyll b instead of bacteriochlorophyll a. A clear phylogenetic association of aerobic phototrophic purple bacteria to anaerobic purple bacteria according to their PufLM sequences is demonstrated indicating multiple evolutionary lines from anaerobic to aerobic phototrophic purple bacteria. The impact of pufLM gene sequences for studies on the environmental diversity of phototrophic bacteria is discussed and the possibility of their identification on the species level in environmental samples is pointed out. PMID:29472894
Crampton-Platt, Alex; Timmermans, Martijn J.T.N.; Gimmel, Matthew L.; Kutty, Sujatha Narayanan; Cockerill, Timothy D.; Vun Khen, Chey; Vogler, Alfried P.
2015-01-01
In spite of the growth of molecular ecology, systematics and next-generation sequencing, the discovery and analysis of diversity is not currently integrated with building the tree-of-life. Tropical arthropod ecologists are well placed to accelerate this process if all specimens obtained through mass-trapping, many of which will be new species, could be incorporated routinely into phylogeny reconstruction. Here we test a shotgun sequencing approach, whereby mitochondrial genomes are assembled from complex ecological mixtures through mitochondrial metagenomics, and demonstrate how the approach overcomes many of the taxonomic impediments to the study of biodiversity. DNA from approximately 500 beetle specimens, originating from a single rainforest canopy fogging sample from Borneo, was pooled and shotgun sequenced, followed by de novo assembly of complete and partial mitogenomes for 175 species. The phylogenetic tree obtained from this local sample was highly similar to that from existing mitogenomes selected for global coverage of major lineages of Coleoptera. When all sequences were combined only minor topological changes were induced against this reference set, indicating an increasingly stable estimate of coleopteran phylogeny, while the ecological sample expanded the tip-level representation of several lineages. Robust trees generated from ecological samples now enable an evolutionary framework for ecology. Meanwhile, the inclusion of uncharacterized samples in the tree-of-life rapidly expands taxon and biogeographic representation of lineages without morphological identification. Mitogenomes from shotgun sequencing of unsorted environmental samples and their associated metadata, placed robustly into the phylogenetic tree, constitute novel DNA “superbarcodes” for testing hypotheses regarding global patterns of diversity. PMID:25957318
Colonisation with toxigenic Corynebacterium diphtheriae in a Scottish burns patient, June 2015.
Deshpande, Ashutosh; Inkster, Teresa; Hamilton, Kate; Litt, David; Fry, Norman; Kennedy, Iain T R; Shookhye-Dickson, Jacqueline; Hill, Robert L R
2015-01-01
On 12 June 2015, Corynebacterium diphtheriae was identified in a skin swab from a burns patient in Scotland. The isolate was confirmed to be genotypically and phenotypically toxigenic. Multilocus sequence typing of three patient isolates yielded sequence type ST 125. The patient was clinically well. We summarise findings of this case, and results of close contact identification and screening: 12 family and close contacts and 32 hospital staff have been found negative for C. diphtheriae.
Woksepp, Hanna; Ryberg, Anna; Berglind, Linda; Schön, Thomas; Söderman, Jan
2017-12-01
Enhanced precision of epidemiological typing in clinically suspected nosocomial outbreaks is crucial. Our aim was to investigate whether single nucleotide polymorphism (SNP) analysis and core genome (cg) multilocus sequence typing (MLST) of whole genome sequencing (WGS) data would more reliably identify a nosocomial outbreak, compared to earlier molecular typing methods. Sixteen isolates from a nosocomial outbreak of ESBL E. coli ST-131 in southeastern Sweden and three control strains were subjected to WGS. Sequences were explored by SNP analysis and cgMLST. cgMLST clearly differentiated between the outbreak isolates and the control isolates (>1400 differences). All clinically identified outbreak isolates showed close clustering (≥2 allele differences), except for two isolates (>50 allele differences). These data confirmed that the isolates with >50 differing genes did not belong to the nosocomial outbreak. The number of SNPs within the outbreak was ≤7, whereas the two discrepant isolates had >700 SNPs. Two of the ESBL E. coli ST-131 isolates did not belong to the clinically identified outbreak. Our results illustrate the power of WGS in terms of resolution, which may avoid overestimation of patients belonging to outbreaks as judged from epidemiological data and previously employed molecular methods with lower discriminatory ability. © 2017 APMIS. Published by John Wiley & Sons Ltd.
Momeni, Stephanie S; Whiddon, Jennifer; Cheon, Kyounga; Moser, Stephen A; Childers, Noel K
2015-12-01
Studies using multilocus sequence typing (MLST) have demonstrated that Streptococcus mutans isolates are genetically diverse. Our laboratory previously demonstrated clonality of S. mutans using MLST but could not discount the possibility of sampling bias. In this study, the clonality of randomly selected S. mutans plaque isolates from African-American children was examined using MLST. Serotype and the presence of collagen-binding proteins (CBPs) encoded by cnm/cbm were also assessed. One-hundred S. mutans isolates were randomly selected for MLST analysis. Sequence analysis was performed and phylogenetic trees were generated using start2 and mega. Thirty-four sequence types were identified, of which 27 were unique to this population. Seventy-five per cent of the isolates clustered into 16 clonal groups. The serotypes observed were c (n = 84), e (n = 3), and k (n = 11). The prevalence of S. mutans isolates of serotype k was notably high, at 17.5%. All isolates were cnm/cbm negative. The clonality of S. mutans demonstrated in this study illustrates the importance of localized population studies and are consistent with transmission. The prevalence of serotype k, a recently proposed systemic pathogen, observed in this study, is higher than reported in most populations and is the first report of S. mutans serotype k in a United States population. © 2015 Eur J Oral Sci.
Álvarez-Pérez, Sergio; de Vega, Clara; Herrera, Carlos M.
2013-01-01
The genetic and evolutionary relationships among floral nectar-dwelling Pseudomonas ‘sensu stricto’ isolates associated to South African and Mediterranean plants were investigated by multilocus sequence analysis (MLSA) of four core housekeeping genes (rrs, gyrB, rpoB and rpoD). A total of 35 different sequence types were found for the 38 nectar bacterial isolates characterised. Phylogenetic analyses resulted in the identification of three main clades [nectar groups (NGs) 1, 2 and 3] of nectar pseudomonads, which were closely related to five intrageneric groups: Pseudomonas oryzihabitans (NG 1); P. fluorescens, P. lutea and P. syringae (NG 2); and P. rhizosphaerae (NG 3). Linkage disequilibrium analysis pointed to a mostly clonal population structure, even when the analysis was restricted to isolates from the same floristic region or belonging to the same NG. Nevertheless, signatures of recombination were observed for NG 3, which exclusively included isolates retrieved from the floral nectar of insect-pollinated Mediterranean plants. In contrast, the other two NGs comprised both South African and Mediterranean isolates. Analyses relating diversification to floristic region and pollinator type revealed that there has been more unique evolution of the nectar pseudomonads within the Mediterranean region than would be expected by chance. This is the first work analysing the sequence of multiple loci to reveal geno- and ecotypes of nectar bacteria. PMID:24116076
Impact of recent molecular phylogenetic studies on classification of ascomycete yeasts
USDA-ARS?s Scientific Manuscript database
Analyses of concatenated gene sequences as well as whole genome sequences are resolving relationships among the ascomycete yeasts (Saccharomycotina), thus allowing classification of members of this subphylum to be based on phylogeny. In addition, changes implemented in the new Botanical Code [Intern...
Corradi, Nicolas; Hijri, Mohamed; Fumagalli, Luca; Sanders, Ian R
2004-11-01
The genes encoding alpha- and beta-tubulins have been widely sampled in most major fungal phyla and they are useful tools for fungal phylogeny. Here, we report the first isolation of alpha-tubulin sequences from arbuscular mycorrhizal fungi (AMF). In parallel, AMF beta-tubulins were sampled and analysed to identify the presence of paralogs of this gene. The AMF alpha-tubulin amino acid phylogeny was congruent with the results previously reported for AMF beta-tubulins and showed that AMF tubulins group together at a basal position in the fungal clade and showed high sequence similarities with members of the Chytridiomycota. This is in contrast with phylogenies for other regions of the AMF genome. The amount and nature of substitutions are consistent with an ancient divergence of both orthologs and paralogs of AMF tubulins. At the amino acid level, however, AMF tubulins have hardly evolved from those of the chytrids. This is remarkable given that these two groups are ancient and the monophyletic Glomeromycota probably diverged from basal fungal ancestors at least 500 million years ago. The specific primers we designed for the AMF tubulins, together with the high molecular variation we found among the AMF species we analysed, make AMF tubulin sequences potentially useful for AMF identification purposes.
Phylogenetic analysis of the alfalfa weevil complex (Coleoptera: Curculionidae) in North America.
Böttger, Jorge A Achata; Bundy, C Scott; Oesterle, Naomi; Hanson, Stephen F
2013-02-01
The Eastern, Western, and Egyptian strains of alfalfa weevil are pests introduced to North America on three separate occasions, now they share partially overlapping geographic ranges, covering most of the continental United States. Behavior, susceptibility to parasites, and subtle morphological differences separate the strains. The difficulty in differentiating among these strains morphologically has led to the application of molecular phylogeny approaches including restriction fragment-length polymorphism characterization and sequencing of mitochondrial genes. While valuable for strain identification, this approach cannot identify interstrain hybrids because mitochondrial markers are maternally inherited. The work reported here extends previous findings by comparing over 7 Kb of sequence from two mitochondrial and four nuclear loci to increase the resolution of molecular phylogeny for these weevils. The related clover leaf weevil, also an occasional pest of alfalfa, was included in the analysis because the molecular phylogeny of this weevil has not been examined to date. Analysis of nuclear loci indicate that the clover weevil is a distinct species. Furthermore, while the three alfalfa weevil strains are separable based on mitochondrial sequence data they cannot be separated using nuclearloci suggesting that they are all recently diverged members of the same species. These data refine the relationships among these strains and may find application in design of better control strategies.
Al-Atiyat, R M; Aljumaah, R S
2014-08-27
This study aimed to estimate evolutionary distances and to reconstruct phylogeny trees between different Awassi sheep populations. Thirty-two sheep individuals from three different geographical areas of Jordan and the Kingdom of Saudi Arabia (KSA) were randomly sampled. DNA was extracted from the tissue samples and sequenced using the T7 promoter universal primer. Different phylogenetic trees were reconstructed from 0.64-kb DNA sequences using the MEGA software with the best general time reverse distance model. Three methods of distance estimation were then used. The maximum composite likelihood test was considered for reconstructing maximum likelihood, neighbor-joining and UPGMA trees. The maximum likelihood tree indicated three major clusters separated by cytosine (C) and thymine (T). The greatest distance was shown between the South sheep and North sheep. On the other hand, the KSA sheep as an outgroup showed shorter evolutionary distance to the North sheep population than to the others. The neighbor-joining and UPGMA trees showed quite reliable clusters of evolutionary differentiation of Jordan sheep populations from the Saudi population. The overall results support geographical information and ecological types of the sheep populations studied. Summing up, the resulting phylogeny trees may contribute to the limited information about the genetic relatedness and phylogeny of Awassi sheep in nearby Arab countries.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd
2017-01-26
The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.
The rRNA evolution and procaryotic phylogeny
NASA Technical Reports Server (NTRS)
Fox, G. E.
1986-01-01
Studies of ribosomal RNA primary structure allow reconstruction of phylogenetic trees for prokaryotic organisms. Such studies reveal major dichotomy among the bacteria that separates them into eubacteria and archaebacteria. Both groupings are further segmented into several major divisions. The results obtained from 5S rRNA sequences are essentially the same as those obtained with the 16S rRNA data. In the case of Gram negative bacteria the ribosomal RNA sequencing results can also be directly compared with hybridization studies and cytochrome c sequencing studies. There is again excellent agreement among the several methods. It seems likely then that the overall picture of microbial phylogeny that is emerging from the RNA sequence studies is a good approximation of the true history of these organisms. The RNA data allow examination of the evolutionary process in a semi-quantitative way. The secondary structures of these RNAs are largely established. As a result it is possible to recognize examples of local structural evolution. Evolutionary pathways accounting for these events can be proposed and their probability can be assessed.
Multi-locus phylogeny of Pleosporales: a taxonomic, ecological and evolutionary re-evaluation
Zhang, Y.; Schoch, C.L.; Fournier, J.; Crous, P.W.; de Gruyter, J.; Woudenberg, J.H.C.; Hirayama, K.; Tanaka, K.; Pointing, S.B.; Spatafora, J.W.; Hyde, K.D.
2009-01-01
Five loci, nucSSU, nucLSU rDNA, TEF1, RPB1 and RPB2, are used for analysing 129 pleosporalean taxa representing 59 genera and 15 families in the current classification of Pleosporales. The suborder Pleosporineae is emended to include four families, viz. Didymellaceae, Leptosphaeriaceae, Phaeosphaeriaceae and Pleosporaceae. In addition, two new families are introduced, i.e. Amniculicolaceae and Lentitheciaceae. Pleomassariaceae is treated as a synonym of Melanommataceae, and new circumscriptions of Lophiostomataceae s. str., Massarinaceae and Lophiotrema are proposed. Familial positions of Entodesmium and Setomelanomma in Phaeosphaeriaceae, Neophaeosphaeria in Leptosphaeriaceae, Leptosphaerulina, Macroventuria and Platychora in Didymellaceae, Pleomassaria in Melanommataceae and Bimuria, Didymocrea, Karstenula and Paraphaeosphaeria in Montagnulaceae are clarified. Both ecological and morphological characters show varying degrees of phylogenetic significance. Pleosporales is most likely derived from a saprobic ancestor with fissitunicate asci containing conspicuous ocular chambers and apical rings. Nutritional shifts in Pleosporales likely occured from saprotrophic to hemibiotrophic or biotrophic. PMID:20169024
Kijima, T E; Innan, Hideki
2013-11-01
A population genetic simulation framework is developed to understand the behavior and molecular evolution of DNA sequences of transposable elements. Our model incorporates random transposition and excision of transposable element (TE) copies, two modes of selection against TEs, and degeneration of transpositional activity by point mutations. We first investigated the relationships between the behavior of the copy number of TEs and these parameters. Our results show that when selection is weak, the genome can maintain a relatively large number of TEs, but most of them are less active. In contrast, with strong selection, the genome can maintain only a limited number of TEs but the proportion of active copies is large. In such a case, there could be substantial fluctuations of the copy number over generations. We also explored how DNA sequences of TEs evolve through the simulations. In general, active copies form clusters around the original sequence, while less active copies have long branches specific to themselves, exhibiting a star-shaped phylogeny. It is demonstrated that the phylogeny of TE sequences could be informative to understand the dynamics of TE evolution.
Willerslev, Eske; Gilbert, M Thomas P; Binladen, Jonas; Ho, Simon YW; Campos, Paula F; Ratan, Aakrosh; Tomsho, Lynn P; da Fonseca, Rute R; Sher, Andrei; Kuznetsova, Tatanya V; Nowak-Kemp, Malgosia; Roth, Terri L; Miller, Webb; Schuster, Stephan C
2009-01-01
Background The scientific literature contains many examples where DNA sequence analyses have been used to provide definitive answers to phylogenetic problems that traditional (non-DNA based) approaches alone have failed to resolve. One notable example concerns the rhinoceroses, a group for which several contradictory phylogenies were proposed on the basis of morphology, then apparently resolved using mitochondrial DNA fragments. Results In this study we report the first complete mitochondrial genome sequences of the extinct ice-age woolly rhinoceros (Coelodonta antiquitatis), and the threatened Javan (Rhinoceros sondaicus), Sumatran (Dicerorhinus sumatrensis), and black (Diceros bicornis) rhinoceroses. In combination with the previously published mitochondrial genomes of the white (Ceratotherium simum) and Indian (Rhinoceros unicornis) rhinoceroses, this data set putatively enables reconstruction of the rhinoceros phylogeny. While the six species cluster into three strongly supported sister-pairings: (i) The black/white, (ii) the woolly/Sumatran, and (iii) the Javan/Indian, resolution of the higher-level relationships has no statistical support. The phylogenetic signal from individual genes is highly diffuse, with mixed topological support from different genes. Furthermore, the choice of outgroup (horse vs tapir) has considerable effect on reconstruction of the phylogeny. The lack of resolution is suggestive of a hard polytomy at the base of crown-group Rhinocerotidae, and this is supported by an investigation of the relative branch lengths. Conclusion Satisfactory resolution of the rhinoceros phylogeny may not be achievable without additional analyses of substantial amounts of nuclear DNA. This study provides a compelling demonstration that, in spite of substantial sequence length, there are significant limitations with single-locus phylogenetics. We expect further examples of this to appear as next-generation, large-scale sequencing of complete mitochondrial genomes becomes commonplace in evolutionary studies. "The human factor in classification is nowhere more evident than in dealing with this superfamily (Rhinocerotoidea)." G. G. Simpson (1945) PMID:19432984
Migration and persistence of human influenza A viruses, Vietnam, 2001-2008.
Le, Mai Quynh; Lam, Ha Minh; Cuong, Vuong Duc; Lam, Tommy Tsan-Yuk; Halpin, Rebecca A; Wentworth, David E; Hien, Nguyen Tran; Thanh, Le Thi; Phuong, Hoang Vu Mai; Horby, Peter; Boni, Maciej F
2013-11-01
Understanding global influenza migration and persistence is crucial for vaccine strain selection. Using 240 new human influenza A virus whole genomes collected in Vietnam during 2001-2008, we looked for persistence patterns and migratory connections between Vietnam and other countries. We found that viruses in Vietnam migrate to and from China, Hong Kong, Taiwan, Cambodia, Japan, South Korea, and the United States. We attempted to reduce geographic bias by generating phylogenies subsampled at the year and country levels. However, migration events in these phylogenies were still driven by the presence or absence of sequence data, indicating that an epidemiologic study design that controls for prevalence is required for robust migration analysis. With whole-genome data, most migration events are not detectable from the phylogeny of the hemagglutinin segment alone, although general migratory relationships between Vietnam and other countries are visible in the hemagglutinin phylogeny. It is possible that virus lineages in Vietnam persisted for >1 year.
DNA barcoding and phylogeny of Calidris and Tringa (Aves: Scolopacidae).
Huang, Zuhao; Tu, Feiyun
2017-07-01
The avian genera Calidris and Tringa are the largest of the widespread family of Scolopacidae. The phylogeny of members of the two genera is still a matter of controversial. Mitochondrial cytochrome c oxidase subunit I (COI) can serve as a fast and accurate marker for the identification and phylogeny of animal species. In this study, we analyzed the COI barcodes of thirty-one species of the two genera. All the species had distinct COI sequences. Two hundred and twenty-one variable sites were identified. Kimura two-parameter distances were calculated between barcodes. Neighbor-joining and maximum likelihood methods were used to construct phylogenetic trees. All the species could be discriminated by their distinct clades in the phylogenetic trees. The phylogenetic trees grouped all the species of Calidris and Tringa into different monophyletic clade, respectively. COI data showed a well-supported phylogeny for Calidris and Tringa species.
Enterobacter muelleri sp. nov., isolated from the rhizosphere of Zea mays.
Kämpfer, Peter; McInroy, John A; Glaeser, Stefanie P
2015-11-01
A beige-pigmented, oxidase-negative bacterial strain (JM-458T), isolated from a rhizosphere sample, was studied using a polyphasic taxonomic approach. Cells of the isolate were rod-shaped and stained Gram-negative. A comparison of the 16S rRNA gene sequence of strain JM-458T with sequences of the type strains of closely related species of the genus Enterobacter showed that it shared highest sequence similarity with Enterobacter mori (98.7 %), Enterobacter hormaechei (98.3 %), Enterobacter cloacae subsp. dissolvens, Enterobacter ludwigii and Enterobacter asburiae (all 98.2 %). 16S rRNA gene sequence similarities to all other Enterobacter species were below 98 %. Multilocus sequence analysis based on concatenated partial rpoB, gyrB, infB and atpD gene sequences showed a clear distinction of strain JM-458T from its closest related type strains. The fatty acid profile of the strain consisted of C16 : 0, C17 : 0 cyclo, iso-C15 : 0 2-OH/C16 : 1ω7c and C18 : 1ω7c as major components. DNA-DNA hybridizations between strain JM-458T and the type strains of E. mori, E. hormaechei and E. ludwigii resulted in relatedness values of 29 % (reciprocal 25 %), 24 % (reciprocal 43 %) and 16 % (reciprocal 17 %), respectively. DNA-DNA hybridization results together with multilocus sequence analysis results and differential biochemical and chemotaxonomic properties showed that strain JM-458T represents a novel species of the genus Enterobacter, for which the name Enterobacter muelleri sp. nov. is proposed. The type strain is JM-458T ( = DSM 29346T = CIP 110826T = LMG 28480T = CCM 8546T).
Bull, Carolee T; Clarke, Christopher R; Cai, Rongman; Vinatzer, Boris A; Jardini, Teresa M; Koike, Steven T
2011-07-01
Since 2002, severe leaf spotting on parsley (Petroselinum crispum) has occurred in Monterey County, CA. Either of two different pathovars of Pseudomonas syringae sensu lato were isolated from diseased leaves from eight distinct outbreaks and once from the same outbreak. Fragment analysis of DNA amplified between repetitive sequence polymerase chain reaction; 16S rDNA sequence analysis; and biochemical, physiological, and host range tests identified the pathogens as Pseudomonas syringae pv. apii and P. syringae pv. coriandricola. Koch's postulates were completed for the isolates from parsley, and host range tests with parsley isolates and pathotype strains demonstrated that P. syringae pv. apii and P. syringae pv. coriandricola cause leaf spot diseases on parsley, celery, and coriander or cilantro. In a multilocus sequence typing (MLST) approach, four housekeeping gene fragments were sequenced from 10 strains isolated from parsley and 56 pathotype strains of P. syringae. Allele sequences were uploaded to the Plant-Associated Microbes Database and a phylogenetic tree was built based on concatenated sequences. Tree topology directly corresponded to P. syringae genomospecies and P. syringae pv. apii was allocated appropriately to genomospecies 3. This is the first demonstration that MLST can accurately allocate new pathogens directly to P. syringae sensu lato genomospecies. According to MLST, P. syringae pv. coriandricola is a member of genomospecies 9, P. cannabina. In a blind test, both P. syringae pv. coriandricola and P. syringae pv. apii isolates from parsley were correctly identified to pathovar. In both cases, MLST described diversity within each pathovar that was previously unknown.
Medina, M; Collins, A G; Silberman, J D; Sogin, M L
2001-08-14
We studied the evolutionary relationships among basal metazoan lineages by using complete large subunit (LSU) and small subunit (SSU) ribosomal RNA sequences for 23 taxa. After identifying competing hypotheses, we performed maximum likelihood searches for trees conforming to each hypothesis. Kishino-Hasegawa tests were used to determine whether the data (LSU, SSU, and combined) reject any of the competing hypotheses. We also conducted unconstrained tree searches, compared the resulting topologies, and calculated bootstrap indices. Shimodaira-Hasegawa tests were applied to determine whether the data reject any of the topologies resulting from the constrained and unconstrained tree searches. LSU, SSU, and the combined data strongly contradict two assertions pertaining to sponge phylogeny. Hexactinellid sponges are not likely to be the basal lineage of a monophyletic Porifera or the sister group to all other animals. Instead, Hexactinellida and Demospongia form a well-supported clade of siliceous sponges, Silicea. It remains unclear, on the basis of these data alone, whether the calcarean sponges are more closely related to Silicea or to nonsponge animals. The SSU and combined data reject the hypothesis that Bilateria is more closely related to Ctenophora than it is to Cnidaria, whereas LSU data alone do not refute either hypothesis. LSU and SSU data agree in supporting the monophyly of Bilateria, Cnidaria, Ctenophora, and Metazoa. LSU sequence data reveal phylogenetic structure in a data set with limited taxon sampling. Continued accumulation of LSU sequences should increase our understanding of animal phylogeny.
Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W
2018-05-25
Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Medina, Monica; Collins, Allen G.; Silberman, Jeffrey
2001-06-21
We studied the evolutionary relationships among basal metazoan lineages by using complete large subunit (LSU) and small subunit (SSU) ribosomal RNA sequences for 23 taxa. After identifying competing hypotheses, we performed maximum likelihood searches for trees conforming to each hypothesis. Kishino-Hasegawa tests were used to determine whether the data (LSU, SSU, and combined) reject any of the competing hypotheses. We also conducted unconstrained tree searches, compared the resulting topologies, and calculated bootstrap indices. Shimodaira-Hasegawa tests were applied to determine whether the data reject any of the topologies resulting from the constrained and unconstrained tree searches. LSU, SSU, and the combinedmore » data strongly contradict two assertions pertaining to sponge phylogeny. Hexactinellid sponges are not likely to be the basal lineage of amonophyletic Porifera or the sister group to all other animals. Instead, Hexactinellida and Demospongia form a well-supported clade of siliceous sponges, Silicea. It remains unclear, on the basis of these data alone, whether the calcarean sponges are more closely related to Silicea or to nonsponge animals. The SSU and combined data reject the hypothesis that Bilateria is more closely related to Ctenophora than it is to Cnidaria, whereas LSU data alone do not refute either hypothesis. LSU and SSU data agree in supporting the monophyly of Bilateria, Cnidaria, Ctenophora, and Metazoa. LSU sequence data reveal phylogenetic structure in a data set with limited taxon sampling. Continued accumulation of LSU sequences should increase our understanding of animal phylogeny.« less
Agatha, Sabine; Strüder-Kypke, Michaela C.
2010-01-01
The phylogeny within the order Choreotrichida is reconstructed using (i) morphologic, ontogenetic, and ultrastructural evidence for the cladistic approach and (ii) the small subunit ribosomal RNA (SSrRNA) gene sequences, including the new sequence of Rimostrombidium lacustris. The morphologic cladograms and the gene trees converge rather well for the Choreotrichida, demonstrating that hyaline and agglutinated loricae do not characterize distinct lineages, i.e., both lorica types can be associated with the most highly developed ciliary pattern. The position of Rimostrombidium lacustris within the family Strobilidiidae is corroborated by the genealogical analyses. The diagnosis of the genus Tintinnidium is improved, adding cytological features, and the genus is divided into two subgenera based on the structure of the somatic kineties. The diagnosis of the family Lohmanniellidae and the genus Lohmanniella are improved, and Rimostrombidium glacicolum Petz, Song and Wilbert, 1995 is affiliated. PMID:17166704
Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic
Yebra, Gonzalo; Hodcroft, Emma B.; Ragonnet-Cronin, Manon L.; Pillay, Deenan; Brown, Andrew J. Leigh; Fraser, Christophe; Kellam, Paul; de Oliveira, Tulio; Dennis, Ann; Hoppe, Anne; Kityo, Cissy; Frampton, Dan; Ssemwanga, Deogratius; Tanser, Frank; Keshani, Jagoda; Lingappa, Jairam; Herbeck, Joshua; Wawer, Maria; Essex, Max; Cohen, Myron S.; Paton, Nicholas; Ratmann, Oliver; Kaleebu, Pontiano; Hayes, Richard; Fidler, Sarah; Quinn, Thomas; Novitsky, Vladimir; Haywards, Andrew; Nastouli, Eleni; Morris, Steven; Clark, Duncan; Kozlakidis, Zisis
2016-01-01
HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences. PMID:28008945
Yebra, Gonzalo; Hodcroft, Emma B; Ragonnet-Cronin, Manon L; Pillay, Deenan; Brown, Andrew J Leigh
2016-12-23
HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.
Jang, Kuem Hee; Hwang, Ui Wook
2009-01-01
Background The phylogenetic position of Bryozoa is one of the most controversial issues in metazoan phylogeny. In an attempt to address this issue, the first bryozoan mitochondrial genome from Flustrellidra hispida (Gymnolaemata, Ctenostomata) was recently sequenced and characterized. Unfortunately, it has extensive gene translocation and extremely reduced size. In addition, the phylogenies obtained from the result were conflicting, so they failed to assign a reliable phylogenetic position to Bryozoa or to clarify lophophorate phylogeny. Thus, it is necessary to characterize further mitochondrial genomes from slowly-evolving bryozoans to obtain a more credible lophophorate phylogeny. Results The complete mitochondrial genome (15,433 bp) of Bugula neritina (Bryozoa, Gymnolaemata, Cheilostomata), one of the most widely distributed cheliostome bryozoans, is sequenced. This second bryozoan mitochondrial genome contains the set of 37 components generally observed in other metazoans, differing from that of F. hispida (Bryozoa, Gymnolaemata, Ctenostomata), which has only 36 components with loss of tRNAser(ucn) genes. The B. neritina mitochondrial genome possesses 27 multiple noncoding regions. The gene order is more similar to those of the two remaining lophophorate phyla (Brachiopoda and Phoronida) and a chiton Katharina tunicate than to that of F. hispida. Phylogenetic analyses based on the nucleotide sequences or amino acid residues of 12 protein-coding genes showed consistently that, within the Lophotrochozoa, the monophyly of the bryozoan class Gymnolaemata (B. neritina and F. hispida) was strongly supported and the bryozoan clade was grouped with brachiopods. Echiura appeared as a subtaxon of Annelida, and Entoprocta as a sister taxon of Phoronida. The clade of Bryozoa + Brachiopoda was clustered with either the clade of Annelida-Echiura or that of Phoronida + Entoprocta. Conclusion This study presents the complete mitochondrial genome of a cheliostome bryozoan, B. neritina. The phylogenetic analyses suggest a close relationship between Bryozoa and Brachiopoda within the Lophotrochozoa. However, the sister group of Bryozoa + Brachiopoda is still ambiguous, although it has some attractions with Annelida-Echiura or Phoronida + Entoprocta. If the latter is a true phylogeny, lophophorate monophyly including Entoprocta is supported. Consequently, the present results imply that Brachiozoa (= Brachiopoda + Phoronida) and the recently-resurrected Bryozoa concept comprising Ectoprocta and Entoprocta may be refuted. PMID:19379522
New Insights into the Diversity of the Genus Faecalibacterium.
Benevides, Leandro; Burman, Sriti; Martin, Rebeca; Robert, Véronique; Thomas, Muriel; Miquel, Sylvie; Chain, Florian; Sokol, Harry; Bermudez-Humaran, Luis G; Morrison, Mark; Langella, Philippe; Azevedo, Vasco A; Chatel, Jean-Marc; Soares, Siomar
2017-01-01
Faecalibacterium prausnitzii is a commensal bacterium, ubiquitous in the gastrointestinal tracts of animals and humans. This species is a functionally important member of the microbiota and studies suggest it has an impact on the physiology and health of the host. F. prausnitzii is the only identified species in the genus Faecalibacterium , but a recent study clustered strains of this species in two different phylogroups. Here, we propose the existence of distinct species in this genus through the use of comparative genomics. Briefly, we performed analyses of 16S rRNA gene phylogeny, phylogenomics, whole genome Multi-Locus Sequence Typing (wgMLST), Average Nucleotide Identity (ANI), gene synteny, and pangenome to better elucidate the phylogenetic relationships among strains of Faecalibacterium . For this, we used 12 newly sequenced, assembled, and curated genomes of F. prausnitzii , which were isolated from feces of healthy volunteers from France and Australia, and combined these with published data from 5 strains downloaded from public databases. The phylogenetic analysis of the 16S rRNA sequences, together with the wgMLST profiles and a phylogenomic tree based on comparisons of genome similarity, all supported the clustering of Faecalibacterium strains in different genospecies. Additionally, the global analysis of gene synteny among all strains showed a highly fragmented profile, whereas the intra-cluster analyses revealed larger and more conserved collinear blocks. Finally, ANI analysis substantiated the presence of three distinct clusters-A, B, and C-composed of five, four, and four strains, respectively. The pangenome analysis of each cluster corroborated the classification of these clusters into three distinct species, each containing less variability than that found within the global pangenome of all strains. Here, we propose that comparison of pangenome subsets and their associated α values may be used as an alternative approach, together with ANI, in the in silico classification of new species. Altogether, our results provide evidence not only for the reconsideration of the phylogenetic and genomic relatedness among strains currently assigned to F. prausnitzii , but also the need for lineage (strain-based) differentiation of this taxon to better define how specific members might be associated with positive or negative host interactions.
The Evolution of SINEs and LINEs in the genus Chironomus (Diptera).
Papusheva, Ekaterina; Gruhl, Mary C; Berezikov, Eugene; Groudieva, Tatiana; Scherbik, Svetlana V; Martin, Jon; Blinov, Alexander; Bergtrom, Gerald
2004-03-01
Genomic DNA amplification from 51 species of the family Chironomidae shows that most contain relatives of NLRCth1 LINE and CTRT1 SINE retrotransposons first found in Chironomus thummi. More than 300 cloned PCR products were sequenced. The amplified region of the reverse transcriptase gene in the LINEs is intact and highly conserved, suggesting active elements. The SINEs are less conserved, consistent with minimal/no selection after transposition. A mitochondrial gene phylogeny resolves the Chironomus genus into six lineages (Guryev et al. 2001). LINE and SINE phylogenies resolve five of these lineages, indicating their monophyletic origin and vertical inheritance. However, both the LINE and the SINE tree topologies differ from the species phylogeny, resolving the elements into "clusters I-IV" and "cluster V" families. The data suggest a descent of all LINE and SINE subfamilies from two major families. Based on the species phylogeny, a few LINEs and a larger number of SINEs are cladisitically misplaced. Most misbranch with LINEs or SINEs from species with the same families of elements. From sequence comparisons, cladistically misplaced LINEs and several misplaced SINEs arose by convergent base substitutions. More diverged SINEs result from early transposition and some are derived from multiple source SINEs in the same species. SINEs from two species (C. dorsalis, C. pallidivittatus), expected to belong to the clusters I-IV family, branch instead with cluster V family SINEs; apparently both families predate separation of cluster V from clusters I-IV species. Correlation of the distribution of active SINEs and LINEs, as well as similar 3' sequence motifs in CTRT1 and NLRCth1, suggests coevolving retrotransposon pairs in which CTRT1 transposition depends on enzymes active during NLRCth1 LINE mobility.
Chen, Meng-Yun; Liang, Dan; Zhang, Peng
2017-08-01
The interordinal relationships of Laurasiatherian mammals are currently one of the most controversial questions in mammalian phylogenetics. Previous studies mainly relied on coding sequences (CDS) and seldom used noncoding sequences. Here, by data mining public genome data, we compiled an intron data set of 3,638 genes (all introns from a protein-coding gene are considered as a gene) (19,055,073 bp) and a CDS data set of 10,259 genes (20,994,285 bp), covering all major lineages of Laurasiatheria (except Pholidota). We found that the intron data contained stronger and more congruent phylogenetic signals than the CDS data. In agreement with this observation, concatenation and species-tree analyses of the intron data set yielded well-resolved and identical phylogenies, whereas the CDS data set produced weakly supported and incongruent results. Further analyses showed that the phylogeny inferred from the intron data is highly robust to data subsampling and change in outgroup, but the CDS data produced unstable results under the same conditions. Interestingly, gene tree statistical results showed that the most frequently observed gene tree topologies for the CDS and intron data are identical, suggesting that the major phylogenetic signal within the CDS data is actually congruent with that within the intron data. Our final result of Laurasiatheria phylogeny is (Eulipotyphla,((Chiroptera, Perissodactyla),(Carnivora, Cetartiodactyla))), favoring a close relationship between Chiroptera and Perissodactyla. Our study 1) provides a well-supported phylogenetic framework for Laurasiatheria, representing a step towards ending the long-standing "hard" polytomy and 2) argues that intron within genome data is a promising data resource for resolving rapid radiation events across the tree of life. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ron, Santiago R; Santos, Juan C; Cannatella, David C
2006-05-01
We present a phylogeny of the Neotropical genus Engystomops (= Physalaemus pustulosus species group) based on sequences of approximately 2.4 kb of mtDNA, (12S rRNA, valine-tRNA, and 16S rRNA) and propose a phylogenetic nomenclature. The phylogeny includes all described taxa and two unnamed species. All analyses indicate that Engystomops is monophyletic and contains two basal allopatric clades. Clade I (Edentulus) includes E. pustulosus and the Amazonian E. petersi + E. cf. freibergi. Clade II (Duovox) includes all species distributed in W Ecuador and NW Peru. Brevivox, a clade of small-sized species is strongly supported within Duovox. Populations of Engystomops pustulosus fall into two well-supported clades, each of which occupies two disjunct portions of the species range. Overall, our phylogeny is congruent with most previous hypotheses. This study is among the few published species-level phylogenies of Neotropical amphibians derived from molecular datasets. A review of the proportion of new species detected by similar studies suggests that the increasing use of molecular techniques will lead to the discovery of a vast number of species of Neotropical amphibians.
C. Mae Culumber; Steve R. Larson; Kevin B. Jensen; Thomas A. Jones
2011-01-01
Leymus is a genomically defined allopolyploid of genus Triticeae with two distinct subgenomes. Chloroplast DNA sequences of Eurasian and North American species are distinct and polyphyletic. However, phylogenies derived from chloroplast and nuclear DNA sequences are confounded by polyploidy and lack of polymorphism among many taxa. The AFLP technique can resolve...
Ned B. Klopfenstein; John W. Hanna; Amy L. Ross-Davis; Jane E. Stewart; Yuko Ota; Rosario Medel-Ortiz; Miguel Armando Lopez-Ramirez; Ruben Damian Elias-Roman; Dionicio Alvarado-Rosales; Mee-Sook Kim
2013-01-01
Armillaria plays diverse ecological roles in forests worldwide, which has inspired interest in understanding phylogenetic relationships within and among species of this genus. Previous rDNA sequence-based phylogenetic analyses of Armillaria have shown general relationships among widely divergent taxa, but rDNA sequences were not reliable for separating closely related...
Kyrillos, Alexandra; Arora, Gaurav; Murray, Bradley; Rosenwald, Anne G
2016-06-01
The bacterium Helicobacter pylori is associated with ulcers and the development of gastric cancer. Several genes, including cytotoxin-associated gene A (CagA) and vacuolating cytotoxin A (VacA), are associated with increased gastric cancer risk. Some strains of H. pylori also contain sequences related to bacteriophage phiHP33; however, the significance of these phage-related sequences remains unknown. We assessed the extent to which phiHP33-related sequences are present in 335 H. pylori strains using homology searches then mapped shared genes between phiHP33 and H. pylori strains onto an existing phylogeny. One hundred and twenty-one H. pylori strains contain phage orthologous sequences, and the presence of the phage-related sequences correlates with the presence of CagA and VacA. Mapping of the phage orthologs onto a phylogeny of H. pylori is consistent with the hypothesis that these genes were acquired by horizontal gene transfer. phiHP33 phage orthologous sequences might be of significance in understanding virulence of different H. pylori strains. © 2015 John Wiley & Sons Ltd.
Detection and characterization of Pasteuria 16S rRNA gene sequences from nematodes and soils.
Duan, Y P; Castro, H F; Hewlett, T E; White, J H; Ogram, A V
2003-01-01
Various bacterial species in the genus Pasteuria have great potential as biocontrol agents against plant-parasitic nematodes, although study of this important genus is hampered by the current inability to cultivate Pasteuria species outside their host. To aid in the study of this genus, an extensive 16S rRNA gene sequence phylogeny was constructed and this information was used to develop cultivation-independent methods for detection of Pasteuria in soils and nematodes. Thirty new clones of Pasteuria 16S rRNA genes were obtained directly from nematodes and soil samples. These were sequenced and used to construct an extensive phylogeny of this genus. These sequences were divided into two deeply branching clades within the low-G + C, Gram-positive division; some sequences appear to represent novel species within the genus Pasteuria. In addition, a surprising degree of 16S rRNA gene sequence diversity was observed within what had previously been designated a single strain of Pasteuria penetrans (P-20). PCR primers specific to Pasteuria 16S rRNA for detection of Pasteuria in soils were also designed and evaluated. Detection limits for soil DNA were 100-10,000 Pasteuria endospores (g soil)(-1).
Reimer, Aleisha; Verghese, Bindhu; Lok, Mei; Ziegler, Jennifer; Farber, Jeffrey; Pagotto, Franco; Graham, Morag; Nadon, Celine A.
2012-01-01
Human listeriosis outbreaks in Canada have been predominantly caused by serotype 1/2a isolates with highly similar pulsed-field gel electrophoresis (PFGE) patterns. Multilocus sequence typing (MLST) and multi-virulence-locus sequence typing (MVLST) each identified a diverse population of Listeria monocytogenes isolates, and within that, both methods had congruent subtypes that substantiated a predominant clone (clonal complex 8; virulence type 59; proposed epidemic clone 5 [ECV]) that has been causing human illness across Canada for more than 2 decades. PMID:22337989
High-resolution typing of Chlamydia trachomatis: epidemiological and clinical uses.
de Vries, Henry J C; Schim van der Loeff, Maarten F; Bruisten, Sylvia M
2015-02-01
A state-of-the-art overview of molecular Chlamydia trachomatis typing methods that are used for routine diagnostics and scientific studies. Molecular epidemiology uses high-resolution typing techniques such as multilocus sequence typing, multilocus variable number of tandem repeats analysis, and whole-genome sequencing to identify strains based on their DNA sequence. These data can be used for cluster, network and phylogenetic analyses, and are used to unveil transmission networks, risk groups, and evolutionary pathways. High-resolution typing of C. trachomatis strains is applied to monitor treatment efficacy and re-infections, and to study the recent emergence of lymphogranuloma venereum (LGV) amongst men who have sex with men in high-income countries. Chlamydia strain typing has clinical relevance in disease management, as LGV needs longer treatment than non-LGV C. trachomatis. It has also led to the discovery of a new variant Chlamydia strain in Sweden, which was not detected by some commercial C. trachomatis diagnostic platforms. After a brief history and comparison of the various Chlamydia typing methods, the applications of the current techniques are described and future endeavors to extend scientific understanding are formulated. High-resolution typing will likely help to further unravel the pathophysiological mechanisms behind the wide clinical spectrum of chlamydial disease.
Phillips, Anastasia; Sotomayor, Cristina; Wang, Qinning; Holmes, Nadine; Furlong, Catriona; Ward, Kate; Howard, Peter; Octavia, Sophie; Lan, Ruiting; Sintchenko, Vitali
2016-09-15
Salmonella Typhimurium (STM) is an important cause of foodborne outbreaks worldwide. Subtyping of STM remains critical to outbreak investigation, yet current techniques (e.g. multilocus variable number tandem repeat analysis, MLVA) may provide insufficient discrimination. Whole genome sequencing (WGS) offers potentially greater discriminatory power to support infectious disease surveillance. We performed WGS on 62 STM isolates of a single, endemic MLVA type associated with two epidemiologically independent, food-borne outbreaks along with sporadic cases in New South Wales, Australia, during 2014. Genomes of case and environmental isolates were sequenced using HiSeq (Illumina) and the genetic distance between them was assessed by single nucleotide polymorphism (SNP) analysis. SNP analysis was compared to the epidemiological context. The WGS analysis supported epidemiological evidence and genomes of within-outbreak isolates were nearly identical. Sporadic cases differed from outbreak cases by a small number of SNPs, although their close relationship to outbreak cases may represent an unidentified common food source that may warrant further public health follow up. Previously unrecognised mini-clusters were detected. WGS of STM can discriminate foodborne community outbreaks within a single endemic MLVA clone. Our findings support the translation of WGS into public health laboratory surveillance of salmonellosis.
Danet, Jean Luc; Balakishiyeva, Gulnara; Cimerman, Agnès; Sauvion, Nicolas; Marie-Jeanne, Véronique; Labonne, Gérard; Lavina, Amparo; Batlle, Assumpcio; Krizanac, Ivana; Skoric, Dijana; Ermacora, Paolo; Serçe, Cigdem Ulubas; Caglayan, Kadriye; Jarausch, Wolfgang; Foissac, Xavier
2011-02-01
The genetic diversity of three temperate fruit tree phytoplasmas 'Candidatus Phytoplasma prunorum', 'Ca. P. mali' and 'Ca. P. pyri' has been established by multilocus sequence analysis. Among the four genetic loci used, the genes imp and aceF distinguished 30 and 24 genotypes, respectively, and showed the highest variability. Percentage of substitution for imp ranged from 50 to 68 % according to species. Percentage of substitution varied between 9 and 12 % for aceF, whereas it was between 5 and 6 % for pnp and secY. In the case of 'Ca P. prunorum' the three most prevalent aceF genotypes were detected in both plants and insect vectors, confirming that the prevalent isolates are propagated by insects. The four isolates known to be hypo-virulent had the same aceF sequence, indicating a possible monophyletic origin. Haplotype network reconstructed by eBURST revealed that among the 34 haplotypes of 'Ca. P. prunorum', the four hypo-virulent isolates also grouped together in the same clade. Genotyping of some Spanish and Azerbaijanese 'Ca. P. pyri' isolates showed that they shared some alleles with 'Ca. P. prunorum', supporting for the first time to our knowledge, the existence of inter-species recombination between these two species.
Laukkanen-Ninios, Riikka; Didelot, Xavier; Jolley, Keith A.; Morelli, Giovanna; Sangal, Vartul; Kristo, Paula; Imori, Priscilla F. M.; Fukushima, Hiroshi; Siitonen, Anja; Tseneva, Galina; Voskressenskaya, Ekaterina; Falcao, Juliana P.; Korkeala, Hannu; Maiden, Martin C. J.; Mazzoni, Camila; Carniel, Elisabeth; Skurnik, Mikael; Achtman, Mark
2014-01-01
Summary Multilocus sequence analysis of 417 strains of Yersinia pseudotuberculosis revealed that it is a complex of four populations, three of which have been previously assigned species status [Y. pseudotuberculosis sensu stricto (s.s.), Yersinia pestis and Yersinia similis] and a fourth population, which we refer to as the Korean group, which may be in the process of speciation. We detected clear signs of recombination within Y. pseudotuberculosis s.s. as well as imports from Y. similis and the Korean group. The sources of genetic diversification within Y. pseudotuberculosis s.s. were approximately equally divided between recombination and mutation, whereas recombination has not yet been demonstrated in Y. pestis, which is also much more genetically monomorphic than is Y. pseudotuberculosis s.s. Most Y. pseudotuberculosis s.s. belong to a diffuse group of sequence types lacking clear population structure, although this species contains a melibiose-negative clade that is present globally in domesticated animals. Yersinia similis corresponds to the previously identified Y. pseudotuberculosis genetic type G4, which is probably not pathogenic because it lacks the virulence factors that are typical for Y. pseudotuberculosis s.s. In contrast, Y. pseudotuberculosis s.s., the Korean group and Y. pestis can all cause disease in humans. PMID:21951486
Zhi-Bin Wen; Ming-Li Zhang; Ge-Lin Zhu; Stewart C. Sanderson
2010-01-01
To reconstruct phylogeny and verify the monophyly of major subgroups, a total of 52 species representing almost all species of Salsoleae s.l. in China were sampled, with analysis based on three molecular markers (nrDNA ITS, cpDNA psbB-psbH and rbcL), using maximum parsimony, maximum likelihood, and Bayesian inference methods. Our molecular evidence provides strong...
2002-01-01
numerous animal clades, including arthropods (Giribet & Ribera , 1998, 2000). The mitochondrial cytochrome oxidase subunits I and II have proven useful as...16S and 28S, D2 rRNA. Insect Molecular Biology, 6, 273-284. Giribet, G. & Ribera , C. (1998) The position of arthropods in animal kingdom: a search...for a reliable outgroup for internal arthropod phylogeny. Molecular Phylogenetics and Evolution, 9, 481-488. Giribet, G. & Ribera , C. (2000) A review
Feng, Jing; Jiang, Yujun; Li, Mingyu; Zhao, Siyu; Zhang, Yanming; Li, Xuesong; Wang, Hui; Lin, Guangen; Wang, Hao; Li, Tiejing; Man, Chaoxin
2018-05-25
Bacteria in Lactobacillus casei group, including Lactobacillus casei (L. casei), Lactobacillus paracasei (L. paracasei), and Lactobacillus rhamnosus (L. rhamnosus) are important lactic acid bacteria in the production of fermented dairy products and are faced with the controversial nomenclatural status due to their close phylogenetic similarity. To probe the evolution and phylogeny of L. casei group, 100 isolates of lactic acid bacteria originated from naturally fermented dairy products in Tibet of China were subjected to multilocus sequence typing (MLST). The MLST scheme, based on analysis of the housekeeping genes fusA, ileS, lepA, leuS, pyrG, recA and recG, revealed that all the isolates belonged to a group containing the L. paracasei reference strains and were clearly different from the strains of L. casei and L. rhamnosus. Although nucleotide diversity (π) was low for the seven genes (ranging from 0.00341 for fusA to 0.01307 for recG), high genetic diversity represented by 83 sequence types (STs) with a discriminatory index of 0.98 was detected. A network-like structure based on split decomposition analysis, and the high values of the relative effect of recombination and mutation in the diversification of the lineages (r/m = 4.76) and the relative frequency of occurrence of recombination and mutation (ρ/θ = 2.62) indicated that intra-species recombination occurred frequently and homologous recombination played a key role in generating genotypic diversity amongst L. paracasei strains in Tibet. The discovery of 51 new STs and the results of STRUCTURE analysis suggested that the L. casei group in Tibet had an individual and particular population structure in comparison to European isolates. Overall, this research might be the first report about genetic diversity and population structure of Lactobacillus populations isolated from naturally fermented dairy products in Tibet based on MLST scheme.
Gómez, Fernando; Moreira, David; López-García, Purificación
2012-01-01
Dinophysoid dinoflagellates are usually considered a large monophyletic group. Large subunit and small subunit (SSU) rDNA phylogenies suggest a basal position for Amphisoleniaceae (Amphisolenia,Triposolenia) with respect to two sister groups, one containing most Phalacroma species plus Oxyphysis and the other Dinophysis,Ornithocercus, Dinophysoid dinoflagellates are usually considered a large monophyletic group. Large subunit and small subunit (SSU) rDNA phylogenies suggest a basal position for Amphisoleniaceae (Amphisolenia,Triposolenia) with respect to two sister groups, one containing most Phalacroma species plus Oxyphysis and the other Dinophysis,Ornithocercus, Histioneis,Citharistes and some Phalacroma species. We provide here new SSU rDNA sequences of Pseudophalacroma (pelagic) and Sinophysis (the only benthic dinophysoid genus). Molecular phylogenies support that they are very divergent with respect to the main clade of Dinophysales. Additional molecular markers of these two key genera are needed to elucidate the evolutionary relations among the dinophysoid dinoflagellates. Histioneis,Citharistes and some Phalacroma species. We provide here new SSU rDNA sequences of Pseudophalacroma (pelagic) and Sinophysis (the only benthic dinophysoid genus). Molecular phylogenies support that they are very divergent with respect to the main clade of Dinophysales. Additional molecular markers of these two key genera are needed to elucidate the evolutionary relations among the dinophysoid dinoflagellates. © 2011 The Author(s) Journal of Eukaryotic Microbiology © 2011 International Society of Protistologists.
Freitas, Ana R.; Novais, Carla; Ruiz-Garbajosa, Patricia; Coque, Teresa M.; Peixe, Luísa
2009-01-01
The population structure of 56 Enterococcus faecium isolates selected from a collection of enterococci from humans, animals, and the environment in Portugal (1997 to 2007) was analyzed by multilocus sequence typing. We identified 41 sequence types clustering into CC17, CC5, CC9, CC22 and CC94, all clonal lineages comprising isolates from different hosts. Our findings highlight the role of community-associated hosts as reservoirs of enterococci able to cause human infections. PMID:19447948
USDA-ARS?s Scientific Manuscript database
Since 2002, severe leaf spotting on parsley (Petroselinum crispum L.) has occurred in Monterey County, California. One of two different pathovars of Pseudomonas syringae sensu lato were isolated from diseased leaves from seven distinct outbreaks and twice from the same outbreak (2002 and 2009). Frag...
Typing of Lymphogranuloma Venereum Chlamydia trachomatis Strains
Christerson, Linus; de Vries, Henry J.C.; de Barbeyrac, Bertille; Gaydos, Charlotte A.; Henrich, Birgit; Hoffmann, Steen; Schachter, Julius; Thorvaldsen, Johannes; Vall-Mayans, Martí; Klint, Markus; Morré, Servaas A.
2010-01-01
We analyzed by multilocus sequence typing 77 lymphogranuloma venereum Chlamydia trachomatis strains from men who have sex with men in Europe and the United States. Specimens from an outbreak in 2003 in Europe were monoclonal. In contrast, several strains were in the United States in the 1980s, including a variant from Europe. PMID:21029543
Streptococcus agalactiae serotype Ib as an agent of meningitis in two adult nonpregnant women.
Martins, E R; Florindo, C; Martins, F; Aldir, I; Borrego, M J; Brum, L; Ramirez, M; Melo-Cristino, J
2007-11-01
Two temporally and geographically clustered cases of meningitis caused by Streptococcus agalactiae expressing the infrequent Ib serotype are reported. Characterization by pulsed-field gel electrophoresis and multilocus sequence typing revealed that the isolates were identical and represented the widely distributed ST10/ST8 lineage associated with serotype Ib.
USDA-ARS?s Scientific Manuscript database
Recent work has shown that Fusarium species and genotypes most commonly associated with human infections, particularly of the cornea (mycotic keratitis), are the same as those most commonly isolated from plumbing systems. The species most dominant in plumbing biofilms is Fusarium keratoplasticum, a ...
USDA-ARS?s Scientific Manuscript database
Foodborne campylobacteriosis has been traced to undercooked chicken liver. The objectives of this study were to measure prevalence of Campylobacter associated with chicken livers at retail and determine which subtypes are detected on the surface and inner tissue of livers. Fifteen packages of fres...
Olsen, Anne Berit; Gulla, Snorre; Steinum, Terje; Colquhoun, Duncan J; Nilsen, Hanne K; Duchaud, Eric
2017-06-01
Skin ulcer development in sea-reared salmonids, commonly associated with Tenacibaculum spp., is a significant fish welfare- and economical problem in Norwegian aquaculture. A collection of 89 Tenacibaculum isolates was subjected to multilocus sequence analysis (MLSA). The isolates were retrieved from outbreaks of clinical disease in farms spread along the Norwegian coast line from seven different fish species over a period of 19 years. MLSA analysis reveals considerable genetic diversity, but allows identification of four main clades. One clade encompasses isolates belonging to the species T. dicentrarchi, whereas three clades encompass bacteria that likely represent novel, as yet undescribed species. The study identified T. maritimum in lumpsucker, T. ovolyticum in halibut, and has extended the host and geographic range for T. soleae, isolated from wrasse. The overall lack of clonality and host specificity, with some indication of geographical range restriction argue for local epidemics involving multiple strains. The diversity of Tenacibaculum isolates from fish displaying ulcerative disease may complicate vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.
Diaz, Maureen H; Winchell, Jonas M
2016-01-01
Over the past decade there have been significant advancements in the methods used for detecting and characterizing Mycoplasma pneumoniae, a common cause of respiratory illness and community-acquired pneumonia worldwide. The repertoire of available molecular diagnostics has greatly expanded from nucleic acid amplification techniques (NAATs) that encompass a variety of chemistries used for detection, to more sophisticated characterizing methods such as multi-locus variable-number tandem-repeat analysis (MLVA), Multi-locus sequence typing (MLST), matrix-assisted laser desorption ionization-time-of-flight mass spectrometry (MALDI-TOF MS), single nucleotide polymorphism typing, and numerous macrolide susceptibility profiling methods, among others. These many molecular-based approaches have been developed and employed to continually increase the level of discrimination and characterization in order to better understand the epidemiology and biology of M. pneumoniae. This review will summarize recent molecular techniques and procedures and lend perspective to how each has enhanced the current understanding of this organism and will emphasize how Next Generation Sequencing may serve as a resource for researchers to gain a more comprehensive understanding of the genomic complexities of this insidious pathogen.
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N.; Garrido, Francis; Joulian, Catherine
2008-01-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers. PMID:18502920
2011-01-01
Background The genus Pyrus belongs to the tribe Pyreae (the former subfamily Maloideae) of the family Rosaceae, and includes one of the most important commercial fruit crops, pear. The phylogeny of Pyrus has not been definitively reconstructed. In our previous efforts, the internal transcribed spacer region (ITS) revealed a poorly resolved phylogeny due to non-concerted evolution of nrDNA arrays. Therefore, introns of low copy nuclear genes (LCNG) are explored here for improved resolution. However, paralogs and lineage sorting are still two challenges for applying LCNGs in phylogenetic studies, and at least two independent nuclear loci should be compared. In this work the second intron of LEAFY and the alcohol dehydrogenase gene (Adh) were selected to investigate their molecular evolution and phylogenetic utility. Results DNA sequence analyses revealed a complex ortholog and paralog structure of Adh genes in Pyrus and Malus, the pears and apples. Comparisons between sequences from RT-PCR and genomic PCR indicate that some Adh homologs are putatively nonfunctional. A partial region of Adh1 was sequenced for 18 Pyrus species and three subparalogs representing Adh1-1 were identified. These led to poorly resolved phylogenies due to low sequence divergence and the inclusion of putative recombinants. For the second intron of LEAFY, multiple inparalogs were discovered for both LFY1int2 and LFY2int2. LFY1int2 is inadequate for phylogenetic analysis due to lineage sorting of two inparalogs. LFY2int2-N, however, showed a relatively high sequence divergence and led to the best-resolved phylogeny. This study documents the coexistence of outparalogs and inparalogs, and lineage sorting of these paralogs and orthologous copies. It reveals putative recombinants that can lead to incorrect phylogenetic inferences, and presents an improved phylogenetic resolution of Pyrus using LFY2int2-N. Conclusions Our study represents the first phylogenetic analyses based on LCNGs in Pyrus. Ancient and recent duplications lead to a complex structure of Adh outparalogs and inparalogs in Pyrus and Malus, resulting in neofunctionalization, nonfunctionalization and possible subfunctionalization. Among all investigated orthologs, LFY2int2-N is the best nuclear marker for phylogenetic reconstruction of Pyrus due to suitable sequence divergence and the absence of lineage sorting. PMID:21917170
Kobayashi, Nobumichi; Nagashima, Shigeo
2009-01-01
We carried out the first study of Enterococcus faecalis clinical isolates in Cuba by multilocus sequence typing linking the molecular typing data with the presence of virulence determinants and the antibiotic resistance genes. A total of 23 E. faecalis isolates recovered from several clinic sources and geographic areas of Cuba during a period between 2000 and 2005 were typed by multilocus sequence typing. Thirteen sequence types (STs) including five novel STs were identified, and the ST 64 (clonal complex [CC] 8), ST 6 (CC2), ST 21(CC21), and ST 16 (CC58) were found in more than one strain. Sixty-seven percent of STs corresponded to STs reported previously in Spain, Poland, and The Netherlands, and other STs (ST115, ST64, ST6, and ST40) were genetically close to those detected in the United States. Prevalence of both antimicrobial resistance genes [aac(6′)-aph(2″), aph(3′), ant(6), ant(3″)(9), aph(2″)-Id, aph(2″)-Ic, erm(B), erm(A), erm(C), mef(A), tet(M), and tet(L)] and virulence genes (agg, gelE, cylA, esp, ccf, and efaAfs) were examined by polymerase chain reaction. Aminoglycoside resistance genes aac(6′)-Ie-aph(2″)-Ia, aph(3′), ant(6), ant(3″)(9) were more frequently detected in ST6, ST16, ST23, ST64, and ST115. The multidrug resistance was distributed to all STs detected, except for ST117 and singleton ST225. The presence of cyl gene was specifically linked to the ST64 and ST16. Presence of the esp, gel, and agg genes was not specific to any particular ST. This research provided the first insight into the population structure of E. faecalis in Cuba, that is, most Cuban strains were related to European strains, whereas others to U.S. strains. The CC2, CC21, and CC8, three of the biggest CCs in the world, were evidently circulating in Cuba, associated with multidrug resistance and virulence traits. PMID:19857135
Phylogenetic relationships among North American Alosa species (Clupeidae)
B.R. Bowen; B.R. Kreiser; P.F. Mickel; J.F. Schaefer; S.B. Adams
2008-01-01
A phylogeny of the six North American species in the genus Alosa, with representatives of three Eurasian species, was generated using mtDNA sequences. This was accomplished by obtaining sequences for three North American species and additional geographical sampling of the other three species. The subgenus Alosa, including the...
Karami, Nahid; Helldal, Lisa; Welinder-Olsson, Christina; Ahrén, Christina; Moore, Edward R B
2013-01-01
Extended-spectrum β-lactamase producing Escherichia coli (ESBL-E. coli) were isolated from infants hospitalized in a neonatal, post-surgery ward during a four-month-long nosocomial outbreak and six-month follow-up period. A multi-locus variable number tandem repeat analysis (MLVA), using 10 loci (GECM-10), for 'generic' (i.e., non-STEC) E. coli was applied for sub-species-level (i.e., sub-typing) delineation and characterization of the bacterial isolates. Ten distinct GECM-10 types were detected among 50 isolates, correlating with the types defined by pulsed-field gel electrophoresis (PFGE), which is recognized to be the 'gold-standard' method for clinical epidemiological analyses. Multi-locus sequence typing (MLST), multiplex PCR genotyping of bla CTX-M, bla TEM, bla OXA and bla SHV genes and antibiotic resistance profiling, as well as a PCR assay specific for detecting isolates of the pandemic O25b-ST131 strain, further characterized the outbreak isolates. Two clusters of isolates with distinct GECM-10 types (G06-04 and G07-02), corresponding to two major PFGE types and the MLST-based sequence types (STs) 131 and 1444, respectively, were confirmed to be responsible for the outbreak. The application of GECM-10 sub-typing provided reliable, rapid and cost-effective epidemiological characterizations of the ESBL-producing isolates from a nosocomial outbreak that correlated with and may be used to replace the laborious PFGE protocol for analyzing generic E. coli.
Pinus ponderosa: A checkered past obscured four species.
Willyard, Ann; Gernandt, David S; Potter, Kevin; Hipkins, Valerie; Marquardt, Paula; Mahalovich, Mary Frances; Langer, Stephen K; Telewski, Frank W; Cooper, Blake; Douglas, Connor; Finch, Kristen; Karemera, Hassani H; Lefler, Julia; Lea, Payton; Wofford, Austin
2017-01-01
Molecular genetic evidence can help delineate taxa in species complexes that lack diagnostic morphological characters. Pinus ponderosa (Pinaceae; subsection Ponderosae) is recognized as a problematic taxon: plastid phylogenies of exemplars were paraphyletic, and mitochondrial phylogeography suggested at least four subdivisions of P. ponderosa. These patterns have not been examined in the context of other Ponderosae species. We hypothesized that putative intraspecific subdivisions might each represent a separate taxon. We genotyped six highly variable plastid simple sequence repeats in 1903 individuals from 88 populations of P. ponderosa and related Ponderosae (P. arizonica, P. engelmannii, and P. jeffreyi). We used multilocus haplotype networks and discriminant analysis of principal components to test clustering of individuals into genetically and geographically meaningful taxonomic units. There are at least four distinct plastid clusters within P. ponderosa that roughly correspond to the geographic distribution of mitochondrial haplotypes. Some geographic regions have intermixed plastid lineages, and some mitochondrial and plastid boundaries do not coincide. Based on relative distances to other species of Ponderosae, these clusters diagnose four distinct taxa. Newly revealed geographic boundaries of four distinct taxa (P. benthamiana, P. brachyptera, P. scopulorum, and a narrowed concept of P. ponderosa) do not correspond completely with taxonomies. Further research is needed to understand their morphological and nuclear genetic makeup, but we suggest that resurrecting originally published species names would more appropriately reflect the taxonomy of this checkered classification than their current treatment as varieties of P. ponderosa. © 2017 Willyard et al. Published by the Botanical Society of America. This work is licensed under a Creative Commons public domain license (CC0 1.0).
Lujan, Nathan K; Armbruster, Jonathan W; Lovejoy, Nathan R; López-Fernández, Hernán
2015-01-01
The Neotropical catfish family Loricariidae is the fifth most species-rich vertebrate family on Earth, with over 800 valid species. The Hypostominae is its most species-rich, geographically widespread, and ecomorphologically diverse subfamily. Here, we provide a comprehensive molecular phylogenetic reappraisal of genus-level relationships in the Hypostominae based on our sequencing and analysis of two mitochondrial and three nuclear loci (4293bp total). Our most striking large-scale systematic discovery was that the tribe Hypostomini, which has traditionally been recognized as sister to tribe Ancistrini based on morphological data, was nested within Ancistrini. This required recognition of seven additional tribe-level clades: the Chaetostoma Clade, the Pseudancistrus Clade, the Lithoxus Clade, the 'Pseudancistrus' Clade, the Acanthicus Clade, the Hemiancistrus Clade, and the Peckoltia Clade. Results of our analysis, which included type- and non-type species for every valid genus in Hypostominae, support the reevaluation and restriction of several historically problematic genera, including Baryancistrus, Cordylancistrus, Hemiancistrus, and Peckoltia. Much of the deep lineage diversity in Hypostominae is restricted to Guiana Shield and northern Andean drainages, with three tribe-level clades still largely restricted to the Guiana Shield. Of the six geographically widespread clades, a paraphyletic assemblage of three contain lineages restricted to drainages west of the Andes Mountains, suggesting that early diversification of the Hypostominae predated the late Miocene surge in Andean uplift. Our results also highlight examples of trophic ecological diversification and convergence in the Loricariidae, including support for three independent origins of highly similar and globally unique morphological specializations for eating wood. Copyright © 2014 Elsevier Inc. All rights reserved.
Evidence for common horizontal transmission of Wolbachia among butterflies and moths.
Ahmed, Muhammad Z; Breinholt, Jesse W; Kawahara, Akito Y
2016-05-27
Wolbachia is one of the most widespread bacteria on Earth. Previous research on Wolbachia-host interactions indicates that the bacterium is typically transferred vertically, from mother to offspring, through the egg cytoplasm. Although horizontal transmission of Wolbachia from one species to another is reported to be common in arthropods, limited direct ecological evidence is available. In this study, we examine horizontal transmission of Wolbachia using a multilocus sequence typing (MLST) strains dataset and used Wolbachia and Lepidoptera genomes to search for evidence for lateral gene transfer (LGT) in Lepidoptera, one of the most diverse cosmopolitan insect orders. We constructed a phylogeny of arthropod-associated MLST Wolbachia strains and calibrated the age of Wolbachia strains associated with lepidopteran species. Our results reveal inter-specific, inter-generic, inter-familial, and inter-ordinal horizontal transmission of Wolbachia strains, without discernible geographic patterns. We found at least seven probable cases of horizontal transmission among 31 species within Lepidoptera and between Lepidoptera and other arthropod hosts. The divergence time analysis revealed that Wolbachia is recently (22.6-4.7 mya, 95 % HPD) introduced in Lepidoptera. Analysis of nine Lepidoptera genomes (Bombyx mori, Danaus plexippus, Heliconius melpomene, Manduca sexta, Melitaea cinxia, Papilio glaucus, P. polytes, P. xuthus and Plutella xylostella) yielded one possible instance of Wolbachia LGT. Our results provide evidence of high incidence of identical and multiple strains of Wolbachia among butterflies and moths, adding Lepidoptera to the growing body of evidence for common horizontal transmission of Wolbachia. This study demonstrates interesting dynamics of this remarkable and influential microorganism.
2016-01-01
Abstract Background Metabarcoding is becoming a common tool used to assess and compare diversity of organisms in environmental samples. Identification of OTUs is one of the critical steps in the process and several taxonomy assignment methods were proposed to accomplish this task. This publication evaluates the quality of reference datasets, alongside with several alignment and phylogeny inference methods used in one of the taxonomy assignment methods, called tree-based approach. This approach assigns anonymous OTUs to taxonomic categories based on relative placements of OTUs and reference sequences on the cladogram and support that these placements receive. New information In tree-based taxonomy assignment approach, reliable identification of anonymous OTUs is based on their placement in monophyletic and highly supported clades together with identified reference taxa. Therefore, it requires high quality reference dataset to be used. Resolution of phylogenetic trees is strongly affected by the presence of erroneous sequences as well as alignment and phylogeny inference methods used in the process. Two preparation steps are essential for the successful application of tree-based taxonomy assignment approach. Curated collections of genetic information do include erroneous sequences. These sequences have detrimental effect on the resolution of cladograms used in tree-based approach. They must be identified and excluded from the reference dataset beforehand. Various combinations of multiple sequence alignment and phylogeny inference methods provide cladograms with different topology and bootstrap support. These combinations of methods need to be tested in order to determine the one that gives highest resolution for the particular reference dataset. Completing the above mentioned preparation steps is expected to decrease the number of unassigned OTUs and thus improve the results of the tree-based taxonomy assignment approach. PMID:27932919
Alignment methods: strategies, challenges, benchmarking, and comparative overview.
Löytynoja, Ari
2012-01-01
Comparative evolutionary analyses of molecular sequences are solely based on the identities and differences detected between homologous characters. Errors in this homology statement, that is errors in the alignment of the sequences, are likely to lead to errors in the downstream analyses. Sequence alignment and phylogenetic inference are tightly connected and many popular alignment programs use the phylogeny to divide the alignment problem into smaller tasks. They then neglect the phylogenetic tree, however, and produce alignments that are not evolutionarily meaningful. The use of phylogeny-aware methods reduces the error but the resulting alignments, with evolutionarily correct representation of homology, can challenge the existing practices and methods for viewing and visualising the sequences. The inter-dependency of alignment and phylogeny can be resolved by joint estimation of the two; methods based on statistical models allow for inferring the alignment parameters from the data and correctly take into account the uncertainty of the solution but remain computationally challenging. Widely used alignment methods are based on heuristic algorithms and unlikely to find globally optimal solutions. The whole concept of one correct alignment for the sequences is questionable, however, as there typically exist vast numbers of alternative, roughly equally good alignments that should also be considered. This uncertainty is hidden by many popular alignment programs and is rarely correctly taken into account in the downstream analyses. The quest for finding and improving the alignment solution is complicated by the lack of suitable measures of alignment goodness. The difficulty of comparing alternative solutions also affects benchmarks of alignment methods and the results strongly depend on the measure used. As the effects of alignment error cannot be predicted, comparing the alignments' performance in downstream analyses is recommended.
Mallatt, Jon; Craig, Catherine Waggoner; Yoder, Matthew J
2010-04-01
This study (1) uses nearly complete rRNA-gene sequences from across Metazoa (197 taxa) to reconstruct animal phylogeny; (2) presents a highly annotated, manual alignment of these sequences with special reference to rRNA features including paired sites (http://purl.oclc.org/NET/rRNA/Metazoan_alignment) and (3) tests, after eliminating as few disruptive, rogue sequences as possible, if a likelihood framework can recover the main metazoan clades. We found that systematic elimination of approximately 6% of the sequences, including the divergent or unstably placed sequences of cephalopods, arrowworm, symphylan and pauropod myriapods, and of myzostomid and nemertodermatid worms, led to a tree that supported Ecdysozoa, Lophotrochozoa, Protostomia, and Bilateria. Deuterostomia, however, was never recovered, because the rRNA of urochordates goes (nonsignificantly) near the base of the Bilateria. Counterintuitively, when we modeled the evolution of the paired sites, phylogenetic resolution was not increased over traditional tree-building models that assume all sites in rRNA evolve independently. The rRNA genes of non-bilaterians contain a higher % AT than do those of most bilaterians. The rRNA genes of Acoela and Myzostomida were found to be secondarily shortened, AT-enriched, and highly modified, throwing some doubt on the location of these worms at the base of Bilateria in the rRNA tree--especially myzostomids, which other evidence suggests are annelids instead. Other findings are marsupial-with-placental mammals, arrowworms in Ecdysozoa (well supported here but contradicted by morphology), and Placozoa as sister to Cnidaria. Finally, despite the difficulties, the rRNA-gene trees are in strong concordance with trees derived from multiple protein-coding genes in supporting the new animal phylogeny. (c) 2009 Elsevier Inc. All rights reserved.
Crampton-Platt, Alex; Timmermans, Martijn J T N; Gimmel, Matthew L; Kutty, Sujatha Narayanan; Cockerill, Timothy D; Vun Khen, Chey; Vogler, Alfried P
2015-09-01
In spite of the growth of molecular ecology, systematics and next-generation sequencing, the discovery and analysis of diversity is not currently integrated with building the tree-of-life. Tropical arthropod ecologists are well placed to accelerate this process if all specimens obtained through mass-trapping, many of which will be new species, could be incorporated routinely into phylogeny reconstruction. Here we test a shotgun sequencing approach, whereby mitochondrial genomes are assembled from complex ecological mixtures through mitochondrial metagenomics, and demonstrate how the approach overcomes many of the taxonomic impediments to the study of biodiversity. DNA from approximately 500 beetle specimens, originating from a single rainforest canopy fogging sample from Borneo, was pooled and shotgun sequenced, followed by de novo assembly of complete and partial mitogenomes for 175 species. The phylogenetic tree obtained from this local sample was highly similar to that from existing mitogenomes selected for global coverage of major lineages of Coleoptera. When all sequences were combined only minor topological changes were induced against this reference set, indicating an increasingly stable estimate of coleopteran phylogeny, while the ecological sample expanded the tip-level representation of several lineages. Robust trees generated from ecological samples now enable an evolutionary framework for ecology. Meanwhile, the inclusion of uncharacterized samples in the tree-of-life rapidly expands taxon and biogeographic representation of lineages without morphological identification. Mitogenomes from shotgun sequencing of unsorted environmental samples and their associated metadata, placed robustly into the phylogenetic tree, constitute novel DNA "superbarcodes" for testing hypotheses regarding global patterns of diversity. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Holovachov, Oleksandr
2016-01-01
Metabarcoding is becoming a common tool used to assess and compare diversity of organisms in environmental samples. Identification of OTUs is one of the critical steps in the process and several taxonomy assignment methods were proposed to accomplish this task. This publication evaluates the quality of reference datasets, alongside with several alignment and phylogeny inference methods used in one of the taxonomy assignment methods, called tree-based approach. This approach assigns anonymous OTUs to taxonomic categories based on relative placements of OTUs and reference sequences on the cladogram and support that these placements receive. In tree-based taxonomy assignment approach, reliable identification of anonymous OTUs is based on their placement in monophyletic and highly supported clades together with identified reference taxa. Therefore, it requires high quality reference dataset to be used. Resolution of phylogenetic trees is strongly affected by the presence of erroneous sequences as well as alignment and phylogeny inference methods used in the process. Two preparation steps are essential for the successful application of tree-based taxonomy assignment approach. Curated collections of genetic information do include erroneous sequences. These sequences have detrimental effect on the resolution of cladograms used in tree-based approach. They must be identified and excluded from the reference dataset beforehand.Various combinations of multiple sequence alignment and phylogeny inference methods provide cladograms with different topology and bootstrap support. These combinations of methods need to be tested in order to determine the one that gives highest resolution for the particular reference dataset.Completing the above mentioned preparation steps is expected to decrease the number of unassigned OTUs and thus improve the results of the tree-based taxonomy assignment approach.
DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.
Kelly, Steven; Maini, Philip K
2013-01-01
The rapidly growing availability of genome information has created considerable demand for both fast and accurate phylogenetic inference algorithms. We present a novel method called DendroBLAST for reconstructing phylogenetic dendrograms/trees from protein sequences using BLAST. This method differs from other methods by incorporating a simple model of sequence evolution to test the effect of introducing sequence changes on the reliability of the bipartitions in the inferred tree. Using realistic simulated sequence data we demonstrate that this method produces phylogenetic trees that are more accurate than other commonly-used distance based methods though not as accurate as maximum likelihood methods from good quality multiple sequence alignments. In addition to tests on simulated data, we use DendroBLAST to generate input trees for a supertree reconstruction of the phylogeny of the Archaea. This independent analysis produces an approximate phylogeny of the Archaea that has both high precision and recall when compared to previously published analysis of the same dataset using conventional methods. Taken together these results demonstrate that approximate phylogenetic trees can be produced in the absence of multiple sequence alignments, and we propose that these trees will provide a platform for improving and informing downstream bioinformatic analysis. A web implementation of the DendroBLAST method is freely available for use at http://www.dendroblast.com/.
Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).
Rambaut, Andrew; Lam, Tommy T; Max Carvalho, Luiz; Pybus, Oliver G
2016-01-01
Gene sequences sampled at different points in time can be used to infer molecular phylogenies on a natural timescale of months or years, provided that the sequences in question undergo measurable amounts of evolutionary change between sampling times. Data sets with this property are termed heterochronous and have become increasingly common in several fields of biology, most notably the molecular epidemiology of rapidly evolving viruses. Here we introduce the cross-platform software tool, TempEst (formerly known as Path-O-Gen), for the visualization and analysis of temporally sampled sequence data. Given a molecular phylogeny and the dates of sampling for each sequence, TempEst uses an interactive regression approach to explore the association between genetic divergence through time and sampling dates. TempEst can be used to (1) assess whether there is sufficient temporal signal in the data to proceed with phylogenetic molecular clock analysis, and (2) identify sequences whose genetic divergence and sampling date are incongruent. Examination of the latter can help identify data quality problems, including errors in data annotation, sample contamination, sequence recombination, or alignment error. We recommend that all users of the molecular clock models implemented in BEAST first check their data using TempEst prior to analysis.
Multilocus inference of species trees and DNA barcoding.
Mallo, Diego; Posada, David
2016-09-05
The unprecedented amount of data resulting from next-generation sequencing has opened a new era in phylogenetic estimation. Although large datasets should, in theory, increase phylogenetic resolution, massive, multilocus datasets have uncovered a great deal of phylogenetic incongruence among different genomic regions, due both to stochastic error and to the action of different evolutionary process such as incomplete lineage sorting, gene duplication and loss and horizontal gene transfer. This incongruence violates one of the fundamental assumptions of the DNA barcoding approach, which assumes that gene history and species history are identical. In this review, we explain some of the most important challenges we will have to face to reconstruct the history of species, and the advantages and disadvantages of different strategies for the phylogenetic analysis of multilocus data. In particular, we describe the evolutionary events that can generate species tree-gene tree discordance, compare the most popular methods for species tree reconstruction, highlight the challenges we need to face when using them and discuss their potential utility in barcoding. Current barcoding methods sacrifice a great amount of statistical power by only considering one locus, and a transition to multilocus barcodes would not only improve current barcoding methods, but also facilitate an eventual transition to species-tree-based barcoding strategies, which could better accommodate scenarios where the barcode gap is too small or inexistent.This article is part of the themed issue 'From DNA barcodes to biomes'. © 2016 The Authors.
Molecular Epidemiology of Human Oral Chagas Disease Outbreaks in Colombia
Ramírez, Juan David; Montilla, Marleny; Cucunubá, Zulma M.; Floréz, Astrid Carolina; Zambrano, Pilar; Guhl, Felipe
2013-01-01
Background Trypanosoma cruzi, the causative agent of Chagas disease, displays significant genetic variability revealed by six Discrete Typing Units (TcI-TcVI). In this pathology, oral transmission represents an emerging epidemiological scenario where different outbreaks associated to food/beverages consumption have been reported in Argentina, Bolivia, Brazil, Ecuador and Venezuela. In Colombia, six human oral outbreaks have been reported corroborating the importance of this transmission route. Molecular epidemiology of oral outbreaks is barely known observing the incrimination of TcI, TcII, TcIV and TcV genotypes. Methodology and Principal Findings High-throughput molecular characterization was conducted performing MLMT (Multilocus Microsatellite Typing) and mtMLST (mitochondrial Multilocus Sequence Typing) strategies on 50 clones from ten isolates. Results allowed observing the occurrence of TcI, TcIV and mixed infection of distinct TcI genotypes. Thus, a majority of specific mitochondrial haplotypes and allelic multilocus genotypes associated to the sylvatic cycle of transmission were detected in the dataset with the foreseen presence of mitochondrial haplotypes and allelic multilocus genotypes associated to the domestic cycle of transmission. Conclusions These findings suggest the incrimination of sylvatic genotypes in the oral outbreaks occurred in Colombia. We observed patterns of super-infection and/or co-infection with a tailored association with the severe forms of myocarditis in the acute phase of the disease. The transmission dynamics of this infection route based on molecular epidemiology evidence was unraveled and the clinical and biological implications are discussed. PMID:23437405
Development and evaluation of a multi-locus sequence typing scheme for Mycoplasma synoviae.
Dijkman, R; Feberwee, A; Landman, W J M
2016-08-01
Reproducible molecular Mycoplasma synoviae typing techniques with sufficient discriminatory power may help to expand knowledge on its epidemiology and contribute to the improvement of control and eradication programmes of this mycoplasma species. The present study describes the development and validation of a novel multi-locus sequence typing (MLST) scheme for M. synoviae. Thirteen M. synoviae isolates originating from different poultry categories, farms and lesions, were subjected to whole genome sequencing. Their sequences were compared to that of M. synoviae reference strain MS53. A high number of single nucleotide polymorphisms (SNPs) indicating considerable genetic diversity were identified. SNPs were present in over 40 putative target genes for MLST of which five target genes were selected (nanA, uvrA, lepA, ruvB and ugpA) for the MLST scheme. This scheme was evaluated analysing 209 M. synoviae samples from different countries, categories of poultry, farms and lesions. Eleven clonal clusters and 76 different sequence types (STs) were obtained. Clustering occurred following geographical origin, supporting the hypothesis of regional population evolution. M. synoviae samples obtained from epidemiologically linked outbreaks often harboured the same ST. In contrast, multiple M. synoviae lineages were found in samples originating from swollen joints or oviducts from hens that produce eggs with eggshell apex abnormalities indicating that further research is needed to identify the genetic factors of M. synoviae that may explain its variations in tissue tropism and disease inducing potential. Furthermore, MLST proved to have a higher discriminatory power compared to variable lipoprotein and haemagglutinin A typing, which generated 50 different genotypes on the same database.
Desoubeaux, Guillaume; Debourgogne, Anne; Wiederhold, Nathan P; Zaffino, Marie; Sutton, Deanna; Burns, Rachel E; Frasca, Salvatore; Hyatt, Michael W; Cray, Carolyn
2018-07-01
Fusarium spp. are saprobic moulds that are responsible for severe opportunistic infections in humans and animals. However, we need epidemiological tools to reliably trace the circulation of such fungal strains within medical or veterinary facilities, to recognize environmental contaminations that might lead to infection and to improve our understanding of factors responsible for the onset of outbreaks. In this study, we used molecular genotyping to investigate clustered cases of Fusarium solani species complex (FSSC) infection that occurred in eight Sphyrnidae sharks under managed care at a public aquarium. Genetic relationships between fungal strains were determined by multi-locus sequence typing (MLST) analysis based on DNA sequencing at five loci, followed by comparison with sequences of 50 epidemiologically unrelated FSSC strains. Our genotyping approach revealed that F. keratoplasticum and F. solani haplotype 9x were most commonly isolated. In one case, the infection proved to be with another Hypocrealian rare opportunistic pathogen Metarhizium robertsii. Twice, sharks proved to be infected with FSSC strains with the same MLST sequence type, supporting the hypothesis the hypothesis that common environmental populations of fungi existed for these sharks and would suggest the longtime persistence of the two clonal strains within the environment, perhaps in holding pools and life support systems of the aquarium. This study highlights how molecular tools like MLST can be used to investigate outbreaks of microbiological disease. This work reinforces the need for regular controls of water quality to reduce microbiological contamination due to waterborne microorganisms.
MULTILOCUS SEQUENCE TYPING OF BRUCELLA ISOLATES FROM THAILAND.
Chawjiraphan, Wireeya; Sonthayanon, Piengchan; Chanket, Phanita; Benjathummarak, Surachet; Kerdsin, Anusak; Kalambhaheti, Thareerat
2016-11-01
Although brucellosis outbreaks in Thailand are rare, they cause abortions and infertility in animals, resulting in significant economic loss. Because Brucella spp display > 90% DNA homology, multilocus sequence typing (MLST) was employed to categorize local Brucella isolates into sequence types (STs) and to determine their genetic relatedness. Brucella samples were isolated from vaginal secretion of cows and goats, and from blood cultures of infected individuals. Brucella species were determined by multiplex PCR of eight loci, in addition to MLST based on partial DNA sequences of nine house-keeping genes. MLST analysis of 36 isolates revealed 78 distinct novel allele types and 34 novel STs, while two isolates possessed the known ST8. Sequence alignments identified polymorphic sites in each allele, ranging from 2-6%, while overall genetic diversity was 3.6%. MLST analysis of the 36 Brucella isolates classified them into three species, namely, B. melitensis, B. abortus and B. suis, in agreement with multiplex PCR results. Genetic relatedness among ST members of B. melitensis and B. abortus determined by eBURST program revealed ST2 as founder of B. abortus isolates and ST8 the founder of B. melitensis isolates. ST 36, 41 and 50 of Thai Brucella isolates were identified as single locus variants of clonal cluster (CC) 8, while the majority of STs were diverse. The genetic diversity and relatedness identified using MLST revealed hitherto unexpected diversity among Thai Brucella isolates. Genetic classification of isolates could reveal the route of brucellosis transmission among humans and farm animals and also reveal their relationship with other isolates in the region and other parts of the world.
Didi, Jennifer; Lemée, Ludovic; Gibert, Laure; Pons, Jean-Louis
2014-01-01
Staphylococcus lugdunensis is an emergent virulent coagulase-negative staphylococcus responsible for severe infections similar to those caused by Staphylococcus aureus. To understand its potentially pathogenic capacity and have further detailed knowledge of the molecular traits of this organism, 93 isolates from various geographic origins were analyzed by multi-virulence-locus sequence typing (MVLST), targeting seven known or putative virulence-associated loci (atlLR2, atlLR3, hlb, isdJ, SLUG_09050, SLUG_16930, and vwbl). The polymorphisms of the putative virulence-associated loci were moderate and comparable to those of the housekeeping genes analyzed by multilocus sequence typing (MLST). However, the MVLST scheme generated 43 virulence types (VTs) compared to 20 sequence types (STs) based on MLST, indicating that MVLST was significantly more discriminating (Simpson's index [D], 0.943). No hypervirulent lineage or cluster specific to carriage strains was defined. The results of multilocus sequence analysis of known and putative virulence-associated loci are consistent with a clonal population structure for S. lugdunensis, suggesting a coevolution of these genes with housekeeping genes. Indeed, the nonsynonymous to synonymous evolutionary substitutions (dN/dS) ratio, the Tajima's D test, and Single-likelihood ancestor counting (SLAC) analysis suggest that all virulence-associated loci were under negative selection, even atlLR2 (AtlL protein) and SLUG_16930 (FbpA homologue), for which the dN/dS ratios were higher. In addition, this analysis of virulence-associated loci allowed us to propose a trilocus sequence typing scheme based on the intragenic regions of atlLR3, isdJ, and SLUG_16930, which is more discriminant than MLST for studying short-term epidemiology and further characterizing the lineages of the rare but highly pathogenic S. lugdunensis. PMID:25078912
Gharout-Sait, Alima; Touati, Abdelaziz; Guillard, Thomas; Brasme, Lucien; de Champs, Christophe
2015-01-01
In this study, 922 consecutive non-duplicate clinical isolates of Enterobacteriaceae obtained from hospitalized and non-hospitalized patients at Bejaia, Algeria were analyzed for AmpC-type β-lactamases production. The ampC genes and their genetic environment were characterized using polymerase chain reaction (PCR) and sequencing. Plasmid incompatibility groups were determined by using PCR-based replicon typing. Phylogenetic grouping and multilocus sequence typing were determined for molecular typing of the plasmid-mediated AmpC (pAmpC) isolates. Of the isolates, 15 (1.6%) were identified as AmpC producers including 14 CMY-4-producing isolates and one DHA-1-producing Klebsiella pneumoniae. All AmpC-producing isolates co-expressed the broad-spectrum TEM-1 β-lactamase and three of them co-produced CTX-M and/or SHV-12 ESBL. Phylogenetic grouping and virulence genotyping of the E. coli isolates revealed that most of them belonged to groups D and B1. Multilocus sequence typing analysis of K. pneumoniae isolates identified four different sequence types (STs) with two new sequences: ST1617 and ST1618. Plasmid replicon typing indicates that blaCMY-4 gene was located on broad host range A/C plasmid, while LVPK replicon was associated with blaDHA-1. All isolates carrying blaCMY-4 displayed the transposon-like structures ISEcp1/ΔISEcp1-blaCMY-blc-sugE. Our study showed that CMY-4 was the main pAmpC in the Enterobacteriaceae isolates in Algeria. Copyright © 2015 Elsevier Editora Ltda. All rights reserved.
Katz, Lee S.; Griswold, Taylor; Williams-Newkirk, Amanda J.; Wagner, Darlene; Petkau, Aaron; Sieffert, Cameron; Van Domselaar, Gary; Deng, Xiangyu; Carleton, Heather A.
2017-01-01
Modern epidemiology of foodborne bacterial pathogens in industrialized countries relies increasingly on whole genome sequencing (WGS) techniques. As opposed to profiling techniques such as pulsed-field gel electrophoresis, WGS requires a variety of computational methods. Since 2013, United States agencies responsible for food safety including the CDC, FDA, and USDA, have been performing whole-genome sequencing (WGS) on all Listeria monocytogenes found in clinical, food, and environmental samples. Each year, more genomes of other foodborne pathogens such as Escherichia coli, Campylobacter jejuni, and Salmonella enterica are being sequenced. Comparing thousands of genomes across an entire species requires a fast method with coarse resolution; however, capturing the fine details of highly related isolates requires a computationally heavy and sophisticated algorithm. Most L. monocytogenes investigations employing WGS depend on being able to identify an outbreak clade whose inter-genomic distances are less than an empirically determined threshold. When the difference between a few single nucleotide polymorphisms (SNPs) can help distinguish between genomes that are likely outbreak-associated and those that are less likely to be associated, we require a fine-resolution method. To achieve this level of resolution, we have developed Lyve-SET, a high-quality SNP pipeline. We evaluated Lyve-SET by retrospectively investigating 12 outbreak data sets along with four other SNP pipelines that have been used in outbreak investigation or similar scenarios. To compare these pipelines, several distance and phylogeny-based comparison methods were applied, which collectively showed that multiple pipelines were able to identify most outbreak clusters and strains. Currently in the US PulseNet system, whole genome multi-locus sequence typing (wgMLST) is the preferred primary method for foodborne WGS cluster detection and outbreak investigation due to its ability to name standardized genomic profiles, its central database, and its ability to be run in a graphical user interface. However, creating a functional wgMLST scheme requires extended up-front development and subject-matter expertise. When a scheme does not exist or when the highest resolution is needed, SNP analysis is used. Using three Listeria outbreak data sets, we demonstrated the concordance between Lyve-SET SNP typing and wgMLST. Availability: Lyve-SET can be found at https://github.com/lskatz/Lyve-SET. PMID:28348549
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leebens-Mack, Jim; Raubeson, Linda A.; Cui, Liying
2005-05-27
While there has been strong support for Amborella and Nymphaeales (water lilies) as branching from basal-most nodes in the angiosperm phylogeny, this hypothesis has recently been challenged by phylogenetic analyses of 61 protein-coding genes extracted from the chloroplast genome sequences of Amborella, Nymphaea and 12 other available land plant chloroplast genomes. These character-rich analyses placed the monocots, represented by three grasses (Poaceae), as sister to all other extant angiosperm lineages. We have extracted protein-coding regions from draft sequences for six additional chloroplast genomes to test whether this surprising result could be an artifact of long-branch attraction due to limited taxonmore » sampling. The added taxa include three monocots (Acorus, Yucca and Typha), a water lily (Nuphar), a ranunculid(Ranunculus), and a gymnosperm (Ginkgo). Phylogenetic analyses of the expanded DNA and protein datasets together with microstructural characters (indels) provided unambiguous support for Amborella and the Nymphaeales as branching from the basal-most nodes in the angiospermphylogeny. However, their relative positions proved to be dependent on method of analysis, with parsimony favoring Amborella as sister to all other angiosperms, and maximum likelihood and neighbor-joining methods favoring an Amborella + Nympheales clade as sister. The maximum likelihood phylogeny supported the later hypothesis, but the likelihood for the former hypothesis was not significantly different. Parametric bootstrap analysis, single gene phylogenies, estimated divergence dates and conflicting in del characters all help to illuminate the nature of the conflict in resolution of the most basal nodes in the angiospermphylogeny. Molecular dating analyses provided median age estimates of 161 mya for the most recent common ancestor of all extant angiosperms and 145 mya for the most recent common ancestor of monocots, magnoliids andeudicots. Whereas long sequences reduce variance in branch lengths and molecular dating estimates, the impact of improved taxon sampling on the rooting of the angiosperm phylogeny together with the results of parametric bootstrap analyses demonstrate how long-branch attraction can mislead genome-scale phylogenetic analyses.« less
Urban, Julie M; Cryan, Jason R
2012-06-14
Members of the hemipteran suborder Auchenorrhyncha (commonly known as planthoppers, tree- and leafhoppers, spittlebugs, and cicadas) are unusual among insects known to harbor endosymbiotic bacteria in that they are associated with diverse assemblages of bacterial endosymbionts. Early light microscopic surveys of species representing the two major lineages of Auchenorrhyncha (the planthopper superfamily Fulgoroidea; and Cicadomorpha, comprising Membracoidea [tree- and leafhoppers], Cercopoidea [spittlebugs], and Cicadoidea [cicadas]), found that most examined species harbored at least two morphologically distinct bacterial endosymbionts, and some harbored as many as six. Recent investigations using molecular techniques have identified multiple obligate bacterial endosymbionts in Cicadomorpha; however, much less is known about endosymbionts of Fulgoroidea. In this study, we present the initial findings of an ongoing PCR-based survey (sequencing 16S rDNA) of planthopper-associated bacteria to document endosymbionts with a long-term history of codiversification with their fulgoroid hosts. Results of PCR surveys and phylogenetic analyses of 16S rDNA recovered a monophyletic clade of Betaproteobacteria associated with planthoppers; this clade included Vidania fulgoroideae, a recently described bacterium identified in exemplars of the planthopper family Cixiidae. We surveyed 77 planthopper species representing 18 fulgoroid families, and detected Vidania in 40 species (representing 13 families). Further, we detected the Sulcia endosymbiont (identified as an obligate endosymbiont of Auchenorrhyncha in previous studies) in 30 of the 40 species harboring Vidania. Concordance of the Vidania phylogeny with the phylogeny of the planthopper hosts (reconstructed based on sequence data from five genes generated from the same insect specimens from which the bacterial sequences were obtained) was supported by statistical tests of codiversification. Codiversification tests also supported concordance of the Sulcia phylogeny with the phylogeny of the planthopper hosts, as well as concordance of planthopper-associated Vidania and Sulcia phylogenies. Our results indicate that the Betaproteobacterium Vidania is an ancient endosymbiont that infected the common ancestor of Fulgoroidea at least 130 million years ago. Comparison of our findings with the early light-microscopic surveys conducted by Müller suggests that Vidania is Müller's x-symbiont, which he hypothesized to have codiversified with most lineages of planthoppers and with the Sulcia endosymbiont.
2012-01-01
Background Members of the hemipteran suborder Auchenorrhyncha (commonly known as planthoppers, tree- and leafhoppers, spittlebugs, and cicadas) are unusual among insects known to harbor endosymbiotic bacteria in that they are associated with diverse assemblages of bacterial endosymbionts. Early light microscopic surveys of species representing the two major lineages of Auchenorrhyncha (the planthopper superfamily Fulgoroidea; and Cicadomorpha, comprising Membracoidea [tree- and leafhoppers], Cercopoidea [spittlebugs], and Cicadoidea [cicadas]), found that most examined species harbored at least two morphologically distinct bacterial endosymbionts, and some harbored as many as six. Recent investigations using molecular techniques have identified multiple obligate bacterial endosymbionts in Cicadomorpha; however, much less is known about endosymbionts of Fulgoroidea. In this study, we present the initial findings of an ongoing PCR-based survey (sequencing 16S rDNA) of planthopper-associated bacteria to document endosymbionts with a long-term history of codiversification with their fulgoroid hosts. Results Results of PCR surveys and phylogenetic analyses of 16S rDNA recovered a monophyletic clade of Betaproteobacteria associated with planthoppers; this clade included Vidania fulgoroideae, a recently described bacterium identified in exemplars of the planthopper family Cixiidae. We surveyed 77 planthopper species representing 18 fulgoroid families, and detected Vidania in 40 species (representing 13 families). Further, we detected the Sulcia endosymbiont (identified as an obligate endosymbiont of Auchenorrhyncha in previous studies) in 30 of the 40 species harboring Vidania. Concordance of the Vidania phylogeny with the phylogeny of the planthopper hosts (reconstructed based on sequence data from five genes generated from the same insect specimens from which the bacterial sequences were obtained) was supported by statistical tests of codiversification. Codiversification tests also supported concordance of the Sulcia phylogeny with the phylogeny of the planthopper hosts, as well as concordance of planthopper-associated Vidania and Sulcia phylogenies. Conclusions Our results indicate that the Betaproteobacterium Vidania is an ancient endosymbiont that infected the common ancestor of Fulgoroidea at least 130 million years ago. Comparison of our findings with the early light-microscopic surveys conducted by Müller suggests that Vidania is Müller’s x-symbiont, which he hypothesized to have codiversified with most lineages of planthoppers and with the Sulcia endosymbiont. PMID:22697166
Ki, Jang-Seu
2010-05-01
Noctiluca scintillans (Macartney) Kofoid et Swezy, 1921 is an unarmoured heterotrophic dinoflagellate with a global distribution, and has been considered as one of the ancestral taxa among dinoflagellates. Recently, 18S rDNA, actin, alpha-, beta-tubulin, and Hsp90-based phylogenies have shown the basal position of the noctilucids. However, the relationships of dinoflagellates in the basal lineages are still controversial. Although the nuclear rDNA (e.g. 18S, ITS-5.8S, and 28S) contains much genetic information, DNA sequences of N. scintillans rDNA molecules were insufficiently characterized as yet. Here the author sequenced a long-range nuclear rDNA, spanning from the 18S to the D5 region of the 28S rDNA, of N. scintillans. The present N. scintillans had a nearly identical genotype (>99.0% similarity) compared to other Noctiluca sequences from different geographic origins. Nucleotide divergence in the partial 28S rDNA was significantly high (p<0.05) as compared to the 18S rDNA, demonstrating that the information from 28S rDNA is more variable. The 28S rDNA phylogeny of 17 selected dinoflagellates, two perkinsids, and two apicomplexans as outgroups showed that N. scintillans and Oxyrrhis marina formed a clade that diverged separately from core dinoflagellates. Copyright (c) 2009 Elsevier GmbH. All rights reserved.
Navarro, Aaron; Martínez-Murcia, Antonio
2018-04-19
The phylogenies derived from housekeeping gene sequence alignments, although mere evolutionary hypotheses, have increased our knowledge about the Aeromonas genetic diversity, providing a robust species delineation framework invaluable for reliable, easy and fast species identification. Previous classifications of Aeromonas, have been fully surpassed by recently developed phylogenetic (natural) classification obtained from the analysis of so-called "molecular chronometers". Despite ribosomal RNAs cannot split all known Aeromonas species, the conserved nature of 16S rRNA offers reliable alignments containing mosaics of sequence signatures which may serve as targets of genus-specific oligonucleotides for subsequent identification/detection tests in samples without culturing. On the contrary, some housekeeping genes coding for proteins show a much better chronometric capacity to discriminate highly related strains. Although both, species and loci, do not all evolve at exactly the same rate, published Aeromonas phylogenies were congruent to each other, indicating that, phylogenetic markers are synchronized and a concatenated multi-gene phylogeny, may be "the mirror" of the entire genomic relationships. Thanks to MLPA approaches, the discovery of new Aeromonas species and strains of rarely isolated species is today more frequent and, consequently, should be extensively promoted for isolate screening and species identification. Although, accumulated data still should be carefully catalogued to inherit a reliable database. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Boonkhot, Phacharaporn; Tadee, Pakpoom; Yamsakul, Panuwat; Pocharoen, Chairoj; Chokesajjawatee, Nipa; Patchanee, Prapas
2015-05-01
Pigs and pork products are well known as an important source of Salmonella, one of the major zoonotic foodborne pathogens. The emergence and spread of antimicrobial resistance is becoming a major public health concern worldwide. Integrons are genetic elements known to have a role in the acquisition and expression of genes conferring antibiotic resistance. This study focuses on the prevalence of class 1 integrons-carrying Salmonella, the genetic diversity of strains of those organisms obtained from swine production chains in Chiang Mai and Lamphun provinces, Thailand, using multilocus sequence typing (MLST) and comparison of genetic diversity of sequence types of Salmonella from this study with pulsotypes identified in previous study. In 175 Salmonella strains, the overall prevalence of class 1 integrons-carrying-Salmonella was 14%. The gene cassettes array pattern "dfrA12-orfF-aadA2" was the most frequently observed. Most of the antimicrobial resistance identified was not associated with related gene cassettes harbored by Salmonella. Six sequence types were generated from 30 randomly selected strains detected by MLST. Salmonella at the human-animal-environment interface was confirmed. Linkages both in the farm to slaughterhouse contamination route and the horizontal transmission of resistance genes were demonstrated. To reduce this problem, the use of antimicrobials in livestock should be controlled by veterinarians. Education and training of food handlers as well as promotion of safe methods of food consumption are important avenues for helping prevent foodborne illness.
Joseph, Susan; Forsythe, Stephen J.
2012-01-01
Cronobacter spp. (previously known as Enterobacter sakazakii) is a bacterial pathogen affecting all age groups, with particularly severe clinical complications in neonates and infants. One recognized route of infection being the consumption of contaminated infant formula. As a recently recognized bacterial pathogen of considerable importance and regulatory control, appropriate detection, and identification schemes are required. The application of multilocus sequence typing (MLST) and analysis (MLSA) of the seven alleles atpD, fusA, glnS, gltB, gyrB, infB, and ppsA (concatenated length 3036 base pairs) has led to considerable advances in our understanding of the genus. This approach is supported by both the reliability of DNA sequencing over subjective phenotyping and the establishment of a MLST database which has open access and is also curated; http://www.pubMLST.org/cronobacter. MLST has been used to describe the diversity of the newly recognized genus, instrumental in the formal recognition of new Cronobacter species (C. universalis and C. condimenti) and revealed the high clonality of strains and the association of clonal complex 4 with neonatal meningitis cases. Clearly the MLST approach has considerable benefits over the use of non-DNA sequence based methods of analysis for newly emergent bacterial pathogens. The application of MLST and MLSA has dramatically enabled us to better understand this opportunistic bacterium which can cause irreparable damage to a newborn baby’s brain, and has contributed to improved control measures to protect neonatal health. PMID:23189075
Arvand, Mardjan; Feil, Edward J.; Giladi, Michael; Boulouis, Henri-Jean; Viezens, Juliane
2007-01-01
Bartonella henselae is a zoonotic pathogen and the causative agent of cat scratch disease and a variety of other disease manifestations in humans. Previous investigations have suggested that a limited subset of B. henselae isolates may be associated with human disease. In the present study, 182 human and feline B. henselae isolates from Europe, North America and Australia were analysed by multi-locus sequence typing (MLST) to detect any associations between sequence type (ST), host species and geographical distribution of the isolates. A total of 14 sequence types were detected, but over 66% (16/24) of the isolates recovered from human disease corresponded to a single genotype, ST1, and this type was detected in all three continents. In contrast, 27.2% (43/158) of the feline isolates corresponded to ST7, but this ST was not recovered from humans and was restricted to Europe. The difference in host association of STs 1 (human) and 7 (feline) was statistically significant (P≤0.001). eBURST analysis assigned the 14 STs to three clonal lineages, which contained two or more STs, and a singleton comprising ST7. These groups were broadly consistent with a neighbour-joining tree, although splits decomposition analysis was indicative of a history of recombination. These data indicate that B. henselae lineages differ in their virulence properties for humans and contribute to a better understanding of the population structure of B. henselae. PMID:18094753
Xiao, Yinghua; Wagendorp, Arjen; Moezelaar, Roy; Abee, Tjakko
2012-01-01
Of 98 suspected food-borne Clostridium perfringens isolates obtained from a nationwide survey by the Food and Consumer Product Safety Authority in The Netherlands, 59 strains were identified as C. perfringens type A. Using PCR-based techniques, the cpe gene encoding enterotoxin was detected in eight isolates, showing a chromosomal location for seven isolates and a plasmid location for one isolate. Further characterization of these strains by using (GTG)5 fingerprint repetitive sequence-based PCR analysis distinguished C. perfringens from other sulfite-reducing clostridia but did not allow for differentiation between various types of C. perfringens strains. To characterize the C. perfringens strains further, multilocus sequence typing (MLST) analysis was performed on eight housekeeping genes of both enterotoxic and non-cpe isolates, and the data were combined with a previous global survey covering strains associated with food poisoning, gas gangrene, and isolates from food or healthy individuals. This revealed that the chromosomal cpe strains (food strains and isolates from food poisoning cases) belong to a distinct cluster that is significantly distant from all the other cpe plasmid-carrying and cpe-negative strains. These results suggest that different groups of C. perfringens have undergone niche specialization and that a distinct group of food isolates has specific core genome sequences. Such findings have epidemiological and evolutionary significance. Better understanding of the origin and reservoir of enterotoxic C. perfringens may allow for improved control of this organism in foods. PMID:22865060
Cooper, Vaughn S.; Hatcher, Philip J.; Verheyde, Bart; Carlier, Aurélien; Vandamme, Peter
2017-01-01
The natural environment serves as a reservoir of opportunistic pathogens. A well-established method for studying the epidemiology of such opportunists is multilocus sequence typing, which in many cases has defined strains predisposed to causing infection. Burkholderia multivorans is an important pathogen in people with cystic fibrosis (CF) and its epidemiology suggests that strains are acquired from non-human sources such as the natural environment. This raises the central question of whether the isolation source (CF or environment) or the multilocus sequence type (ST) of B. multivorans better predicts their genomic content and functionality. We identified four pairs of B. multivorans isolates, representing distinct STs and consisting of one CF and one environmental isolate each. All genomes were sequenced using the PacBio SMRT sequencing technology, which resulted in eight high-quality B. multivorans genome assemblies. The present study demonstrated that the genomic structure of the examined B. multivorans STs is highly conserved and that the B. multivorans genomic lineages are defined by their ST. Orthologous protein families were not uniformly distributed among chromosomes, with core orthologs being enriched on the primary chromosome and ST-specific orthologs being enriched on the second and third chromosome. The ST-specific orthologs were enriched in genes involved in defense mechanisms and secondary metabolism, corroborating the strain-specificity of these virulence characteristics. Finally, the same B. multivorans genomic lineages occur in both CF and environmental samples and on different continents, demonstrating their ubiquity and evolutionary persistence. PMID:28430818
Biogeography of Burkholderia pseudomallei in the Torres Strait Islands of Northern Australia
Baker, Anthony; Mayo, Mark; Owens, Leigh; Burgess, Graham; Norton, Robert; McBride, William John Hannan; Currie, Bart J.
2013-01-01
It has been hypothesized that biogeographical boundaries are a feature of Burkholderia pseudomallei ecology, and they impact the epidemiology of melioidosis on a global scale. This study examined the relatedness of B. pseudomallei sourced from islands in the Torres Strait of Northern Australia to determine if the geography of isolated island communities is a determinant of the organisms' dispersal. Environmental sampling on Badu Island in the Near Western Island cluster recovered a single clone. An additional 32 clinical isolates from the region were sourced. Isolates were characterized using multilocus sequence typing and a multiplex PCR targeting the flagellum gene cluster. Gene cluster analysis determined that 69% of the isolates from the region encoded the ancestral Burkholderia thailandensis-like flagellum and chemotaxis gene cluster, a proportion significantly lower than that reported from mainland Australia and consistent with observations of isolates from southern Papua New Guinea. A goodness-of-fit test indicated that there was geographic localization of sequence types throughout the archipelago, with the exception of Thursday Island, the economic and cultural hub of the region. Sequence types common to mainland Australia and Papua New Guinea were identified. These findings demonstrate for the first time an environmental reservoir for B. pseudomallei in the Torres Strait, and multilocus sequence typing suggests that the organism is not randomly distributed throughout this region and that seawater may provide a barrier to dispersal of the organism. Moreover, these findings support an anthropogenic dispersal hypothesis for the spread of B. pseudomallei throughout this region. PMID:23698533
Keller, Judith I; Shriver, W Gregory
2014-01-01
Campylobacter jejuni is responsible for the majority of bacterial foodborne gastroenteritis in the US, usually due to the consumption of undercooked poultry. Research on which avian species transmit the bacterium is limited, especially in the US. We sampled wild birds in three families-Anatidae, Scolopacidae, and Laridae-in eastern North America to determine the prevalence and specific strains of Campylobacter. The overall prevalence of Campylobacter spp. was 9.2% for all wild birds sampled (n = 781). Campylobacter jejuni was the most prevalent species (8.1%), while Campylobacter coli and Campylobacter lari prevalence estimates were low (1.4% and 0.3%, respectively). We used multilocus sequence typing PCR specific to C. jejuni to characterize clonal complexes and sequence types isolated from wild bird samples and detected 13 novel sequence types, along with a clonal complex previously only associated with human disease (ST-658). Wild birds share an increasing amount of habitat with humans as more landscapes become fragmented and developed for human needs. Wild birds are and will remain an important aspect of public health due to their ability to carry and disperse emerging zoonotic pathogens or their arthropod vectors. As basic information such as prevalence is limited or lacking from a majority of wild birds in the US, this study provides further insight into Campylobacter epidemiology, host preference, and strain characterization of C. jejuni.
Archaebacterial phylogeny: perspectives on the urkingdoms
NASA Technical Reports Server (NTRS)
Woese, C. R.; Olsen, G. J.
1986-01-01
Comparisons of complete 16S ribosomal RNA sequences have been used to confirm, refine and extend earlier concepts of archaebacterial phylogeny. The archaebacteria fall naturally into two major branches or divisions, I--the sulfur-dependent thermophilic archaebacteria, and II--the methanogenic archaebacteria and their relatives. Division I comprises a relatively closely related and phenotypically homogeneous collection of thermophilic sulfur-dependent species--encompassing the genera Sulfolobus, Thermoproteus, Pyrodictium and Desulfurococcus. The organisms of Division II, however, form a less compact grouping phylogenetically, and are also more diverse in phenotype. All three of the (major) methanogen groups are found in Division II, as are the extreme halophiles and two types of thermoacidophiles, Thermoplasma acidophilum and Thermococcus celer. This last species branches sufficiently deeply in the Division II line that it might be considered to represent a separate, third Division. However, both the extreme halophiles and Tp. acidophilum branch within the cluster of methanogens. The extreme halophiles are specifically related to the Methanomicrobiales, to the exclusion of both the Methanococcales and the Methanobacteriales. Tp. acidophilum is peripherally related to the halophile-Methanomicrobiales group. By 16S rRNA sequence measure the archaebacteria constitute a phylogenetically coherent grouping (clade), which excludes both the eubacteria and the eukaryotes--a conclusion that is supported by other sequence evidence as well. Alternative proposals for archaebacterial phylogeny, not based upon sequence evidence, are discussed and evaluated. In particular, proposals to rename (reclassify) various subgroups of the archaebacteria as new kingdoms are found wanting, for both their lack of proper experimental support and the taxonomic confusion they introduce.
Tamar, Karin; Carranza, Salvador; Sindaco, Roberto; Moravec, Jiří; Trape, Jean-François; Meiri, Shai
2016-10-01
Acanthodactylus lizards are among the most diverse and widespread diurnal reptiles in the arid regions spanning from North Africa across to western India. Acanthodactylus constitutes the most species-rich genus in the family Lacertidae, with over 40 recognized species inhabiting a wide variety of dry habitats. The genus has seldom undergone taxonomic revisions, and although there are a number of described species and species-groups, their boundaries, as well as their interspecific relationships, remain largely unresolved. We constructed a multilocus phylogeny, combining data from two mitochondrial (12S, cytb) and three nuclear (MC1R, ACM4, c-mos) markers for 302 individuals belonging to 36 known species, providing the first large-scale time-calibrated molecular phylogeny of the genus. We evaluated phylogenetic relationships between and within species-groups, and assessed Acanthodactylus biogeography across its known range. Acanthodactylus cladogenesis is estimated to have originated in Africa due to vicariance and dispersal events from the Oligocene onwards. Radiation started with the separation into three clades: the Western and scutellatus clades largely distributed in North Africa, and the Eastern clade occurring mostly in south-west Asia. Most Acanthodactylus species diverged during the Miocene, possibly as a result of regional geological instability and climatic changes. We support most of the current taxonomic classifications and phylogenetic relationships, and provide genetic validity for most species. We reveal a new distinct blanfordii species-group, suggest new phylogenetic positions (A. hardyi, A. masirae), and synonymize several species and subspecies (A. lineomaculatus, A. boskianus khattensis and A. b. nigeriensis) with their phylogenetically closely-related species. We recommend a thorough systematic revision of taxa, such as A. guineensis, A. grandis, A. dumerilii, A. senegalensis and the pardalis and erythrurus species-groups, which exhibit high levels of intraspecific variability, and clear evidence of phylogenetic complexity. Copyright © 2016 Elsevier Inc. All rights reserved.
Cluster of Serogroup W135 Meningococci, Southeastern Florida, 2008–2009
Mejia-Echeverry, Alvaro; Fiorella, Paul; Leguen, Fermin; Livengood, John; Kay, Robyn; Hopkins, Richard
2010-01-01
Recently, 14 persons in southeastern Florida were identified with Neisseria meningitidis serogroup W135 invasive infections. All isolates tested had matching or near-matching pulsed-field gel electrophoresis patterns and belonged to the multilocus sequence type 11 clonal complex. The epidemiologic investigation suggested recent endemic transmission of this clonal complex in southeastern Florida. PMID:20031054
Clonal origins of Vibrio cholerae O1 El Tor strains, Papua New Guinea, 2009-2011.
Horwood, Paul F; Collins, Deirdre; Jonduo, Marinjho H; Rosewell, Alexander; Dutta, Samir R; Dagina, Rosheila; Ropa, Berry; Siba, Peter M; Greenhill, Andrew R
2011-11-01
We used multilocus sequence typing and variable number tandem repeat analysis to determine the clonal origins of Vibrio cholerae O1 El Tor strains from an outbreak of cholera that began in 2009 in Papua New Guinea. The epidemic is ongoing, and transmission risk is elevated within the Pacific region.
Legione, Alistair R; Amery-Gale, Jemima; Lynch, Michael; Haynes, Leesa; Gilkerson, James R; Sansom, Fiona M; Devlin, Joanne M
2016-04-28
We detected Chlamydia pecorum in two koalas ( Phascolarctos cinereus ) from a closed island population in Victoria, Australia, previously free of Chlamydia infection. The ompA and multilocus sequence type were most closely related to published isolates of livestock rather than koala origin, suggesting potential cross-species transmission of C. pecorum .
USDA-ARS?s Scientific Manuscript database
The objective of this study was to assess genetic diversity and antimicrobial susceptibility of Campylobacter jejuni and coli recovered from broiler ceca at slaughter. Ceca from one broiler were collected from the evisceration line in a commercial processing plant, once or twice weekly for two year...
USDA-ARS?s Scientific Manuscript database
The phylogeny of Amaryllidaceae tribe Hippeastreae was inferred using chloroplast (3’ycf1, ndhF, trnL-F) and nuclear (ITS rDNA) sequence data under maximum parsimony and maximum likelihood frameworks. Network analyses were applied to resolve conflicting signals among data sets and putative scenarios...
Matthew Parks; Richard Cronn; Aaron Liston
2009-01-01
We reconstruct the infrageneric phylogeny of Pinus from 37 nearly-complete chloroplast genomes (average 109 kilobases each of an approximately 120 kilobase genome) generated using multiplexed massively parallel sequencing. We found that 30/33 ingroup nodes resolved wlth > 95-percent bootstrap support; this is a substantial improvement relative...
Teaching the Process of Molecular Phylogeny and Systematics: A Multi-Part Inquiry-Based Exercise
ERIC Educational Resources Information Center
Lents, Nathan H.; Cifuentes, Oscar E.; Carpi, Anthony
2010-01-01
Three approaches to molecular phylogenetics are demonstrated to biology students as they explore molecular data from "Homo sapiens" and four related primates. By analyzing DNA sequences, protein sequences, and chromosomal maps, students are repeatedly challenged to develop hypotheses regarding the ancestry of the five species. Although…
USDA-ARS?s Scientific Manuscript database
Fov isolates belonging to all known races, biotypes, and most of known genotypes were characterized by phylogenetic and VCG analysis. VCGs with multiple members were sequenced for at least two members, and the resulting sequences were always identical except for VCG01111 members. Vegetative compatib...
A Well-Resolved Phylogeny of the Trees of Puerto Rico Based on DNA Barcode Sequence Data
Muscarella, Robert; Uriarte, María; Erickson, David L.; Swenson, Nathan G.; Zimmerman, Jess K.; Kress, W. John
2014-01-01
Background The use of phylogenetic information in community ecology and conservation has grown in recent years. Two key issues for community phylogenetics studies, however, are (i) low terminal phylogenetic resolution and (ii) arbitrarily defined species pools. Methodology/principal findings We used three DNA barcodes (plastid DNA regions rbcL, matK, and trnH-psbA) to infer a phylogeny for 527 native and naturalized trees of Puerto Rico, representing the vast majority of the entire tree flora of the island (89%). We used a maximum likelihood (ML) approach with and without a constraint tree that enforced monophyly of recognized plant orders. Based on 50% consensus trees, the ML analyses improved phylogenetic resolution relative to a comparable phylogeny generated with Phylomatic (proportion of internal nodes resolved: constrained ML = 74%, unconstrained ML = 68%, Phylomatic = 52%). We quantified the phylogenetic composition of 15 protected forests in Puerto Rico using the constrained ML and Phylomatic phylogenies. We found some evidence that tree communities in areas of high water stress were relatively phylogenetically clustered. Reducing the scale at which the species pool was defined (from island to soil types) changed some of our results depending on which phylogeny (ML vs. Phylomatic) was used. Overall, the increased terminal resolution provided by the ML phylogeny revealed additional patterns that were not observed with a less-resolved phylogeny. Conclusions/significance With the DNA barcode phylogeny presented here (based on an island-wide species pool), we show that a more fully resolved phylogeny increases power to detect nonrandom patterns of community composition in several Puerto Rican tree communities. Especially if combined with additional information on species functional traits and geographic distributions, this phylogeny will (i) facilitate stronger inferences about the role of historical processes in governing the assembly and composition of Puerto Rican forests, (ii) provide insight into Caribbean biogeography, and (iii) aid in incorporating evolutionary history into conservation planning. PMID:25386879
A well-resolved phylogeny of the trees of Puerto Rico based on DNA barcode sequence data.
Muscarella, Robert; Uriarte, María; Erickson, David L; Swenson, Nathan G; Zimmerman, Jess K; Kress, W John
2014-01-01
The use of phylogenetic information in community ecology and conservation has grown in recent years. Two key issues for community phylogenetics studies, however, are (i) low terminal phylogenetic resolution and (ii) arbitrarily defined species pools. We used three DNA barcodes (plastid DNA regions rbcL, matK, and trnH-psbA) to infer a phylogeny for 527 native and naturalized trees of Puerto Rico, representing the vast majority of the entire tree flora of the island (89%). We used a maximum likelihood (ML) approach with and without a constraint tree that enforced monophyly of recognized plant orders. Based on 50% consensus trees, the ML analyses improved phylogenetic resolution relative to a comparable phylogeny generated with Phylomatic (proportion of internal nodes resolved: constrained ML = 74%, unconstrained ML = 68%, Phylomatic = 52%). We quantified the phylogenetic composition of 15 protected forests in Puerto Rico using the constrained ML and Phylomatic phylogenies. We found some evidence that tree communities in areas of high water stress were relatively phylogenetically clustered. Reducing the scale at which the species pool was defined (from island to soil types) changed some of our results depending on which phylogeny (ML vs. Phylomatic) was used. Overall, the increased terminal resolution provided by the ML phylogeny revealed additional patterns that were not observed with a less-resolved phylogeny. With the DNA barcode phylogeny presented here (based on an island-wide species pool), we show that a more fully resolved phylogeny increases power to detect nonrandom patterns of community composition in several Puerto Rican tree communities. Especially if combined with additional information on species functional traits and geographic distributions, this phylogeny will (i) facilitate stronger inferences about the role of historical processes in governing the assembly and composition of Puerto Rican forests, (ii) provide insight into Caribbean biogeography, and (iii) aid in incorporating evolutionary history into conservation planning.
Characterization of genetic variability of Venezuelan equine encephalitis viruses
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...
2016-04-07
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles
2014-10-01
The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches' broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea.
de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles
2014-01-01
The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches’ broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea. PMID:25505843
Systematic Error in Seed Plant Phylogenomics
Zhong, Bojian; Deusch, Oliver; Goremykin, Vadim V.; Penny, David; Biggs, Patrick J.; Atherton, Robin A.; Nikiforova, Svetlana V.; Lockhart, Peter James
2011-01-01
Resolving the closest relatives of Gnetales has been an enigmatic problem in seed plant phylogeny. The problem is known to be difficult because of the extent of divergence between this diverse group of gymnosperms and their closest phylogenetic relatives. Here, we investigate the evolutionary properties of conifer chloroplast DNA sequences. To improve taxon sampling of Cupressophyta (non-Pinaceae conifers), we report sequences from three new chloroplast (cp) genomes of Southern Hemisphere conifers. We have applied a site pattern sorting criterion to study compositional heterogeneity, heterotachy, and the fit of conifer chloroplast genome sequences to a general time reversible + G substitution model. We show that non-time reversible properties of aligned sequence positions in the chloroplast genomes of Gnetales mislead phylogenetic reconstruction of these seed plants. When 2,250 of the most varied sites in our concatenated alignment are excluded, phylogenetic analyses favor a close evolutionary relationship between the Gnetales and Pinaceae—the Gnepine hypothesis. Our analytical protocol provides a useful approach for evaluating the robustness of phylogenomic inferences. Our findings highlight the importance of goodness of fit between substitution model and data for understanding seed plant phylogeny. PMID:22016337
Carter, Stuart D.; Birtles, Richard J.; Brown, Jennifer M.; Hart, C. Anthony; Evans, Nicholas J.
2016-01-01
ABSTRACT Treponema species are implicated in many diseases of humans and animals. Digital dermatitis (DD) treponemes are reported to cause severe lesions in cattle, sheep, pigs, goats, and wild elk, causing substantial global animal welfare issues and economic losses. The fastidiousness of these spirochetes has previously precluded studies investigating within-phylogroup genetic diversity. An archive of treponemes that we isolated enabled multilocus sequence typing to quantify the diversity and population structure of DD treponemes. Isolates (n = 121) were obtained from different animal hosts in nine countries on three continents. The analyses herein of currently isolated DD treponemes at seven housekeeping gene loci confirm the classification of the three previously designated phylogroups: the Treponema medium, Treponema phagedenis, and Treponema pedis phylogroups. Sequence analysis of seven DD treponeme housekeeping genes revealed a generally low level of diversity among the strains within each phylogroup, removing the need for the previously used “-like” suffix. Surprisingly, all isolates within each phylogroup clustered together, regardless of host or geographic origin, suggesting that the same sequence types (STs) can infect different animals. Some STs were derived from multiple animals from the same farm, highlighting probable within-farm transmissions. Several STs infected multiple hosts from similar geographic regions, identifying probable frequent between-host transmissions. Interestingly, T. pedis appears to be evolving more quickly than the T. medium or T. phagedenis DD treponeme phylogroup, by forming two unique ST complexes. The lack of phylogenetic discrimination between treponemes isolated from different hosts or geographic regions substantially contrasts with the data for other clinically relevant spirochetes. IMPORTANCE The recent expansion of the host range of digital dermatitis (DD) treponemes from cattle to sheep, goats, pigs, and wild elk, coupled with the high level of 16S rRNA gene sequence similarity across hosts and with human treponemes, suggests that the same bacterial species can cause disease in multiple different hosts. This multilocus sequence typing (MLST) study further demonstrates that these bacteria isolated from different hosts are indeed very similar, raising the potential for cross-species transmission. The study also shows that infection spread occurs frequently, both locally and globally, suggesting transmission by routes other than animal-animal transmission alone. These results indicate that on-farm biosecurity is important for controlling disease spread in domesticated species. Continued surveillance and vigilance are important for ascertaining the evolution and tracking any further host range expansion of these important pathogens. PMID:27208135
Clegg, Simon R; Carter, Stuart D; Birtles, Richard J; Brown, Jennifer M; Hart, C Anthony; Evans, Nicholas J
2016-08-01
Treponema species are implicated in many diseases of humans and animals. Digital dermatitis (DD) treponemes are reported to cause severe lesions in cattle, sheep, pigs, goats, and wild elk, causing substantial global animal welfare issues and economic losses. The fastidiousness of these spirochetes has previously precluded studies investigating within-phylogroup genetic diversity. An archive of treponemes that we isolated enabled multilocus sequence typing to quantify the diversity and population structure of DD treponemes. Isolates (n = 121) were obtained from different animal hosts in nine countries on three continents. The analyses herein of currently isolated DD treponemes at seven housekeeping gene loci confirm the classification of the three previously designated phylogroups: the Treponema medium, Treponema phagedenis, and Treponema pedis phylogroups. Sequence analysis of seven DD treponeme housekeeping genes revealed a generally low level of diversity among the strains within each phylogroup, removing the need for the previously used "-like" suffix. Surprisingly, all isolates within each phylogroup clustered together, regardless of host or geographic origin, suggesting that the same sequence types (STs) can infect different animals. Some STs were derived from multiple animals from the same farm, highlighting probable within-farm transmissions. Several STs infected multiple hosts from similar geographic regions, identifying probable frequent between-host transmissions. Interestingly, T. pedis appears to be evolving more quickly than the T. medium or T. phagedenis DD treponeme phylogroup, by forming two unique ST complexes. The lack of phylogenetic discrimination between treponemes isolated from different hosts or geographic regions substantially contrasts with the data for other clinically relevant spirochetes. The recent expansion of the host range of digital dermatitis (DD) treponemes from cattle to sheep, goats, pigs, and wild elk, coupled with the high level of 16S rRNA gene sequence similarity across hosts and with human treponemes, suggests that the same bacterial species can cause disease in multiple different hosts. This multilocus sequence typing (MLST) study further demonstrates that these bacteria isolated from different hosts are indeed very similar, raising the potential for cross-species transmission. The study also shows that infection spread occurs frequently, both locally and globally, suggesting transmission by routes other than animal-animal transmission alone. These results indicate that on-farm biosecurity is important for controlling disease spread in domesticated species. Continued surveillance and vigilance are important for ascertaining the evolution and tracking any further host range expansion of these important pathogens. Copyright © 2016 Clegg et al.
Bhattacharjee, Kaushik; Banerjee, Subhro; Joshi, Santa Ram
2012-01-01
Isolation and characterization of actinomycetes from soil samples from altitudinal gradient of North-East India were investigated for computational RNomics based phylogeny. A total of 52 diverse isolates of Streptomyces from the soil samples were isolated on four different media and from these 6 isolates were selected on the basis of cultural characteristics, microscopic and biochemical studies. Sequencing of 16S rDNA of the selected isolates identified them to belong to six different species of Streptomyces. The molecular morphometric and physico-kinetic analysis of 16S rRNA sequences were performed to predict the diversity of the genus. The computational RNomics study revealed the significance of the structural RNA based phylogenetic analysis in a relatively diverse group of Streptomyces. PMID:22829729
Ibarra-Cerdeña, Carlos N.; Zaldívar-Riverón, Alejandro; Peterson, A. Townsend; Sánchez-Cordero, Víctor; Ramsey, Janine M.
2014-01-01
The niche conservatism hypothesis states that related species diverge in niche characteristics at lower rates than expected, given their lineage divergence. Here we analyze whether niche conservatism is a common pattern among vector species (Hemiptera: Reduviidae: Triatominae) of Trypanosoma cruzi that inhabit North and Central America, a highly heterogeneous landmass in terms of environmental gradients. Mitochondrial and nuclear loci were used in a multi-locus phylogenetic framework to reconstruct phylogenetic relationships among species and estimate time of divergence of selected clades to draw biogeographic inferences. Then, we estimated similarity between the ecological niche of sister species and tested the niche conservatism hypothesis using our best estimate of phylogeny. Triatoma is not monophyletic. A primary clade with all North and Central American (NCA) triatomine species from the genera Triatoma, Dipetalogaster, and Panstrongylus, was consistently recovered. Nearctic species within the NCA clade (T. p. protracta, T. r. rubida) diverged during the Pliocene, whereas the Neotropical species (T. phyllosoma, T. longipennis, T. dimidiata complex) are estimated to have diverged more recently, during the Pleistocene. The hypothesis of niche conservatism could not be rejected for any of six sister species pairs. Niche similarity between sister species best fits a retention model. While this framework is used here to infer niche evolution, it has a direct impact on spatial vector dynamics driven by human population movements, expansion of transportation networks and climate change scenarios. PMID:25356550
2009-01-01
Background Bacterial genomes are mosaic structures composed of genes present in every strain of the same species (core genome), and genes present in some but not all strains of a species (accessory genome). The aim of this study was to compare the genetic diversity of core and accessory genes of a Salmonella enterica subspecies enterica serovar Typhimurium (Typhimurium) population isolated from food-animal and human sources in four regions of Mexico. Multilocus sequence typing (MLST) and macrorestriction fingerprints by pulsed-field gel electrophoresis (PFGE) were used to address the core genetic variation, and genes involved in pathogenesis and antibiotic resistance were selected to evaluate the accessory genome. Results We found a low genetic diversity for both housekeeping and accessory genes. Sequence type 19 (ST19) was supported as the founder genotype of STs 213, 302 and 429. We found a temporal pattern in which the derived ST213 is replacing the founder ST19 in the four geographic regions analyzed and a geographic trend in the number of resistance determinants. The distribution of the accessory genes was not random among chromosomal genotypes. We detected strong associations among the different accessory genes and the multilocus chromosomal genotypes (STs). First, the Salmonella virulence plasmid (pSTV) was found mostly in ST19 isolates. Second, the plasmid-borne betalactamase cmy-2 was found only in ST213 isolates. Third, the most abundant integron, IP-1 (dfrA12, orfF and aadA2), was found only in ST213 isolates. Fourth, the Salmonella genomic island (SGI1) was found mainly in a subgroup of ST19 isolates carrying pSTV. The mapping of accessory genes and multilocus genotypes on the dendrogram derived from macrorestiction fingerprints allowed the establishment of genetic subgroups within the population. Conclusion Despite the low levels of genetic diversity of core and accessory genes, the non-random distribution of the accessory genes across chromosomal backgrounds allowed us to discover genetic subgroups within the population. This study provides information about the importance of the accessory genome in generating genetic variability within a bacterial population. PMID:19573249
Taxonomic and phytogeographic implications from ITS phylogeny in Berberis (Berberidaceae).
Kim, Young-Dong; Kim, Sung-Hee; Landrum, Leslie R
2004-06-01
A phylogeny based on the internal transcribed spacer (ITS) sequences from 79 taxa representing much of the diversity of Berberis L. (four major groups and 22 sections) was constructed for the first time. The phylogeny was basically congruent with the previous classification schemes at higher taxonomic levels, such as groups and subgroups. A notable exception is the non-monophyly of the group Occidentales of compound-leaved Berberis (previously separated as Mahonia). At lower levels, however, most of previous sections and subsections were not evident especially in simple-leaved Berberis. Possible relationship between section Horridae (group Occidentales) and the simple-leaved Berberis clade implies paraphyly of the compound-leaved Berberis. A well-known South America-Old World (mainly Asia) disjunctive distribution pattern of the simple-leaved Berberis is explained by a vicariance event occurring in the Cretaceous period. The ITS phylogeny also suggests that a possible connection between the Asian and South American groups through the North American species ( Berberis canadensis or B. fendleri) is highly unlikely.
SNV-PPILP: refined SNV calling for tumor data using perfect phylogenies and ILP.
van Rens, Karen E; Mäkinen, Veli; Tomescu, Alexandru I
2015-04-01
Recent studies sequenced tumor samples from the same progenitor at different development stages and showed that by taking into account the phylogeny of this development, single-nucleotide variant (SNV) calling can be improved. Accurate SNV calls can better reveal early-stage tumors, identify mechanisms of cancer progression or help in drug targeting. We present SNV-PPILP, a fast and easy to use tool for refining GATK's Unified Genotyper SNV calls, for multiple samples assumed to form a phylogeny. We tested SNV-PPILP on simulated data, with a varying number of samples, SNVs, read coverage and violations of the perfect phylogeny assumption. We always match or improve the accuracy of GATK, with a significant improvement on low read coverage. SNV-PPILP, available at cs.helsinki.fi/gsa/snv-ppilp/, is written in Python and requires the free ILP solver lp_solve. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Migration and Persistence of Human Influenza A Viruses, Vietnam, 2001–2008
Le, Mai Quynh; Lam, Ha Minh; Cuong, Vuong Duc; Lam, Tommy Tsan-Yuk; Halpin, Rebecca A; Wentworth, David E; Hien, Nguyen Tran; Thanh, Le Thi; Phuong, Hoang Vu Mai; Horby, Peter
2013-01-01
Understanding global influenza migration and persistence is crucial for vaccine strain selection. Using 240 new human influenza A virus whole genomes collected in Vietnam during 2001–2008, we looked for persistence patterns and migratory connections between Vietnam and other countries. We found that viruses in Vietnam migrate to and from China, Hong Kong, Taiwan, Cambodia, Japan, South Korea, and the United States. We attempted to reduce geographic bias by generating phylogenies subsampled at the year and country levels. However, migration events in these phylogenies were still driven by the presence or absence of sequence data, indicating that an epidemiologic study design that controls for prevalence is required for robust migration analysis. With whole-genome data, most migration events are not detectable from the phylogeny of the hemagglutinin segment alone, although general migratory relationships between Vietnam and other countries are visible in the hemagglutinin phylogeny. It is possible that virus lineages in Vietnam persisted for >1 year. PMID:24188643
Wang, Xiao-Jing; Wang, Xiao-Xing; Wang, Ya-Jun; Wang, Xi-Zhong; He, Guang-Xin; Chen, Hong-Wei; Fei, Li-Song
2002-09-01
Activin, which is included in the transforming growth factor-beta (TGF beta) superfamily of proteins and receptors, is known to have broad-ranging effects in the creatures. The mature peptide of beta A subunit of this gene, one of the most highly conserved sequence, can elevate the basal secretion of follicle-stimulating hormone (FSH) in the pituitary and FSH is pivotal to organism's reproduction. Reproduction block is one of the main reasons which cause giant panda to extinct. The sequence of Activin beta A subunit gene mature peptides has been successfully amplified from giant panda, red panda and malayan sun bear's genomic DNA by using polymerase chain reaction (PCR) with a pair of degenerate primers. The PCR products were cloned into the vector pBlueScript+ of Esherichia coli. Sequence analysis of Activin beta A subunit gene mature peptides shows that the length of this gene segment is the same (359 bp) and there is no intron in all three species. The sequence encodes a peptide of 119 amino acid residues. The homology comparison demonstrates 93.9% DNA homology and 99% homology in amino acid among these three species. Both GenBank blast search result and restriction enzyme map reveal that the sequences of Activin beta A subunit gene mature peptides of different species are highly conserved during the evolution process. Phylogeny analysis is performed with PHYLIP software package. A consistent phylogeny tree has been drawn with three different methods. The software analysis outcome accords with the academic view that giant panda has a closer relationship to the malayan sun bear than the red panda. Giant panda should be grouped into the bear family (Uersidae) with the malayan sun bear. As to the red panda, it would be better that this animal be grouped into the unique family (red panda family) because of great difference between the red panda and the bears (Uersidae).
Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian
2009-03-01
Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.
A six-gene phylogeny provides new insights into choanoflagellate evolution.
Carr, Martin; Richter, Daniel J; Fozouni, Parinaz; Smith, Timothy J; Jeuck, Alexandra; Leadbeater, Barry S C; Nitsche, Frank
2017-02-01
Recent studies have shown that molecular phylogenies of the choanoflagellates (Class Choanoflagellatea) are in disagreement with their traditional taxonomy, based on morphology, and that Choanoflagellatea requires considerable taxonomic revision. Furthermore, phylogenies suggest that the morphological and ecological evolution of the group is more complex than has previously been recognized. Here we address the taxonomy of the major choanoflagellate order Craspedida, by erecting four new genera. The new genera are shown to be morphologically, ecologically and phylogenetically distinct from other choanoflagellate taxa. Furthermore, we name five novel craspedid species, as well as formally describe ten species that have been shown to be either misidentified or require taxonomic revision. Our revised phylogeny, including 18 new species and sequence data for two additional genes, provides insights into the morphological and ecological evolution of the choanoflagellates. We examine the distribution within choanoflagellates of these two additional genes, EF-1A and EFL, closely related translation GTPases which are required for protein synthesis. Mapping the presence and absence of these genes onto the phylogeny highlights multiple events of gene loss within the choanoflagellates. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Mitogenomic perspectives on the origin and phylogeny of living amphibians.
Zhang, Peng; Zhou, Hui; Chen, Yue-Qin; Liu, Yi-Fei; Qu, Liang-Hu
2005-06-01
Establishing the relationships among modern amphibians (lissamphibians) and their ancient relatives is necessary for our understanding of early tetrapod evolution. However, the phylogeny is still intractable because of the highly specialized anatomy and poor fossil record of lissamphibians. Paleobiologists are still not sure whether lissamphibians are monophyletic or polyphyletic, and which ancient group (temnospondyls or lepospondyls) is most closely related to them. In an attempt to address these problems, eight mitochondrial genomes of living amphibians were determined and compared with previously published amphibian sequences. A comprehensive molecular phylogenetic analysis of nucleotide sequences yields a highly resolved tree congruent with the traditional hypotheses (Batrachia). By using a molecular clock-independent approach for inferring dating information from molecular phylogenies, we present here the first molecular timescale for lissamphibian evolution, which suggests that lissamphibians first emerged about 330 million years ago. By observing the fit between molecular and fossil times, we suggest that the temnospondyl-origin hypothesis for lissamphibians is more credible than other hypotheses. Moreover, under this timescale, the potential geographic origins of the main living amphibian groups are discussed: (i) advanced frogs (neobatrachians) may possess an Africa-India origin; (ii) salamanders may have originated in east Asia; (iii) the tropic forest of the Triassic Pangaea may be the place of origin for the ancient caecilians. An accurate phylogeny with divergence times can be also helpful to direct the search for "missing" fossils, and can benefit comparative studies of amphibian evolution.
2014-01-01
Background Next-generation sequencing has provided a wealth of plastid genome sequence data from an increasingly diverse set of green plants (Viridiplantae). Although these data have helped resolve the phylogeny of numerous clades (e.g., green algae, angiosperms, and gymnosperms), their utility for inferring relationships across all green plants is uncertain. Viridiplantae originated 700-1500 million years ago and may comprise as many as 500,000 species. This clade represents a major source of photosynthetic carbon and contains an immense diversity of life forms, including some of the smallest and largest eukaryotes. Here we explore the limits and challenges of inferring a comprehensive green plant phylogeny from available complete or nearly complete plastid genome sequence data. Results We assembled protein-coding sequence data for 78 genes from 360 diverse green plant taxa with complete or nearly complete plastid genome sequences available from GenBank. Phylogenetic analyses of the plastid data recovered well-supported backbone relationships and strong support for relationships that were not observed in previous analyses of major subclades within Viridiplantae. However, there also is evidence of systematic error in some analyses. In several instances we obtained strongly supported but conflicting topologies from analyses of nucleotides versus amino acid characters, and the considerable variation in GC content among lineages and within single genomes affected the phylogenetic placement of several taxa. Conclusions Analyses of the plastid sequence data recovered a strongly supported framework of relationships for green plants. This framework includes: i) the placement of Zygnematophyceace as sister to land plants (Embryophyta), ii) a clade of extant gymnosperms (Acrogymnospermae) with cycads + Ginkgo sister to remaining extant gymnosperms and with gnetophytes (Gnetophyta) sister to non-Pinaceae conifers (Gnecup trees), and iii) within the monilophyte clade (Monilophyta), Equisetales + Psilotales are sister to Marattiales + leptosporangiate ferns. Our analyses also highlight the challenges of using plastid genome sequences in deep-level phylogenomic analyses, and we provide suggestions for future analyses that will likely incorporate plastid genome sequence data for thousands of species. We particularly emphasize the importance of exploring the effects of different partitioning and character coding strategies. PMID:24533922
NASA Technical Reports Server (NTRS)
Woese, C. R.; Achenbach, L.; Rouviere, P.; Mandelco, L.
1991-01-01
A major and too little recognized source of artifact in phylogenetic analysis of molecular sequence data is compositional difference among sequences. The problem becomes particularly acute when alignments contain ribosomal RNAs from both mesophilic and thermophilic species. Among prokaryotes the latter are considerably higher in G + C content than the former, which often results in artificial clustering of thermophilic lineages and their being placed artificially deep in phylogenetic trees. In this communication we review archaeal phylogeny in the light of this consideration, focusing in particular on the phylogenetic position of the sulfate reducing species Archaeoglobus fulgidus, using both 16S rRNA and 23S rRNA sequences. The analysis shows clearly that the previously reported deep branching of the A. fulgidus lineage (very near the base of the euryarchaeal side of the archaeal tree) is incorrect, and that the lineage actually groups with a previously recognized unit that comprises the Methanomicrobiales and extreme halophiles.
Genomic sequencing of deer tick virus and phylogeny of powassan-related viruses of North America.
Kuno, G; Artsob, H; Karabatsos, N; Tsuchiya, K R; Chang, G J
2001-11-01
Powassan (POW) virus is responsible for central nervous system infection in humans in North America and the eastern parts of Russia. Recently, a new flavivirus, deer tick (DT) virus, related to POW virus was isolated in the United States, but neither its pathogenic potential in human nor the taxonomic relationship with POW virus has been elucidated. In this study, we obtained the near-full-length genomic sequence of the DT virus and complete sequences of 3 genomic regions of 15 strains of POW-related virus strains. The phylogeny revealed 2 lineages, one of which had the prototype POW virus and the other DT virus. Both lineages can cause central nervous system infection in humans. By use of the combination of molecular definition of virus species within the genus Flavivirus and serological distinction in a 2-way cross-neutralization test, the lineage of DT virus is classified as a distinct genotype of POW virus.
Markov-modulated Markov chains and the covarion process of molecular evolution.
Galtier, N; Jean-Marie, A
2004-01-01
The covarion (or site specific rate variation, SSRV) process of biological sequence evolution is a process by which the evolutionary rate of a nucleotide/amino acid/codon position can change in time. In this paper, we introduce time-continuous, space-discrete, Markov-modulated Markov chains as a model for representing SSRV processes, generalizing existing theory to any model of rate change. We propose a fast algorithm for diagonalizing the generator matrix of relevant Markov-modulated Markov processes. This algorithm makes phylogeny likelihood calculation tractable even for a large number of rate classes and a large number of states, so that SSRV models become applicable to amino acid or codon sequence datasets. Using this algorithm, we investigate the accuracy of the discrete approximation to the Gamma distribution of evolutionary rates, widely used in molecular phylogeny. We show that a relatively large number of classes is required to achieve accurate approximation of the exact likelihood when the number of analyzed sequences exceeds 20, both under the SSRV and among site rate variation (ASRV) models.
Investigation of the Evolutionary Development of the Genus Bifidobacterium by Comparative Genomics
Lugli, Gabriele Andrea; Milani, Christian; Turroni, Francesca; Duranti, Sabrina; Ferrario, Chiara; Viappiani, Alice; Mancabelli, Leonardo; Mangifesta, Marta; Taminiau, Bernard; Delcenserie, Véronique; van Sinderen, Douwe
2014-01-01
The Bifidobacterium genus currently encompasses 48 recognized taxa, which have been isolated from different ecosystems. However, the current phylogeny of bifidobacteria is hampered by the relative paucity of genotypic data. Here, we reassessed the taxonomy of this bacterial genus using genome-based approaches, which demonstrated that the previous taxonomic view of bifidobacteria contained several inconsistencies. In particular, high levels of genetic relatedness were shown to exist between particular Bifidobacterium taxa which would not justify their status as separate species. The results presented are here based on average nucleotide identity analysis involving the genome sequences for each type strain of the 48 bifidobacterial taxa, as well as phylogenetic comparative analysis of the predicted core genome of the Bifidobacterium genus. The results of this study demonstrate that the availability of complete genome sequences allows the reconstruction of a more robust bifidobacterial phylogeny than that obtained from a single gene-based sequence comparison, thus discouraging the assignment of a new or separate bifidobacterial taxon without such a genome-based validation. PMID:25107967
Kochzius, Marc; Söller, Rainer; Khalaf, Maroof A; Blohm, Dietmar
2003-09-01
This study investigates the molecular phylogeny of seven lionfishes of the genera Dendrochirus and Pterois. MP, ML, and NJ phylogenetic analysis based on 964 bp of partial mitochondrial DNA sequences (cytochrome b and 16S rDNA) revealed two main clades: (1) "Pterois" clade (Pterois miles and Pterois volitans), and (2) "Pteropterus-Dendrochirus" clade (remainder of the sampled species). The position of Dendrochirus brachypterus either basal to the main clades or in the "Pteropterus-Dendrochirus" clade cannot be resolved. However, the molecular phylogeny did not support the current separation of the genera Pterois and Dendrochirus. The siblings P. miles and P. volitans are clearly separated and our results support the proposed allopatric or parapatric distribution in the Indian and Pacific Ocean. However, the present analysis cannot reveal if P. miles and P. volitans are separate species or two populations of a single species, because the observed separation in different clades can be either explained by speciation or lineage sorting. Molecular clock estimates for the siblings P. miles and P. volitans suggest a divergence time of 2.4-8.3 mya, which coincide with geological events that created vicariance between populations of the Indian and Pacific Ocean.
2013-01-01
Background Phylogeny estimation from aligned haplotype sequences has attracted more and more attention in the recent years due to its importance in analysis of many fine-scale genetic data. Its application fields range from medical research, to drug discovery, to epidemiology, to population dynamics. The literature on molecular phylogenetics proposes a number of criteria for selecting a phylogeny from among plausible alternatives. Usually, such criteria can be expressed by means of objective functions, and the phylogenies that optimize them are referred to as optimal. One of the most important estimation criteria is the parsimony which states that the optimal phylogeny T∗for a set H of n haplotype sequences over a common set of variable loci is the one that satisfies the following requirements: (i) it has the shortest length and (ii) it is such that, for each pair of distinct haplotypes hi,hj∈H, the sum of the edge weights belonging to the path from hi to hj in T∗ is not smaller than the observed number of changes between hi and hj. Finding the most parsimonious phylogeny for H involves solving an optimization problem, called the Most Parsimonious Phylogeny Estimation Problem (MPPEP), which is NP-hard in many of its versions. Results In this article we investigate a recent version of the MPPEP that arises when input data consist of single nucleotide polymorphism haplotypes extracted from a population of individuals on a common genomic region. Specifically, we explore the prospects for improving on the implicit enumeration strategy of implicit enumeration strategy used in previous work using a novel problem formulation and a series of strengthening valid inequalities and preliminary symmetry breaking constraints to more precisely bound the solution space and accelerate implicit enumeration of possible optimal phylogenies. We present the basic formulation and then introduce a series of provable valid constraints to reduce the solution space. We then prove that these constraints can often lead to significant reductions in the gap between the optimal solution and its non-integral linear programming bound relative to the prior art as well as often substantially faster processing of moderately hard problem instances. Conclusion We provide an indication of the conditions under which such an optimal enumeration approach is likely to be feasible, suggesting that these strategies are usable for relatively large numbers of taxa, although with stricter limits on numbers of variable sites. The work thus provides methodology suitable for provably optimal solution of some harder instances that resist all prior approaches. PMID:23343437
O'Hara, F. Patrick; Suaya, Jose A.; Ray, G. Thomas; Baxter, Roger; Brown, Megan L.; Mera, Robertino M.; Close, Nicole M.; Thomas, Elizabeth
2016-01-01
A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants. PMID:26669861
O'Hara, F Patrick; Suaya, Jose A; Ray, G Thomas; Baxter, Roger; Brown, Megan L; Mera, Robertino M; Close, Nicole M; Thomas, Elizabeth; Amrine-Madsen, Heather
2016-01-01
A number of molecular typing methods have been developed for characterization of Staphylococcus aureus isolates. The utility of these systems depends on the nature of the investigation for which they are used. We compared two commonly used methods of molecular typing, multilocus sequence typing (MLST) (and its clustering algorithm, Based Upon Related Sequence Type [BURST]) with the staphylococcal protein A (spa) typing (and its clustering algorithm, Based Upon Repeat Pattern [BURP]), to assess the utility of these methods for macroepidemiology and evolutionary studies of S. aureus in the United States. We typed a total of 366 clinical isolates of S. aureus by these methods and evaluated indices of diversity and concordance values. Our results show that, when combined with the BURP clustering algorithm to delineate clonal lineages, spa typing produces results that are highly comparable with those produced by MLST/BURST. Therefore, spa typing is appropriate for use in macroepidemiology and evolutionary studies and, given its lower implementation cost, this method appears to be more efficient. The findings are robust and are consistent across different settings, patient ages, and specimen sources. Our results also support a model in which the methicillin-resistant S. aureus (MRSA) population in the United States comprises two major lineages (USA300 and USA100), which each consist of closely related variants.
Nakano, V; Ignacio, A; Llanco, L; Bueris, V; Sircili, M P; Avila-Campos, M J
2017-04-01
Clostridium perfringens is an anaerobic bacterium ubiquitous in various environments, especially in soil and the gastrointestinal tract of healthy humans and animals. In this study, multilocus sequence typing protocol was used to investigate genotypic relationships among 40 C. perfringens strains isolated from humans and broiler chicken with necrotic enteritis [NE]. The results indicated a few clonal populations, mainly observed in human strains, with 32.5% of all strains associated with one of three clonal complexes and 30 sequences types. The CC-1 cluster showed an interesting and unexpected result because it contained seven strains [six from animals and one of human origin]. Detection assays for toxin genes tpeL and netB were also performed. The netB gene was only observed in 7.5% of the strains from healthy human. The toxin gene tpeL was detected in 22.5% of the C. perfringens strains isolated from three individuals and in six broilers with NE. Our study describes the role of some C. perfringens strains of human origin acting as reservoirs of virulence genes and sources of infection. In addition, the strains of human and animal origin were found to be genetically distinct but phylogenetically close, and the human strains showed more diversity than the animal strains. Copyright © 2017 Elsevier Ltd. All rights reserved.
Rodriguez, C; Taminiau, B; Brévers, B; Avesani, V; Van Broeck, J; Leroux, A A; Amory, H; Delmée, M; Daube, G
2014-08-06
Clostridium difficile has been identified as a significant agent of diarrhoea and enterocolitis in both foals and adult horses. Hospitalization, antibiotic therapy or changes in diet may contribute to the development of C. difficile infection. Horses admitted to a care unit are therefore at greater risk of being colonized. The aim of this study was to investigate the carriage of C. difficile in hospitalized horses and the possible influence of some risk factors in colonization. During a seven-month period, faecal samples and data relating the clinical history of horses admitted to a veterinary teaching hospital were collected. C. difficile isolates were characterized through toxin profiles, cytotoxicity activity, PCR-ribotyping, antimicrobial resistance and multilocus sequence typing (MLST). Ten isolates were obtained with a total of seven different PCR-ribotypes, including PCR-ribotype 014. Five of them were identified as toxinogenic. A high resistance to gentamicin, clindamycin and ceftiofur was found. MLST revealed four different sequencing types (ST), which included ST11, ST26, ST2 and ST15, and phylogenetic analysis showed that most of the isolates clustered in the same lineage. Clinical history suggests that horses frequently harbour toxigenic and non-toxigenic C. difficile and that in most cases they are colonized regardless of the reason for hospitalization; the development of diarrhoea is more unusual. Copyright © 2014 Elsevier B.V. All rights reserved.
Methicillin-resistant Staphylococcus aureus from dental school clinic surfaces and students.
Roberts, Marilyn C; Soge, Olusegun O; Horst, Jeremy A; Ly, Kiet A; Milgrom, Peter
2011-10-01
Methicillin-resistant Staphylococcus aureus (MRSA) isolated from frequently touched dental school clinic surfaces were compared with MRSA isolated nasal cultures of dental students. Sixty-one dental students and 95 environmental surfaces from 7 clinics were sampled using SANICULT (Starplex Scientific Inc, Etobicoke, Ontario, Canada) swabs. Antimicrobial susceptibility testing was performed, and pulsed-field gel electrophoresis analysis, the mecA gene, multilocus sequence type, and SCCmec type were determined by polymerase chain reaction and sequencing. Thirteen (21%) dental students and 8 (8.4%) surfaces were MRSA positive. Three MRSA strains were SCCmec type IV, whereas 3 were nontypeable isolates and Panton-Valentine leukocidin positive (PVL+), and none were USA300. One surface and 1 student isolate shared the same multilocus sequence type ST 8 and were 75% related. Two groups of students carried the same MRSA strains. The MRSA-positive samples were from 4 of 7 dental clinics. In addition, 21% of the dental students carried MRSA, which is > 10 times higher than the general public and twice as frequent as in other university students. This is the first study to characterize MRSA from dental clinic surfaces and dental students and suggests that both may be reservoirs for MRSA. Further studies are needed to verify this premise. Copyright © 2011 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Mosby, Inc. All rights reserved.
Wang, Ping; Tong, Jing-jing; Ma, Xiu-hua; Song, Feng-li; Fan, Ling; Guo, Cui-mei; Shi, Wei; Yu, Sang-jie; Yao, Kai-hu; Yang, Yong-hong
2015-01-01
To investigate the serotypes, antibiotic susceptibilities, and multi-locus sequence type (MLST) profiles of Streptococcus agalactiae (S. agalactiae) in Beijing to provide references for the prevention and treatment of S. agalactiae infections. All isolates were identified using the CAMP test and the latex-agglutination assay and serotyped using a Strep-B-Latex kit, after which they were assessed for antibiotic susceptibility, macrolide-resistance genes, and MLST profiles. In total, 56 S. agalactiae isolates were identified in 863 pregnant women (6.5%). Serotypes Ia, Ib, II, III, and V were identified, among which types III (32.1%), Ia (17.9%), Ib (16.1%), and V (14.3%) were the predominant serotypes. All isolates were susceptible to penicillin and ceftriaxone. The nonsusceptiblity rates measured for erythromycin, clarithromycin, azithromycin, telithromycin, clindamycin, tetracycline, and levofloxacin were 85.7%, 92.9%, 98.2%, 30.4%, 73.2%, 91%, and 39.3%, respectively. We identified 14 sequence types (STs) for the 56 isolates, among which ST19 (30.4%) was predominant. The rate of fluoroquinolone resistance was higher in serotype III than in the other serotypes. Among the 44 erythromycin-resistant isolates, 32 (72.7%) carried ermB. S. agalactiae isolates of the serotypes Ia, Ib, III, and V are common in Beijing. Among the S. agalactiae isolates, the macrolide and clindamycin resistance rates are extremely high. Most of the erythromycin-resistant isolates carry ermB.
Coagulase-Negative Staphylococci in Human Milk From Mothers of Preterm Compared With Term Neonates.
Soeorg, Hiie; Metsvaht, Tuuli; Eelmäe, Imbi; Metsvaht, Hanna Kadri; Treumuth, Sirli; Merila, Mirjam; Ilmoja, Mari-Liis; Lutsar, Irja
2017-05-01
Human milk is the preferred nutrition for neonates and a source of bacteria. Research aim: The authors aimed to characterize the molecular epidemiology and genetic content of staphylococci in the human milk of mothers of preterm and term neonates. Staphylococci were isolated once per week in the 1st month postpartum from the human milk of mothers of 20 healthy term and 49 preterm neonates hospitalized in the neonatal intensive care unit. Multilocus variable-number tandem-repeats analysis and multilocus sequence typing were used. The presence of the mecA gene, icaA gene of the ica-operon, IS 256, and ACME genetic elements was determined by PCR. The human milk of mothers of preterm compared with term neonates had higher counts of staphylococci but lower species diversity. The human milk of mothers of preterm compared with term neonates more often contained Staphylococcus epidermidis mecA (32.7% vs. 2.6%), icaA (18.8% vs. 6%), IS 256 (7.9% vs. 0.9%), and ACME (15.4% vs. 5.1%), as well as Staphylococcus haemolyticus mecA (90.5% vs. 10%) and IS 256 (61.9% vs. 10%). The overall distribution of multilocus variable-number tandem-repeats analysis (MLVA) types and sequence types was similar between the human milk of mothers of preterm and term neonates, but a few mecA-IS 256-positive MLVA types colonized only mothers of preterm neonates. Maternal hospitalization within 1 month postpartum and the use of an arterial catheter or antibacterial treatment in the neonate increased the odds of harboring mecA-positive staphylococci in human milk. Limiting exposure of mothers of preterm neonates to the hospital could prevent human milk colonization with more pathogenic staphylococci.
USDA-ARS?s Scientific Manuscript database
Detection, identification, and classification of yeasts have undergone a major transformation in the last decade and a half following application of gene sequence analyses and genome comparisons. Development of a database (barcode) of easily determined DNA sequences from domains 1 and 2 (D1/D2) of t...
Phylogenetic Placement of Exact Amplicon Sequences Improves Associations with Clinical Information
McDonald, Daniel; Gonzalez, Antonio; Navas-Molina, Jose A.; Jiang, Lingjing; Xu, Zhenjiang Zech; Winker, Kevin; Kado, Deborah M.; Orwoll, Eric; Manary, Mark; Mirarab, Siavash
2018-01-01
ABSTRACT Recent algorithmic advances in amplicon-based microbiome studies enable the inference of exact amplicon sequence fragments. These new methods enable the investigation of sub-operational taxonomic units (sOTU) by removing erroneous sequences. However, short (e.g., 150-nucleotide [nt]) DNA sequence fragments do not contain sufficient phylogenetic signal to reproduce a reasonable tree, introducing a barrier in the utilization of critical phylogenetically aware metrics such as Faith’s PD or UniFrac. Although fragment insertion methods do exist, those methods have not been tested for sOTUs from high-throughput amplicon studies in insertions against a broad reference phylogeny. We benchmarked the SATé-enabled phylogenetic placement (SEPP) technique explicitly against 16S V4 sequence fragments and showed that it outperforms the conceptually problematic but often-used practice of reconstructing de novo phylogenies. In addition, we provide a BSD-licensed QIIME2 plugin (https://github.com/biocore/q2-fragment-insertion) for SEPP and integration into the microbial study management platform QIITA. IMPORTANCE The move from OTU-based to sOTU-based analysis, while providing additional resolution, also introduces computational challenges. We demonstrate that one popular method of dealing with sOTUs (building a de novo tree from the short sequences) can provide incorrect results in human gut metagenomic studies and show that phylogenetic placement of the new sequences with SEPP resolves this problem while also yielding other benefits over existing methods. PMID:29719869
Molecular Phylogeny of the Animal Kingdom.
ERIC Educational Resources Information Center
Field, Katharine G.; And Others
1988-01-01
A rapid sequencing method for ribosomal RNA was applied to the resolution of evolutionary relationships among Metazoa. Describes the four groups (chordates, echinoderms, arthropods, and eucoelomate protostomes) that radiated from the coelomates. (TW)
Clonal Origins of Vibrio cholerae O1 El Tor Strains, Papua New Guinea, 2009–2011
Collins, Deirdre; Jonduo, Marinjho H.; Rosewell, Alexander; Dutta, Samir R.; Dagina, Rosheila; Ropa, Berry; Siba, Peter M.; Greenhill, Andrew R.
2011-01-01
We used multilocus sequence typing and variable number tandem repeat analysis to determine the clonal origins of Vibrio cholerae O1 El Tor strains from an outbreak of cholera that began in 2009 in Papua New Guinea. The epidemic is ongoing, and transmission risk is elevated within the Pacific region. PMID:22099099
Ellis, Crystal N.; Schuster, Brian M.; Striplin, Megan J.; Jones, Stephen H.; Whistler, Cheryl A.
2012-01-01
Risk of gastric infection with Vibrio parahaemolyticus increases with favorable environmental conditions and population shifts that increase prevalence of infective strains. Genetic analysis of New Hampshire strains revealed a unique population with some isolates similar to outbreak-causing strains and high-level diversity that increased as waters warmed. PMID:22407686
Ellis, Crystal N; Schuster, Brian M; Striplin, Megan J; Jones, Stephen H; Whistler, Cheryl A; Cooper, Vaughn S
2012-05-01
Risk of gastric infection with Vibrio parahaemolyticus increases with favorable environmental conditions and population shifts that increase prevalence of infective strains. Genetic analysis of New Hampshire strains revealed a unique population with some isolates similar to outbreak-causing strains and high-level diversity that increased as waters warmed.
USDA-ARS?s Scientific Manuscript database
A growing interest in the biological control of locusts and grasshoppers (Acrididae) has led to the development of biopesticides based on naturally occurring pathogens which offers an environmentally safe alternative to chemical pesticides. However, the fungal strains which are being sought for biop...
A critical re-evaluation of multilocus sequence typing (MLST) efforts in Wolbachia.
Bleidorn, Christoph; Gerth, Michael
2018-01-01
Wolbachia (Alphaproteobacteria, Rickettsiales) is the most common, and arguably one of the most important inherited symbionts. Molecular differentiation of Wolbachia strains is routinely performed with a set of five multilocus sequence typing (MLST) markers. However, since its inception in 2006, the performance of MLST in Wolbachia strain typing has not been assessed objectively. Here, we evaluate the properties of Wolbachia MLST markers and compare it to 252 other single copy loci present in the genome of most Wolbachia strains. Specifically, we investigated how well MLST performs at strain differentiation, at reflecting genetic diversity of strains, and as phylogenetic marker. We find that MLST loci are outperformed by other loci at all tasks they are currently employed for, and thus that they do not reflect the properties of a Wolbachia strain very well. We argue that whole genome typing approaches should be used for Wolbachia typing in the future. Alternatively, if few loci approaches are necessary, we provide a characterisation of 252 single copy loci for a number a criteria, which may assist in designing specific typing systems or phylogenetic studies. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Serra, Rita; Peterson, Stephen; Venâncio, Armando
2008-04-01
Despite several studies reporting Penicillium as one of the most frequent fungal genera in cork planks, the isolates were rarely identified to species level. We conducted a detailed study to identify Penicillium species from the field to the factory environment prior to and after boiling the cork planks. A total of 84 samples were analyzed. Of the 486 Penicillium isolates phenotypically identified, 32 representative or unusual strains were selected for identification by multilocus DNA sequence type. Cork proved to be a rich source of Penicillium biodiversity. A total of 30 taxa were recognized from cork including rarely seen species and 6 phylogenetically unique groups. Spores of some species lodged deep in cork can survive the boiling process. P. glabrum, P. glandicola and P. toxicarium, species with high CFU numbers in the field, are still frequently present in cork after boiling. Other species are killed by the boiling treatment and replaced by Penicillium species originating from the factory environment. Species known to contribute to cork taint were isolated at all stages. Good manufacturing practices are necessary at all stages in the preparation of cork planks to minimize the load of Penicillium species that produce cork taint.
Leavitt, Dean H; Starrett, James; Westphal, Michael F; Hedin, Marshal
2015-10-01
We use mitochondrial and multi-locus nuclear DNA sequence data to infer both species boundaries and species relationships within California nemesiid spiders. Higher-level phylogenetic data show that the California radiation is monophyletic and distantly related to European members of the genus Brachythele. As such, we consider all California nemesiid taxa to belong to the genus Calisoga Chamberlin, 1937. Rather than find support for one or two taxa as previously hypothesized, genetic data reveal Calisoga to be a species-rich radiation of spiders, including perhaps dozens of species. This conclusion is supported by multiple mitochondrial barcoding analyses, and also independent analyses of nuclear data that reveal general genealogical congruence. We discovered three instances of sympatry, and genetic data indicate reproductive isolation when in sympatry. An examination of female reproductive morphology does not reveal species-specific characters, and observed male morphological differences for a subset of putative species are subtle. Our coalescent species tree analysis of putative species lays the groundwork for future research on the taxonomy and biogeographic history of this remarkable endemic radiation. Copyright © 2015 Elsevier Inc. All rights reserved.
Tatay-Dualde, Juan; Prats-van der Ham, Miranda; Paterna, Ana; Sánchez, Antonio; Corrales, Juan Carlos; Contreras, Antonio; Tola, Sebastiana; Gómez-Martin, Ángel
2017-01-01
Mycoplasma capricolum subsp. capricolum is one of the causative agents of contagious agalactia (CA). Nevertheless, there is still a lack of information about its antimicrobial susceptibility and genetic characteristics. Therefore, the aim of this work was to study the antimicrobial and genetic variability of different Mycoplasma capricolum subsp. capricolum field isolates. For this purpose, the growth inhibition effect of 18 antimicrobials and a multilocus sequence typing (MLST) scheme based on five housekeeping genes (fusA, glpQ, gyrB, lepA and rpoB) were performed on 32 selected field isolates from Italy and Spain.The results showed a wide range of growth inhibitory effects for almost all the antimicrobials studied. Macrolides presented lower efficacy inhibiting Mcc growth than in previous works performed on other CA-causative mycoplasmas. Erythromycin was not able to inhibit the growth of any of the studied strains, contrary to doxycycline, which inhibited the growth of all of them from low concentrations. On the other hand, the study of the concatenated genes revealed a high genetic variability among the different Mcc isolates. Hence, these genetic variations were greater than the ones reported in prior works on other mycoplasma species. PMID:28346546
Lee, Mellesia F; Cadogan, Paul; Eytle, Sarah; Copeland, Sonia; Walochnik, Julia; Lindo, John F
2017-01-01
Giardia spp. are the causative agents of intestinal infections in a wide variety of mammals including humans and companion animals. Dogs may be reservoirs of zoonotic Giardia spp.; however, the potential for transmission between dogs and humans in Jamaica has not been studied. Conventional PCR was used to screen 285 human and 225 dog stool samples for Giardia targeting the SSU rDNA gene followed by multilocus sequencing of the triosephosphate isomerase (tpi), glutamate dehydrogenase (gdh), and β-giardin (bg) genes. Prevalence of human infections based on PCR was 6.7 % (19/285) and canine infections 19.6 % (44/225). Nested PCR conducted on all 63 positive samples revealed the exclusive presence of assemblage A in both humans and dogs. Sub-assemblage A-II was responsible for 79.0 % (15/19) and 70.5 % (31/44) of the infections in humans and dogs, respectively, while sub-assemblage A-I was identified at a rate of 15.8 % (3/19) and 29.5 % (13/44) in humans and dogs, respectively. The predominance of a single circulating assemblage among both humans and dogs in Jamaica suggests possible zoonotic transmission of Giardia infections.
Genotypic analysis of Mucor from the platypus in Australia.
Connolly, J H; Stodart, B J; Ash, G J
2010-01-01
Mucor amphibiorum is the only pathogen known to cause significant morbidity and mortality in the free-living platypus (Ornithorhynchus anatinus) in Tasmania. Infection has also been reported in free-ranging cane toads (Bufo marinus) and green tree frogs (Litoria caerulea) from mainland Australia but has not been confirmed in platypuses from the mainland. To date, there has been little genotyping specifically conducted on M. amphibiorum. A collection of 21 Mucor isolates representing isolates from the platypus, frogs and toads, and environmental samples were obtained for genotypic analysis. Internal transcribed spacer (ITS) region sequencing and GenBank comparison confirmed the identity of most of the isolates. Representative isolates from infected platypuses formed a clade containing the reference isolates of M. amphibiorum from the Centraal Bureau voor Schimmelcultures repository. The M. amphibiorum isolates showed a close sequence identity with Mucor indicus and consisted of two haplotypes, differentiated by single nucleotide polymorphisms within the ITS1 and ITS2 regions. With the exception of isolate 96-4049, all isolates from platypuses were in one haplotype. Multilocus fingerprinting via the use of intersimple sequence repeats polymerase chain reaction identified 19 genotypes. Two major clusters were evident: 1) M. amphibiorum and Mucor racemosus; and 2) Mucor circinelloides, Mucor ramosissimus, and Mucor fragilis. Seven M. amphibiorum isolates from platypuses were present in two subclusters, with isolate 96-4053 appearing genetically distinct from all other isolates. Isolates classified as M. circinelloides by sequence analysis formed a separate subcluster, distinct from other Mucor spp. The combination of sequencing and multilocus fingerprinting has the potential to provide the tools for rapid identification of M. amphibiorum. Data presented on the diversity of the pathogen and further work in linking genetic diversity to functional diversity will provide critical information for its management in Tasmanian river systems.
Bletz, Stefan; Janezic, Sandra; Harmsen, Dag; Rupnik, Maja; Mellmann, Alexander
2018-06-01
Clostridium difficile , recently renamed Clostridioides difficile , is the most common cause of antibiotic-associated nosocomial gastrointestinal infections worldwide. To differentiate endogenous infections and transmission events, highly discriminatory subtyping is necessary. Today, methods based on whole-genome sequencing data are increasingly used to subtype bacterial pathogens; however, frequently a standardized methodology and typing nomenclature are missing. Here we report a core genome multilocus sequence typing (cgMLST) approach developed for C. difficile Initially, we determined the breadth of the C. difficile population based on all available MLST sequence types with Bayesian inference (BAPS). The resulting BAPS partitions were used in combination with C. difficile clade information to select representative isolates that were subsequently used to define cgMLST target genes. Finally, we evaluated the novel cgMLST scheme with genomes from 3,025 isolates. BAPS grouping ( n = 6 groups) together with the clade information led to a total of 11 representative isolates that were included for cgMLST definition and resulted in 2,270 cgMLST genes that were present in all isolates. Overall, 2,184 to 2,268 cgMLST targets were detected in the genome sequences of 70 outbreak-associated and reference strains, and on average 99.3% cgMLST targets (1,116 to 2,270 targets) were present in 2,954 genomes downloaded from the NCBI database, underlining the representativeness of the cgMLST scheme. Moreover, reanalyzing different cluster scenarios with cgMLST were concordant to published single nucleotide variant analyses. In conclusion, the novel cgMLST is representative for the whole C. difficile population, is highly discriminatory in outbreak situations, and provides a unique nomenclature facilitating interlaboratory exchange. Copyright © 2018 American Society for Microbiology.
Core Genome Multilocus Sequence Typing Scheme for High-Resolution Typing of Enterococcus faecium
de Been, Mark; Pinholt, Mette; Top, Janetta; Bletz, Stefan; van Schaik, Willem; Brouwer, Ellen; Rogers, Malbert; Kraat, Yvette; Bonten, Marc; Corander, Jukka; Westh, Henrik; Harmsen, Dag
2015-01-01
Enterococcus faecium, a common inhabitant of the human gut, has emerged in the last 2 decades as an important multidrug-resistant nosocomial pathogen. Since the start of the 21st century, multilocus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However, due to the use of a small number of genes, the resolution of MLST is limited. Whole-genome sequencing (WGS) now allows for high-resolution tracing of outbreaks, but current WGS-based approaches lack standardization, rendering them less suitable for interlaboratory prospective surveillance. To overcome this limitation, we developed a core genome MLST (cgMLST) scheme for E. faecium. cgMLST transfers genome-wide single nucleotide polymorphism (SNP) diversity into a standardized and portable allele numbering system that is far less computationally intensive than SNP-based analysis of WGS data. The E. faecium cgMLST scheme was built using 40 genome sequences that represented the diversity of the species. The scheme consists of 1,423 cgMLST target genes. To test the performance of the scheme, we performed WGS analysis of 103 outbreak isolates from five different hospitals in the Netherlands, Denmark, and Germany. The cgMLST scheme performed well in distinguishing between epidemiologically related and unrelated isolates, even between those that had the same sequence type (ST), which denotes the higher discriminatory power of this cgMLST scheme over that of conventional MLST. We also show that in terms of resolution, the performance of the E. faecium cgMLST scheme is equivalent to that of an SNP-based approach. In conclusion, the cgMLST scheme developed in this study facilitates rapid, standardized, and high-resolution tracing of E. faecium outbreaks. PMID:26400782
Parker, Jennifer K.; Havird, Justin C.
2012-01-01
Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops. PMID:22194287
Parker, Jennifer K; Havird, Justin C; De La Fuente, Leonardo
2012-03-01
Isolates of the plant pathogen Xylella fastidiosa are genetically very similar, but studies on their biological traits have indicated differences in virulence and infection symptomatology. Taxonomic analyses have identified several subspecies, and phylogenetic analyses of housekeeping genes have shown broad host-based genetic differences; however, results are still inconclusive for genetic differentiation of isolates within subspecies. This study employs multilocus sequence analysis of environmentally mediated genes (MLSA-E; genes influenced by environmental factors) to investigate X. fastidiosa relationships and differentiate isolates with low genetic variability. Potential environmentally mediated genes, including host colonization and survival genes related to infection establishment, were identified a priori. The ratio of the rate of nonsynonymous substitutions to the rate of synonymous substitutions (dN/dS) was calculated to select genes that may be under increased positive selection compared to previously studied housekeeping genes. Nine genes were sequenced from 54 X. fastidiosa isolates infecting different host plants across the United States. Results of maximum likelihood (ML) and Bayesian phylogenetic (BP) analyses are in agreement with known X. fastidiosa subspecies clades but show novel within-subspecies differentiation, including geographic differentiation, and provide additional information regarding host-based isolate variation and specificity. dN/dS ratios of environmentally mediated genes, though <1 due to high sequence similarity, are significantly greater than housekeeping gene dN/dS ratios and correlate with increased sequence variability. MLSA-E can more precisely resolve relationships between closely related bacterial strains with low genetic variability, such as X. fastidiosa isolates. Discovering the genetic relationships between X. fastidiosa isolates will provide new insights into the epidemiology of populations of X. fastidiosa, allowing improved disease management in economically important crops.
Wagner, Isaac D.; Varghese, Litty B.; Hemme, Christopher L.; Wiegel, Juergen
2013-01-01
Thermal environments have island-like characteristics and provide a unique opportunity to study population structure and diversity patterns of microbial taxa inhabiting these sites. Strains having ≥98% 16S rRNA gene sequence similarity to the obligately anaerobic Firmicutes Thermoanaerobacter uzonensis were isolated from seven geothermal springs, separated by up to 1600 m, within the Uzon Caldera (Kamchatka, Russian Far East). The intraspecies variation and spatial patterns of diversity for this taxon were assessed by multilocus sequence analysis (MLSA) of 106 strains. Analysis of eight protein-coding loci (gyrB, lepA, leuS, pyrG, recA, recG, rplB, and rpoB) revealed that all loci were polymorphic and that nucleotide substitutions were mostly synonymous. There were 148 variable nucleotide sites across 8003 bp concatenates of the protein-coding loci. While pairwise FST values indicated a small but significant level of genetic differentiation between most subpopulations, there was a negligible relationship between genetic divergence and spatial separation. Strains with the same allelic profile were only isolated from the same hot spring, occasionally from consecutive years, and single locus variant (SLV) sequence types were usually derived from the same spring. While recombination occurred, there was an “epidemic” population structure in which a particular T. uzonensis sequence type rose in frequency relative to the rest of the population. These results demonstrate spatial diversity patterns for an anaerobic bacterial species in a relative small geographic location and reinforce the view that terrestrial geothermal springs are excellent places to look for biogeographic diversity patterns regardless of the involved distances. PMID:23801987
Rademaker, Jan L. W.; Herbet, Hélène; Starrenburg, Marjo J. C.; Naser, Sabri M.; Gevers, Dirk; Kelly, William J.; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E. T.
2007-01-01
The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)5-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene. PMID:17890345
Rademaker, Jan L W; Herbet, Hélène; Starrenburg, Marjo J C; Naser, Sabri M; Gevers, Dirk; Kelly, William J; Hugenholtz, Jeroen; Swings, Jean; van Hylckama Vlieg, Johan E T
2007-11-01
The diversity of a collection of 102 lactococcus isolates including 91 Lactococcus lactis isolates of dairy and nondairy origin was explored using partial small subunit rRNA gene sequence analysis and limited phenotypic analyses. A subset of 89 strains of L. lactis subsp. cremoris and L. lactis subsp. lactis isolates was further analyzed by (GTG)(5)-PCR fingerprinting and a novel multilocus sequence analysis (MLSA) scheme. Two major genomic lineages within L. lactis were found. The L. lactis subsp. cremoris type-strain-like genotype lineage included both L. lactis subsp. cremoris and L. lactis subsp. lactis isolates. The other major lineage, with a L. lactis subsp. lactis type-strain-like genotype, comprised L. lactis subsp. lactis isolates only. A novel third genomic lineage represented two L. lactis subsp. lactis isolates of nondairy origin. The genomic lineages deviate from the subspecific classification of L. lactis that is based on a few phenotypic traits only. MLSA of six partial genes (atpA, encoding ATP synthase alpha subunit; pheS, encoding phenylalanine tRNA synthetase; rpoA, encoding RNA polymerase alpha chain; bcaT, encoding branched chain amino acid aminotransferase; pepN, encoding aminopeptidase N; and pepX, encoding X-prolyl dipeptidyl peptidase) revealed 363 polymorphic sites (total length, 1,970 bases) among 89 L. lactis subsp. cremoris and L. lactis subsp. lactis isolates with unique sequence types for most isolates. This allowed high-resolution cluster analysis in which dairy isolates form subclusters of limited diversity within the genomic lineages. The pheS DNA sequence analysis yielded two genetic groups dissimilar to the other genotyping analysis-based lineages, indicating a disparate acquisition route for this gene.
Core Genome Multilocus Sequence Typing Scheme for High- Resolution Typing of Enterococcus faecium.
de Been, Mark; Pinholt, Mette; Top, Janetta; Bletz, Stefan; Mellmann, Alexander; van Schaik, Willem; Brouwer, Ellen; Rogers, Malbert; Kraat, Yvette; Bonten, Marc; Corander, Jukka; Westh, Henrik; Harmsen, Dag; Willems, Rob J L
2015-12-01
Enterococcus faecium, a common inhabitant of the human gut, has emerged in the last 2 decades as an important multidrug-resistant nosocomial pathogen. Since the start of the 21st century, multilocus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However, due to the use of a small number of genes, the resolution of MLST is limited. Whole-genome sequencing (WGS) now allows for high-resolution tracing of outbreaks, but current WGS-based approaches lack standardization, rendering them less suitable for interlaboratory prospective surveillance. To overcome this limitation, we developed a core genome MLST (cgMLST) scheme for E. faecium. cgMLST transfers genome-wide single nucleotide polymorphism(SNP) diversity into a standardized and portable allele numbering system that is far less computationally intensive than SNP-based analysis of WGS data. The E. faecium cgMLST scheme was built using 40 genome sequences that represented the diversity of the species. The scheme consists of 1,423 cgMLST target genes. To test the performance of the scheme, we performed WGS analysis of 103 outbreak isolates from five different hospitals in the Netherlands, Denmark, and Germany. The cgMLST scheme performed well in distinguishing between epidemiologically related and unrelated isolates, even between those that had the same sequence type (ST), which denotes the higher discriminatory power of this cgMLST scheme over that of conventional MLST. We also show that in terms of resolution, the performance of the E. faecium cgMLST scheme is equivalent to that of an SNP-based approach. In conclusion, the cgMLST scheme developed in this study facilitates rapid, standardized, and high-resolution tracing of E. faecium outbreaks.