Testing the new animal phylogeny: a phylum level molecular analysis of the animal kingdom.
Bourlat, Sarah J; Nielsen, Claus; Economou, Andrew D; Telford, Maximilian J
2008-10-01
The new animal phylogeny inferred from ribosomal genes some years ago has prompted a number of radical rearrangements of the traditional, morphology based metazoan tree. The two main bilaterian clades, Deuterostomia and Protostomia, find strong support, but the protostomes consist of two sister groups, Ecdysozoa and Lophotrochozoa, not seen in morphology based trees. Although widely accepted, not all recent molecular phylogenetic analyses have supported the tripartite structure of the new animal phylogeny. Furthermore, even if the small ribosomal subunit (SSU) based phylogeny is correct, there is a frustrating lack of resolution of relationships between the phyla that make up the three clades of this tree. To address this issue, we have assembled a dataset including a large number of aligned sequence positions as well as a broad sampling of metazoan phyla. Our dataset consists of sequence data from ribosomal and mitochondrial genes combined with new data from protein coding genes (5139 amino acid and 3524 nucleotide positions in total) from 37 representative taxa sampled across the Metazoa. Our data show strong support for the basic structure of the new animal phylogeny as well as for the Mandibulata including Myriapoda. We also provide some resolution within the Lophotrochozoa, where we confirm support for a monophyletic clade of Echiura, Sipuncula and Annelida and surprising evidence of a close relationship between Brachiopoda and Nemertea.
Keremane, Manjunath L.; Lee, Richard F.; Maureira-Butler, Ivan J.; Roose, Mikeal L.
2013-01-01
Background Genus Citrus (Rutaceae) comprises many important cultivated species that generally hybridize easily. Phylogenetic study of a group showing extensive hybridization is challenging. Since the genus Citrus has diverged recently (4–12 Ma), incomplete lineage sorting of ancestral polymorphisms is also likely to cause discrepancies among genes in phylogenetic inferences. Incongruence of gene trees is observed and it is essential to unravel the processes that cause inconsistencies in order to understand the phylogenetic relationships among the species. Methodology and Principal Findings (1) We generated phylogenetic trees using haplotype sequences of six low copy nuclear genes. (2) Published simple sequence repeat data were re-analyzed to study population structure and the results were compared with the phylogenetic trees constructed using sequence data and coalescence simulations. (3) To distinguish between hybridization and incomplete lineage sorting, we developed and utilized a coalescence simulation approach. In other studies, species trees have been inferred despite the possibility of hybridization having occurred and used to generate null distributions of the effect of lineage sorting alone (by coalescent simulation). Since this is problematic, we instead generate these distributions directly from observed gene trees. Of the six trees generated, we used the most resolved three to detect hybrids. We found that 11 of 33 samples appear to be affected by historical hybridization. Analysis of the remaining three genes supported the conclusions from the hybrid detection test. Conclusions We have identified or confirmed probable hybrid origins for several Citrus cultivars using three different approaches–gene phylogenies, population structure analysis and coalescence simulation. Hybridization and incomplete lineage sorting were identified primarily based on differences among gene phylogenies with reference to null expectations via coalescence simulations. We conclude that identifying hybridization as a frequent cause of incongruence among gene trees is critical to correctly infer the phylogeny among species of Citrus. PMID:23874615
Jones, Christopher M; Stres, Blaz; Rosenquist, Magnus; Hallin, Sara
2008-09-01
Denitrification is a facultative respiratory pathway in which nitrite (NO2(-)), nitric oxide (NO), and nitrous oxide (N2O) are successively reduced to nitrogen gas (N(2)), effectively closing the nitrogen cycle. The ability to denitrify is widely dispersed among prokaryotes, and this polyphyletic distribution has raised the possibility of horizontal gene transfer (HGT) having a substantial role in the evolution of denitrification. Comparisons of 16S rRNA and denitrification gene phylogenies in recent studies support this possibility; however, these results remain speculative as they are based on visual comparisons of phylogenies from partial sequences. We reanalyzed publicly available nirS, nirK, norB, and nosZ partial sequences using Bayesian and maximum likelihood phylogenetic inference. Concomitant analysis of denitrification genes with 16S rRNA sequences from the same organisms showed substantial differences between the trees, which were supported by examining the posterior probability of monophyletic constraints at different taxonomic levels. Although these differences suggest HGT of denitrification genes, the presence of structural variants for nirK, norB, and nosZ makes it difficult to determine HGT from other evolutionary events. Additional analysis using phylogenetic networks and likelihood ratio tests of phylogenies based on full-length sequences retrieved from genomes also revealed significant differences in tree topologies among denitrification and 16S rRNA gene phylogenies, with the exception of the nosZ gene phylogeny within the data set of the nirK-harboring genomes. However, inspection of codon usage and G + C content plots from complete genomes gave no evidence for recent HGT. Instead, the close proximity of denitrification gene copies in the genomes of several denitrifying bacteria suggests duplication. Although HGT cannot be ruled out as a factor in the evolution of denitrification genes, our analysis suggests that other phenomena, such gene duplication/divergence and lineage sorting, may have differently influenced the evolution of each denitrification gene.
Kasey K. Pham; Andrew L. Hipp; Paul S. Manos; Richard C. Cronn
2017-01-01
Owing to high rates of introgressive hybridization, the plastid genome is poorly suited to fine-scale DNA barcoding and phylogenetic studies of the oak genus (Quercus, Fagaceae). At the tips of the oak plastome phylogeny, recent gene migration and reticulation generally cause topology to reflect geographic structure, while deeper branches reflect...
Probabilistic modeling of the evolution of gene synteny within reconciled phylogenies
2015-01-01
Background Most models of genome evolution concern either genetic sequences, gene content or gene order. They sometimes integrate two of the three levels, but rarely the three of them. Probabilistic models of gene order evolution usually have to assume constant gene content or adopt a presence/absence coding of gene neighborhoods which is blind to complex events modifying gene content. Results We propose a probabilistic evolutionary model for gene neighborhoods, allowing genes to be inserted, duplicated or lost. It uses reconciled phylogenies, which integrate sequence and gene content evolution. We are then able to optimize parameters such as phylogeny branch lengths, or probabilistic laws depicting the diversity of susceptibility of syntenic regions to rearrangements. We reconstruct a structure for ancestral genomes by optimizing a likelihood, keeping track of all evolutionary events at the level of gene content and gene synteny. Ancestral syntenies are associated with a probability of presence. We implemented the model with the restriction that at most one gene duplication separates two gene speciations in reconciled gene trees. We reconstruct ancestral syntenies on a set of 12 drosophila genomes, and compare the evolutionary rates along the branches and along the sites. We compare with a parsimony method and find a significant number of results not supported by the posterior probability. The model is implemented in the Bio++ library. It thus benefits from and enriches the classical models and methods for molecular evolution. PMID:26452018
Zi-xiang Yang; Xiao-ming Chen; Nathan P. Havill; Ying Feng; Hang Chen
2010-01-01
Rhus gall aphids (Fordinae : Melaphidini) have a disjunct distribution in East Asia and North America and have specific host plant relationships. Some of them are of economic importance and all species form sealed galls which show great variation in shape, size, structure, and galling-site. We present a phylogeny incorporating ten species and four...
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N; Garrido, Francis; Joulian, Catherine
2008-07-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers.
Pathgroups, a dynamic data structure for genome reconstruction problems.
Zheng, Chunfang
2010-07-01
Ancestral gene order reconstruction problems, including the median problem, quartet construction, small phylogeny, guided genome halving and genome aliquoting, are NP hard. Available heuristics dedicated to each of these problems are computationally costly for even small instances. We present a data structure enabling rapid heuristic solution to all these ancestral genome reconstruction problems. A generic greedy algorithm with look-ahead based on an automatically generated priority system suffices for all the problems using this data structure. The efficiency of the algorithm is due to fast updating of the structure during run time and to the simplicity of the priority scheme. We illustrate with the first rapid algorithm for quartet construction and apply this to a set of yeast genomes to corroborate a recent gene sequence-based phylogeny. http://albuquerque.bioinformatics.uottawa.ca/pathgroup/Quartet.html chunfang313@gmail.com Supplementary data are available at Bioinformatics online.
Pilot, M; Dahlheim, M E; Hoelzel, A R
2010-01-01
In social species, breeding system and gregarious behavior are key factors influencing the evolution of large-scale population genetic structure. The killer whale is a highly social apex predator showing genetic differentiation in sympatry between populations of foraging specialists (ecotypes), and low levels of genetic diversity overall. Our comparative assessments of kinship, parentage and dispersal reveal high levels of kinship within local populations and ongoing male-mediated gene flow among them, including among ecotypes that are maximally divergent within the mtDNA phylogeny. Dispersal from natal populations was rare, implying that gene flow occurs without dispersal, as a result of reproduction during temporary interactions. Discordance between nuclear and mitochondrial phylogenies was consistent with earlier studies suggesting a stochastic basis for the magnitude of mtDNA differentiation between matrilines. Taken together our results show how the killer whale breeding system, coupled with social, dispersal and foraging behaviour, contributes to the evolution of population genetic structure.
Li, Fupeng; Hao, Chaoyun; Yan, Lin; Wu, Baoduo; Qin, Xiaowei; Lai, Jianxiong; Song, Yinghui
2015-09-01
In higher plants, sucrose synthase (Sus, EC 2.4.1.13) is widely considered as a key enzyme involved in sucrose metabolism. Although, several paralogous genes encoding different isozymes of Sus have been identified and characterized in multiple plant genomes, to date detailed information about the Sus genes is lacking for cacao. This study reports the identification of six novel Sus genes from economically important cacao tree. Analyses of the gene structure and phylogeny of the Sus genes demonstrated evolutionary conservation in the Sus family across cacao and other plant species. The expression of cacao Sus genes was investigated via real-time PCR in various tissues, different developmental phases of leaf, flower bud and pod. The Sus genes exhibited distinct but partially redundant expression profiles in cacao, with TcSus1, TcSus5 and TcSus6, being the predominant genes in the bark with phloem, TcSus2 predominantly expressing in the seed during the stereotype stage. TcSus3 and TcSus4 were significantly detected more in the pod husk and seed coat along the pod development, and showed development dependent expression profiles in the cacao pod. These results provide new insights into the evolution, and basic information that will assist in elucidating the functions of cacao Sus gene family.
Quéméneur, Marianne; Heinrich-Salmeron, Audrey; Muller, Daniel; Lièvremont, Didier; Jauzein, Michel; Bertin, Philippe N.; Garrido, Francis; Joulian, Catherine
2008-01-01
A new primer set was designed to specifically amplify ca. 1,100 bp of aoxB genes encoding the As(III) oxidase catalytic subunit from taxonomically diverse aerobic As(III)-oxidizing bacteria. Comparative analysis of AoxB protein sequences showed variable conservation levels and highlighted the conservation of essential amino acids and structural motifs. AoxB phylogeny of pure strains showed well-discriminated taxonomic groups and was similar to 16S rRNA phylogeny. Alphaproteobacteria-, Betaproteobacteria-, and Gammaproteobacteria-related sequences were retrieved from environmental surveys, demonstrating their prevalence in mesophilic As-contaminated soils. Our study underlines the usefulness of the aoxB gene as a functional marker of aerobic As(III) oxidizers. PMID:18502920
Intron-loss evolution of hatching enzyme genes in Teleostei
2010-01-01
Background Hatching enzyme, belonging to the astacin metallo-protease family, digests egg envelope at embryo hatching. Orthologous genes of the enzyme are found in all vertebrate genomes. Recently, we found that exon-intron structures of the genes were conserved among tetrapods, while the genes of teleosts frequently lost their introns. Occurrence of such intron losses in teleostean hatching enzyme genes is an uncommon evolutionary event, as most eukaryotic genes are generally known to be interrupted by introns and the intron insertion sites are conserved from species to species. Here, we report on extensive studies of the exon-intron structures of teleostean hatching enzyme genes for insight into how and why introns were lost during evolution. Results We investigated the evolutionary pathway of intron-losses in hatching enzyme genes of 27 species of Teleostei. Hatching enzyme genes of basal teleosts are of only one type, which conserves the 9-exon-8-intron structure of an assumed ancestor. On the other hand, otocephalans and euteleosts possess two types of hatching enzyme genes, suggesting a gene duplication event in the common ancestor of otocephalans and euteleosts. The duplicated genes were classified into two clades, clades I and II, based on phylogenetic analysis. In otocephalans and euteleosts, clade I genes developed a phylogeny-specific structure, such as an 8-exon-7-intron, 5-exon-4-intron, 4-exon-3-intron or intron-less structure. In contrast to the clade I genes, the structures of clade II genes were relatively stable in their configuration, and were similar to that of the ancestral genes. Expression analyses revealed that hatching enzyme genes were high-expression genes, when compared to that of housekeeping genes. When expression levels were compared between clade I and II genes, clade I genes tends to be expressed more highly than clade II genes. Conclusions Hatching enzyme genes evolved to lose their introns, and the intron-loss events occurred at the specific points of teleostean phylogeny. We propose that the high-expression hatching enzyme genes frequently lost their introns during the evolution of teleosts, while the low-expression genes maintained the exon-intron structure of the ancestral gene. PMID:20796321
Lateral Gene Transfer from the Dead
Szöllősi, Gergely J.; Tannier, Eric; Lartillot, Nicolas; Daubin, Vincent
2013-01-01
In phylogenetic studies, the evolution of molecular sequences is assumed to have taken place along the phylogeny traced by the ancestors of extant species. In the presence of lateral gene transfer, however, this may not be the case, because the species lineage from which a gene was transferred may have gone extinct or not have been sampled. Because it is not feasible to specify or reconstruct the complete phylogeny of all species, we must describe the evolution of genes outside the represented phylogeny by modeling the speciation dynamics that gave rise to the complete phylogeny. We demonstrate that if the number of sampled species is small compared with the total number of existing species, the overwhelming majority of gene transfers involve speciation to and evolution along extinct or unsampled lineages. We show that the evolution of genes along extinct or unsampled lineages can to good approximation be treated as those of independently evolving lineages described by a few global parameters. Using this result, we derive an algorithm to calculate the probability of a gene tree and recover the maximum-likelihood reconciliation given the phylogeny of the sampled species. Examining 473 near-universal gene families from 36 cyanobacteria, we find that nearly a third of transfer events (28%) appear to have topological signatures of evolution along extinct species, but only approximately 6% of transfers trace their ancestry to before the common ancestor of the sampled cyanobacteria. [Gene tree reconciliation; lateral gene transfer; macroevolution; phylogeny.] PMID:23355531
Szöllősi, Gergely J.; Boussau, Bastien; Abby, Sophie S.; Tannier, Eric; Daubin, Vincent
2012-01-01
The timing of the evolution of microbial life has largely remained elusive due to the scarcity of prokaryotic fossil record and the confounding effects of the exchange of genes among possibly distant species. The history of gene transfer events, however, is not a series of individual oddities; it records which lineages were concurrent and thus provides information on the timing of species diversification. Here, we use a probabilistic model of genome evolution that accounts for differences between gene phylogenies and the species tree as series of duplication, transfer, and loss events to reconstruct chronologically ordered species phylogenies. Using simulations we show that we can robustly recover accurate chronologically ordered species phylogenies in the presence of gene tree reconstruction errors and realistic rates of duplication, transfer, and loss. Using genomic data we demonstrate that we can infer rooted species phylogenies using homologous gene families from complete genomes of 10 bacterial and archaeal groups. Focusing on cyanobacteria, distinguished among prokaryotes by a relative abundance of fossils, we infer the maximum likelihood chronologically ordered species phylogeny based on 36 genomes with 8,332 homologous gene families. We find the order of speciation events to be in full agreement with the fossil record and the inferred phylogeny of cyanobacteria to be consistent with the phylogeny recovered from established phylogenomics methods. Our results demonstrate that lateral gene transfers, detected by probabilistic models of genome evolution, can be used as a source of information on the timing of evolution, providing a valuable complement to the limited prokaryotic fossil record. PMID:23043116
Yu, Xianxian; Duan, Xiaoshan; Zhang, Rui; Fu, Xuehao; Ye, Lingling; Kong, Hongzhi; Xu, Guixia; Shan, Hongyan
2016-01-01
AP1/FUL, SEP, AGL6, and FLC subfamily genes play important roles in flower development. The phylogenetic relationships among them, however, have been controversial, which impedes our understanding of the origin and functional divergence of these genes. One possible reason for the controversy may be the problems caused by changes in the exon-intron structure of genes, which, according to recent studies, may generate non-homologous sites and hamper the homology-based sequence alignment. In this study, we first performed exon-by-exon alignments of these and three outgroup subfamilies (SOC1, AG, and STK). Phylogenetic trees reconstructed based on these matrices show improved resolution and better congruence with species phylogeny. In the context of these phylogenies, we traced evolutionary changes of exon-intron structures in each subfamily. We found that structural changes have occurred frequently following gene duplication and speciation events. Notably, exons 7 and 8 (if present) suffered more structural changes than others. With the knowledge of exon-intron structural changes, we generated more reasonable alignments containing all the focal subfamilies. The resulting trees showed that the SEP subfamily is sister to the monophyletic group formed by AP1/FUL and FLC subfamily genes and that the AGL6 subfamily forms a sister group to the three abovementioned subfamilies. Based on this topology, we inferred the evolutionary history of exon-intron structural changes among different subfamilies. Particularly, we found that the eighth exon originated before the divergence of AP1/FUL, FLC, SEP, and AGL6 subfamilies and degenerated in the ancestral FLC-like gene. These results provide new insights into the origin and evolution of the AP1/FUL, FLC, SEP, and AGL6 subfamilies. PMID:27200066
Ritz, C M; Reiker, J; Charles, G; Hoxey, P; Hunt, D; Lowry, M; Stuppy, W; Taylor, N
2012-11-01
The cacti of tribe Tephrocacteae (Cactaceae-Opuntioideae) are adapted to diverse climatic conditions over a wide area of the southern Andes and adjacent lowlands. They exhibit a range of life forms from geophytes and cushion-plants to dwarf shrubs, shrubs or small trees. To confirm or challenge previous morphology-based classifications and molecular phylogenies, we sampled DNA sequences from the chloroplast trnK/matK region and the nuclear low copy gene phyC and compared the resulting phylogenies with previous data gathered from nuclear ribosomal DNA sequences. The here presented chloroplast and nuclear low copy gene phylogenies were mutually congruent and broadly coincident with the classification based on gross morphology and seed micro-morphology and anatomy. Reconstruction of hypothetical ancestral character states suggested that geophytes and cushion-forming species probably evolved several times from dwarf shrubby precursors. We also traced an increase of embryo size at the expense of the nucellus-derived storage tissue during the evolution of the Tephrocacteae, which is thought to be an evolutionary advantage because nutrients are then more rapidly accessible for the germinating embryo. In contrast to these highly concordant phylogenies, nuclear ribosomal DNA data sampled by a previous study yielded conflicting phylogenetic signals. Secondary structure predictions of ribosomal transcribed spacers suggested that this phylogeny is strongly influenced by the inclusion of paralogous sequence probably arisen by genome duplication during the evolution of this plant group. Copyright © 2012 Elsevier Inc. All rights reserved.
Liu, Chaoyang; Wang, Xia; Xu, Yuantao; Deng, Xiuxin; Xu, Qiang
2014-10-01
MYB transcription factor represents one of the largest gene families in plant genomes. Sweet orange (Citrus sinensis) is one of the most important fruit crops worldwide, and recently the genome has been sequenced. This provides an opportunity to investigate the organization and evolutionary characteristics of sweet orange MYB genes from whole genome view. In the present study, we identified 100 R2R3-MYB genes in the sweet orange genome. A comprehensive analysis of this gene family was performed, including the phylogeny, gene structure, chromosomal localization and expression pattern analyses. The 100 genes were divided into 29 subfamilies based on the sequence similarity and phylogeny, and the classification was also well supported by the highly conserved exon/intron structures and motif composition. The phylogenomic comparison of MYB gene family among sweet orange and related plant species, Arabidopsis, cacao and papaya suggested the existence of functional divergence during evolution. Expression profiling indicated that sweet orange R2R3-MYB genes exhibited distinct temporal and spatial expression patterns. Our analysis suggested that the sweet orange MYB genes may play important roles in different plant biological processes, some of which may be potentially involved in citrus fruit quality. These results will be useful for future functional analysis of the MYB gene family in sweet orange.
Plastid genome structure and loss of photosynthetic ability in the parasitic genus Cuscuta.
Revill, Meredith J W; Stanley, Susan; Hibberd, Julian M
2005-09-01
The genus Cuscuta (dodder) is composed of parasitic plants, some species of which appear to be losing the ability to photosynthesize. A molecular phylogeny was constructed using 15 species of Cuscuta in order to assess whether changes in photosynthetic ability and alterations in structure of the plastid genome relate to phylogenetic position within the genus. The molecular phylogeny provides evidence for four major clades within Cuscuta. Although DNA blot analysis showed that Cuscuta species have smaller plastid genomes than tobacco, and that plastome size varied significantly even within one Cuscuta clade, dot blot analysis indicated that the dodders possess homologous sequence to 101 genes from the tobacco plastome. Evidence is provided for significant rates of DNA transfer from plastid to nucleus in Cuscuta. Size and structure of Cuscuta plastid genomes, as well as photosynthetic ability, appear to vary independently of position within the phylogeny, thus supporting the hypothesis that within Cuscuta photosynthetic ability and organization of the plastid genome are changing in an unco-ordinated manner.
Zhang, Yan-Cong; Lin, Kui
2015-01-01
Overlapping genes (OGs) represent one type of widespread genomic feature in bacterial genomes and have been used as rare genomic markers in phylogeny inference of closely related bacterial species. However, the inference may experience a decrease in performance for phylogenomic analysis of too closely or too distantly related genomes. Another drawback of OGs as phylogenetic markers is that they usually take little account of the effects of genomic rearrangement on the similarity estimation, such as intra-chromosome/genome translocations, horizontal gene transfer, and gene losses. To explore such effects on the accuracy of phylogeny reconstruction, we combine phylogenetic signals of OGs with collinear genomic regions, here called locally collinear blocks (LCBs). By putting these together, we refine our previous metric of pairwise similarity between two closely related bacterial genomes. As a case study, we used this new method to reconstruct the phylogenies of 88 Enterobacteriale genomes of the class Gammaproteobacteria. Our results demonstrated that the topological accuracy of the inferred phylogeny was improved when both OGs and LCBs were simultaneously considered, suggesting that combining these two phylogenetic markers may reduce, to some extent, the influence of gene loss on phylogeny inference. Such phylogenomic studies, we believe, will help us to explore a more effective approach to increasing the robustness of phylogeny reconstruction of closely related bacterial organisms. PMID:26715828
RPS8—a New Informative DNA Marker for Phylogeny of Babesia and Theileria Parasites in China
Tian, Zhan-Cheng; Liu, Guang-Yuan; Yin, Hong; Luo, Jian-Xun; Guan, Gui-Quan; Luo, Jin; Xie, Jun-Ren; Shen, Hui; Tian, Mei-Yuan; Zheng, Jin-feng; Yuan, Xiao-song; Wang, Fang-fang
2013-01-01
Piroplasmosis is a serious debilitating and sometimes fatal disease. Phylogenetic relationships within piroplasmida are complex and remain unclear. We compared the intron–exon structure and DNA sequences of the RPS8 gene from Babesia and Theileria spp. isolates in China. Similar to 18S rDNA, the 40S ribosomal protein S8 gene, RPS8, including both coding and non-coding regions is a useful and novel genetic marker for defining species boundaries and for inferring phylogenies because it tends to have little intra-specific variation but considerable inter-specific difference. However, more samples are needed to verify the usefulness of the RPS8 (coding and non-coding regions) gene as a marker for the phylogenetic position and detection of most Babesia and Theileria species, particularly for some closely related species. PMID:24244571
A Molecular Phylogeny of Hemiptera Inferred from Mitochondrial Genome Sequences
Song, Nan; Liang, Ai-Ping; Bu, Cui-Ping
2012-01-01
Classically, Hemiptera is comprised of two suborders: Homoptera and Heteroptera. Homoptera includes Cicadomorpha, Fulgoromorpha and Sternorrhyncha. However, according to previous molecular phylogenetic studies based on 18S rDNA, Fulgoromorpha has a closer relationship to Heteroptera than to other hemipterans, leaving Homoptera as paraphyletic. Therefore, the position of Fulgoromorpha is important for studying phylogenetic structure of Hemiptera. We inferred the evolutionary affiliations of twenty-five superfamilies of Hemiptera using mitochondrial protein-coding genes and rRNAs. We sequenced three mitogenomes, from Pyrops candelaria, Lycorma delicatula and Ricania marginalis, representing two additional families in Fulgoromorpha. Pyrops and Lycorma are representatives of an additional major family Fulgoridae in Fulgoromorpha, whereas Ricania is a second representative of the highly derived clade Ricaniidae. The organization and size of these mitogenomes are similar to those of the sequenced fulgoroid species. Our consensus phylogeny of Hemiptera largely supported the relationships (((Fulgoromorpha,Sternorrhyncha),Cicadomorpha),Heteroptera), and thus supported the classic phylogeny of Hemiptera. Selection of optimal evolutionary models (exclusion and inclusion of two rRNA genes or of third codon positions of protein-coding genes) demonstrated that rapidly evolving and saturated sites should be removed from the analyses. PMID:23144967
Bacterial phylogeny structures soil resistomes across habitats
NASA Astrophysics Data System (ADS)
Forsberg, Kevin J.; Patel, Sanket; Gibson, Molly K.; Lauber, Christian L.; Knight, Rob; Fierer, Noah; Dantas, Gautam
2014-05-01
Ancient and diverse antibiotic resistance genes (ARGs) have previously been identified from soil, including genes identical to those in human pathogens. Despite the apparent overlap between soil and clinical resistomes, factors influencing ARG composition in soil and their movement between genomes and habitats remain largely unknown. General metagenome functions often correlate with the underlying structure of bacterial communities. However, ARGs are proposed to be highly mobile, prompting speculation that resistomes may not correlate with phylogenetic signatures or ecological divisions. To investigate these relationships, we performed functional metagenomic selections for resistance to 18 antibiotics from 18 agricultural and grassland soils. The 2,895 ARGs we discovered were mostly new, and represent all major resistance mechanisms. We demonstrate that distinct soil types harbour distinct resistomes, and that the addition of nitrogen fertilizer strongly influenced soil ARG content. Resistome composition also correlated with microbial phylogenetic and taxonomic structure, both across and within soil types. Consistent with this strong correlation, mobility elements (genes responsible for horizontal gene transfer between bacteria such as transposases and integrases) syntenic with ARGs were rare in soil by comparison with sequenced pathogens, suggesting that ARGs may not transfer between soil bacteria as readily as is observed between human pathogens. Together, our results indicate that bacterial community composition is the primary determinant of soil ARG content, challenging previous hypotheses that horizontal gene transfer effectively decouples resistomes from phylogeny.
Li, S.-F.; Xu, J.-W.; Yang, Q.-L.; Wang, C.H.; Chen, Q.; Chapman, D.C.; Lu, G.
2009-01-01
Based upon morphological characters, Silver carp Hypophthalmichthys molitrix and bighead carp Hypophthalmichthys nobilis (or Aristichthys nobilis) have been classified into either the same genus or two distinct genera. Consequently, the taxonomic relationship of the two species at the generic level remains equivocal. This issue is addressed by sequencing complete mitochondrial genomes of H. molitrix and H. nobilis, comparing their mitogenome organization, structure and sequence similarity, and conducting a comprehensive phylogenetic analysis of cyprinid species. As with other cyprinid fishes, the mitogenomes of the two species were structurally conserved, containing 37 genes including 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA (tRNAs) genes and a putative control region (D-loop). Sequence similarity between the two mitogenomes varied in different genes or regions, being highest in the tRNA genes (98??8%), lowest in the control region (89??4%) and intermediate in the protein-coding genes (94??2%). Analyses of the sequence comparison and phylogeny using concatenated protein sequences support the view that the two species belong to the genus Hypophthalmichthys. Further studies using nuclear markers and involving more closely related species, and the systematic combination of traditional biology and molecular biology are needed in order to confirm this conclusion. ?? 2009 The Fisheries Society of the British Isles.
Agatha, Sabine; Strüder-Kypke, Michaela C.
2010-01-01
The phylogeny within the order Choreotrichida is reconstructed using (i) morphologic, ontogenetic, and ultrastructural evidence for the cladistic approach and (ii) the small subunit ribosomal RNA (SSrRNA) gene sequences, including the new sequence of Rimostrombidium lacustris. The morphologic cladograms and the gene trees converge rather well for the Choreotrichida, demonstrating that hyaline and agglutinated loricae do not characterize distinct lineages, i.e., both lorica types can be associated with the most highly developed ciliary pattern. The position of Rimostrombidium lacustris within the family Strobilidiidae is corroborated by the genealogical analyses. The diagnosis of the genus Tintinnidium is improved, adding cytological features, and the genus is divided into two subgenera based on the structure of the somatic kineties. The diagnosis of the family Lohmanniellidae and the genus Lohmanniella are improved, and Rimostrombidium glacicolum Petz, Song and Wilbert, 1995 is affiliated. PMID:17166704
Roettger, Mayo; Martin, William; Dagan, Tal
2009-09-01
Among the methods currently used in phylogenomic practice to detect the presence of lateral gene transfer (LGT), one of the most frequently employed is the comparison of gene tree topologies for different genes. In cases where the phylogenies for different genes are incompatible, or discordant, for well-supported branches there are three simple interpretations for the result: 1) gene duplications (paralogy) followed by many independent gene losses have occurred, 2) LGT has occurred, or 3) the phylogeny is well supported but for reasons unknown is nonetheless incorrect. Here, we focus on the third possibility by examining the properties of 22,437 published multiple sequence alignments, the Bayesian maximum likelihood trees for which either do or do not suggest the occurrence of LGT by the criterion of discordant branches. The alignments that produce discordant phylogenies differ significantly in several salient alignment properties from those that do not. Using a support vector machine, we were able to predict the inference of discordant tree topologies with up to 80% accuracy from alignment properties alone.
A six-gene phylogeny provides new insights into choanoflagellate evolution.
Carr, Martin; Richter, Daniel J; Fozouni, Parinaz; Smith, Timothy J; Jeuck, Alexandra; Leadbeater, Barry S C; Nitsche, Frank
2017-02-01
Recent studies have shown that molecular phylogenies of the choanoflagellates (Class Choanoflagellatea) are in disagreement with their traditional taxonomy, based on morphology, and that Choanoflagellatea requires considerable taxonomic revision. Furthermore, phylogenies suggest that the morphological and ecological evolution of the group is more complex than has previously been recognized. Here we address the taxonomy of the major choanoflagellate order Craspedida, by erecting four new genera. The new genera are shown to be morphologically, ecologically and phylogenetically distinct from other choanoflagellate taxa. Furthermore, we name five novel craspedid species, as well as formally describe ten species that have been shown to be either misidentified or require taxonomic revision. Our revised phylogeny, including 18 new species and sequence data for two additional genes, provides insights into the morphological and ecological evolution of the choanoflagellates. We examine the distribution within choanoflagellates of these two additional genes, EF-1A and EFL, closely related translation GTPases which are required for protein synthesis. Mapping the presence and absence of these genes onto the phylogeny highlights multiple events of gene loss within the choanoflagellates. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Smith, Adam R.; Proffitt, Melissa R.; Ho, Winnie W.; Mullaney, Claire B.; Maldonado-Ocampo, Javier A.; Lovejoy, Nathan R.; Alves-Gomes, José A.; Smith, G. Troy
2018-01-01
The electric communication signals of weakly electric ghost knifefishes (Gymnotiformes: Apteronotidae) provide a valuable model system for understanding the evolution and physiology of behavior. Apteronotids produce continuous wave-type electric organ discharges (EODs) that are used for electrolocation and communication. The frequency and waveform of EODs, as well as the structure of transient EOD modulations (chirps), vary substantially across species. Understanding how these signals have evolved, however, has been hampered by the lack of a well-supported phylogeny for this family. We constructed a molecular phylogeny for the Apteronotidae by using sequence data from three genes (cytochrome c oxidase subunit 1, recombination activating gene 2, and cytochrome oxidase B) in 32 species representing 13 apteronotid genera. This phylogeny and an extensive database of apteronotid signals allowed us to examine signal evolution by using ancestral state reconstruction (ASR) and phylogenetic generalized least squares (PGLS) models. Our molecular phylogeny largely agrees with another recent sequence-based phylogeny and identified five robust apteronotid clades: (i) Sternarchorhamphus + Orthosternarchus, (ii) Adontosternarchus, (iii) Apteronotus + Parapteronotus, (iv) Sternarchorhynchus, and (v) a large clade including Porotergus, ‘Apteronotus’, Compsaraia, Sternarchogiton, Sternarchella, and Magosternarchus. We analyzed novel chirp recordings from two apteronotid species (Orthosternarchus tamandua and Sternarchorhynchus mormyrus), and combined data from these species with that from previously recorded species in our phylogenetic analyses. Some signal parameters in O. tamandua were plesiomorphic (e.g., low frequency EODs and chirps with little frequency modulation that nevertheless interrupt the EOD), suggesting that ultra-high frequency EODs and ‘‘big” chirps evolved after apteronotids diverged from other gymnotiforms. In contrast to previous studies, our PGLS analyses using the new phylogeny indicated the presence of phylogenetic signals in the relationships between some EOD and chirp parameters. The ASR demonstrated that most EOD and chirp parameters are evolutionarily labile and have often diversified even among closely related species. PMID:27769924
Smith, Adam R; Proffitt, Melissa R; Ho, Winnie W; Mullaney, Claire B; Maldonado-Ocampo, Javier A; Lovejoy, Nathan R; Alves-Gomes, José A; Smith, G Troy
2016-10-01
The electric communication signals of weakly electric ghost knifefishes (Gymnotiformes: Apteronotidae) provide a valuable model system for understanding the evolution and physiology of behavior. Apteronotids produce continuous wave-type electric organ discharges (EODs) that are used for electrolocation and communication. The frequency and waveform of EODs, as well as the structure of transient EOD modulations (chirps), vary substantially across species. Understanding how these signals have evolved, however, has been hampered by the lack of a well-supported phylogeny for this family. We constructed a molecular phylogeny for the Apteronotidae by using sequence data from three genes (cytochrome c oxidase subunit 1, recombination activating gene 2, and cytochrome oxidase B) in 32 species representing 13 apteronotid genera. This phylogeny and an extensive database of apteronotid signals allowed us to examine signal evolution by using ancestral state reconstruction (ASR) and phylogenetic generalized least squares (PGLS) models. Our molecular phylogeny largely agrees with another recent sequence-based phylogeny and identified five robust apteronotid clades: (i) Sternarchorhamphus+Orthosternarchus, (ii) Adontosternarchus, (iii) Apteronotus+Parapteronotus, (iv) Sternarchorhynchus, and (v) a large clade including Porotergus, 'Apteronotus', Compsaraia, Sternarchogiton, Sternarchella, and Magosternarchus. We analyzed novel chirp recordings from two apteronotid species (Orthosternarchus tamandua and Sternarchorhynchus mormyrus), and combined data from these species with that from previously recorded species in our phylogenetic analyses. Some signal parameters in O. tamandua were plesiomorphic (e.g., low frequency EODs and chirps with little frequency modulation that nevertheless interrupt the EOD), suggesting that ultra-high frequency EODs and "big" chirps evolved after apteronotids diverged from other gymnotiforms. In contrast to previous studies, our PGLS analyses using the new phylogeny indicated the presence of phylogenetic signals in the relationships between some EOD and chirp parameters. The ASR demonstrated that most EOD and chirp parameters are evolutionarily labile and have often diversified even among closely related species. Published by Elsevier Ltd.
Principles of cophylogenetic maps
NASA Astrophysics Data System (ADS)
Charleston, Michael A.
Cophylogeny is the study of the relationships between phylogenies of ecologically related groups (taxa, geographical areas, genes etc.), where one, the "host" phylogeny, is independent and the other, the "associate" phylogeny, is hypothesized to be dependent to some degree on the host. Given two such phylogenies our aim is to estimate the past associations between the host and associate taxa. This chapter describes cophylogeny and discusses some of its basic pri nciples. The necessary properties of any cophylogenetic method are described. Charleston [5] created a graph which contains all the potential solutions to a given cophylogenetic problem. The vertices of this graph are associations, either observed or hypothetical, between "host" and associated taxonomic units, and the arcs correspond to the associate phylogeny. A new and more general method of constructing the Jungle is presented, which will correctly account for reticulate host and/or parasite phylogenies. Keywords: cophylogeny, coevolution, gene tree/species tree, host/parasite coevolution, host switch, horizontal transfer, biogeography.
Huang, Jie; Chen, Zigui; Song, Weibo; Berger, Helmut
2014-01-01
Classifications of the Urostyloidea were mainly based on morphology and morphogenesis. Since molecular phylogeny largely focused on limited sampling using mostly the one-gene information, the incongruence between morphological data and gene sequences have risen. In this work, the three-gene data (SSU-rDNA, ITS1-5.8S-ITS2 and LSU-rDNA) comprising 12 genera in the “core urostyloids” are sequenced, and the phylogenies based on these different markers are compared using maximum-likelihood and Bayesian algorithms and tested by unconstrained and constrained analyses. The molecular phylogeny supports the following conclusions: (1) the monophyly of the core group of Urostyloidea is well supported while the whole Urostyloidea is not monophyletic; (2) Thigmokeronopsis and Apokeronopsis are clearly separated from the pseudokeronopsids in analyses of all three gene markers, supporting their exclusion from the Pseudokeronopsidae and the inclusion in the Urostylidae; (3) Diaxonella and Apobakuella should be assigned to the Urostylidae; (4) Bergeriella, Monocoronella and Neourostylopsis flavicana share a most recent common ancestor; (5) all molecular trees support the transfer of Metaurostylopsis flavicana to the recently proposed genus Neourostylopsis; (6) all molecular phylogenies fail to separate the morphologically well-defined genera Uroleptopsis and Pseudokeronopsis; and (7) Arcuseries gen. nov. containing three distinctly deviating Anteholosticha species is established. PMID:24140978
A novel molecular marker for the study of Neotropical cichlid phylogeny.
Fabrin, T M C; Gasques, L S; Prioli, S M A P; Prioli, A J
2015-12-22
The use of molecular markers has contributed to phylogeny and to the reconstruction of species' evolutionary history. Each region of the genome has different evolution rates, which may or may not identify phylogenetic signal at different levels. Therefore, it is important to assess new molecular markers that can be used for phylogenetic reconstruction. Regions that may be associated with species characteristics and are subject to selective pressure, such as opsin genes, which encode proteins related to the visual system and are widely expressed by Cichlidae family members, are interesting. Our aim was to identify a new nuclear molecular marker that could establish the phylogeny of Neotropical cichlids and is potentially correlated with the visual system. We used Bayesian inference and maximum likelihood analysis to support the use of the nuclear opsin LWS gene in the phylogeny of eight Neotropical cichlid species. Their use concatenated to the mitochondrial gene COI was also tested. The LWS gene fragment comprised the exon 2-4 region, including the introns. The LWS gene provided good support for both analyses up to the genus level, distinguishing the studied species, and when concatenated to the COI gene, there was a good support up to the species level. Another benefit of utilizing this region, is that some polymorphisms are associated with changes in spectral properties of the LWS opsin protein, which constitutes the visual pigment that absorbs red light. Thus, utilization of this gene as a molecular marker to study the phylogeny of Neotropical cichlids is promising.
Saand, Mumtaz Ali; Xu, You-Ping; Munyampundu, Jean-Pierre; Li, Wen; Zhang, Xuan-Rui; Cai, Xin-Zhong
2015-01-01
Cyclic nucleotide-gated ion channels (CNGCs) are calcium-permeable channels that are involved in various biological functions. Nevertheless, phylogeny and function of plant CNGCs are not well understood. In this study, 333 CNGC genes from 15 plant species were identified using comprehensive bioinformatics approaches. Extensive bioinformatics analyses demonstrated that CNGCs of Group IVa were distinct to those of other groups in gene structure and amino acid sequence of cyclic nucleotide-binding domain. A CNGC-specific motif that recognizes all identified plant CNGCs was generated. Phylogenetic analysis indicated that CNGC proteins of flowering plant species formed five groups. However, CNGCs of the non-vascular plant Physcomitrella patens clustered only in two groups (IVa and IVb), while those of the vascular non-flowering plant Selaginella moellendorffii gathered in four (IVa, IVb, I and II). These data suggest that Group IV CNGCs are most ancient and Group III CNGCs are most recently evolved in flowering plants. Furthermore, silencing analyses revealed that a set of CNGC genes might be involved in disease resistance and abiotic stress responses in tomato and function of SlCNGCs does not correlate with the group that they are belonging to. Our results indicate that Group IVa CNGCs are structurally but not functionally unique among plant CNGCs. PMID:26546226
Kang, Hahk-Soo
2017-02-01
Genomics-based methods are now commonplace in natural products research. A phylogeny-guided mining approach provides a means to quickly screen a large number of microbial genomes or metagenomes in search of new biosynthetic gene clusters of interest. In this approach, biosynthetic genes serve as molecular markers, and phylogenetic trees built with known and unknown marker gene sequences are used to quickly prioritize biosynthetic gene clusters for their metabolites characterization. An increase in the use of this approach has been observed for the last couple of years along with the emergence of low cost sequencing technologies. The aim of this review is to discuss the basic concept of a phylogeny-guided mining approach, and also to provide examples in which this approach was successfully applied to discover new natural products from microbial genomes and metagenomes. I believe that the phylogeny-guided mining approach will continue to play an important role in genomics-based natural products research.
Hughes, Lily C; Ortí, Guillermo; Huang, Yu; Sun, Ying; Baldwin, Carole C; Thompson, Andrew W; Arcila, Dahiana; Betancur-R, Ricardo; Li, Chenhong; Becker, Leandro; Bellora, Nicolás; Zhao, Xiaomeng; Li, Xiaofeng; Wang, Min; Fang, Chao; Xie, Bing; Zhou, Zhuocheng; Huang, Hai; Chen, Songlin; Venkatesh, Byrappa; Shi, Qiong
2018-05-14
Our understanding of phylogenetic relationships among bony fishes has been transformed by analysis of a small number of genes, but uncertainty remains around critical nodes. Genome-scale inferences so far have sampled a limited number of taxa and genes. Here we leveraged 144 genomes and 159 transcriptomes to investigate fish evolution with an unparalleled scale of data: >0.5 Mb from 1,105 orthologous exon sequences from 303 species, representing 66 out of 72 ray-finned fish orders. We apply phylogenetic tests designed to trace the effect of whole-genome duplication events on gene trees and find paralogy-free loci using a bioinformatics approach. Genome-wide data support the structure of the fish phylogeny, and hypothesis-testing procedures appropriate for phylogenomic datasets using explicit gene genealogy interrogation settle some long-standing uncertainties, such as the branching order at the base of the teleosts and among early euteleosts, and the sister lineage to the acanthomorph and percomorph radiations. Comprehensive fossil calibrations date the origin of all major fish lineages before the end of the Cretaceous.
Mallatt, Jon; Craig, Catherine Waggoner; Yoder, Matthew J
2010-04-01
This study (1) uses nearly complete rRNA-gene sequences from across Metazoa (197 taxa) to reconstruct animal phylogeny; (2) presents a highly annotated, manual alignment of these sequences with special reference to rRNA features including paired sites (http://purl.oclc.org/NET/rRNA/Metazoan_alignment) and (3) tests, after eliminating as few disruptive, rogue sequences as possible, if a likelihood framework can recover the main metazoan clades. We found that systematic elimination of approximately 6% of the sequences, including the divergent or unstably placed sequences of cephalopods, arrowworm, symphylan and pauropod myriapods, and of myzostomid and nemertodermatid worms, led to a tree that supported Ecdysozoa, Lophotrochozoa, Protostomia, and Bilateria. Deuterostomia, however, was never recovered, because the rRNA of urochordates goes (nonsignificantly) near the base of the Bilateria. Counterintuitively, when we modeled the evolution of the paired sites, phylogenetic resolution was not increased over traditional tree-building models that assume all sites in rRNA evolve independently. The rRNA genes of non-bilaterians contain a higher % AT than do those of most bilaterians. The rRNA genes of Acoela and Myzostomida were found to be secondarily shortened, AT-enriched, and highly modified, throwing some doubt on the location of these worms at the base of Bilateria in the rRNA tree--especially myzostomids, which other evidence suggests are annelids instead. Other findings are marsupial-with-placental mammals, arrowworms in Ecdysozoa (well supported here but contradicted by morphology), and Placozoa as sister to Cnidaria. Finally, despite the difficulties, the rRNA-gene trees are in strong concordance with trees derived from multiple protein-coding genes in supporting the new animal phylogeny. (c) 2009 Elsevier Inc. All rights reserved.
Swain, Timothy D
2018-01-01
The recent rapid proliferation of novel taxon identification in the Zoanthidea has been accompanied by a parallel propagation of gene trees as a tool of species discovery, but not a corresponding increase in our understanding of phylogeny. This disparity is caused by the trade-off between the capabilities of automated DNA sequence alignment and data content of genes applied to phylogenetic inference in this group. Conserved genes or segments are easily aligned across the order, but produce poorly resolved trees; hypervariable genes or segments contain the evolutionary signal necessary for resolution and robust support, but sequence alignment is daunting. Staggered alignments are a form of phylogeny-informed sequence alignment composed of a mosaic of local and universal regions that allow phylogenetic inference to be applied to all nucleotides from both hypervariable and conserved gene segments. Comparisons between species tree phylogenies inferred from all data (staggered alignment) and hypervariable-excluded data (standard alignment) demonstrate improved confidence and greater topological agreement with other sources of data for the complete-data tree. This novel phylogeny is the most comprehensive to date (in terms of taxa and data) and can serve as an expandable tool for evolutionary hypothesis testing in the Zoanthidea. Spanish language abstract available in Text S1. Translation by L. O. Swain, DePaul University, Chicago, Illinois, 60604, USA. Copyright © 2017 Elsevier Inc. All rights reserved.
Huang, Jie; Chen, Zigui; Song, Weibo; Berger, Helmut
2014-01-01
Classifications of the Urostyloidea were mainly based on morphology and morphogenesis. Since molecular phylogeny largely focused on limited sampling using mostly the one-gene information, the incongruence between morphological data and gene sequences have risen. In this work, the three-gene data (SSU-rDNA, ITS1-5.8S-ITS2 and LSU-rDNA) comprising 12 genera in the "core urostyloids" are sequenced, and the phylogenies based on these different markers are compared using maximum-likelihood and Bayesian algorithms and tested by unconstrained and constrained analyses. The molecular phylogeny supports the following conclusions: (1) the monophyly of the core group of Urostyloidea is well supported while the whole Urostyloidea is not monophyletic; (2) Thigmokeronopsis and Apokeronopsis are clearly separated from the pseudokeronopsids in analyses of all three gene markers, supporting their exclusion from the Pseudokeronopsidae and the inclusion in the Urostylidae; (3) Diaxonella and Apobakuella should be assigned to the Urostylidae; (4) Bergeriella, Monocoronella and Neourostylopsis flavicana share a most recent common ancestor; (5) all molecular trees support the transfer of Metaurostylopsis flavicana to the recently proposed genus Neourostylopsis; (6) all molecular phylogenies fail to separate the morphologically well-defined genera Uroleptopsis and Pseudokeronopsis; and (7) Arcuseries gen. nov. containing three distinctly deviating Anteholosticha species is established. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Stadler, Tanja; Degnan, James H.; Rosenberg, Noah A.
2016-01-01
Classic null models for speciation and extinction give rise to phylogenies that differ in distribution from empirical phylogenies. In particular, empirical phylogenies are less balanced and have branching times closer to the root compared to phylogenies predicted by common null models. This difference might be due to null models of the speciation and extinction process being too simplistic, or due to the empirical datasets not being representative of random phylogenies. A third possibility arises because phylogenetic reconstruction methods often infer gene trees rather than species trees, producing an incongruity between models that predict species tree patterns and empirical analyses that consider gene trees. We investigate the extent to which the difference between gene trees and species trees under a combined birth–death and multispecies coalescent model can explain the difference in empirical trees and birth–death species trees. We simulate gene trees embedded in simulated species trees and investigate their difference with respect to tree balance and branching times. We observe that the gene trees are less balanced and typically have branching times closer to the root than the species trees. Empirical trees from TreeBase are also less balanced than our simulated species trees, and model gene trees can explain an imbalance increase of up to 8% compared to species trees. However, we see a much larger imbalance increase in empirical trees, about 100%, meaning that additional features must also be causing imbalance in empirical trees. This simulation study highlights the necessity of revisiting the assumptions made in phylogenetic analyses, as these assumptions, such as equating the gene tree with the species tree, might lead to a biased conclusion. PMID:26968785
Yong, Hoi-Sen; Song, Sze-Looi; Lim, Phaik-Eem; Chan, Kok-Gan; Chow, Wan-Loo; Eamsobhana, Praphathip
2015-01-01
The whole mitochondrial genome of the pest fruit fly Bactrocera arecae was obtained from next-generation sequencing of genomic DNA. It had a total length of 15,900 bp, consisting of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a non-coding region (A + T-rich control region). The control region (952 bp) was flanked by rrnS and trnI genes. The start codons included 6 ATG, 3 ATT and 1 each of ATA, ATC, GTG and TCG. Eight TAA, two TAG, one incomplete TA and two incomplete T stop codons were represented in the protein-coding genes. The cloverleaf structure for trnS1 lacked the D-loop, and that of trnN and trnF lacked the TΨC-loop. Molecular phylogeny based on 13 protein-coding genes was concordant with 37 mitochondrial genes, with B. arecae having closest genetic affinity to B. tryoni. The subgenus Bactrocera of Dacini tribe and the Dacinae subfamily (Dacini and Ceratitidini tribes) were monophyletic. The whole mitogenome of B. arecae will serve as a useful dataset for studying the genetics, systematics and phylogenetic relationships of the many species of Bactrocera genus in particular, and tephritid fruit flies in general. PMID:26472633
A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis
Fitzpatrick, David A; Logue, Mary E; Stajich, Jason E; Butler, Geraldine
2006-01-01
Background To date, most fungal phylogenies have been derived from single gene comparisons, or from concatenated alignments of a small number of genes. The increase in fungal genome sequencing presents an opportunity to reconstruct evolutionary events using entire genomes. As a tool for future comparative, phylogenomic and phylogenetic studies, we used both supertrees and concatenated alignments to infer relationships between 42 species of fungi for which complete genome sequences are available. Results A dataset of 345,829 genes was extracted from 42 publicly available fungal genomes. Supertree methods were employed to derive phylogenies from 4,805 single gene families. We found that the average consensus supertree method may suffer from long-branch attraction artifacts, while matrix representation with parsimony (MRP) appears to be immune from these. A genome phylogeny was also reconstructed from a concatenated alignment of 153 universally distributed orthologs. Our MRP supertree and concatenated phylogeny are highly congruent. Within the Ascomycota, the sub-phyla Pezizomycotina and Saccharomycotina were resolved. Both phylogenies infer that the Leotiomycetes are the closest sister group to the Sordariomycetes. There is some ambiguity regarding the placement of Stagonospora nodurum, the sole member of the class Dothideomycetes present in the dataset. Within the Saccharomycotina, a monophyletic clade containing organisms that translate CTG as serine instead of leucine is evident. There is also strong support for two groups within the CTG clade, one containing the fully sexual species Candida lusitaniae, Candida guilliermondii and Debaryomyces hansenii, and the second group containing Candida albicans, Candida dubliniensis, Candida tropicalis, Candida parapsilosis and Lodderomyces elongisporus. The second major clade within the Saccharomycotina contains species whose genomes have undergone a whole genome duplication (WGD), and their close relatives. We could not confidently resolve whether Candida glabrata or Saccharomyces castellii lies at the base of the WGD clade. Conclusion We have constructed robust phylogenies for fungi based on whole genome analysis. Overall, our phylogenies provide strong support for the classification of phyla, sub-phyla, classes and orders. We have resolved the relationship of the classes Leotiomyctes and Sordariomycetes, and have identified two classes within the CTG clade of the Saccharomycotina that may correlate with sexual status. PMID:17121679
Xiao, P; Niu, L L; Zhao, Q J; Chen, X Y; Wang, L J; Li, L; Zhang, H P; Guo, J Z; Xu, H Y; Zhong, T
2017-11-16
The origins and phylogeny of different sheep breeds has been widely studied using polymorphisms within the mitochondrial hypervariable region. However, little is known about the mitochondrial DNA (mtDNA) content and phylogeny based on mtDNA protein-coding genes. In this study, we assessed the phylogeny and copy number of the mtDNA in eight indigenous (population size, n=184) and three introduced (n=66) sheep breeds in China based on five mitochondrial coding genes (COX1, COX2, ATP8, ATP6 and COX3). The mean haplotype and nucleotide diversities were 0.944 and 0.00322, respectively. We identified a correlation between the lineages distribution and the genetic distance, whereby Valley-type Tibetan sheep had a closer genetic relationship with introduced breeds (Dorper, Poll Dorset and Suffolk) than with other indigenous breeds. Similarly, the Median-joining profile of haplotypes revealed the distribution of clusters according to genetic differences. Moreover, copy number analysis based on the five mitochondrial coding genes was affected by the genetic distance combining with genetic phylogeny; we also identified obvious non-synonymous mutations in ATP6 between the different levels of copy number expressions. These results imply that differences in mitogenomic compositions resulting from geographical separation lead to differences in mitochondrial function.
Mondal, Sunil Kanti; Kundu, Sudip; Das, Rabindranath; Roy, Sujit
2016-08-01
Bacteria and archaea have evolved with the ability to fix atmospheric dinitrogen in the form of ammonia, catalyzed by the nitrogenase enzyme complex which comprises three structural genes nifK, nifD and nifH. The nifK and nifD encodes for the beta and alpha subunits, respectively, of component 1, while nifH encodes for component 2 of nitrogenase. Phylogeny based on nifDHK have indicated that Cyanobacteria is closer to Proteobacteria alpha and gamma but not supported by the tree based on 16SrRNA. The evolutionary ancestor for the different trees was also different. The GC1 and GC2% analysis showed more consistency than GC3% which appeared to below for Firmicutes, Cyanobacteria and Euarchaeota while highest in Proteobacteria beta and clearly showed the proportional effect on the codon usage with a few exceptions. Few genes from Firmicutes, Euryarchaeota, Proteobacteria alpha and delta were found under mutational pressure. These nif genes with low and high GC3% from different classes of organisms showed similar expected number of codons. Distribution of the genes and codons, based on codon usage demonstrated opposite pattern for different orientation of mirror plane when compared with each other. Overall our results provide a comprehensive analysis on the evolutionary relationship of the three structural nif genes, nifK, nifD and nifH, respectively, in the context of codon usage bias, GC content relationship and amino acid composition of the encoded proteins and exploration of crucial statistical method for the analysis of positive data with non-constant variance to identify the shape factors of codon adaptation index.
Phylogenomic Reconstruction of the Oomycete Phylogeny Derived from 37 Genomes
McCarthy, Charley G. P.
2017-01-01
ABSTRACT The oomycetes are a class of microscopic, filamentous eukaryotes within the Stramenopiles-Alveolata-Rhizaria (SAR) supergroup which includes ecologically significant animal and plant pathogens, most infamously the causative agent of potato blight Phytophthora infestans. Single-gene and concatenated phylogenetic studies both of individual oomycete genera and of members of the larger class have resulted in conflicting conclusions concerning species phylogenies within the oomycetes, particularly for the large Phytophthora genus. Genome-scale phylogenetic studies have successfully resolved many eukaryotic relationships by using supertree methods, which combine large numbers of potentially disparate trees to determine evolutionary relationships that cannot be inferred from individual phylogenies alone. With a sufficient amount of genomic data now available, we have undertaken the first whole-genome phylogenetic analysis of the oomycetes using data from 37 oomycete species and 6 SAR species. In our analysis, we used established supertree methods to generate phylogenies from 8,355 homologous oomycete and SAR gene families and have complemented those analyses with both phylogenomic network and concatenated supermatrix analyses. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and individual clades within the problematic Phytophthora genus. Support for the resolution of the inferred relationships between individual Phytophthora clades varies depending on the methodology used. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. IMPORTANCE The oomycetes are a class of eukaryotes and include ecologically significant animal and plant pathogens. Single-gene and multigene phylogenetic studies of individual oomycete genera and of members of the larger classes have resulted in conflicting conclusions concerning interspecies relationships among these species, particularly for the Phytophthora genus. The onset of next-generation sequencing techniques now means that a wealth of oomycete genomic data is available. For the first time, we have used genome-scale phylogenetic methods to resolve oomycete phylogenetic relationships. We used supertree methods to generate single-gene and multigene species phylogenies. Overall, our supertree analyses utilized phylogenetic data from 8,355 oomycete gene families. We have also complemented our analyses with superalignment phylogenies derived from 131 single-copy ubiquitous gene families. Our results show that a genome-scale approach to oomycete phylogeny resolves oomycete classes and clades. Our analysis represents an important first step in large-scale phylogenomic analysis of the oomycetes. PMID:28435885
Chassain, Benoît; Lemée, Ludovic; Didi, Jennifer; Thiberge, Jean-Michel; Brisse, Sylvain; Pons, Jean-Louis
2012-01-01
Staphylococcus lugdunensis is recognized as one of the major pathogenic species within the genus Staphylococcus, even though it belongs to the coagulase-negative group. A multilocus sequence typing (MLST) scheme was developed to study the genetic relationships and population structure of 87 S. lugdunensis isolates from various clinical and geographic sources by DNA sequence analysis of seven housekeeping genes (aroE, dat, ddl, gmk, ldh, recA, and yqiL). The number of alleles ranged from four (gmk and ldh) to nine (yqiL). Allelic profiles allowed the definition of 20 different sequence types (STs) and five clonal complexes. The 20 STs lacked correlation with geographic source. Isolates recovered from hematogenic infections (blood or osteoarticular isolates) or from skin and soft tissue infections did not cluster in separate lineages. Penicillin-resistant isolates clustered mainly in one clonal complex, unlike glycopeptide-tolerant isolates, which did not constitute a distinct subpopulation within S. lugdunensis. Phylogenies from the sequences of the seven individual housekeeping genes were congruent, indicating a predominantly mutational evolution of these genes. Quantitative analysis of the linkages between alleles from the seven loci revealed a significant linkage disequilibrium, thus confirming a clonal population structure for S. lugdunensis. This first MLST scheme for S. lugdunensis provides a new tool for investigating the macroepidemiology and phylogeny of this unusually virulent coagulase-negative Staphylococcus. PMID:22785196
Bacterial phylogeny structures soil resistomes across habitats
Forsberg, Kevin J.; Patel, Sanket; Gibson, Molly K.; Lauber, Christian L.; Knight, Rob; Fierer, Noah; Dantas, Gautam
2014-01-01
Summary Ancient and diverse antibiotic resistance genes (ARGs) have previously been identified from soil1–3, including genes identical to those in human pathogens4. Despite the apparent overlap between soil and clinical resistomes4–6, factors influencing ARG composition in soil and their movement between genomes and habitats remain largely unknown3. General metagenome functions often correlate with the underlying structure of bacterial communities7–12. However, ARGs are hypothesized to be highly mobile4,5,13, prompting speculation that resistomes may not correlate with phylogenetic signatures or ecological divisions13,14. To investigate these relationships, we performed functional metagenomic selections for resistance to 18 antibiotics from 18 agricultural and grassland soils. The 2895 ARGs we discovered were predominantly novel, and represent all major resistance mechanisms15. We demonstrate that distinct soil types harbor distinct resistomes, and that nitrogen fertilizer amendments strongly influenced soil ARG content. Resistome composition also correlated with microbial phylogenetic and taxonomic structure, both across and within soil types. Consistent with this strong correlation, mobility elements syntenic with ARGs were rare in soil compared to sequenced pathogens, suggesting that ARGs in the soil may not transfer between bacteria as readily as is observed in the clinic. Together, our results indicate that bacterial community composition is the primary determinant of soil ARG content, challenging previous hypotheses that horizontal gene transfer effectively decouples resistomes from phylogeny13,14. PMID:24847883
An Evolutionarily Structured Universe of Protein Architecture
Caetano-Anollés, Gustavo; Caetano-Anollés, Derek
2003-01-01
Protein structural diversity encompasses a finite set of architectural designs. Embedded in these topologies are evolutionary histories that we here uncover using cladistic principles and measurements of protein-fold usage and sharing. The reconstructed phylogenies are inherently rooted and depict histories of protein and proteome diversification. Proteome phylogenies showed two monophyletic sister-groups delimiting Bacteria and Archaea, and a topology rooted in Eucarya. This suggests three dramatic evolutionary events and a common ancestor with a eukaryotic-like, gene-rich, and relatively modern organization. Conversely, a general phylogeny of protein architectures showed that structural classes of globular proteins appeared early in evolution and in defined order, the α/β class being the first. Although most ancestral folds shared a common architecture of barrels or interleaved β-sheets and α-helices, many were clearly derived, such as polyhedral folds in the all-α class and β-sandwiches, β-propellers, and β-prisms in all-β proteins. We also describe transformation pathways of architectures that are prevalently used in nature. For example, β-barrels with increased curl and stagger were favored evolutionary outcomes in the all-β class. Interestingly, we found cases where structural change followed the α-to-β tendency uncovered in the tree of architectures. Lastly, we traced the total number of enzymatic functions associated with folds in the trees and show that there is a general link between structure and enzymatic function. PMID:12840035
Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W
2011-08-01
The third chromosome of Drosophila pseudoobscura is polymorphic for numerous gene arrangements that form classical clines in North America. The polytene salivary chromosomes isolated from natural populations revealed changes in gene order that allowed the different gene arrangements to be linked together by paracentric inversions representing one of the first cases where genetic data were used to construct a phylogeny. Although the inversion phylogeny can be used to determine the relationships among the gene arrangements, the cytogenetic data are unable to infer the ancestral arrangement or the age of the different chromosome types. These are both important properties if one is to infer the evolutionary forces responsible for the spread and maintenance of the chromosomes. Here, we employ the nucleotide sequences of 18 regions distributed across the third chromosome in 80-100 D. pseudoobscura strains to test whether five gene arrangements are of unique or multiple origin, what the ancestral arrangement was, and what are the ages of the different arrangements. Each strain carried one of six commonly found gene arrangements and the sequences were used to infer their evolutionary relationships. Breakpoint regions in the center of the chromosome supported monophyly of the gene arrangements, whereas regions at the ends of the chromosome gave phylogenies that provided less support for monophyly of the chromosomes either because the individual markers did not have enough phylogenetically informative sites or genetic exchange scrambled information among the gene arrangements. A data set where the genetic markers were concatenated strongly supported a unique origin of the different gene arrangements. The inversion polymorphism of D. pseudoobscura is estimated to be about a million years old. We have also shown that the generated phylogeny is consistent with the cytological phylogeny of this species. In addition, the data presented here support hypothetical as the ancestral arrangement. One of the youngest arrangements, Arrowhead, has one of the highest population frequencies suggesting that selection has been responsible for its rapid increase.
Naushad, Sohail; Barkema, Herman W.; Luby, Christopher; Condas, Larissa A. Z.; Nobrega, Diego B.; Carson, Domonique A.; De Buck, Jeroen
2016-01-01
Non-aureus staphylococci (NAS), a heterogeneous group of a large number of species and subspecies, are the most frequently isolated pathogens from intramammary infections in dairy cattle. Phylogenetic relationships among bovine NAS species are controversial and have mostly been determined based on single-gene trees. Herein, we analyzed phylogeny of bovine NAS species using whole-genome sequencing (WGS) of 441 distinct isolates. In addition, evolutionary relationships among bovine NAS were estimated from multilocus data of 16S rRNA, hsp60, rpoB, sodA, and tuf genes and sequences from these and numerous other single genes/proteins. All phylogenies were created with FastTree, Maximum-Likelihood, Maximum-Parsimony, and Neighbor-Joining methods. Regardless of methodology, WGS-trees clearly separated bovine NAS species into five monophyletic coherent clades. Furthermore, there were consistent interspecies relationships within clades in all WGS phylogenetic reconstructions. Except for the Maximum-Parsimony tree, multilocus data analysis similarly produced five clades. There were large variations in determining clades and interspecies relationships in single gene/protein trees, under different methods of tree constructions, highlighting limitations of using single genes for determining bovine NAS phylogeny. However, based on WGS data, we established a robust phylogeny of bovine NAS species, unaffected by method or model of evolutionary reconstructions. Therefore, it is now possible to determine associations between phylogeny and many biological traits, such as virulence, antimicrobial resistance, environmental niche, geographical distribution, and host specificity. PMID:28066335
Liu, Yajuan J; Hodson, Matthew C; Hall, Benjamin D
2006-09-29
At present, there is not a widely accepted consensus view regarding the phylogenetic structure of kingdom Fungi although two major phyla, Ascomycota and Basidiomycota, are clearly delineated. Regarding the lower fungi, Zygomycota and Chytridiomycota, a variety of proposals have been advanced. Microsporidia may or may not be fungi; the Glomales (vesicular-arbuscular mycorrhizal fungi) may or may not constitute a fifth fungal phylum, and the loss of the flagellum may have occurred either once or multiple times during fungal evolution. All of these issues are capable of being resolved by a molecular phylogenetic analysis which achieves strong statistical support for major branches. To date, no fungal phylogeny based upon molecular characters has satisfied this criterion. Using the translated amino acid sequences of the RPB1 and RPB2 genes, we have inferred a fungal phylogeny that consists largely of well-supported monophyletic phyla. Our major results, each with significant statistical support, are: (1) Microsporidia are sister to kingdom Fungi and are not members of Zygomycota; that is, Microsporidia and fungi originated from a common ancestor. (2) Chytridiomycota, the only fungal phylum having a developmental stage with a flagellum, is paraphyletic and is the basal lineage. (3) Zygomycota is monophyletic based upon sampling of Trichomycetes, Zygomycetes, and Glomales. (4) Zygomycota, Basidiomycota, and Ascomycota form a monophyletic group separate from Chytridiomycota. (5) Basidiomycota and Ascomycota are monophyletic sister groups. In general, this paper highlights the evolutionary position and significance of the lower fungi (Zygomycota and Chytridiomycota). Our results suggest that loss of the flagellum happened only once during early stages of fungal evolution; consequently, the majority of fungi, unlike plants and animals, are nonflagellated. The phylogeny we infer from gene sequences is the first one that is congruent with the widely accepted morphology-based classification of Fungi. We find that, contrary to what has been published elsewhere, the four morphologically defined phyla (Ascomycota, Basidiomycota, Zygomycota and Chytridiomycota) do not overlap with one another. Microsporidia are not included within kingdom Fungi; rather they are a sister-group to the Fungi. Our study demonstrates the applicability of protein sequences from large, slowly-evolving genes to the derivation of well-resolved and highly supported phylogenies across long evolutionary distances.
Classification and phylogeny of the cyanobiont Anabaena azollae Strasburger: an answered question?
Pereira, Ana L; Vasconcelos, Vitor
2014-06-01
The symbiosis Azolla-Anabaena azollae, with a worldwide distribution in pantropical and temperate regions, is one of the most studied, because of its potential application as a biofertilizer, especially in rice fields, but also as an animal food and in phytoremediation. The cyanobiont is a filamentous, heterocystic cyanobacterium that inhabits the foliar cavities of the pteridophyte and the indusium on the megasporocarp (female reproductive structure). The classification and phylogeny of the cyanobiont is very controversial: from its morphology, it has been named Nostoc azollae, Anabaena azollae, Anabaena variabilis status azollae and recently Trichormus azollae, but, from its 16S rRNA gene sequence, it has been assigned to Nostoc and/or Anabaena, and from its phycocyanin gene sequence, it has been assigned as non-Nostoc and non-Anabaena. The literature also points to a possible co-evolution between the cyanobiont and the Azolla host, since dendrograms and phylogenetic trees of fatty acids, short tandemly repeated repetitive (STRR) analysis and restriction fragment length polymorphism (RFLP) analysis of nif genes and the 16S rRNA gene give a two-cluster association that matches the two-section ranking of the host (Azolla). Another controversy surrounds the possible existence of more than one genus or more than one species strain. The use of freshly isolated or cultured cyanobionts is an additional problem, since their morphology and protein profiles are different. This review gives an overview of how morphological, chemical and genetic analyses influence the classification and phylogeny of the cyanobiont and future research. © 2014 IUMS.
Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
Dupont, Pierre-Yves; Cox, Murray P.
2017-01-01
Lateral gene transfer (LGT, also known as horizontal gene transfer), an atypical mechanism of transferring genes between species, has almost become the default explanation for genes that display an unexpected composition or phylogeny. Numerous methods of detecting LGT events all rely on two fundamental strategies: primary structure composition or gene tree/species tree comparisons. Discouragingly, the results of these different approaches rarely coincide. With the wealth of genome data now available, detection of laterally transferred genes is increasingly being attempted in large uncurated eukaryotic datasets. However, detection methods depend greatly on the quality of the underlying genomic data, which are typically complex for eukaryotes. Furthermore, given the automated nature of genomic data collection, it is typically impractical to manually verify all protein or gene models, orthology predictions, and multiple sequence alignments, requiring researchers to accept a substantial margin of error in their datasets. Using a test case comprising plant-associated genomes across the fungal kingdom, this study reveals that composition- and phylogeny-based methods have little statistical power to detect laterally transferred genes. In particular, phylogenetic methods reveal extreme levels of topological variation in fungal gene trees, the vast majority of which show departures from the canonical species tree. Therefore, it is inherently challenging to detect LGT events in typical eukaryotic genomes. This finding is in striking contrast to the large number of claims for laterally transferred genes in eukaryotic species that routinely appear in the literature, and questions how many of these proposed examples are statistically well supported. PMID:28235827
USDA-ARS?s Scientific Manuscript database
Despite a recent new classification, a stable tree of life for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study we apply five single copy nuclear genes (SCNGs) to the phylogeny of the order Cycadales. We specifically aim to evaluate seve...
An evolutionary scenario for the origin of flowers.
Frohlich, Michael W
2003-07-01
The Mostly Male theory is the first to use evidence from gene phylogenies, genetics, modern plant morphology and fossils to explain the evolutionary origin of flowers. It proposes that flower organization derives more from the male structures of ancestral gymnosperms than from female structures. The theory arose from a hypothesis-based study. Such studies are the most likely to generate testable evolutionary scenarios, which should be the ultimate goal of evo-devo.
Chen, Meng-Yun; Liang, Dan; Zhang, Peng
2015-11-01
Incongruence between different phylogenomic analyses is the main challenge faced by phylogeneticists in the genomic era. To reduce incongruence, phylogenomic studies normally adopt some data filtering approaches, such as reducing missing data or using slowly evolving genes, to improve the signal quality of data. Here, we assembled a phylogenomic data set of 58 jawed vertebrate taxa and 4682 genes to investigate the backbone phylogeny of jawed vertebrates under both concatenation and coalescent-based frameworks. To evaluate the efficiency of extracting phylogenetic signals among different data filtering methods, we chose six highly intractable internodes within the backbone phylogeny of jawed vertebrates as our test questions. We found that our phylogenomic data set exhibits substantial conflicting signal among genes for these questions. Our analyses showed that non-specific data sets that are generated without bias toward specific questions are not sufficient to produce consistent results when there are several difficult nodes within a phylogeny. Moreover, phylogenetic accuracy based on non-specific data is considerably influenced by the size of data and the choice of tree inference methods. To address such incongruences, we selected genes that resolve a given internode but not the entire phylogeny. Notably, not only can this strategy yield correct relationships for the question, but it also reduces inconsistency associated with data sizes and inference methods. Our study highlights the importance of gene selection in phylogenomic analyses, suggesting that simply using a large amount of data cannot guarantee correct results. Constructing question-specific data sets may be more powerful for resolving problematic nodes. © The Author(s) 2015. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
USDA-ARS?s Scientific Manuscript database
Premise of study: Our purposes were to (1) obtain a well-resolved plastid counterpart to the 94 gene nuclear ortholog gene phylogeny of Arbizu et al. (2014, Amer. J. Bot. 101:1666-1685; and Syst. Bot., in press), and (2) to investigate various classes and numbers of plastid markers necessary for a c...
Wallis, Graham P; Cameron-Christie, Sophia R; Kennedy, Hannah L; Palmer, Gemma; Sanders, Tessa R; Winter, David J
2017-06-01
Classification, phylogeography and the testing of evolutionary hypotheses rely on correct estimation of species phylogeny. Early molecular phylogenies often relied on mtDNA alone, which acts as a single linkage group with one history. Over the last decade, the use of multiple nuclear sequences has often revealed conflict among gene trees. This observation can be attributed to hybridization, lineage sorting, paralogy or selection. Here, we use 54 groups of fishes from 48 studies to estimate the degree of concordance between mitochondrial and nuclear gene trees in two ecological grades of fishes: marine and freshwater. We test the hypothesis that freshwater fish phylogenies should, on average, show more discordance because of their higher propensity for hybridization in the past. In keeping with this idea, concordance between mitochondrial and nuclear gene trees (as measured by proportion of components shared) is on average 50% higher in marine fishes. We discuss why this difference almost certainly results from introgression caused by greater historical hybridization among lineages in freshwater groups, and further emphasize the need to use multiple nuclear genes, and identify conflict among them, in estimation of species phylogeny. © 2017 John Wiley & Sons Ltd.
Cavalier-Smith, Thomas
2015-04-01
Contradictory and confusing results can arise if sequenced 'monoprotist' samples really contain DNA of very different species. Eukaryote-wide phylogenetic analyses using five genes from the amoeboflagellate culture ATCC 50646 previously implied it was an undescribed percolozoan related to percolatean flagellates (Stephanopogon, Percolomonas). Contrastingly, three phylogenetic analyses of 18S rRNA alone, did not place it within Percolozoa, but as an isolated deep-branching excavate. I resolve that contradiction by sequence phylogenies for all five genes individually, using up to 652 taxa. Its 18S rRNA sequence (GQ377652) is near-identical to one from stained-glass windows, somewhat more distant from one from cooling-tower water, all three related to terrestrial actinocephalid gregarines Hoplorhynchus and Pyxinia. All four protein-gene sequences (Hsp90; α-tubulin; β-tubulin; actin) are from an amoeboflagellate heterolobosean percolozoan, not especially deeply branching. Contrary to previous conclusions from trees combining protein and rRNA sequences or rDNA trees including Eozoa only, this culture does not represent a major novel deep-branching eukaryote lineage distinct from Heterolobosea, and thus lacks special significance for deep eukaryote phylogeny, though the rDNA sequence is important for gregarine phylogeny. α-Tubulin trees for over 250 eukaryotes refute earlier suggestions of lateral gene transfer within eukaryotes, being largely congruent with morphology and other gene trees. Copyright © 2015. Published by Elsevier GmbH.
2010-01-01
Background The subclass Enoplia (Phylum Nematoda) is purported to be the earliest branching clade amongst all nematode taxa, yet the deep phylogeny of this important lineage remains elusive. Free-living marine species within the order Enoplida play prominent roles in marine ecosystems, but previous molecular phylogenies have provided only the briefest evolutionary insights; this study aimed to firmly resolve internal relationships within the hyper-diverse but poorly understood Enoplida. In addition, we revisited the molecular framework of the Nematoda using a rigorous phylogenetic approach in order to investigate patterns of early splits amongst the oldest lineages (Dorylaimia and Enoplia). Results Morphological identifications, nuclear gene sequences (18S and 28S rRNA), and mitochondrial gene sequences (cox1) were obtained from marine Enoplid specimens representing 37 genera. The 18S gene was used to resolve deep splits within the Enoplia and evaluate the branching order of major clades in the nematode tree; multiple phylogenetic methods and rigorous empirical tests were carried out to assess tree topologies under different parameters and combinations of taxa. Significantly increased taxon sampling within the Enoplida resulted in a well-supported, robust phylogenetic topology of this group, although the placement of certain clades was not fully resolved. Our analysis could not unequivocally confirm the earliest splits in the nematode tree, and outgroup choice significantly affected the observed branching order of the Dorylaimia and Enoplia. Both 28S and cox1 were too variable to infer deep phylogeny, but provided additional insight at lower taxonomic levels. Conclusions Analysis of internal relationships reveals that the Enoplia is split into two main clades, with groups consisting of terrestrial (Triplonchida) and primarily marine fauna (Enoplida). Five independent lineages were recovered within the Enoplida, containing a mixture of marine and terrestrial species; clade structure suggests that habitat transitions have occurred at least four times within this group. Unfortunately, we were unable to obtain a consistent or well-supported topology amongst early-branching nematode lineages. It appears unlikely that single-gene phylogenies using the conserved 18S gene will be useful for confirming the branching order at the base of the nematode tree-future efforts will require multi-gene analyses or phylogenomic methods. PMID:21073704
The Tree of Life and a New Classification of Bony Fishes
Betancur-R., Ricardo; Broughton, Richard E.; Wiley, Edward O.; Carpenter, Kent; López, J. Andrés; Li, Chenhong; Holcroft, Nancy I.; Arcila, Dahiana; Sanciangco, Millicent; Cureton II, James C; Zhang, Feifei; Buser, Thaddaeus; Campbell, Matthew A.; Ballesteros, Jesus A; Roa-Varon, Adela; Willis, Stuart; Borden, W. Calvin; Rowley, Thaine; Reneau, Paulette C.; Hough, Daniel J.; Lu, Guoqing; Grande, Terry; Arratia, Gloria; Ortí, Guillermo
2013-01-01
The tree of life of fishes is in a state of flux because we still lack a comprehensive phylogeny that includes all major groups. The situation is most critical for a large clade of spiny-finned fishes, traditionally referred to as percomorphs, whose uncertain relationships have plagued ichthyologists for over a century. Most of what we know about the higher-level relationships among fish lineages has been based on morphology, but rapid influx of molecular studies is changing many established systematic concepts. We report a comprehensive molecular phylogeny for bony fishes that includes representatives of all major lineages. DNA sequence data for 21 molecular markers (one mitochondrial and 20 nuclear genes) were collected for 1410 bony fish taxa, plus four tetrapod species and two chondrichthyan outgroups (total 1416 terminals). Bony fish diversity is represented by 1093 genera, 369 families, and all traditionally recognized orders. The maximum likelihood tree provides unprecedented resolution and high bootstrap support for most backbone nodes, defining for the first time a global phylogeny of fishes. The general structure of the tree is in agreement with expectations from previous morphological and molecular studies, but significant new clades arise. Most interestingly, the high degree of uncertainty among percomorphs is now resolved into nine well-supported supraordinal groups. The order Perciformes, considered by many a polyphyletic taxonomic waste basket, is defined for the first time as a monophyletic group in the global phylogeny. A new classification that reflects our phylogenetic hypothesis is proposed to facilitate communication about the newly found structure of the tree of life of fishes. Finally, the molecular phylogeny is calibrated using 60 fossil constraints to produce a comprehensive time tree. The new time-calibrated phylogeny will provide the basis for and stimulate new comparative studies to better understand the evolution of the amazing diversity of fishes. PMID:23653398
Caufield, Page W; Saxena, Deepak; Fitch, David; Li, Yihong
2007-02-01
There are suggestions that the phylogeny of Streptococcus mutans, a member of the human indigenous biota that is transmitted mostly mother to child, might parallel the evolutionary history of its human host. The relatedness and phylogeny of plasmid-containing strains of S. mutans were examined based on chromosomal DNA fingerprints (CDF), a hypervariable region (HVR) of a 5.6-kb plasmid, the rRNA gene intergenic spacer region (IGSR), serotypes, and the genotypes of mutacin I and II. Plasmid-containing strains were studied because their genetic diversity was twice as great as that of plasmid-free strains. The CDF of S. mutans from unrelated human hosts were unique, except those from Caucasians, which were essentially identical. The evolutionary history of the IGSR, with or without the serotype and mutacin characters, clearly delineated an Asian clade. Also, a continuous association with mutacin II could be reconstructed through an evolutionary lineage with the IGSR, but not for serotype e. DNA sequences from the HVR of the plasmid produced a well-resolved phylogeny that differed from the chromosomal phylogeny, indicating that the horizontal transfer of the plasmid may have occurred multiple times. The plasmid phylogeny was more congruent with serotype e than with mutacin II evolution, suggesting a possible functional correlation. Thus, the history of this three-tiered relationship between human, bacterium, and plasmid supported both coevolution and independent evolution.
Molecular evolution of the crustacean hyperglycemic hormone family in ecdysozoans
2010-01-01
Background Crustacean Hyperglycemic Hormone (CHH) family peptides are neurohormones known to regulate several important functions in decapod crustaceans such as ionic and energetic metabolism, molting and reproduction. The structural conservation of these peptides, together with the variety of functions they display, led us to investigate their evolutionary history. CHH family peptides exist in insects (Ion Transport Peptides) and may be present in all ecdysozoans as well. In order to extend the evolutionary study to the entire family, CHH family peptides were thus searched in taxa outside decapods, where they have been, to date, poorly investigated. Results CHH family peptides were characterized by molecular cloning in a branchiopod crustacean, Daphnia magna, and in a collembolan, Folsomia candida. Genes encoding such peptides were also rebuilt in silico from genomic sequences of another branchiopod, a chelicerate and two nematodes. These sequences were included in updated datasets to build phylogenies of the CHH family in pancrustaceans. These phylogenies suggest that peptides found in Branchiopoda and Collembola are more closely related to insect ITPs than to crustacean CHHs. Datasets were also used to support a phylogenetic hypothesis about pancrustacean relationships, which, in addition to gene structures, allowed us to propose two evolutionary scenarios of this multigenic family in ecdysozoans. Conclusions Evolutionary scenarios suggest that CHH family genes of ecdysozoans originate from an ancestral two-exon gene, and genes of arthropods from a three-exon one. In malacostracans, the evolution of the CHH family has involved several duplication, insertion or deletion events, leading to neuropeptides with a wide variety of functions, as observed in decapods. This family could thus constitute a promising model to investigate the links between gene duplications and functional divergence. PMID:20184761
Wirshing, Herman H; Baker, Andrew C
2014-08-01
Molecular phylogenies of scleractinian corals often fail to agree with traditional phylogenies derived from morphological characters. These discrepancies are generally attributed to non-homologous or morphologically plastic characters used in taxonomic descriptions. Consequently, morphological convergence of coral skeletons among phylogenetically unrelated groups is considered to be the major evolutionary process confounding molecular and morphological hypotheses. A strategy that may help identify cases of convergence and/or diversification in coral morphology is to compare phylogenies of existing "neutral" genetic markers used to estimate genealogic phylogenetic history with phylogenies generated from non-neutral genes involved in calcification (biomineralization). We tested the hypothesis that differences among calcification gene phylogenies with respect to the "neutral" trees may represent convergent or divergent functional strategies among calcification gene proteins that may correlate to aspects of coral skeletal morphology. Partial sequences of two nuclear genes previously determined to be involved in the calcification process in corals, "Cnidaria-III" membrane-bound/secreted α-carbonic anhydrase (CIII-MBSα-CA) and bone morphogenic protein (BMP) 2/4, were PCR-amplified, cloned and sequenced from 31 scleractinian coral species in 26 genera and 9 families. For comparison, "neutral" gene phylogenies were generated from sequences from two protein-coding "non-calcification" genes, one nuclear (β-tubulin) and one mitochondrial (cytochrome b), from the same individuals. Cloned CIII-MBSα-CA sequences were found to be non-neutral, and phylogenetic analyses revealed CIII-MBSα-CAs to exhibit a complex evolutionary history with clones distributed between at least 2 putative gene copies. However, for several coral taxa only one gene copy was recovered. With CIII-MBSα-CA, several recovered clades grouped taxa that differed from the "non-calcification" loci. In some cases, these taxa shared aspects of their skeletal morphology (i.e., convergence or diversification relative to the "non-calcification" loci), but in other cases they did not. For example, the "non-calcification" loci recovered Atlantic and Pacific mussids as separate evolutionary lineages, whereas with CIII-MBSα-CA, clones of two species of Atlantic mussids (Isophyllia sinuosa and Mycetophyllia sp.) and two species of Pacific mussids (Acanthastrea echinata and Lobophyllia hemprichii) were united in a distinct clade (except for one individual of Mycetophyllia). However, this clade also contained other taxa which were not unambiguously correlated with morphological features. BMP2/4 also contained clones that likely represent different gene copies. However, many of the sequences showed no significant deviation from neutrality, and reconstructed phylogenies were similar to the "non-calcification" tree topologies with a few exceptions. Although individual calcification genes are unlikely to precisely explain the diverse morphological features exhibited by scleractinian corals, this study demonstrates an approach for identifying cases where morphological taxonomy may have been misled by convergent and/or divergent molecular evolutionary processes in corals. Studies such as this may help illuminate our understanding of the likely complex evolution of genes involved in the calcification process, and enhance our knowledge of the natural history and biodiversity within this central ecological group. Published by Elsevier Inc.
Kreipe, Victoria; Corral-Hernández, Elena; Scheu, Stefan; Schaefer, Ina; Maraun, Mark
2015-06-01
Species of the genus Steganacarus are soil-living oribatid mites (Acari, Phthiracaridae) with a ptychoid body. The phylogeny and species status of the species of Steganacarus are not resolved, some authors group all ten German species of Steganacarus within the genus Steganacarus whereas others split them into three subgenera, Steganacarus, Tropacarus and Atropacarus. Additionally, two species, S. magnus and T. carinatus, comprise morphotypes of questionable species status. We investigated the phylogeny and species status of ten European Steganacarus species, i.e. S. applicatus, S. herculeanus, S. magnus forma magna, S. magnus forma anomala, S. spinosus, Tropacarus brevipilus, T. carinatus forma carinata, T. carinatus forma pulcherrima, Atropacarus striculus and Rhacaplacarus ortizi. We used two molecular markers, a 251 bp fragment of the nuclear gene 28S rDNA (D3) and a 477 bp fragment of the mitochondrial COI region. The phylogeny based on a combined analysis of D3 and COI separated four subgenera (Steganacarus, Tropacarus and Atropacarus, Rhacaplacarus) indicating that they form monophyletic groups. The COI region separated all ten species of the genus Steganacarus and showed variation within some species often correlating with the geographic origin of the species. Resolution of the more conserved D3 region was limited, indicating that radiation events are rather recent. Overall, our results indicate that both genes alone cannot be used for phylogeny and barcoding since variation is too low in D3 and too high in COI. However, when used in combination these genes provide reliable insight into the phylogeny, radiation and species status of taxa of the genus Steganacarus.
Singh, Reema; Schilde, Christina; Schaap, Pauline
2016-11-17
Dictyostelia are a well-studied group of organisms with colonial multicellularity, which are members of the mostly unicellular Amoebozoa. A phylogeny based on SSU rDNA data subdivided all Dictyostelia into four major groups, but left the position of the root and of six group-intermediate taxa unresolved. Recent phylogenies inferred from 30 or 213 proteins from sequenced genomes, positioned the root between two branches, each containing two major groups, but lacked data to position the group-intermediate taxa. Since the positions of these early diverging taxa are crucial for understanding the evolution of phenotypic complexity in Dictyostelia, we sequenced six representative genomes of early diverging taxa. We retrieved orthologs of 47 housekeeping proteins with an average size of 890 amino acids from six newly sequenced and eight published genomes of Dictyostelia and unicellular Amoebozoa and inferred phylogenies from single and concatenated protein sequence alignments. Concatenated alignments of all 47 proteins, and four out of five subsets of nine concatenated proteins all produced the same consensus phylogeny with 100% statistical support. Trees inferred from just two out of the 47 proteins, individually reproduced the consensus phylogeny, highlighting that single gene phylogenies will rarely reflect correct species relationships. However, sets of two or three concatenated proteins again reproduced the consensus phylogeny, indicating that a small selection of genes suffices for low cost classification of as yet unincorporated or newly discovered dictyostelid and amoebozoan taxa by gene amplification. The multi-locus consensus phylogeny shows that groups 1 and 2 are sister clades in branch I, with the group-intermediate taxon D. polycarpum positioned as outgroup to group 2. Branch II consists of groups 3 and 4, with the group-intermediate taxon Polysphondylium violaceum positioned as sister to group 4, and the group-intermediate taxon Dictyostelium polycephalum branching at the base of that whole clade. Given the data, the approximately unbiased test rejects all alternative topologies favoured by SSU rDNA and individual proteins with high statistical support. The test also rejects monophyletic origins for the genera Acytostelium, Polysphondylium and Dictyostelium. The current position of Acytostelium ellipticum in the consensus phylogeny indicates that somatic cells were lost twice in Dictyostelia.
Phylogenomic Analysis and Dynamic Evolution of Chloroplast Genomes in Salicaceae
Huang, Yuan; Wang, Jun; Yang, Yongping; Fan, Chuanzhu; Chen, Jiahui
2017-01-01
Chloroplast genomes of plants are highly conserved in both gene order and gene content. Analysis of the whole chloroplast genome is known to provide much more informative DNA sites and thus generates high resolution for plant phylogenies. Here, we report the complete chloroplast genomes of three Salix species in family Salicaceae. Phylogeny of Salicaceae inferred from complete chloroplast genomes is generally consistent with previous studies but resolved with higher statistical support. Incongruences of phylogeny, however, are observed in genus Populus, which most likely results from homoplasy. By comparing three Salix chloroplast genomes with the published chloroplast genomes of other Salicaceae species, we demonstrate that the synteny and length of chloroplast genomes in Salicaceae are highly conserved but experienced dynamic evolution among species. We identify seven positively selected chloroplast genes in Salicaceae, which might be related to the adaptive evolution of Salicaceae species. Comparative chloroplast genome analysis within the family also indicates that some chloroplast genes are lost or became pseudogenes, infer that the chloroplast genes horizontally transferred to the nucleus genome. Based on the complete nucleus genome sequences from two Salicaceae species, we remarkably identify that the entire chloroplast genome is indeed transferred and integrated to the nucleus genome in the individual of the reference genome of P. trichocarpa at least once. This observation, along with presence of the large nuclear plastid DNA (NUPTs) and NUPTs-containing multiple chloroplast genes in their original order in the chloroplast genome, favors the DNA-mediated hypothesis of organelle to nucleus DNA transfer. Overall, the phylogenomic analysis using chloroplast complete genomes clearly elucidates the phylogeny of Salicaceae. The identification of positively selected chloroplast genes and dynamic chloroplast-to-nucleus gene transfers in Salicaceae provide resources to better understand the successful adaptation of Salicaceae species. PMID:28676809
Fifteen new earthworm mitogenomes shed new light on phylogeny within the Pheretima complex
Zhang, Liangliang; Sechi, Pierfrancesco; Yuan, Minglong; Jiang, Jibao; Dong, Yan; Qiu, Jiangping
2016-01-01
The Pheretima complex within the Megascolecidae family is a major earthworm group. Recently, the systematic status of the Pheretima complex based on morphology was challenged by molecular studies. In this study, we carry out the first comparative mitogenomic study in oligochaetes. The mitogenomes of 15 earthworm species were sequenced and compared with other 9 available earthworm mitogenomes, with the main aim to explore their phylogenetic relationships and test different analytical approaches on phylogeny reconstruction. The general earthworm mitogenomic features revealed to be conservative: all genes encoded on the same strand, all the protein coding loci shared the same initiation codon (ATG), and tRNA genes showed conserved structures. The Drawida japonica mitogenome displayed the highest A + T content, reversed AT/GC-skews and the highest genetic diversity. Genetic distances among protein coding genes displayed their maximum and minimum interspecific values in the ATP8 and CO1 genes, respectively. The 22 tRNAs showed variable substitution patterns between the considered earthworm mitogenomes. The inclusion of rRNAs positively increased phylogenetic support. Furthermore, we tested different trimming tools for alignment improvement. Our analyses rejected reciprocal monophyly among Amynthas and Metaphire and indicated that the two genera should be systematically classified into one. PMID:26833286
2011-01-01
Background The genus Pyrus belongs to the tribe Pyreae (the former subfamily Maloideae) of the family Rosaceae, and includes one of the most important commercial fruit crops, pear. The phylogeny of Pyrus has not been definitively reconstructed. In our previous efforts, the internal transcribed spacer region (ITS) revealed a poorly resolved phylogeny due to non-concerted evolution of nrDNA arrays. Therefore, introns of low copy nuclear genes (LCNG) are explored here for improved resolution. However, paralogs and lineage sorting are still two challenges for applying LCNGs in phylogenetic studies, and at least two independent nuclear loci should be compared. In this work the second intron of LEAFY and the alcohol dehydrogenase gene (Adh) were selected to investigate their molecular evolution and phylogenetic utility. Results DNA sequence analyses revealed a complex ortholog and paralog structure of Adh genes in Pyrus and Malus, the pears and apples. Comparisons between sequences from RT-PCR and genomic PCR indicate that some Adh homologs are putatively nonfunctional. A partial region of Adh1 was sequenced for 18 Pyrus species and three subparalogs representing Adh1-1 were identified. These led to poorly resolved phylogenies due to low sequence divergence and the inclusion of putative recombinants. For the second intron of LEAFY, multiple inparalogs were discovered for both LFY1int2 and LFY2int2. LFY1int2 is inadequate for phylogenetic analysis due to lineage sorting of two inparalogs. LFY2int2-N, however, showed a relatively high sequence divergence and led to the best-resolved phylogeny. This study documents the coexistence of outparalogs and inparalogs, and lineage sorting of these paralogs and orthologous copies. It reveals putative recombinants that can lead to incorrect phylogenetic inferences, and presents an improved phylogenetic resolution of Pyrus using LFY2int2-N. Conclusions Our study represents the first phylogenetic analyses based on LCNGs in Pyrus. Ancient and recent duplications lead to a complex structure of Adh outparalogs and inparalogs in Pyrus and Malus, resulting in neofunctionalization, nonfunctionalization and possible subfunctionalization. Among all investigated orthologs, LFY2int2-N is the best nuclear marker for phylogenetic reconstruction of Pyrus due to suitable sequence divergence and the absence of lineage sorting. PMID:21917170
Phylogeny with introgression in Habronattus jumping spiders (Araneae: Salticidae).
Leduc-Robert, Geneviève; Maddison, Wayne P
2018-02-22
Habronattus is a diverse clade of jumping spiders with complex courtship displays and repeated evolution of Y chromosomes. A well-resolved species phylogeny would provide an important framework to study these traits, but has not yet been achieved, in part because the few genes available in past studies gave conflicting signals. Such discordant gene trees could be the result of incomplete lineage sorting (ILS) in recently diverged parts of the phylogeny, but there are indications that introgression could be a source of conflict. To infer Habronattus phylogeny and investigate the cause of gene tree discordance, we assembled transcriptomes for 34 Habronattus species and 2 outgroups. The concatenated 2.41 Mb of nuclear data (1877 loci) resolved phylogeny by Maximum Likelihood (ML) with high bootstrap support (95-100%) at most nodes, with some uncertainty surrounding the relationships of H. icenoglei, H. cambridgei, H. oregonensis, and Pellenes canadensis. Species tree analyses by ASTRAL and SVDQuartets gave almost completely congruent results. Several nodes in the ML phylogeny from 12.33 kb of mitochondrial data are incongruent with the nuclear phylogeny and indicate possible mitochondrial introgression: the internal relationships of the americanus and the coecatus groups, the relationship between the altanus, decorus, banksi, and americanus group, and between H. clypeatus and the coecatus group. To determine the relative contributions of ILS and introgression, we analyzed gene tree discordance for nuclear loci longer than 1 kb using Bayesian Concordance Analysis (BCA) for the americanus group (679 loci) and the VCCR clade (viridipes/clypeatus/coecatus/roberti groups) (517 loci) and found signals of introgression in both. Finally, we tested specifically for introgression in the concatenated nuclear matrix with Patterson's D statistics and D FOIL . We found nuclear introgression resulting in substantial admixture between americanus group species, between H. roberti and the clypeatus group, and between the clypeatus and coecatus groups. Our results indicate that the phylogenetic history of Habronattus is predominantly a diverging tree, but that hybridization may have been common between phylogenetically distant species, especially in subgroups with complex courtship displays.
Kelley, Scott T; Cassirer, E Frances; Weiser, Glen C; Safaee, Shirin
2007-01-01
Wild and domestic animal populations are known to be sources and reservoirs of emerging diseases. There is also a growing recognition that horizontal genetic transfer (HGT) plays an important role in bacterial pathogenesis. We used molecular phylogenetic methods to assess diversity and cross-transmission rates of Pasteurellaceae bacteria in populations of bighorn sheep, Dall's sheep, domestic sheep and domestic goats. Members of the Pasteurellaceae cause an array of deadly illnesses including bacterial pneumonia known as "pasteurellosis", a particularly devastating disease for bighorn sheep. A phylogenetic analysis of a combined dataset of two RNA genes (16S ribosomal RNA and RNAse P RNA) revealed remarkable evolutionary diversity among Pasteurella trehalosi and Mannheimia (Pasteurella) haemolytica bacteria isolated from sheep and goats. Several phylotypes appeared to associate with particular host species, though we found numerous instances of apparent cross-transmission among species and populations. Statistical analyses revealed that host species, geographic locale and biovariant classification, but not virulence, correlated strongly with Pasteurellaceae phylogeny. Sheep host species correlated with P. trehalosi isolates phylogeny (PTP test; P=0.002), but not with the phylogeny of M. haemolytica isolates, suggesting that P. trehalosi bacteria may be more host specific. With regards to populations within species, we also discovered a strong correlation between geographic locale and isolate phylogeny in the Rocky Mountain bighorn sheep (PTP test; P=0.001). We also investigated the potential for HGT of the leukotoxin A (lktA) gene, which produces a toxin that plays an integral role in causing disease. Comparative analysis of the combined RNA gene phylogeny and the lktA phylogenies revealed considerable incongruence between the phylogenies, suggestive of HGT. Furthermore, we found identical lktA alleles in unrelated bacterial species, some of which had been isolated from sheep in distantly removed populations. For example, lktA sequences from P. trehalosi isolated from remote Alaskan Dall's sheep were 100% identical over a 900-nucleotide stretch to sequences determined from M. haemolytica isolated from domestic sheep in the UK. This extremely high degree of sequence similarity of lktA sequences among distinct bacterial species suggests that HGT has played a role in the evolution of lktA in wild hosts.
Vd’ačný, Peter; Bourland, William A.; Orsi, William; Epstein, Slava S.; Foissner, Wilhelm
2012-01-01
The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria + Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. PMID:22789763
Vd'ačný, Peter; Bourland, William A; Orsi, William; Epstein, Slava S; Foissner, Wilhelm
2012-11-01
The class Litostomatea is a highly diverse ciliate taxon comprising hundreds of free-living and endocommensal species. However, their traditional morphology-based classification conflicts with 18S rRNA gene phylogenies indicating (1) a deep bifurcation of the Litostomatea into Rhynchostomatia and Haptoria+Trichostomatia, and (2) body polarization and simplification of the oral apparatus as main evolutionary trends in the Litostomatea. To test whether 18S rRNA molecules provide a suitable proxy for litostomatean evolutionary history, we used eighteen new ITS1-5.8S rRNA-ITS2 region sequences from various free-living litostomatean orders. These single- and multiple-locus analyses are in agreement with previous 18S rRNA gene phylogenies, supporting that both 18S rRNA gene and ITS region sequences are effective tools for resolving phylogenetic relationships among the litostomateans. Despite insertions, deletions and mutational saturations in the ITS region, the present study shows that ITS1 and ITS2 molecules can be used to infer phylogenetic relationships not only at species level but also at higher taxonomic ranks when their secondary structure information is utilized to aid alignment. Copyright © 2012 Elsevier Inc. All rights reserved.
Diao, Weiping; Snyder, John C.; Liu, Jinbing; Pan, Baogui; Guo, Guangjun; Ge, Wei; Dawood, Mohammad Hasan Salman Ali
2018-01-01
The NAM, ATAF1/2, and CUC2 (NAC) transcription factors form a large plant-specific gene family, which is involved in the regulation of tissue development in response to biotic and abiotic stress. To date, there have been no comprehensive studies investigating chromosomal location, gene structure, gene phylogeny, conserved motifs, or gene expression of NAC in pepper (Capsicum annuum L.). The recent release of the complete genome sequence of pepper allowed us to perform a genome-wide investigation of Capsicum annuum L. NAC (CaNAC) proteins. In the present study, a comprehensive analysis of the CaNAC gene family in pepper was performed, and a total of 104 CaNAC genes were identified. Genome mapping analysis revealed that CaNAC genes were enriched on four chromosomes (chromosomes 1, 2, 3, and 6). In addition, phylogenetic analysis of the NAC domains from pepper, potato, Arabidopsis, and rice showed that CaNAC genes could be clustered into three groups (I, II, and III). Group III, which contained 24 CaNAC genes, was exclusive to the Solanaceae plant family. Gene structure and protein motif analyses showed that these genes were relatively conserved within each subgroup. The number of introns in CaNAC genes varied from 0 to 8, with 83 (78.9%) of CaNAC genes containing two or less introns. Promoter analysis confirmed that CaNAC genes are involved in pepper growth, development, and biotic or abiotic stress responses. Further, the expression of 22 selected CaNAC genes in response to seven different biotic and abiotic stresses [salt, heat shock, drought, Phytophthora capsici, abscisic acid, salicylic acid (SA), and methyl jasmonate (MeJA)] was evaluated by quantitative RT-PCR to determine their stress-related expression patterns. Several putative stress-responsive CaNAC genes, including CaNAC72 and CaNAC27, which are orthologs of the known stress-responsive Arabidopsis gene ANAC055 and potato gene StNAC30, respectively, were highly regulated by treatment with different types of stress. Our results also showed that CaNAC36 plays an important role in the interaction network, interacting with 48 genes. Most of these genes are in the mitogen-activated protein kinase (MAPK) family. Taken together, our results provide a platform for further studies to identify the biological functions of CaNAC genes. PMID:29596349
Interconnected microbiomes and resistomes in low-income human habitats
Pehrsson, Erica C.; Tsukayama, Pablo; Patel, Sanket; Mejía-Bautista, Melissa; Sosa-Soto, Giordano; Navarrete, Karla M.; Calderon, Maritza; Cabrera, Lilia; Hoyos-Arango, William; Bertoli, M. Teresita; Berg, Douglas E.; Gilman, Robert H.; Dantas, Gautam
2016-01-01
Summary Antibiotic-resistant infections annually claim hundreds of thousands of lives worldwide. This problem is exacerbated by resistance gene exchange between pathogens and benign microbes from diverse habitats. Mapping resistance gene dissemination between humans and their environment is a public health priority. We characterized the bacterial community structure and resistance exchange networks of hundreds of interconnected human fecal and environmental samples from two low-income Latin American communities. We found that resistomes across habitats are generally structured by bacterial phylogeny along ecological gradients, but identified key resistance genes that cross habitat boundaries and determined their association with mobile genetic elements. We also assessed the effectiveness of widely-used excreta management strategies in reducing fecal bacteria and resistance genes in these settings representative of low- and middle-income countries. Our results lay the foundation for quantitative risk assessment and surveillance of resistance dissemination across interconnected habitats in settings representing over two-thirds of the world’s population. PMID:27172044
Ludeña, Bertha; Chabrillange, Nathalie; Aberlenc-Bertossi, Frédérique; Adam, Hélène; Tregear, James W.; Pintaud, Jean-Christophe
2011-01-01
Background and Aims Molecular phylogenetic studies of palms (Arecaceae) have not yet provided a fully resolved phylogeny of the family. There is a need to increase the current set of markers to resolve difficult groups such as the Neotropical subtribe Bactridinae (Arecoideae: Cocoseae). We propose the use of two single-copy nuclear genes as valuable tools for palm phylogenetics. Methods New primers were developed for the amplification of the AGAMOUS 1 (AG1) and PHYTOCHROME B (PHYB) genes. For the AGAMOUS gene, the paralogue 1 of Elaeis guineensis (EgAG1) was targeted. The region amplified contained coding sequences between the MIKC K and C MADS-box domains. For the PHYB gene, exon 1 (partial sequence) was first amplified in palm species using published degenerate primers for Poaceae, and then specific palm primers were designed. The two gene portions were sequenced in 22 species of palms representing all genera of Bactridinae, with emphasis on Astrocaryum and Hexopetion, the status of the latter genus still being debated. Key Results The new primers designed allow consistent amplification and high-quality sequencing within the palm family. The two loci studied produced more variability than chloroplast loci and equally or less variability than PRK, RPBII and ITS nuclear markers. The phylogenetic structure obtained with AG1 and PHYB genes provides new insights into intergeneric relationships within the Bactridinae and the intrageneric structure of Astrocaryum. The Hexopetion clade was recovered as monophyletic with both markers and was weakly supported as sister to Astrocaryum sensu stricto in the combined analysis. The rare Astrocaryum minus formed a species complex with Astrocaryum gynacanthum. Moreover, both AG1 and PHYB contain a microsatellite that could have further uses in species delimitation and population genetics. Conclusions AG1 and PHYB provide additional phylogenetic information within the palm family, and should prove useful in combination with other genes to improve the resolution of palm phylogenies. PMID:21828068
Yu, Li; Li, Yi-Wei; Ryder, Oliver A; Zhang, Ya-Ping
2007-10-24
Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt) gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other difficult phylogenetic issues. Although the whole mitochondrial DNA sequence based phylogeny is robust, it remains in conflict with phylogenetic relationships suggested by analysis of limited nuclear-encoded data, a situation that will require gathering more nuclear DNA sequence information.
Yu, Li; Li, Yi-Wei; Ryder, Oliver A; Zhang, Ya-Ping
2007-01-01
Background Despite the small number of ursid species, bear phylogeny has long been a focus of study due to their conservation value, as all bear genera have been classified as endangered at either the species or subspecies level. The Ursidae family represents a typical example of rapid evolutionary radiation. Previous analyses with a single mitochondrial (mt) gene or a small number of mt genes either provide weak support or a large unresolved polytomy for ursids. We revisit the contentious relationships within Ursidae by analyzing complete mt genome sequences and evaluating the performance of both entire mt genomes and constituent mtDNA genes in recovering a phylogeny of extremely recent speciation events. Results This mitochondrial genome-based phylogeny provides strong evidence that the spectacled bear diverged first, while within the genus Ursus, the sloth bear is the sister taxon of all the other five ursines. The latter group is divided into the brown bear/polar bear and the two black bears/sun bear assemblages. These findings resolve the previous conflicts between trees using partial mt genes. The ability of different categories of mt protein coding genes to recover the correct phylogeny is concordant with previous analyses for taxa with deep divergence times. This study provides a robust Ursidae phylogenetic framework for future validation by additional independent evidence, and also has significant implications for assisting in the resolution of other similarly difficult phylogenetic investigations. Conclusion Identification of base composition bias and utilization of the combined data of whole mitochondrial genome sequences has allowed recovery of a strongly supported phylogeny that is upheld when using multiple alternative outgroups for the Ursidae, a mammalian family that underwent a rapid radiation since the mid- to late Pliocene. It remains to be seen if the reliability of mt genome analysis will hold up in studies of other difficult phylogenetic issues. Although the whole mitochondrial DNA sequence based phylogeny is robust, it remains in conflict with phylogenetic relationships suggested by analysis of limited nuclear-encoded data, a situation that will require gathering more nuclear DNA sequence information. PMID:17956639
Phylogeny, rate variation, and genome size evolution of Pelargonium (Geraniaceae).
Weng, Mao-Lun; Ruhlman, Tracey A; Gibby, Mary; Jansen, Robert K
2012-09-01
The phylogeny of 58 Pelargonium species was estimated using five plastid markers (rbcL, matK, ndhF, rpoC1, trnL-F) and one mitochondrial gene (nad5). The results confirmed the monophyly of three major clades and four subclades within Pelargonium but also indicate the need to revise some sectional classifications. This phylogeny was used to examine karyotype evolution in the genus: plotting chromosome sizes, numbers and 2C-values indicates that genome size is significantly correlated with chromosome size but not number. Accelerated rates of nucleotide substitution have been previously detected in both plastid and mitochondrial genes in Pelargonium, but sparse taxon sampling did not enable identification of the phylogenetic distribution of these elevated rates. Using the multigene phylogeny as a constraint, we investigated lineage- and locus-specific heterogeneity of substitution rates in Pelargonium for an expanded number of taxa and demonstrated that both plastid and mitochondrial genes have had accelerated substitution rates but with markedly disparate patterns. In the plastid, the exons of rpoC1 have significantly accelerated substitution rates compared to its intron and the acceleration was mainly due to nonsynonymous substitutions. In contrast, the mitochondrial gene, nad5, experienced substantial acceleration of synonymous substitution rates in three internal branches of Pelargonium, but this acceleration ceased in all terminal branches. Several lineages also have dN/dS ratios significantly greater than one for rpoC1, indicating that positive selection is acting on this gene, whereas the accelerated synonymous substitutions in the mitochondrial gene are the result of elevated mutation rates. Published by Elsevier Inc.
Lin, Feng-Jiau; Liu, Yuan; Sha, Zhongli; Tsang, Ling Ming; Chu, Ka Hou; Chan, Tin-Yam; Liu, Ruiyu; Cui, Zhaoxia
2012-11-16
The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further evidences for the divergence between the two mud shrimp infraorders, Gebiidea and Axiidea, corroborating previous molecular phylogeny and justifying their infraordinal status. Mitochondrial genome sequences appear to be promising markers for resolving phylogenetic issues concerning decapod crustaceans that warrant further investigations and our present study has also provided further information concerning the mt genome evolution of the Decapoda.
2012-01-01
Background The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. Results We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Conclusions Phylogenetic analyses on the mt genome sequences and the distinct gene orders provide further evidences for the divergence between the two mud shrimp infraorders, Gebiidea and Axiidea, corroborating previous molecular phylogeny and justifying their infraordinal status. Mitochondrial genome sequences appear to be promising markers for resolving phylogenetic issues concerning decapod crustaceans that warrant further investigations and our present study has also provided further information concerning the mt genome evolution of the Decapoda. PMID:23153176
Phylogeny of the Paracalanidae Giesbrecht, 1888 (Crustacea: Copepoda: Calanoida).
Cornils, Astrid; Blanco-Bercial, Leocadio
2013-12-01
The Paracalanidae are ecologically-important marine planktonic copepods that occur in the epipelagic zone in temperate and tropical waters. They are often the dominant taxon - in terms of biomass and abundance - in continental shelf regions. As primary consumers, they form a vital link in the pelagic food web between primary producers and higher trophic levels. Despite the ecological importance of the taxon, evolutionary and systematic relationships within the family remain largely unknown. A multigene phylogeny including 24 species, including representatives for all seven genera, was determined based on two nuclear genes, small-subunit (18S) ribosomal RNA and Histone 3 (H3) and one mitochondrial gene, cytochrome c oxidase subunit I (COI). The molecular phylogeny was well supported by Maximum likelihood and Bayesian inference analysis; all genera were found to be monophyletic, except for Paracalanus, which was separated into two distinct clades: the Paracalanus aculeatus group and Paracalanus parvus group. The molecular phylogeny also confirmed previous findings that Mecynocera and Calocalanus are genera of the family Paracalanidae. For comparison, a morphological phylogeny was created for 35 paracalanid species based on 54 morphological characters derived from published descriptions. The morphological phylogeny did not resolve all genera as monophyletic and bootstrap support was not strong. Molecular and morphological phylogenies were not congruent in the positioning of Bestiolina and the Paracalanus species groups, possibly due to the lack of sufficient phylogenetically-informative morphological characters. Copyright © 2013 Elsevier Inc. All rights reserved.
Cao, Jin-Jun; Li, Wei-Hai
2018-01-01
Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis, which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs (COII and ND5) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNASer(AGN), which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla, and Chloroperlidae and Perlidae were herein supported to be a sister group. PMID:29495588
Wang, Ying; Cao, Jin-Jun; Li, Wei-Hai
2018-02-28
Stoneflies comprise an ancient group of insects, but the phylogenetic position of Plecoptera and phylogenetic relations within Plecoptera have long been controversial, and more molecular data is required to reconstruct precise phylogeny. Herein, we present the complete mitogenome of a stonefly, Suwallia teleckojensis , which is 16146 bp in length and consists of 13 protein-coding genes (PCGs), 2 ribosomal RNAs (rRNAs), 22 transfer RNAs (tRNAs) and a control region (CR). Most PCGs initiate with the standard start codon ATN. However, ND5 and ND1 started with GTG and TTG. Typical termination codons TAA and TAG were found in eleven PCGs, and the remaining two PCGs ( COII and ND5 ) have incomplete termination codons. All transfer RNA genes (tRNAs) have the classic cloverleaf secondary structures, with the exception of tRNA Ser(AGN) , which lacks the dihydrouridine (DHU) arm. Secondary structures of the two ribosomal RNAs were shown referring to previous models. A large tandem repeat region, two potential stem-loop (SL) structures, Poly N structure (2 poly-A, 1 poly-T and 1 poly-C), and four conserved sequence blocks (CSBs) were detected in the control region. Finally, both maximum likelihood (ML) and Bayesian inference (BI) analyses suggested that the Capniidae was monophyletic, and the other five stonefly families form a monophyletic group. In this study, S. teleckojensis was closely related to Sweltsa longistyla , and Chloroperlidae and Perlidae were herein supported to be a sister group.
Xu, Xuming; Zhang, Samuel Shao-Min; Barnstable, Colin J; Tombran-Tink, Joyce
2006-01-01
Background Pigment epithelium derived factor (PEDF), a member of the serpin family, regulates cell proliferation, promotes survival of neurons, and blocks growth of new blood vessels in mammals. Defining the molecular phylogeny of PEDF by bioinformatic analysis is one approach to understanding the link between its gene structure and its function in these biological processes. Results From a comprehensive search of available DNA databases we identified a single PEDF gene in all vertebrate species examined. These included four mammalian and six non-mammalian vertebrate species in which PEDF had not previously been described. A five gene cluster around PEDF was found in an approximate 100 kb region in mammals, birds, and amphibians. In ray-finned fish these genes are scattered over three chromosomes although only one PEDF gene was consistently found. The PEDF gene is absent in invertebrates including Drosophila melanogaster (D. melanogaster), Caenorhabditis elegans (C. elegans), and sea squirt (C. intestinalis). The PEDF gene is transcribed in all vertebrate phyla, suggesting it is biologically active throughout vertebrate evolution. The multiple actions of PEDF are likely conserved in evolution since it has the same gene structure across phyla, although the size of the gene ranges from 48.3 kb in X. tropicalis to 2.9 kb in fugu, with human PEDF at a size of 15.6 kb. A strong similarity in the proximal 200 bp of the PEDF promoter in mammals suggests the existence of a possible regulatory region across phyla. Using a non-synonymous/synonymous substitution rate ratio we show that mammalian and fish PEDFs have similar ratios of <0.13, reflecting a strong purifying selection of PEDF gene. A large number of repetitive transposable elements of the SINE and LINE class were found with random distribution in both the promoter and introns of mammalian PEDF. Conclusion The PEDF gene first appears in vertebrates and our studies suggest that the regulation and biological actions of this gene are preserved across vertebrates. This comprehensive analysis of the PEDF gene across phyla provides new information that will aid further characterization of common functional motifs of this serpin in biological processes. PMID:17020603
2009-01-01
Background Tunicates represent a key metazoan group as the sister-group of vertebrates within chordates. The six complete mitochondrial genomes available so far for tunicates have revealed distinctive features. Extensive gene rearrangements and particularly high evolutionary rates have been evidenced with regard to other chordates. This peculiar evolutionary dynamics has hampered the reconstruction of tunicate phylogenetic relationships within chordates based on mitogenomic data. Results In order to further understand the atypical evolutionary dynamics of the mitochondrial genome of tunicates, we determined the complete sequence of the solitary ascidian Herdmania momus. This genome from a stolidobranch ascidian presents the typical tunicate gene content with 13 protein-coding genes, 2 rRNAs and 24 tRNAs which are all encoded on the same strand. However, it also presents a novel gene arrangement, highlighting the extreme plasticity of gene order observed in tunicate mitochondrial genomes. Probabilistic phylogenetic inferences were conducted on the concatenation of the 13 mitochondrial protein-coding genes from representatives of major metazoan phyla. We show that whereas standard homogeneous amino acid models support an artefactual sister position of tunicates relative to all other bilaterians, the CAT and CAT+BP site- and time-heterogeneous mixture models place tunicates as the sister-group of vertebrates within monophyletic chordates. Moreover, the reference phylogeny indicates that tunicate mitochondrial genomes have experienced a drastic acceleration in their evolutionary rate that equally affects protein-coding and ribosomal-RNA genes. Conclusion This is the first mitogenomic study supporting the new chordate phylogeny revealed by recent phylogenomic analyses. It illustrates the beneficial effects of an increased taxon sampling coupled with the use of more realistic amino acid substitution models for the reconstruction of animal phylogeny. PMID:19922605
Singh, Tiratha Raj; Tsagkogeorga, Georgia; Delsuc, Frédéric; Blanquart, Samuel; Shenkar, Noa; Loya, Yossi; Douzery, Emmanuel Jp; Huchon, Dorothée
2009-11-17
Tunicates represent a key metazoan group as the sister-group of vertebrates within chordates. The six complete mitochondrial genomes available so far for tunicates have revealed distinctive features. Extensive gene rearrangements and particularly high evolutionary rates have been evidenced with regard to other chordates. This peculiar evolutionary dynamics has hampered the reconstruction of tunicate phylogenetic relationships within chordates based on mitogenomic data. In order to further understand the atypical evolutionary dynamics of the mitochondrial genome of tunicates, we determined the complete sequence of the solitary ascidian Herdmania momus. This genome from a stolidobranch ascidian presents the typical tunicate gene content with 13 protein-coding genes, 2 rRNAs and 24 tRNAs which are all encoded on the same strand. However, it also presents a novel gene arrangement, highlighting the extreme plasticity of gene order observed in tunicate mitochondrial genomes. Probabilistic phylogenetic inferences were conducted on the concatenation of the 13 mitochondrial protein-coding genes from representatives of major metazoan phyla. We show that whereas standard homogeneous amino acid models support an artefactual sister position of tunicates relative to all other bilaterians, the CAT and CAT+BP site- and time-heterogeneous mixture models place tunicates as the sister-group of vertebrates within monophyletic chordates. Moreover, the reference phylogeny indicates that tunicate mitochondrial genomes have experienced a drastic acceleration in their evolutionary rate that equally affects protein-coding and ribosomal-RNA genes. This is the first mitogenomic study supporting the new chordate phylogeny revealed by recent phylogenomic analyses. It illustrates the beneficial effects of an increased taxon sampling coupled with the use of more realistic amino acid substitution models for the reconstruction of animal phylogeny.
Munds, Rachel A; Titus, Chelsea L; Eggert, Lori S; Blomquist, Gregory E
2018-05-25
Extensive phylogenetic studies have found robust phylogenies are modeled by using a multi-gene approach and sampling from the majority of the taxa of interest. Yet, molecular studies focused on the lorises, a cryptic primate family, have often relied on one gene, or just mitochondrial DNA, and many were unable to include all four genera in the analyses, resulting in inconclusive phylogenies. Past phylogenetic loris studies resulted in lorises being monophyletic, paraphyletic, or an unresolvable trichotomy with the closely related galagos. The purpose of our study is to improve our understanding of loris phylogeny and evolutionary history by using a multi-gene approach. We used the mitochondrial genes cytochrome b, and cytochrome c oxidase subunit 1, along with a nuclear intron (recombination activating gene 2) and nuclear exon (the melanocortin 1 receptor). Maximum Likelihood and Bayesian phylogenetic analyses were conducted based on data from each locus, as well as on the concatenated sequences. The robust, concatenated results found lorises to be a monophyletic family (Lorisidae) (PP ≥ 0.99) with two distinct subfamilies: the African Perodictinae (PP ≥ 0.99) and the Asian Lorisinae (PP ≥ 0.99). Additionally, from these analyses all four genera were all recovered as monophyletic (PP ≥ 0.99). Some of our single-gene analyses recovered monophyly, but many had discordances, with some showing paraphyly or a deep-trichotomy. Bayesian partitioned analyses inferred the most recent common ancestors of lorises emerged ∼42 ± 6 million years ago (mya), the Asian Lorisinae separated ∼30 ± 9 mya, and Perodictinae arose ∼26 ± 10 mya. These times fit well with known historical tectonic shifts of the area, as well as with the sparse loris fossil record. Additionally, our results agree with previous multi-gene studies on Lorisidae which found lorises to be monophyletic and arising ∼40 mya (Perelman et al., 2011; Pozzi et al., 2014). By taking a multi-gene approach, we were able to recover a well-supported, monophyletic loris phylogeny and inferred the evolutionary history of this cryptic family. Copyright © 2018 Elsevier Inc. All rights reserved.
Salas-Leiva, Dayana E; Meerow, Alan W; Calonje, Michael; Griffith, M Patrick; Francisco-Ortega, Javier; Nakamura, Kyoko; Stevenson, Dennis W; Lewis, Carl E; Namoff, Sandra
2013-11-01
Despite a recent new classification, a stable phylogeny for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study, five single-copy nuclear genes (SCNGs) are applied to the phylogeny of the order Cycadales. The specific aim is to evaluate several gene tree-species tree reconciliation approaches for developing an accurate phylogeny of the order, to contrast them with concatenated parsimony analysis and to resolve the erstwhile problematic phylogenetic position of these three genera. DNA sequences of five SCNGs were obtained for 20 cycad species representing all ten genera of Cycadales. These were analysed with parsimony, maximum likelihood (ML) and three Bayesian methods of gene tree-species tree reconciliation, using Cycas as the outgroup. A calibrated date estimation was developed with Bayesian methods, and biogeographic analysis was also conducted. Concatenated parsimony, ML and three species tree inference methods resolve exactly the same tree topology with high support at most nodes. Dioon and Bowenia are the first and second branches of Cycadales after Cycas, respectively, followed by an encephalartoid clade (Macrozamia-Lepidozamia-Encephalartos), which is sister to a zamioid clade, of which Ceratozamia is the first branch, and in which Stangeria is sister to Microcycas and Zamia. A single, well-supported phylogenetic hypothesis of the generic relationships of the Cycadales is presented. However, massive extinction events inferred from the fossil record that eliminated broader ancestral distributions within Zamiaceae compromise accurate optimization of ancestral biogeographical areas for that hypothesis. While major lineages of Cycadales are ancient, crown ages of all modern genera are no older than 12 million years, supporting a recent hypothesis of mostly Miocene radiations. This phylogeny can contribute to an accurate infrafamilial classification of Zamiaceae.
Unequal rates of Y chromosome gene divergence during speciation of the family Ursidae.
Nakagome, Shigeki; Pecon-Slattery, Jill; Masuda, Ryuichi
2008-07-01
Evolution of the bear family Ursidae is well investigated in terms of morphological, paleontological, and genetic features. However, several phylogenetic ambiguities occur within the subfamily Ursinae (the family Ursidae excluding the giant panda and spectacled bear), which may correlate with behavioral traits of female philopatry and male-biased dispersal which form the basis of the observed matriarchal population structure in these species. In the process of bear evolution, we investigate the premise that such behavioral traits may be reflected in patterns of variation among genes with different modes of inheritance: matrilineal mitochondrial DNA (mtDNA), patrilineal Y chromosome, biparentally inherited autosomes, and the X chromosome. In the present study, we sequenced 3 Y-linked genes (3,453 bp) and 4 X-linked genes (4,960 bp) and reanalyzed previously published sequences from autosome genes (2,347 bp) in ursid species to investigate differences in evolutionary rates associated with patterns of inheritance. The results describe topological incongruence between sex-linked genes and autosome genes and between nuclear DNA and mtDNA. In more ancestral branches within the bear phylogeny, Y-linked genes evolved faster than autosome and X-linked genes, consistent with expectations based on male-driven evolution. However, this pattern changes among branches leading to each species within the lineage of Ursinae whereby the evolutionary rates of Y-linked genes have fewer than expected substitutions. This inconsistency between more recent nodes of the bear phylogeny with more ancestral nodes may reflect the influences of sex-biased dispersal as well as molecular evolutionary characteristics of the Y chromosome, and stochastic events in species natural history, and phylogeography unique to ursine bears.
Insights into the phylogeny or arylamine N-acetyltransferases in fungi.
Martins, Marta; Dairou, Julien; Rodrigues-Lima, Fernando; Dupret, Jean-Marie; Silar, Philippe
2010-08-01
Previous studies have shown that Eumycetes fungi can acylate arylamine thanks to arylamine N-acetyltransferases, xenobiotic-metabolizing enzymes also found in animals and bacteria. In this article, we present the results of mining 96 available fungal genome sequences for arylamine N-acetyltransferase genes and propose their phylogeny. The filamentous Pezizomycotina are shown to possess many putative N-acetyltransferases, whilst these are often lacking in other fungal groups. The evolution of the N-acetyltransferases is best explained by the presence of at least one gene in the opisthokont ancestor of the fungi and animal kingdoms, followed by recurrent gene losses and gene duplications. A possible horizontal gene transfer event may have occurred from bacteria to the basidiomycetous yeast Malassezia globosa.
Xu, Kai Wei; Zou, Lan; Penttinen, Petri; Wang, Ke; Heng, Nan Nan; Zhang, Xiao Ping; Chen, Qiang; Zhao, Ke; Chen, Yuan Xue
2015-10-01
A total of 54 rhizobial strains were isolated from faba bean root nodules in 21 counties of Sichuan hilly areas in China, and their symbiotic effectiveness, genetic diversity and phylogeny were assessed. Only six strains increased the shoot dry mass of the host plant significantly (P ≤ 0.05). Based on the cluster analysis of combined 16S rDNA and intergenic spacer region (IGS) PCR-RFLP, the strains were divided into 31 genotypes in 11 groups, indicating a high degree of genetic diversity among the strains. The sequence analysis of three housekeeping genes (atpD, glnII and recA) and 16S rDNA indicated that the strains represented two R. leguminosarum, two Rhizobium spp., R. mesosinicum, Agrobacterium sp. and A. tumefaciens. The strains representing four Rhizobium species were divided into two distinct nodC and nifH genotypes. However, the phylogeny of housekeeping genes and symbiotic genes was not congruent, implying that the strains had been shaped by vertical evolution of the housekeeping genes and lateral evolution of the symbiotic genes. Copyright © 2015 Elsevier GmbH. All rights reserved.
New Perspectives on Ebola Virus Evolution.
Brown, Celeste J; Quates, Caleb J; Mirabzadeh, Christopher A; Miller, Craig R; Wichman, Holly A; Miura, Tanya A; Ytreberg, F Marty
2016-01-01
Since the recent devastating outbreak of Ebola virus disease in western Africa, there has been significant effort to understand the evolution of the deadly virus that caused the outbreak. There has been a considerable investment in sequencing Ebola virus (EBOV) isolates, and the results paint an important picture of how the virus has spread in western Africa. EBOV evolution cannot be understood outside the context of previous outbreaks, however. We have focused this study on the evolution of the EBOV glycoprotein gene (GP) because one of its products, the spike glycoprotein (GP1,2), is central to the host immune response and because it contains a large amount of the phylogenetic signal for this virus. We inferred the maximum likelihood phylogeny of 96 nonredundant GP gene sequences representing each of the outbreaks since 1976 up to the end of 2014. We tested for positive selection and considered the placement of adaptive amino acid substitutions along the phylogeny and within the protein structure of GP1,2. We conclude that: 1) the common practice of rooting the phylogeny of EBOV between the first known outbreak in 1976 and the next outbreak in 1995 provides a misleading view of EBOV evolution that ignores the fact that there is a non-human EBOV host between outbreaks; 2) the N-terminus of GP1 may be constrained from evolving in response to the host immune system by the highly expressed, secreted glycoprotein, which is encoded by the same region of the GP gene; 3) although the mucin-like domain of GP1 is essential for EBOV in vivo, it evolves rapidly without losing its twin functions: providing O-linked glycosylation sites and a flexible surface.
Urantowka, Adam Dawid; Kroczak, Aleksandra; Mackiewicz, Paweł
2017-07-14
Conures are a morphologically diverse group of Neotropical parrots classified as members of the tribe Arini, which has recently been subjected to a taxonomic revision. The previously broadly defined Aratinga genus of this tribe has been split into the 'true' Aratinga and three additional genera, Eupsittula, Psittacara and Thectocercus. Popular markers used in the reconstruction of the parrots' phylogenies derive from mitochondrial DNA. However, current phylogenetic analyses seem to indicate conflicting relationships between Aratinga and other conures, and also among other Arini members. Therefore, it is not clear if the mtDNA phylogenies can reliably define the species tree. The inconsistencies may result from the variable evolution rate of the markers used or their weak phylogenetic signal. To resolve these controversies and to assess to what extent the phylogenetic relationships in the tribe Arini can be inferred from mitochondrial genomes, we compared representative Arini mitogenomes as well as examined the usefulness of the individual mitochondrial markers and the efficiency of various phylogenetic methods. Single molecular markers produced inconsistent tree topologies, while different methods offered various topologies even for the same marker. A significant disagreement in these tree topologies occurred for cytb, nd2 and nd6 genes, which are commonly used in parrot phylogenies. The strongest phylogenetic signal was found in the control region and RNA genes. However, these markers cannot be used alone in inferring Arini phylogenies because they do not provide fully resolved trees. The most reliable phylogeny of the parrots under study is obtained only on the concatenated set of all mitochondrial markers. The analyses established significantly resolved relationships within the former Aratinga representatives and the main genera of the tribe Arini. Such mtDNA phylogeny can be in agreement with the species tree, owing to its match with synapomorphic features in plumage colouration. Phylogenetic relationships inferred from single mitochondrial markers can be incorrect and contradictory. Therefore, such phylogenies should be considered with caution. Reliable results can be produced by concatenated sets of all or at least the majority of mitochondrial genes and the control region. The results advance a new view on the relationships among the main genera of Arini and resolve the inconsistencies between the taxa that were previously classified as the broadly defined genus Aratinga. Although gene and species trees do not always have to be consistent, the mtDNA phylogenies for Arini can reflect the species tree.
Kosushkin, S A; Borodulina, O R; Solov'eva, E N; Grechko, V V
2008-01-01
We have isolated and characterised sequences of a SINE family specific for squamate reptiles from a genome of lacertid lizard that we called Squam1. Copies are 360-390 bp in length and share a significant similarity with tRNA gene sequence on its 5'-end. This family was also detected by us in DNA of representatives of varanids, iguanids (anolis), gekkonids, and snakes. No signs of it were found in DNA of mammals, birds, amphibians, and crocodiles. Detailed analysis of primary structure of the retroposons obtained by us from genomic libraries or GenBank sequences was carried out. Most taxa possess 2-3 subfamilies of the SINE in their genomes with specific diagnostic features in their primary structure. Individual variability of copies in different families is about 85% and is just slightly lower on the genera level. Comparison of consensus sequences on family level reveals a high degree of structural similarity with a number of specific apomorphic features which makes it a useful marker of phylogeny for this group of reptiles. Snakes do not show specific affinity to varanids when compared to other lizards, as it was suggested earlier.
Ghost-tree: creating hybrid-gene phylogenetic trees for diversity analyses.
Fouquier, Jennifer; Rideout, Jai Ram; Bolyen, Evan; Chase, John; Shiffer, Arron; McDonald, Daniel; Knight, Rob; Caporaso, J Gregory; Kelley, Scott T
2016-02-24
Fungi play critical roles in many ecosystems, cause serious diseases in plants and animals, and pose significant threats to human health and structural integrity problems in built environments. While most fungal diversity remains unknown, the development of PCR primers for the internal transcribed spacer (ITS) combined with next-generation sequencing has substantially improved our ability to profile fungal microbial diversity. Although the high sequence variability in the ITS region facilitates more accurate species identification, it also makes multiple sequence alignment and phylogenetic analysis unreliable across evolutionarily distant fungi because the sequences are hard to align accurately. To address this issue, we created ghost-tree, a bioinformatics tool that integrates sequence data from two genetic markers into a single phylogenetic tree that can be used for diversity analyses. Our approach starts with a "foundation" phylogeny based on one genetic marker whose sequences can be aligned across organisms spanning divergent taxonomic groups (e.g., fungal families). Then, "extension" phylogenies are built for more closely related organisms (e.g., fungal species or strains) using a second more rapidly evolving genetic marker. These smaller phylogenies are then grafted onto the foundation tree by mapping taxonomic names such that each corresponding foundation-tree tip would branch into its new "extension tree" child. We applied ghost-tree to graft fungal extension phylogenies derived from ITS sequences onto a foundation phylogeny derived from fungal 18S sequences. Our analysis of simulated and real fungal ITS data sets found that phylogenetic distances between fungal communities computed using ghost-tree phylogenies explained significantly more variance than non-phylogenetic distances. The phylogenetic metrics also improved our ability to distinguish small differences (effect sizes) between microbial communities, though results were similar to non-phylogenetic methods for larger effect sizes. The Silva/UNITE-based ghost tree presented here can be easily integrated into existing fungal analysis pipelines to enhance the resolution of fungal community differences and improve understanding of these communities in built environments. The ghost-tree software package can also be used to develop phylogenetic trees for other marker gene sets that afford different taxonomic resolution, or for bridging genome trees with amplicon trees. ghost-tree is pip-installable. All source code, documentation, and test code are available under the BSD license at https://github.com/JTFouquier/ghost-tree .
Yuan, Ming-Long; Zhang, Qi-Lin; Zhang, Li; Jia, Cheng-Lin; Li, Xiao-Peng; Yang, Xing-Zhuo; Feng, Run-Qiu
2018-05-01
Grassland caterpillars (Lepidoptera: Lymantriinae: Gynaephora) are the most important pests in alpine meadows of the Tibetan Plateau (TP) and have well adapted to high-altitude environments. To further understand the evolutionary history and their adaptation to the TP, we newly determined seven complete TP Gynaephora mitogenomes. Compared to single genes, whole mitogenomes provided the best phylogenetic signals and obtained robust results, supporting the monophyly of the TP Gynaephora species and a phylogeny of Arctiinae + (Aganainae + Lymantriinae). Incongruent phylogenetic signals were found among single mitochondrial genes, none of which recovered the same phylogeny as the whole mitogenome. We identified six best-performing single genes using Shimodaira-Hasegawa tests and found that the combinations of rrnS and either cox1 or cox3 generated the same phylogeny as the whole mitogenome, indicating the phylogenetic potential of these three genes for future evolutionary studies of Gynaephora. The TP Gynaephora species were estimated to radiate on the TP during the Pliocene and Quaternary, supporting an association of the diversification and speciation of the TP Gynaephora species with the TP uplifts and associated climate changes during this time. Selection analyses revealed accelerated evolutionary rates of the mitochondrial protein-coding genes in the TP Gynaephora species, suggesting that they accumulated more nonsynonymous substitutions that may benefit their adaptation to high altitudes. Furthermore, signals of positive selection were detected in nad5 of two Gynaephora species with the highest altitude-distributions, indicating that this gene may contribute to Gynaephora's adaptation to divergent altitudes. This study adds to the understanding of the TP Gynaephora evolutionary relationships and suggests a link between mitogenome evolution and ecological adaptation to high-altitude environments in grassland caterpillars. Copyright © 2018 Elsevier Inc. All rights reserved.
Chen, Meng-Yun; Liang, Dan; Zhang, Peng
2017-08-01
The interordinal relationships of Laurasiatherian mammals are currently one of the most controversial questions in mammalian phylogenetics. Previous studies mainly relied on coding sequences (CDS) and seldom used noncoding sequences. Here, by data mining public genome data, we compiled an intron data set of 3,638 genes (all introns from a protein-coding gene are considered as a gene) (19,055,073 bp) and a CDS data set of 10,259 genes (20,994,285 bp), covering all major lineages of Laurasiatheria (except Pholidota). We found that the intron data contained stronger and more congruent phylogenetic signals than the CDS data. In agreement with this observation, concatenation and species-tree analyses of the intron data set yielded well-resolved and identical phylogenies, whereas the CDS data set produced weakly supported and incongruent results. Further analyses showed that the phylogeny inferred from the intron data is highly robust to data subsampling and change in outgroup, but the CDS data produced unstable results under the same conditions. Interestingly, gene tree statistical results showed that the most frequently observed gene tree topologies for the CDS and intron data are identical, suggesting that the major phylogenetic signal within the CDS data is actually congruent with that within the intron data. Our final result of Laurasiatheria phylogeny is (Eulipotyphla,((Chiroptera, Perissodactyla),(Carnivora, Cetartiodactyla))), favoring a close relationship between Chiroptera and Perissodactyla. Our study 1) provides a well-supported phylogenetic framework for Laurasiatheria, representing a step towards ending the long-standing "hard" polytomy and 2) argues that intron within genome data is a promising data resource for resolving rapid radiation events across the tree of life. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome-wide characterization of the Pectate Lyase-like (PLL) genes in Brassica rapa.
Jiang, Jingjing; Yao, Lina; Miao, Ying; Cao, Jiashu
2013-11-01
Pectate lyases (PL) depolymerize demethylated pectin (pectate, EC 4.2.2.2) by catalyzing the eliminative cleavage of α-1,4-glycosidic linked galacturonan. Pectate Lyase-like (PLL) genes are one of the largest and most complex families in plants. However, studies on the phylogeny, gene structure, and expression of PLL genes are limited. To understand the potential functions of PLL genes in plants, we characterized their intron-exon structure, phylogenetic relationships, and protein structures, and measured their expression patterns in various tissues, specifically the reproductive tissues in Brassica rapa. Sequence alignments revealed two characteristic motifs in PLL genes. The chromosome location analysis indicated that 18 of the 46 PLL genes were located in the least fractionated sub-genome (LF) of B. rapa, while 16 were located in the medium fractionated sub-genome (MF1) and 12 in the more fractionated sub-genome (MF2). Quantitative RT-PCR analysis showed that BrPLL genes were expressed in various tissues, with most of them being expressed in flowers. Detailed qRT-PCR analysis identified 11 pollen specific PLL genes and several other genes with unique spatial expression patterns. In addition, some duplicated genes showed similar expression patterns. The phylogenetic analysis identified three PLL gene subfamilies in plants, among which subfamily II might have evolved from gene neofunctionalization or subfunctionalization. Therefore, this study opens the possibility for exploring the roles of PLL genes during plant development.
Wright, Jeremy J; David, Solomon R; Near, Thomas J
2012-06-01
Extant gars represent the remaining members of a formerly diverse assemblage of ancient ray-finned fishes and have been the subject of multiple phylogenetic analyses using morphological data. Here, we present the first hypothesis of phylogenetic relationships among living gar species based on molecular data, through the examination of gene tree heterogeneity and coalescent species tree analyses of a portion of one mitochondrial (COI) and seven nuclear (ENC1, myh6, plagl2, S7 ribosomal protein intron 1, sreb2, tbr1, and zic1) genes. Individual gene trees displayed varying degrees of resolution with regards to species-level relationships, and the gene trees inferred from COI and the S7 intron were the only two that were completely resolved. Coalescent species tree analyses of nuclear genes resulted in a well-resolved and strongly supported phylogenetic tree of living gar species, for which Bayesian posterior node support was further improved by the inclusion of the mitochondrial gene. Species-level relationships among gars inferred from our molecular data set were highly congruent with previously published morphological phylogenies, with the exception of the placement of two species, Lepisosteus osseus and L. platostomus. Re-examination of the character coding used by previous authors provided partial resolution of this topological discordance, resulting in broad concordance in the phylogenies inferred from individual genes, the coalescent species tree analysis, and morphology. The completely resolved phylogeny inferred from the molecular data set with strong Bayesian posterior support at all nodes provided insights into the potential for introgressive hybridization and patterns of allopatric speciation in the evolutionary history of living gars, as well as a solid foundation for future examinations of functional diversification and evolutionary stasis in a "living fossil" lineage. Copyright © 2012 Elsevier Inc. All rights reserved.
Optimization of Multilocus Sequence Analysis for Identification of Species in the Genus Vibrio
Gabriel, Michael W.; Matsui, George Y.; Friedman, Robert
2014-01-01
Multilocus sequence analysis (MLSA) is an important method for identification of taxa that are not well differentiated by 16S rRNA gene sequences alone. In this procedure, concatenated sequences of selected genes are constructed and then analyzed. The effects that the number and the order of genes used in MLSA have on reconstruction of phylogenetic relationships were examined. The recA, rpoA, gapA, 16S rRNA gene, gyrB, and ftsZ sequences from 56 species of the genus Vibrio were used to construct molecular phylogenies, and these were evaluated individually and using various gene combinations. Phylogenies from two-gene sequences employing recA and rpoA in both possible gene orders were different. The addition of the gapA gene sequence, producing all six possible concatenated sequences, reduced the differences in phylogenies to degrees of statistical (bootstrap) support for some nodes. The overall statistical support for the phylogenetic tree, assayed on the basis of a reliability score (calculated from the number of nodes having bootstrap values of ≥80 divided by the total number of nodes) increased with increasing numbers of genes used, up to a maximum of four. No further improvement was observed from addition of the fifth gene sequence (ftsZ), and addition of the sixth gene (gyrB) resulted in lower proportions of strongly supported nodes. Reductions in the numbers of strongly supported nodes were also observed when maximum parsimony was employed for tree construction. Use of a small number of gene sequences in MLSA resulted in accurate identification of Vibrio species. PMID:24951781
Milla, Liz; van Nieukerken, Erik J; Vijverberg, Ruben; Doorenweerd, Camiel; Wilcox, Stephen A; Halsey, Mike; Young, David A; Jones, Therésa M; Kallies, Axel; Hilton, Douglas J
2018-03-01
Heliozelidae are a widespread, evolutionarily early diverging family of small, day-flying monotrysian moths, for which a comprehensive phylogeny is lacking. We generated the first molecular phylogeny of the family using DNA sequences of two mitochondrial genes (COI and COII) and two nuclear genes (H3 and 28S) from 130 Heliozelidae specimens, including eight of the twelve known genera: Antispila, Antispilina, Coptodisca, Heliozela, Holocacista, Hoplophanes, Pseliastis, and Tyriozela. Our results provide strong support for five major Heliozelidae clades: (i) a large widespread clade containing the leaf-mining genera Antispilina, Coptodisca and Holocacista and some species of Antispila, (ii) a clade containing most of the described Antispila, (iii) a clade containing the leaf-mining genus Heliozela and the monotypic genus Tyriozela, (iv) an Australian clade containing Pseliastis and (v) an Australian clade containing Hoplophanes. Each clade includes several new species and potentially new genera. Collectively, our data uncover a rich and undescribed diversity that appears to be especially prevalent in Australia. Our work highlights the need for a major taxonomic revision of the family and for generating a robust molecular phylogeny using multi-gene approaches in order to resolve the relationships among clades. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Phylogenetic relationships of Hemiptera inferred from mitochondrial and nuclear genes.
Song, Nan; Li, Hu; Cai, Wanzhi; Yan, Fengming; Wang, Jianyun; Song, Fan
2016-11-01
Here, we reconstructed the Hemiptera phylogeny based on the expanded mitochondrial protein-coding genes and the nuclear 18S rRNA gene, separately. The differential rates of change across lineages may associate with long-branch attraction (LBA) effect and result in conflicting estimates of phylogeny from different types of data. To reduce the potential effects of systematic biases on inferences of topology, various data coding schemes, site removal method, and different algorithms were utilized in phylogenetic reconstruction. We show that the outgroups Phthiraptera, Thysanoptera, and the ingroup Sternorrhyncha share similar base composition, and exhibit "long branches" relative to other hemipterans. Thus, the long-branch attraction between these groups is suspected to cause the failure of recovering Hemiptera under the homogeneous model. In contrast, a monophyletic Hemiptera is supported when heterogeneous model is utilized in the analysis. Although higher level phylogenetic relationships within Hemiptera remain to be answered, consensus between analyses is beginning to converge on a stable phylogeny.
Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes
2012-01-01
Background Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. Results We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. Conclusions Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem. PMID:22759433
Vibrio chromosomes share common history.
Kirkup, Benjamin C; Chang, LeeAnn; Chang, Sarah; Gevers, Dirk; Polz, Martin F
2010-05-10
While most gamma proteobacteria have a single circular chromosome, Vibrionales have two circular chromosomes. Horizontal gene transfer is common among Vibrios, and in light of this genetic mobility, it is an open question to what extent the two chromosomes themselves share a common history since their formation. Single copy genes from each chromosome (142 genes from chromosome I and 42 genes from chromosome II) were identified from 19 sequenced Vibrionales genomes and their phylogenetic comparison suggests consistent phylogenies for each chromosome. Additionally, study of the gene organization and phylogeny of the respective origins of replication confirmed the shared history. Thus, while elements within the chromosomes may have experienced significant genetic mobility, the backbones share a common history. This allows conclusions based on multilocus sequence analysis (MLSA) for one chromosome to be applied equally to both chromosomes.
Singh, Prashant; Singh, Satya Shila; Elster, Josef; Mishra, Arun Kumar
2013-06-01
In order to assess phylogeny, population genetics, and approximation of future course of cyanobacterial evolution based on nifH gene sequences, 41 heterocystous cyanobacterial strains collected from all over India have been used in the present study. NifH gene sequence analysis data confirm that the heterocystous cyanobacteria are monophyletic while the stigonematales show polyphyletic origin with grave intermixing. Further, analysis of nifH gene sequence data using intricate mathematical extrapolations revealed that the nucleotide diversity and recombination frequency is much greater in Nostocales than the Stigonematales. Similarly, DNA divergence studies showed significant values of divergence with greater gene conversion tracts in the unbranched (Nostocales) than the branched (Stigonematales) strains. Our data strongly support the origin of true branching cyanobacterial strains from the unbranched strains.
Visualizing phylogenetic tree landscapes.
Wilgenbusch, James C; Huang, Wen; Gallivan, Kyle A
2017-02-02
Genomic-scale sequence alignments are increasingly used to infer phylogenies in order to better understand the processes and patterns of evolution. Different partitions within these new alignments (e.g., genes, codon positions, and structural features) often favor hundreds if not thousands of competing phylogenies. Summarizing and comparing phylogenies obtained from multi-source data sets using current consensus tree methods discards valuable information and can disguise potential methodological problems. Discovery of efficient and accurate dimensionality reduction methods used to display at once in 2- or 3- dimensions the relationship among these competing phylogenies will help practitioners diagnose the limits of current evolutionary models and potential problems with phylogenetic reconstruction methods when analyzing large multi-source data sets. We introduce several dimensionality reduction methods to visualize in 2- and 3-dimensions the relationship among competing phylogenies obtained from gene partitions found in three mid- to large-size mitochondrial genome alignments. We test the performance of these dimensionality reduction methods by applying several goodness-of-fit measures. The intrinsic dimensionality of each data set is also estimated to determine whether projections in 2- and 3-dimensions can be expected to reveal meaningful relationships among trees from different data partitions. Several new approaches to aid in the comparison of different phylogenetic landscapes are presented. Curvilinear Components Analysis (CCA) and a stochastic gradient decent (SGD) optimization method give the best representation of the original tree-to-tree distance matrix for each of the three- mitochondrial genome alignments and greatly outperformed the method currently used to visualize tree landscapes. The CCA + SGD method converged at least as fast as previously applied methods for visualizing tree landscapes. We demonstrate for all three mtDNA alignments that 3D projections significantly increase the fit between the tree-to-tree distances and can facilitate the interpretation of the relationship among phylogenetic trees. We demonstrate that the choice of dimensionality reduction method can significantly influence the spatial relationship among a large set of competing phylogenetic trees. We highlight the importance of selecting a dimensionality reduction method to visualize large multi-locus phylogenetic landscapes and demonstrate that 3D projections of mitochondrial tree landscapes better capture the relationship among the trees being compared.
Zhao, Rui-lin; Desjardin, Dennis E.; Soytong, Kasem; Hyde, Kevin D.
2008-01-01
We present an overview of previous research results on the molecular phylogenetic analyses in Agaricales and its higher ranks (Agaricomycetes/Agaricomycotina/Basidiomycota) along with the most recent treatments of taxonomic systems in these taxa. Establishing phylogenetic hypotheses using DNA sequences, from which an understanding of the natural evolutionary relationships amongst clades may be derived, requires a robust dataset. It has been recognized that single-gene phylogenies may not truly represent organismal phylogenies, but the concordant phylogenetic genealogies from multiple-gene datasets can resolve this problem. The genes commonly used in mushroom phylogenetic research are summarized. PMID:18837104
Molecular Analysis of the Nitrate-Reducing Community from Unplanted and Maize-Planted Soils
Philippot, Laurent; Piutti, Séverine; Martin-Laurent, Fabrice; Hallet, Stéphanie; Germon, Jean Claude
2002-01-01
Microorganisms that use nitrate as an alternative terminal electron acceptor play an important role in the global nitrogen cycle. The diversity of the nitrate-reducing community in soil and the influence of the maize roots on the structure of this community were studied. The narG gene encoding the membrane bound nitrate reductase was selected as a functional marker for the nitrate-reducing community. The use of narG is of special interest because the phylogeny of the narG gene closely reflects the 16S ribosomal DNA phylogeny. Therefore, targeting the narG gene provided for the first time a unique insight into the taxonomic composition of the nitrate-reducing community in planted and unplanted soils. The PCR-amplified narG fragments were cloned and analyzed by restriction fragment length polymorphism (RFLP). In all, 60 RFLP types represented by two or more clones were identified in addition to the 58 RFLP types represented by only one clone. At least one clone belonging to each RFLP type was then sequenced. Several of the obtained sequences were not related to the narG genes from cultivated bacteria, suggesting the existence of unidentified nitrate-reducing bacteria in the studied soil. However, environmental sequences were also related to NarG from many bacterial divisions, i.e., Actinobacteria and α, β, and γ Proteobacteria. The presence of the plant roots resulted in a shift in the structure of the nitrate-reducing community between the unplanted and planted soils. Sequencing of RFLP types dominant in the rhizosphere or present only in the rhizosphere revealed that they are related to NarG from the Actinobacteria in an astonishingly high proportion. PMID:12450836
The Extent of Genome Flux and Its Role in the Differentiation of Bacterial Lineages
Nowell, Reuben W.; Green, Sarah; Laue, Bridget E.; Sharp, Paul M.
2014-01-01
Horizontal gene transfer (HGT) and gene loss are key processes in bacterial evolution. However, the role of gene gain and loss in the emergence and maintenance of ecologically differentiated bacterial populations remains an open question. Here, we use whole-genome sequence data to quantify gene gain and loss for 27 lineages of the plant-associated bacterium Pseudomonas syringae. We apply an extensive error-control procedure that accounts for errors in draft genome data and greatly improves the accuracy of patterns of gene occurrence among these genomes. We demonstrate a history of extensive genome fluctuation for this species and show that individual lineages could have acquired thousands of genes in the same period in which a 1% amino acid divergence accrues in the core genome. Elucidating the dynamics of genome fluctuation reveals the rapid turnover of gained genes, such that the majority of recently gained genes are quickly lost. Despite high observed rates of fluctuation, a phylogeny inferred from patterns of gene occurrence is similar to a phylogeny based on amino acid replacements within the core genome. Furthermore, the core genome phylogeny suggests that P. syringae should be considered a number of distinct species, with levels of divergence at least equivalent to those between recognized bacterial species. Gained genes are transferred from a variety of sources, reflecting the depth and diversity of the potential gene pool available via HGT. Overall, our results provide further insights into the evolutionary dynamics of genome fluctuation and implicate HGT as a major factor contributing to the diversification of P. syringae lineages. PMID:24923323
Phylogeny of mycoplasmalike organisms (phytoplasmas): a basis for their classification.
Gundersen, D E; Lee, I M; Rehner, S A; Davis, R E; Kingsbury, D T
1994-01-01
A global phylogenetic analysis using parsimony of 16S rRNA gene sequences from 46 mollicutes, 19 mycoplasmalike organisms (MLOs) (new trivial name, phytoplasmas), and several related bacteria placed the MLOs definitively among the members of the class Mollicutes and revealed that MLOs form a large discrete monophyletic clade, paraphyletic to the Acholeplasma species, within the Anaeroplasma clade. Within the MLO clade resolved in the global mollicutes phylogeny and a comprehensive MLO phylogeny derived by parsimony analyses of 16S rRNA gene sequences from 30 diverse MLOs representative of nearly all known distinct MLO groups, five major phylogenetic groups with a total of 11 distinct subclades (monophyletic groups or taxa) could be recognized. These MLO subclades (roman numerals) and designated type strains were as follows: i, Maryland aster yellows AY1; ii, apple proliferation AP-A; iii, peanut witches'-broom PnWB; iv, Canada peach X CX; v, rice yellow dwarf RYD; vi, pigeon pea witches'-broom PPWB; vii, palm lethal yellowing LY; viii, ash yellows AshY; ix, clover proliferation CP; x, elm yellows EY; and xi, loofah witches'-broom LfWB. The designations of subclades and their phylogenetic positions within the MLO clade were supported by a congruent phylogeny derived by parsimony analyses of ribosomal protein L22 gene sequences from most representative MLOs. On the basis of the phylogenies inferred in the present study, we propose that MLOs should be represented taxonomically at the minimal level of genus and that each phylogenetically distinct MLO subclade identified should represent at least a distinct species under this new genus. Images PMID:8071198
Thirugnanasambantham, Krishnaraj; Saravanan, Subramanian; Karikalan, Kulandaivelu; Bharanidharan, Rajaraman; Lalitha, Perumal; Ilango, S; HairulIslam, Villianur Ibrahim
2015-10-01
Momordica charantia (bitter gourd, bitter melon) is a monoecious Cucurbitaceae with anti-oxidant, anti-microbial, anti-viral and anti-diabetic potential. Molecular studies on this economically valuable plant are very essential to understand its phylogeny and evolution. MicroRNAs (miRNAs) are conserved, small, non-coding RNA with ability to regulate gene expression by bind the 3' UTR region of target mRNA and are evolved at different rates in different plant species. In this study we have utilized homology based computational approach and identified 27 mature miRNAs for the first time from this bio-medically important plant. The phylogenetic tree developed from binary data derived from the data on presence/absence of the identified miRNAs were noticed to be uncertain and biased. Most of the identified miRNAs were highly conserved among the plant species and sequence based phylogeny analysis of miRNAs resolved the above difficulties in phylogeny approach using miRNA. Predicted gene targets of the identified miRNAs revealed their importance in regulation of plant developmental process. Reported miRNAs held sequence conservation in mature miRNAs and the detailed phylogeny analysis of pre-miRNA sequences revealed genus specific segregation of clusters. Copyright © 2015 Elsevier Ltd. All rights reserved.
Shang, Haihong; Li, Wei; Zou, Changsong; Yuan, Youlu
2013-07-01
NAC domain proteins are plant-specific transcription factors known to play diverse roles in various plant developmental processes. In the present study, we performed the first comprehensive study of the NAC gene family in Gossypium raimondii Ulbr., incorporating phylogenetic, chromosomal location, gene structure, conserved motif, and expression profiling analyses. We identified 145 NAC transcription factor (NAC-TF) genes that were phylogenetically clustered into 18 distinct subfamilies. Of these, 127 NAC-TF genes were distributed across the 13 chromosomes, 80 (55%) were preferentially retained duplicates located in both duplicated regions and six were located in triplicated chromosomal regions. The majority of NAC-TF genes showed temporal-, spatial-, and tissue-specific expression patterns based on transcriptomic and qRT-PCR analyses. However, the expression patterns of several duplicate genes were partially redundant, suggesting the occurrence of sub-functionalization during their evolution. Based on their genomic organization, we concluded that genomic duplications contributed significantly to the expansion of the NAC-TF gene family in G. raimondii. Comprehensive analysis of their expression profiles could provide novel insights into the functional divergence among members of the NAC gene family in G. raimondii. © 2013 Institute of Botany, Chinese Academy of Sciences.
Molecular phylogeny, morphology, pigment chemistry and ecology in Hygrophoraceae (Agaricales)
D. Jean Lodge; Mahajabeen Padamsee; P. Brandon Matheny; M. Catherine Aime; Sharon A. Cantrell; David Boertmann; Alexander Kovalenko; Alfredo Vizzini; Bryn T.M. Dentinger; Paul M. Kirk; A. Martin Ainsworth; Jean-Marc Moncalvo; Rytas Vilgalys; Ellen Larsson; Robert Lucking; Gareth W. Griffith; Matthew E. Smith; Lorilei L. Norvell; Dennis E. Desjardin; Scott A. Redhead; Clark L. Ovrebo; Edgar B. Lickey; Enrico Ercole; Karen W. Hughes; Regis Courtecuisse; Anthony Young; Manfred Binder; Andrew M. Minnis; Daniel L. Lindner; Beatriz Ortiz-Santana; John Haight; Thomas Laessoe; Timothy J. Baroni; Jozsef Geml; Tsutomu Hattori
2013-01-01
Molecular phylogenies using 1â4 gene regions and information on ecology, morphology and pigment chemistry were used in a partial revision of the agaric family Hygrophoraceae. The phylogenetically supported genera we recognize here in the Hygrophoraceae based on these and previous analyses are: Acantholichen, Ampulloclitocybe, Arrhenia, Cantharellula, Cantharocybe,...
SICLE: a high-throughput tool for extracting evolutionary relationships from phylogenetic trees.
DeBlasio, Dan F; Wisecaver, Jennifer H
2016-01-01
We present the phylogeny analysis software SICLE (Sister Clade Extractor), an easy-to-use, high-throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4,589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.
Chidebe, Ifeoma N.
2017-01-01
ABSTRACT Cowpea derives most of its N nutrition from biological nitrogen fixation (BNF) via symbiotic bacteroids in root nodules. In Sub-Saharan Africa, the diversity and biogeographic distribution of bacterial microsymbionts nodulating cowpea and other indigenous legumes are not well understood, though needed for increased legume production. The aim of this study was to describe the distribution and phylogenies of rhizobia at different agroecological regions of Mozambique using PCR of the BOX element (BOX-PCR), restriction fragment length polymorphism of the internal transcribed spacer (ITS-RFLP), and sequence analysis of ribosomal, symbiotic, and housekeeping genes. A total of 122 microsymbionts isolated from two cowpea varieties (IT-1263 and IT-18) grouped into 17 clades within the BOX-PCR dendrogram. The PCR-ITS analysis yielded 17 ITS types for the bacterial isolates, while ITS-RFLP analysis placed all test isolates in six distinct clusters (I to VI). BLASTn sequence analysis of 16S rRNA and four housekeeping genes (glnII, gyrB, recA, and rpoB) showed their alignment with Rhizobium and Bradyrhizobium species. The results revealed a group of highly diverse and adapted cowpea-nodulating microsymbionts which included Bradyrhizobium pachyrhizi, Bradyrhizobium arachidis, Bradyrhizobium yuanmingense, and a novel Bradyrhizobium sp., as well as Rhizobium tropici, Rhizobium pusense, and Neorhizobium galegae in Mozambican soils. Discordances observed in single-gene phylogenies could be attributed to horizontal gene transfer and/or subsequent recombinations of the genes. Natural deletion of 60 bp of the gyrB region was observed in isolate TUTVU7; however, this deletion effect on DNA gyrase function still needs to be confirmed. The inconsistency of nifH with core gene phylogenies suggested differences in the evolutionary history of both chromosomal and symbiotic genes. IMPORTANCE A diverse group of both Bradyrhizobium and Rhizobium species responsible for cowpea nodulation in Mozambique was found in this study. Future studies could prove useful in evaluating these bacterial isolates for symbiotic efficiency and strain competitiveness in Mozambican soils. PMID:29101189
Industrial applications of high-performance computing for phylogeny reconstruction
NASA Astrophysics Data System (ADS)
Bader, David A.; Moret, Bernard M.; Vawter, Lisa
2001-07-01
Phylogenies (that is, tree-of-life relationships) derived from gene order data may prove crucial in answering some fundamental open questions in biomolecular evolution. Real-world interest is strong in determining these relationships. For example, pharmaceutical companies may use phylogeny reconstruction in drug discovery for discovering synthetic pathways unique to organisms that they wish to target. Health organizations study the phylogenies of organisms such as HIV in order to understand their epidemiologies and to aid in predicting the behaviors of future outbreaks. And governments are interested in aiding the production of such foodstuffs as rice, wheat and potatoes via genetics through understanding of the phylogenetic distribution of genetic variation in wild populations. Yet few techniques are available for difficult phylogenetic reconstruction problems. Appropriate tools for analysis of such data may aid in resolving some of the phylogenetic problems that have been analyzed without much resolution for decades. With the rapid accumulation of whole genome sequences for a wide diversity of taxa, especially microbial taxa, phylogenetic reconstruction based on changes in gene order and gene content is showing promise, particularly for resolving deep (i.e., ancient) branch splits. However, reconstruction from gene-order data is even more computationally expensive than reconstruction from sequence data, particularly in groups with large numbers of genes and highly-rearranged genomes. We have developed a software suite, GRAPPA, that extends the breakpoint analysis (BPAnalysis) method of Sankoff and Blanchette while running much faster: in a recent analysis of chloroplast genome data for species of Campanulaceae on a 512-processor Linux supercluster with Myrinet, we achieved a one-million-fold speedup over BPAnalysis. GRAPPA can use either breakpoint or inversion distance (computed exactly) for its computation and runs on single-processor machines as well as parallel and high-performance computers.
Sagova-Mareckova, Marketa; Ulanova, Dana; Sanderova, Petra; Omelka, Marek; Kamenik, Zdenek; Olsovska, Jana; Kopecky, Jan
2015-04-01
Distribution and evolutionary history of resistance genes in environmental actinobacteria provide information on intensity of antibiosis and evolution of specific secondary metabolic pathways at a given site. To this day, actinobacteria producing biologically active compounds were isolated mostly from soil but only a limited range of soil environments were commonly sampled. Consequently, soil remains an unexplored environment in search for novel producers and related evolutionary questions. Ninety actinobacteria strains isolated at contrasting soil sites were characterized phylogenetically by 16S rRNA gene, for presence of erm and ABC transporter resistance genes and antibiotic production. An analogous analysis was performed in silico with 246 and 31 strains from Integrated Microbial Genomes (JGI_IMG) database selected by the presence of ABC transporter genes and erm genes, respectively. In the isolates, distances of erm gene sequences were significantly correlated to phylogenetic distances based on 16S rRNA genes, while ABC transporter gene distances were not. The phylogenetic distance of isolates was significantly correlated to soil pH and organic matter content of isolation sites. In the analysis of JGI_IMG datasets the correlation between phylogeny of resistance genes and the strain phylogeny based on 16S rRNA genes or five housekeeping genes was observed for both the erm genes and ABC transporter genes in both actinobacteria and streptomycetes. However, in the analysis of sequences from genomes where both resistance genes occurred together the correlation was observed for both ABC transporter and erm genes in actinobacteria but in streptomycetes only in the erm gene. The type of erm resistance gene sequences was influenced by linkage to 16S rRNA gene sequences and site characteristics. The phylogeny of ABC transporter gene was correlated to 16S rRNA genes mainly above the genus level. The results support the concept of new specific secondary metabolite scaffolds occurring more likely in taxonomically distant producers but suggest that the antibiotic selection of gene pools is also influenced by site conditions.
Salas-Leiva, Dayana E.; Meerow, Alan W.; Calonje, Michael; Griffith, M. Patrick; Francisco-Ortega, Javier; Nakamura, Kyoko; Stevenson, Dennis W.; Lewis, Carl E.; Namoff, Sandra
2013-01-01
Background and aims Despite a recent new classification, a stable phylogeny for the cycads has been elusive, particularly regarding resolution of Bowenia, Stangeria and Dioon. In this study, five single-copy nuclear genes (SCNGs) are applied to the phylogeny of the order Cycadales. The specific aim is to evaluate several gene tree–species tree reconciliation approaches for developing an accurate phylogeny of the order, to contrast them with concatenated parsimony analysis and to resolve the erstwhile problematic phylogenetic position of these three genera. Methods DNA sequences of five SCNGs were obtained for 20 cycad species representing all ten genera of Cycadales. These were analysed with parsimony, maximum likelihood (ML) and three Bayesian methods of gene tree–species tree reconciliation, using Cycas as the outgroup. A calibrated date estimation was developed with Bayesian methods, and biogeographic analysis was also conducted. Key Results Concatenated parsimony, ML and three species tree inference methods resolve exactly the same tree topology with high support at most nodes. Dioon and Bowenia are the first and second branches of Cycadales after Cycas, respectively, followed by an encephalartoid clade (Macrozamia–Lepidozamia–Encephalartos), which is sister to a zamioid clade, of which Ceratozamia is the first branch, and in which Stangeria is sister to Microcycas and Zamia. Conclusions A single, well-supported phylogenetic hypothesis of the generic relationships of the Cycadales is presented. However, massive extinction events inferred from the fossil record that eliminated broader ancestral distributions within Zamiaceae compromise accurate optimization of ancestral biogeographical areas for that hypothesis. While major lineages of Cycadales are ancient, crown ages of all modern genera are no older than 12 million years, supporting a recent hypothesis of mostly Miocene radiations. This phylogeny can contribute to an accurate infrafamilial classification of Zamiaceae. PMID:23997230
Phylogeny of flowering plants by the chloroplast genome sequences: in search of a "lucky gene".
Logacheva, M D; Penin, A A; Samigullin, T H; Vallejo-Roman, C M; Antonov, A S
2007-12-01
One of the most complicated remaining problems of molecular-phylogenetic analysis is choosing an appropriate genome region. In an ideal case, such a region should have two specific properties: (i) results of analysis using this region should be similar to the results of multigene analysis using the maximal number of regions; (ii) this region should be arranged compactly and be significantly shorter than the multigene set. The second condition is necessary to facilitate sequencing and extension of taxons under analysis, the number of which is also crucial for molecular phylogenetic analysis. Such regions have been revealed for some groups of animals and have been designated as "lucky genes". We have carried out a computational experiment on analysis of 41 complete chloroplast genomes of flowering plants aimed at searching for a "lucky gene" for reconstruction of their phylogeny. It is shown that the phylogenetic tree inferred from a combination of translated nucleotide sequences of genes encoding subunits of plastid RNA polymerase is closest to the tree constructed using all protein coding sites of the chloroplast genome. The only node for which a contradiction is observed is unstable according to the different type analyses. For all the other genes or their combinations, the coincidence is significantly worse. The RNA polymerase genes are compactly arranged in the genome and are fourfold shorter than the total length of protein coding genes used for phylogenetic analysis. The combination of all necessary features makes this group of genes main candidates for the role of "lucky gene" in studying phylogeny of flowering plants.
The Bacterial Mobile Resistome Transfer Network Connecting the Animal and Human Microbiomes
Hu, Yongfei; Yang, Xi; Li, Jing; Lv, Na; Liu, Fei; Wu, Jun; Lin, Ivan Y. C.; Wu, Na; Gao, George F.
2016-01-01
ABSTRACT Horizontally acquired antibiotic resistance genes (ARGs) in bacteria are highly mobile and have been ranked as principal risk resistance determinants. However, the transfer network of the mobile resistome and the forces driving mobile ARG transfer are largely unknown. Here, we present the whole profile of the mobile resistome in 23,425 bacterial genomes and explore the effects of phylogeny and ecology on the recent transfer (≥99% nucleotide identity) of mobile ARGs. We found that mobile ARGs are mainly present in four bacterial phyla and are significantly enriched in Proteobacteria. The recent mobile ARG transfer network, which comprises 703 bacterial species and 16,859 species pairs, is shaped by the bacterial phylogeny, while an ecological barrier also exists, especially when interrogating bacteria colonizing different human body sites. Phylogeny is still a driving force for the transfer of mobile ARGs between farm animals and the human gut, and, interestingly, the mobile ARGs that are shared between the human and animal gut microbiomes are also harbored by diverse human pathogens. Taking these results together, we suggest that phylogeny and ecology are complementary in shaping the bacterial mobile resistome and exert synergistic effects on the development of antibiotic resistance in human pathogens. IMPORTANCE The development of antibiotic resistance threatens our modern medical achievements. The dissemination of antibiotic resistance can be largely attributed to the transfer of bacterial mobile antibiotic resistance genes (ARGs). Revealing the transfer network of these genes in bacteria and the forces driving the gene flow is of great importance for controlling and predicting the emergence of antibiotic resistance in the clinic. Here, by analyzing tens of thousands of bacterial genomes and millions of human and animal gut bacterial genes, we reveal that the transfer of mobile ARGs is mainly controlled by bacterial phylogeny but under ecological constraints. We also found that dozens of ARGs are transferred between the human and animal gut and human pathogens. This work demonstrates the whole profile of mobile ARGs and their transfer network in bacteria and provides further insight into the evolution and spread of antibiotic resistance in nature. PMID:27613679
The Bacterial Mobile Resistome Transfer Network Connecting the Animal and Human Microbiomes.
Hu, Yongfei; Yang, Xi; Li, Jing; Lv, Na; Liu, Fei; Wu, Jun; Lin, Ivan Y C; Wu, Na; Weimer, Bart C; Gao, George F; Liu, Yulan; Zhu, Baoli
2016-11-15
Horizontally acquired antibiotic resistance genes (ARGs) in bacteria are highly mobile and have been ranked as principal risk resistance determinants. However, the transfer network of the mobile resistome and the forces driving mobile ARG transfer are largely unknown. Here, we present the whole profile of the mobile resistome in 23,425 bacterial genomes and explore the effects of phylogeny and ecology on the recent transfer (≥99% nucleotide identity) of mobile ARGs. We found that mobile ARGs are mainly present in four bacterial phyla and are significantly enriched in Proteobacteria The recent mobile ARG transfer network, which comprises 703 bacterial species and 16,859 species pairs, is shaped by the bacterial phylogeny, while an ecological barrier also exists, especially when interrogating bacteria colonizing different human body sites. Phylogeny is still a driving force for the transfer of mobile ARGs between farm animals and the human gut, and, interestingly, the mobile ARGs that are shared between the human and animal gut microbiomes are also harbored by diverse human pathogens. Taking these results together, we suggest that phylogeny and ecology are complementary in shaping the bacterial mobile resistome and exert synergistic effects on the development of antibiotic resistance in human pathogens. The development of antibiotic resistance threatens our modern medical achievements. The dissemination of antibiotic resistance can be largely attributed to the transfer of bacterial mobile antibiotic resistance genes (ARGs). Revealing the transfer network of these genes in bacteria and the forces driving the gene flow is of great importance for controlling and predicting the emergence of antibiotic resistance in the clinic. Here, by analyzing tens of thousands of bacterial genomes and millions of human and animal gut bacterial genes, we reveal that the transfer of mobile ARGs is mainly controlled by bacterial phylogeny but under ecological constraints. We also found that dozens of ARGs are transferred between the human and animal gut and human pathogens. This work demonstrates the whole profile of mobile ARGs and their transfer network in bacteria and provides further insight into the evolution and spread of antibiotic resistance in nature. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Phylogeny and temporal diversification of darters (Percidae: Etheostomatinae).
Near, Thomas J; Bossu, Christen M; Bradburd, Gideon S; Carlson, Rose L; Harrington, Richard C; Hollingsworth, Phillip R; Keck, Benjamin P; Etnier, David A
2011-10-01
Discussions aimed at resolution of the Tree of Life are most often focused on the interrelationships of major organismal lineages. In this study, we focus on the resolution of some of the most apical branches in the Tree of Life through exploration of the phylogenetic relationships of darters, a species-rich clade of North American freshwater fishes. With a near-complete taxon sampling of close to 250 species, we aim to investigate strategies for efficient multilocus data sampling and the estimation of divergence times using relaxed-clock methods when a clade lacks a fossil record. Our phylogenetic data set comprises a single mitochondrial DNA (mtDNA) gene and two nuclear genes sampled from 245 of the 248 darter species. This dense sampling allows us to determine if a modest amount of nuclear DNA sequence data can resolve relationships among closely related animal species. Darters lack a fossil record to provide age calibration priors in relaxed-clock analyses. Therefore, we use a near-complete species-sampled phylogeny of the perciform clade Centrarchidae, which has a rich fossil record, to assess two distinct strategies of external calibration in relaxed-clock divergence time estimates of darters: using ages inferred from the fossil record and molecular evolutionary rate estimates. Comparison of Bayesian phylogenies inferred from mtDNA and nuclear genes reveals that heterospecific mtDNA is present in approximately 12.5% of all darter species. We identify three patterns of mtDNA introgression in darters: proximal mtDNA transfer, which involves the transfer of mtDNA among extant and sympatric darter species, indeterminate introgression, which involves the transfer of mtDNA from a lineage that cannot be confidently identified because the introgressed haplotypes are not clearly referable to mtDNA haplotypes in any recognized species, and deep introgression, which is characterized by species diversification within a recipient clade subsequent to the transfer of heterospecific mtDNA. The results of our analyses indicate that DNA sequences sampled from single-copy nuclear genes can provide appreciable phylogenetic resolution for closely related animal species. A well-resolved near-complete species-sampled phylogeny of darters was estimated with Bayesian methods using a concatenated mtDNA and nuclear gene data set with all identified heterospecific mtDNA haplotypes treated as missing data. The relaxed-clock analyses resulted in very similar posterior age estimates across the three sampled genes and methods of calibration and therefore offer a viable strategy for estimating divergence times for clades that lack a fossil record. In addition, an informative rank-free clade-based classification of darters that preserves the rich history of nomenclature in the group and provides formal taxonomic communication of darter clades was constructed using the mtDNA and nuclear gene phylogeny. On the whole, the appeal of mtDNA for phylogeny inference among closely related animal species is diminished by the observations of extensive mtDNA introgression and by finding appreciable phylogenetic signal in a modest sampling of nuclear genes in our phylogenetic analyses of darters.
Podsiadlowski, Lars; Braband, Anke; Struck, Torsten H; von Döhren, Jörn; Bartolomaeus, Thomas
2009-01-01
Background The new animal phylogeny established several taxa which were not identified by morphological analyses, most prominently the Ecdysozoa (arthropods, roundworms, priapulids and others) and Lophotrochozoa (molluscs, annelids, brachiopods and others). Lophotrochozoan interrelationships are under discussion, e.g. regarding the position of Nemertea (ribbon worms), which were discussed to be sister group to e.g. Mollusca, Brachiozoa or Platyhelminthes. Mitochondrial genomes contributed well with sequence data and gene order characters to the deep metazoan phylogeny debate. Results In this study we present the first complete mitochondrial genome record for a member of the Nemertea, Lineus viridis. Except two trnP and trnT, all genes are located on the same strand. While gene order is most similar to that of the brachiopod Terebratulina retusa, sequence based analyses of mitochondrial genes place nemerteans close to molluscs, phoronids and entoprocts without clear preference for one of these taxa as sister group. Conclusion Almost all recent analyses with large datasets show good support for a taxon comprising Annelida, Mollusca, Brachiopoda, Phoronida and Nemertea. But the relationships among these taxa vary between different studies. The analysis of gene order differences gives evidence for a multiple independent occurrence of a large inversion in the mitochondrial genome of Lophotrochozoa and a re-inversion of the same part in gastropods. We hypothesize that some regions of the genome have a higher chance for intramolecular recombination than others and gene order data have to be analysed carefully to detect convergent rearrangement events. PMID:19660126
Fadhlaoui-Zid, Karima; Knittweis, Leyla; Aurelle, Didier; Nafkha, Chaala; Ezzeddine, Soufia; Fiorentino, Fabio; Ghmati, Hisham; Ceriola, Luca; Jarboui, Othman; Maltagliati, Ferruccio
2012-01-01
The polymorphism of the mitochondrial gene cytochrome oxidase III was studied in the Mediterranean octopus, Octopus vulgaris Cuvier, 1797. A total of 202 specimens from seven sampling sites were analysed with the aim of elucidating patterns of genetic structure in the central Mediterranean Sea and to give an insight into the phylogeny of the Octopus genus. Phylogenetic analyses showed that individuals from the central Mediterranean belong to the O. vulgaris species whose limits should nevertheless be clarified. Concerning genetic structure, two high-frequency haplotypes were present in all locations. The overall genetic divergence (Φ(ST)=0.05, P<0.05) indicated a significant genetic structuring in the study area and an AMOVA highlighted a significant break between western and eastern Mediterranean basins (Φ(CT)=0.094, P<0.05). Possible explanations for the observed patterns of genetic structuring are discussed with reference to their relevance for fisheries management. Copyright © 2012. Published by Elsevier SAS.
Phylogeny and Divergence Times of Gymnosperms Inferred from Single-Copy Nuclear Genes
Guo, Dong-Mei; Yang, Zu-Yu; Wang, Xiao-Quan
2014-01-01
Phylogenetic reconstruction is fundamental to study evolutionary biology and historical biogeography. However, there was not a molecular phylogeny of gymnosperms represented by extensive sampling at the genus level, and most published phylogenies of this group were constructed based on cytoplasmic DNA markers and/or the multi-copy nuclear ribosomal DNA. In this study, we use LFY and NLY, two single-copy nuclear genes that originated from an ancient gene duplication in the ancestor of seed plants, to reconstruct the phylogeny and estimate divergence times of gymnosperms based on a complete sampling of extant genera. The results indicate that the combined LFY and NLY coding sequences can resolve interfamilial relationships of gymnosperms and intergeneric relationships of most families. Moreover, the addition of intron sequences can improve the resolution in Podocarpaceae but not in cycads, although divergence times of the cycad genera are similar to or longer than those of the Podocarpaceae genera. Our study strongly supports cycads as the basal-most lineage of gymnosperms rather than sister to Ginkgoaceae, and a sister relationship between Podocarpaceae and Araucariaceae and between Cephalotaxaceae-Taxaceae and Cupressaceae. In addition, intergeneric relationships of some families that were controversial, and the relationships between Taxaceae and Cephalotaxaceae and between conifers and Gnetales are discussed based on the nuclear gene evidence. The molecular dating analysis suggests that drastic extinctions occurred in the early evolution of gymnosperms, and extant coniferous genera in the Northern Hemisphere are older than those in the Southern Hemisphere on average. This study provides an evolutionary framework for future studies on gymnosperms. PMID:25222863
Schrago, Carlos G; Menezes, Albert N; Furtado, Carolina; Bonvicino, Cibele R; Seuanez, Hector N
2014-11-05
Neotropical primates (NP) are presently distributed in the New World from Mexico to northern Argentina, comprising three large families, Cebidae, Atelidae, and Pitheciidae, consequently to their diversification following their separation from Old World anthropoids near the Eocene/Oligocene boundary, some 40 Ma. The evolution of NP has been intensively investigated in the last decade by studies focusing on their phylogeny and timescale. However, despite major efforts, the phylogenetic relationship between these three major clades and the age of their last common ancestor are still controversial because these inferences were based on limited numbers of loci and dating analyses that did not consider the evolutionary variation associated with the distribution of gene trees within the proposed phylogenies. We show, by multispecies coalescent analyses of selected genome segments, spanning along 92,496,904 bp that the early diversification of extant NP was marked by a 2-fold increase of their effective population size and that Atelids and Cebids are more closely related respective to Pitheciids. The molecular phylogeny of NP has been difficult to solve because of population-level phenomena at the early evolution of the lineage. The association of evolutionary variation with the distribution of gene trees within proposed phylogenies is crucial for distinguishing the mean genetic divergence between species (the mean coalescent time between loci) from speciation time. This approach, based on extensive genomic data provided by new generation DNA sequencing, provides more accurate reconstructions of phylogenies and timescales for all organisms. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Phylogeny and Systematics of Leptomyxid Amoebae (Amoebozoa, Tubulinea, Leptomyxida).
Smirnov, Alexey; Nassonova, Elena; Geisen, Stefan; Bonkowski, Michael; Kudryavtsev, Alexander; Berney, Cedric; Glotova, Anna; Bondarenko, Natalya; Dyková, Iva; Mrva, Martin; Fahrni, Jose; Pawlowski, Jan
2017-04-01
We describe four new species of Flabellula, Leptomyxa and Rhizamoeba and publish new SSU rRNA gene and actin gene sequences of leptomyxids. Using these data we provide the most comprehensive SSU phylogeny of leptomyxids to date. Based on the analyses of morphological data and results of the SSU rRNA gene phylogeny we suggest changes in the systematics of the order Leptomyxida (Amoebozoa: Lobosa: Tubulinea). We propose to merge the genera Flabellula and Paraflabellula (the genus Flabellula remains valid by priority rule). The genus Rhizamoeba is evidently polyphyletic in all phylogenetic trees; we suggest retaining the generic name Rhizamoeba for the group unifying R. saxonica, R.matisi n. sp. and R. polyura, the latter remains the type species of the genus Rhizamoeba. Based on molecular and morphological evidence we move all remaining Rhizamoeba species to the genus Leptomyxa. New family Rhizamoebidae is established here in order to avoid paraphyly of the family Leptomyxidae. With the suggested changes both molecular and morphological systems of the order Leptomyxida are now fully congruent to each other. Copyright © 2016 Elsevier GmbH. All rights reserved.
Chen, Yongmei; Hou, Yansong; Guo, Zixiao; Wang, Wenqing; Zhong, Cairong; Zhou, Renchao; Shi, Suhua
2015-01-01
The genus Rhizophora is one of the most important components of mangrove forests. It is an ideal system for studying biogeography, molecular evolution, population genetics, hybridization and conservation genetics of mangroves. However, there are no sufficient molecular markers to address these topics. Here, we developed 77 pairs of nuclear gene primers, which showed successful PCR amplifications across all five Rhizophora species and sequencing in R. apiculata. Here, we present three tentative applications using a subset of the developed nuclear genes to (I) reconstruct the phylogeny, (II) examine the genetic structure and (III) identify natural hybridization in Rhizophora. Phylogenetic analyses support the hypothesis that Rhizophora had disappeared in the Atlantic-East Pacific (AEP) region and was re-colonized from the IWP region approximately 12.7 Mya. Population genetics analyses in four natural populations of R. apiculata in Hainan, China, revealed extremely low genetic diversity, strong population differentiation and extensive admixture, suggesting that the Pleistocene glaciations, particularly the last glacial maximum, greatly influenced the population dynamics of R. apiculata in Hainan. We also verified the hybrid status of a morphologically intermediate individual between R. apiculata and R. stylosa in Hainan. Based on the sequences of five nuclear genes and one chloroplast intergenic spacer, this individual is likely to be an F1 hybrid, with R. stylosa as its maternal parent. The nuclear gene markers developed in this study should be of great value for characterizing the hybridization and introgression patterns in other cases of this genus and testing the role of natural selection using population genomics approaches.
Plastome phylogeny and early diversification of Brassicaceae.
Guo, Xinyi; Liu, Jianquan; Hao, Guoqian; Zhang, Lei; Mao, Kangshan; Wang, Xiaojuan; Zhang, Dan; Ma, Tao; Hu, Quanjun; Al-Shehbaz, Ihsan A; Koch, Marcus A
2017-02-16
The family Brassicaceae encompasses diverse species, many of which have high scientific and economic importance. Early diversifications and phylogenetic relationships between major lineages or clades remain unclear. Here we re-investigate Brassicaceae phylogeny with complete plastomes from 51 species representing all four lineages or 5 of 6 major clades (A, B, C, E and F) as identified in earlier studies. Bayesian and maximum likelihood phylogenetic analyses using a partitioned supermatrix of 77 protein coding genes resulted in nearly identical tree topologies exemplified by highly supported relationships between clades. All four lineages were well identified and interrelationships between them were resolved. The previously defined Clade C was found to be paraphyletic (the genus Megadenia formed a separate lineage), while the remaining clades were monophyletic. Clade E (lineage III) was sister to clades B + C rather than to all core Brassicaceae (clades A + B + C or lineages I + II), as suggested by a previous transcriptome study. Molecular dating based on plastome phylogeny supported the origin of major lineages or clades between late Oligocene and early Miocene, and the following radiative diversification across the family took place within a short timescale. In addition, gene losses in the plastomes occurred multiple times during the evolutionary diversification of the family. Plastome phylogeny illustrates the early diversification of cruciferous species. This phylogeny will facilitate our further understanding of evolution and adaptation of numerous species in the model family Brassicaceae.
Identification and characterization of the grape WRKY family.
Zhang, Ying; Feng, Jian Can
2014-01-01
WRKY transcription factors have functions in plant growth and development and in response to biotic and abiotic stresses. Many studies have focused on functional identification of WRKY transcription factors, but little is known about the molecular phylogeny or global expression patterns of the complete WRKY family. In this study, we identified 80 WRKY proteins encoded in the grape genome. Based on the structural features of these proteins, the grape WRKY genes were classified into three groups (groups 1-3). Analysis of WRKY genes expression profiles indicated that 28 WRKY genes were differentially expressed in response to biotic stress caused by grape whiterot and/or salicylic acid (SA). In that 16 WRKY genes upregulated both by whiterot pathogenic bacteria and SA. The results indicated that 16 WRKY proteins participated in SA-dependent defense signal pathway. This study provides a basis for cloning genes with specific functions from grape.
Seelan, Jaya Seelan Sathiya; Justo, Alfredo; Nagy, Laszlo G; Grand, Edward A; Redhead, Scott A; Hibbett, David
2015-01-01
The genus Lentinus (Polyporaceae, Basidiomycota) is widely documented from tropical and temperate forests and is taxonomically controversial. Here we studied the relationships between Lentinus subg. Lentinus sensu Pegler (i.e. sections Lentinus, Tigrini, Dicholamellatae, Rigidi, Lentodiellum and Pleuroti and polypores that share similar morphological characters). We generated sequences of internal transcribed spacers (ITS) and partial 28S regions of nuc rDNA and genes encoding the largest subunit of RNA polymerase II (RPB1), focusing on Lentinus subg. Lentinus sensu Pegler and the Neofavolus group, combined these data with sequences from GenBank (including RPB2 gene sequences) and performed phylogenetic analyses with maximum likelihood and Bayesian methods. We also evaluated the transition in hymenophore morphology between Lentinus, Neofavolus and related polypores with ancestral state reconstruction. Single-gene phylogenies and phylogenies combining ITS and 28S with RPB1 and RPB2 genes all support existence of a Lentinus/Polyporellus clade and a separate Neofavolus clade. Polyporellus (represented by P. arcularius, P. ciliatus, P. brumalis) forms a clade with species representing Lentinus subg. Lentinus sensu Pegler (1983), excluding L. suavissimus. Lentinus tigrinus appears as the sister group of Polyporellus in the four-gene phylogeny, but this placement was weakly supported. All three multigene analyses and the single-gene analysis using ITS strongly supported Polyporus tricholoma as the sister group of the Lentinus/Polyporellus clade; only the 28S rRNA phylogeny failed to support this placement. Under parsimony the ancestral hymenophoral configuration for the Lentinus/Polyporellus clade is estimated to be circular pores, with independent transitions to angular pores and lamellae. The ancestral state for the Neofavolus clade is estimated to be angular pores, with a single transition to lamellae in L. suavissimus. We propose that Lentinus suavissimus (section Pleuroti) should be reclassified as Neofavolus suavissimus comb. nov. © 2015 by The Mycological Society of America.
Smith, Stephen A; Moore, Michael J; Brown, Joseph W; Yang, Ya
2015-08-05
The use of transcriptomic and genomic datasets for phylogenetic reconstruction has become increasingly common as researchers attempt to resolve recalcitrant nodes with increasing amounts of data. The large size and complexity of these datasets introduce significant phylogenetic noise and conflict into subsequent analyses. The sources of conflict may include hybridization, incomplete lineage sorting, or horizontal gene transfer, and may vary across the phylogeny. For phylogenetic analysis, this noise and conflict has been accommodated in one of several ways: by binning gene regions into subsets to isolate consistent phylogenetic signal; by using gene-tree methods for reconstruction, where conflict is presumed to be explained by incomplete lineage sorting (ILS); or through concatenation, where noise is presumed to be the dominant source of conflict. The results provided herein emphasize that analysis of individual homologous gene regions can greatly improve our understanding of the underlying conflict within these datasets. Here we examined two published transcriptomic datasets, the angiosperm group Caryophyllales and the aculeate Hymenoptera, for the presence of conflict, concordance, and gene duplications in individual homologs across the phylogeny. We found significant conflict throughout the phylogeny in both datasets and in particular along the backbone. While some nodes in each phylogeny showed patterns of conflict similar to what might be expected with ILS alone, the backbone nodes also exhibited low levels of phylogenetic signal. In addition, certain nodes, especially in the Caryophyllales, had highly elevated levels of strongly supported conflict that cannot be explained by ILS alone. This study demonstrates that phylogenetic signal is highly variable in phylogenomic data sampled across related species and poses challenges when conducting species tree analyses on large genomic and transcriptomic datasets. Further insight into the conflict and processes underlying these complex datasets is necessary to improve and develop adequate models for sequence analysis and downstream applications. To aid this effort, we developed the open source software phyparts ( https://bitbucket.org/blackrim/phyparts ), which calculates unique, conflicting, and concordant bipartitions, maps gene duplications, and outputs summary statistics such as internode certainy (ICA) scores and node-specific counts of gene duplications.
Single-Copy Genes as Molecular Markers for Phylogenomic Studies in Seed Plants
De La Torre, Amanda R.; Sterck, Lieven; Cánovas, Francisco M.; Avila, Concepción; Merino, Irene; Cabezas, José Antonio; Cervera, María Teresa; Ingvarsson, Pär K.
2017-01-01
Phylogenetic relationships among seed plant taxa, especially within the gymnosperms, remain contested. In contrast to angiosperms, for which several genomic, transcriptomic and phylogenetic resources are available, there are few, if any, molecular markers that allow broad comparisons among gymnosperm species. With few gymnosperm genomes available, recently obtained transcriptomes in gymnosperms are a great addition to identifying single-copy gene families as molecular markers for phylogenomic analysis in seed plants. Taking advantage of an increasing number of available genomes and transcriptomes, we identified single-copy genes in a broad collection of seed plants and used these to infer phylogenetic relationships between major seed plant taxa. This study aims at extending the current phylogenetic toolkit for seed plants, assessing its ability for resolving seed plant phylogeny, and discussing potential factors affecting phylogenetic reconstruction. In total, we identified 3,072 single-copy genes in 31 gymnosperms and 2,156 single-copy genes in 34 angiosperms. All studied seed plants shared 1,469 single-copy genes, which are generally involved in functions like DNA metabolism, cell cycle, and photosynthesis. A selected set of 106 single-copy genes provided good resolution for the seed plant phylogeny except for gnetophytes. Although some of our analyses support a sister relationship between gnetophytes and other gymnosperms, phylogenetic trees from concatenated alignments without 3rd codon positions and amino acid alignments under the CAT + GTR model, support gnetophytes as a sister group to Pinaceae. Our phylogenomic analyses demonstrate that, in general, single-copy genes can uncover both recent and deep divergences of seed plant phylogeny. PMID:28460034
Kutschera, Verena E.; Bidon, Tobias; Hailer, Frank; Rodi, Julia L.; Fain, Steven R.; Janke, Axel
2014-01-01
Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. PMID:24903145
NASA Astrophysics Data System (ADS)
Gadeken, K.; Dorgan, K. M.; Moore, J.; Berke, S. K.
2016-02-01
Evolutionary relationships may shed light on observed patterns of diversity and functional traits when viewed through the lens of phylogeny. The potential for phylogenetic information to be used to explain patterns in community structure, such as niche partitioning and responses to stress, is extensive. Differential distribution of related species with similar functional traits suggests niche partitioning, and local redundancy in functional traits may indicate the potential for interspecific competition. In this study, we investigated phylogenetic and functional diversity as a function of habitat for sites with varying levels of oil contamination in the Northern Gulf of Mexico. Our study was conducted in a shallow benthic community at the Chandeleur Islands, a group of uninhabited barrier islands. Infauna were sampled from seagrass (Halodule wrightii) and bare sediment at three sites along the island chain that experienced variable levels of oil impact from the Deepwater Horizon oil spill. Individuals were preserved and 18S and COI genes sequenced, and a phylogenetic tree was constructed of the local community using maximum likelihood. Phylogenetic diversity and evenness were quantified. Ecologically important functional traits were then compiled into respective distance matrices, evaluated through different functional diversity indices, and assessed for correlation with the phylogeny. This integration of functional and phylogenetic diversity has the potential to provide greater insight into factors driving community structure than either metric alone. Determining relevant metrics of diversity is critical to understanding the ecological effects of major disturbances such as oil spills.
Mondragón-Palomino, Mariana; Trontin, Charlotte
2011-01-01
Background and Aims The TCP family is an ancient group of plant developmental transcription factors that regulate cell division in vegetative and reproductive structures and are essential in the establishment of flower zygomorphy. In-depth research on eudicot TCPs has documented their evolutionary and developmental role. This has not happened to the same extent in monocots, although zygomorphy has been critical for the diversification of Orchidaceae and Poaceae, the largest families of this group. Investigating the evolution and function of TCP-like genes in a wider group of monocots requires a detailed phylogenetic analysis of all available sequence information and a system that facilitates comparing genetic and functional information. Methods The phylogenetic relationships of TCP-like genes in monocots were investigated by analysing sequences from the genomes of Zea mays, Brachypodium distachyon, Oryza sativa and Sorghum bicolor, as well as EST data from several other monocot species. Key Results All available monocot TCP-like sequences are associated in 20 major groups with an average identity ≥64 % and most correspond to well-supported clades of the phylogeny. Their sequence motifs and relationships of orthology were documented and it was found that 67 % of the TCP-like genes of Sorghum, Oryza, Zea and Brachypodium are in microsyntenic regions. This analysis suggests that two rounds of whole genome duplication drove the expansion of TCP-like genes in these species. Conclusions A system of classification is proposed where putative or recognized monocot TCP-like genes are assigned to a specific clade of PCF-, CIN- or CYC/tb1-like genes. Specific biases in sequence data of this family that must be tackled when studying its molecular evolution and phylogeny are documented. Finally, the significant retention of duplicated TCP genes from Zea mays is considered in the context of balanced gene drive. PMID:21444336
Nørskov-Lauritsen, Niels; Overballe, Merete D.; Kilian, Mogens
2009-01-01
To obtain more information on the much-debated definition of prokaryotic species, we investigated the borders of Haemophilus influenzae by comparative analysis of H. influenzae reference strains with closely related bacteria including strains assigned to Haemophilus haemolyticus, cryptic genospecies biotype IV, and the never formally validated species “Haemophilus intermedius”. Multilocus sequence phylogeny based on six housekeeping genes separated a cluster encompassing the type and the reference strains of H. influenzae from 31 more distantly related strains. Comparison of 16S rRNA gene sequences supported this delineation but was obscured by a conspicuously high number of polymorphic sites in many of the strains that did not belong to the core group of H. influenzae strains. The division was corroborated by the differential presence of genes encoding H. influenzae adhesion and penetration protein, fuculokinase, and Cu,Zn-superoxide dismutase, whereas immunoglobulin A1 protease activity or the presence of the iga gene was of limited discriminatory value. The existence of porphyrin-synthesizing strains (“H. intermedius”) closely related to H. influenzae was confirmed. Several chromosomally encoded hemin biosynthesis genes were identified, and sequence analysis showed these genes to represent an ancestral genotype rather than recent transfers from, e.g., Haemophilus parainfluenzae. Strains previously assigned to H. haemolyticus formed several separate lineages within a distinct but deeply branching cluster, intermingled with strains of “H. intermedius” and cryptic genospecies biotype IV. Although H. influenzae is phenotypically more homogenous than some other Haemophilus species, the genetic diversity and multicluster structure of strains traditionally associated with H. influenzae make it difficult to define the natural borders of that species. PMID:19060144
Wu, Chung-Shien; Wang, Ya-Nan; Hsu, Chi-Yao; Chaw, Shu-Miaw
2011-01-01
The relationships among the extant five gymnosperm groups—gnetophytes, Pinaceae, non-Pinaceae conifers (cupressophytes), Ginkgo, and cycads—remain equivocal. To clarify this issue, we sequenced the chloroplast genomes (cpDNAs) from two cupressophytes, Cephalotaxus wilsoniana and Taiwania cryptomerioides, and 53 common chloroplast protein-coding genes from another three cupressophytes, Agathis dammara, Nageia nagi, and Sciadopitys verticillata, and a non-Cycadaceae cycad, Bowenia serrulata. Comparative analyses of 11 conifer cpDNAs revealed that Pinaceae and cupressophytes each lost a different copy of inverted repeats (IRs), which contrasts with the view that the same IR has been lost in all conifers. Based on our structural finding, the character of an IR loss no longer conflicts with the “gnepines” hypothesis (gnetophytes sister to Pinaceae). Chloroplast phylogenomic analyses of amino acid sequences recovered incongruent topologies using different tree-building methods; however, we demonstrated that high heterotachous genes (genes that have highly different rates in different lineages) contributed to the long-branch attraction (LBA) artifact, resulting in incongruence of phylogenomic estimates. Additionally, amino acid compositions appear more heterogeneous in high than low heterotachous genes among the five gymnosperm groups. Removal of high heterotachous genes alleviated the LBA artifact and yielded congruent and robust tree topologies in which gnetophytes and Pinaceae formed a sister clade to cupressophytes (the gnepines hypothesis) and Ginkgo clustered with cycads. Adding more cupressophyte taxa could not improve the accuracy of chloroplast phylogenomics for the five gymnosperm groups. In contrast, removal of high heterotachous genes from data sets is simple and can increase confidence in evaluating the phylogeny of gymnosperms. PMID:21933779
Wu, Chung-Shien; Wang, Ya-Nan; Hsu, Chi-Yao; Lin, Ching-Ping; Chaw, Shu-Miaw
2011-01-01
The relationships among the extant five gymnosperm groups--gnetophytes, Pinaceae, non-Pinaceae conifers (cupressophytes), Ginkgo, and cycads--remain equivocal. To clarify this issue, we sequenced the chloroplast genomes (cpDNAs) from two cupressophytes, Cephalotaxus wilsoniana and Taiwania cryptomerioides, and 53 common chloroplast protein-coding genes from another three cupressophytes, Agathis dammara, Nageia nagi, and Sciadopitys verticillata, and a non-Cycadaceae cycad, Bowenia serrulata. Comparative analyses of 11 conifer cpDNAs revealed that Pinaceae and cupressophytes each lost a different copy of inverted repeats (IRs), which contrasts with the view that the same IR has been lost in all conifers. Based on our structural finding, the character of an IR loss no longer conflicts with the "gnepines" hypothesis (gnetophytes sister to Pinaceae). Chloroplast phylogenomic analyses of amino acid sequences recovered incongruent topologies using different tree-building methods; however, we demonstrated that high heterotachous genes (genes that have highly different rates in different lineages) contributed to the long-branch attraction (LBA) artifact, resulting in incongruence of phylogenomic estimates. Additionally, amino acid compositions appear more heterogeneous in high than low heterotachous genes among the five gymnosperm groups. Removal of high heterotachous genes alleviated the LBA artifact and yielded congruent and robust tree topologies in which gnetophytes and Pinaceae formed a sister clade to cupressophytes (the gnepines hypothesis) and Ginkgo clustered with cycads. Adding more cupressophyte taxa could not improve the accuracy of chloroplast phylogenomics for the five gymnosperm groups. In contrast, removal of high heterotachous genes from data sets is simple and can increase confidence in evaluating the phylogeny of gymnosperms.
Hussain, Sabir; Devers-Lamrani, Marion; El Azhari, Najoi; Martin-Laurent, Fabrice
2011-06-01
The phenylurea herbicide isoproturon, 3-(4-isopropylphenyl)-1,1-dimethylurea (IPU), was found to be rapidly mineralized in an agricultural soil in France that had been periodically exposed to IPU. Enrichment cultures from samples of this soil isolated a bacterial strain able to mineralize IPU. 16S rRNA sequence analysis showed that this strain belonged to the phylogeny of the genus Sphingomonas (96% similarity with Sphingomonas sp. JEM-14, AB219361) and was designated Sphingomonas sp. strain SH. From this strain, a partial sequence of a 1,2-dioxygenase (catA) gene coding for an enzyme degrading catechol putatively formed during IPU mineralization was amplified. Phylogenetic analysis revealed that the catA sequence was related to Sphingomonas spp. and showed a lack of congruence between the catA and 16S rRNA based phylogenies, implying horizontal gene transfer of the catA gene cluster between soil microbiota. The IPU degrading ability of strain SH was strongly influenced by pH with maximum degradation taking place at pH 7.5. SH was only able to mineralize IPU and its known metabolites including 4-isopropylaniline and it could not degrade other structurally related phenylurea herbicides such as diuron, linuron, monolinuron and chlorotoluron or their aniline derivatives. These observations suggest that the catabolic abilities of the strain SH are highly specific to the metabolism of IPU.
Graf, Louis; Kim, Yae Jin; Cho, Ga Youn; Miller, Kathy Ann
2017-01-01
Coccophora langsdorfii (Turner) Greville (Fucales) is an intertidal brown alga that is endemic to Northeast Asia and increasingly endangered by habitat loss and climate change. We sequenced the complete circular plastid and mitochondrial genomes of C. langsdorfii. The circular plastid genome is 124,450 bp and contains 139 protein-coding, 28 tRNA and 6 rRNA genes. The circular mitochondrial genome is 35,660 bp and contains 38 protein-coding, 25 tRNA and 3 rRNA genes. The structure and gene content of the C. langsdorfii plastid genome is similar to those of other species in the Fucales. The plastid genomes of brown algae in other orders share similar gene content but exhibit large structural recombination. The large in-frame insert in the cox2 gene in the mitochondrial genome of C. langsdorfii is typical of other brown algae. We explored the effect of this insertion on the structure and function of the cox2 protein. We estimated the usefulness of 135 plastid genes and 35 mitochondrial genes for developing molecular markers. This study shows that 29 organellar genes will prove efficient for resolving brown algal phylogeny. In addition, we propose a new molecular marker suitable for the study of intraspecific genetic diversity that should be tested in a large survey of populations of C. langsdorfii. PMID:29095864
Jang, Kuem Hee; Hwang, Ui Wook
2009-01-01
Background The phylogenetic position of Bryozoa is one of the most controversial issues in metazoan phylogeny. In an attempt to address this issue, the first bryozoan mitochondrial genome from Flustrellidra hispida (Gymnolaemata, Ctenostomata) was recently sequenced and characterized. Unfortunately, it has extensive gene translocation and extremely reduced size. In addition, the phylogenies obtained from the result were conflicting, so they failed to assign a reliable phylogenetic position to Bryozoa or to clarify lophophorate phylogeny. Thus, it is necessary to characterize further mitochondrial genomes from slowly-evolving bryozoans to obtain a more credible lophophorate phylogeny. Results The complete mitochondrial genome (15,433 bp) of Bugula neritina (Bryozoa, Gymnolaemata, Cheilostomata), one of the most widely distributed cheliostome bryozoans, is sequenced. This second bryozoan mitochondrial genome contains the set of 37 components generally observed in other metazoans, differing from that of F. hispida (Bryozoa, Gymnolaemata, Ctenostomata), which has only 36 components with loss of tRNAser(ucn) genes. The B. neritina mitochondrial genome possesses 27 multiple noncoding regions. The gene order is more similar to those of the two remaining lophophorate phyla (Brachiopoda and Phoronida) and a chiton Katharina tunicate than to that of F. hispida. Phylogenetic analyses based on the nucleotide sequences or amino acid residues of 12 protein-coding genes showed consistently that, within the Lophotrochozoa, the monophyly of the bryozoan class Gymnolaemata (B. neritina and F. hispida) was strongly supported and the bryozoan clade was grouped with brachiopods. Echiura appeared as a subtaxon of Annelida, and Entoprocta as a sister taxon of Phoronida. The clade of Bryozoa + Brachiopoda was clustered with either the clade of Annelida-Echiura or that of Phoronida + Entoprocta. Conclusion This study presents the complete mitochondrial genome of a cheliostome bryozoan, B. neritina. The phylogenetic analyses suggest a close relationship between Bryozoa and Brachiopoda within the Lophotrochozoa. However, the sister group of Bryozoa + Brachiopoda is still ambiguous, although it has some attractions with Annelida-Echiura or Phoronida + Entoprocta. If the latter is a true phylogeny, lophophorate monophyly including Entoprocta is supported. Consequently, the present results imply that Brachiozoa (= Brachiopoda + Phoronida) and the recently-resurrected Bryozoa concept comprising Ectoprocta and Entoprocta may be refuted. PMID:19379522
H. Thorsten Lumbsch; Ekaphan Kraichak; Sittiporn Parnmen; Eimy Rivas Plata; Andre Aptroot; Marcela E.S. Caceres; Damien Ertz; Shirley Cunha Feuerstein; Joel A. Mercado-Diaz; Bettina Staiger; Dries Van den Broeck; Robert Lücking
2014-01-01
We provide an updated skeleton phylogeny of the lichenized family Graphidaceae (excluding subfamily Gomphilloideae), based on three loci (mtSSU, nuLSU, RPB2), to elucidate the position of four new genera, Aggregatorygma, Borinquenotrema, Corticorygma, and Paratopeliopsis, as well as the placement of the enigmatic species Diorygma erythrellum, Fissurina monilifera, and...
Shi, Qing-Hui; Sun, Xiao-Yan; Wang, Yun-Liang; Hao, Jia-Sheng; Yang, Qun
2015-01-01
Nymphalidae is the largest family of butterflies with their phylogenetic relationships not adequately approached to date. The mitochondrial genomes (mitogenomes) of 11 new nymphalid species were reported and a comparative mitogenomic analysis was conducted together with other 22 available nymphalid mitogenomes. A phylogenetic analysis of the 33 species from all 13 currently recognized nymphalid subfamilies was done based on the mitogenomic data set with three Lycaenidae species as the outgroups. The mitogenome comparison showed that the eleven new mitogenomes were similar with those of other butterflies in gene content and order. The reconstructed phylogenetic trees reveal that the nymphalids are made up of five major clades (the nymphaline, heliconiine, satyrine, danaine and libytheine clades), with sister relationship between subfamilies Cyrestinae and Biblidinae, and most likely between subfamilies Morphinae and Satyrinae. This whole mitogenome-based phylogeny is generally congruent with those of former studies based on nuclear-gene and mitogenomic analyses, but differs considerably from the result of morphological cladistic analysis, such as the basal position of Libytheinae in morpho-phylogeny is not confirmed in molecular studies. However, we found that the mitogenomic phylogeny established herein is compatible with selected morphological characters (including developmental and adult morpho-characters).
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica
Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu
2017-01-01
GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple (Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS (MdGRAS6, 26, 28, 44, 53, 64, 107, and 122) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA3, 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees. PMID:28503152
Identification, Classification, and Expression Analysis of GRAS Gene Family in Malus domestica.
Fan, Sheng; Zhang, Dong; Gao, Cai; Zhao, Ming; Wu, Haiqin; Li, Youmei; Shen, Yawen; Han, Mingyu
2017-01-01
GRAS genes encode plant-specific transcription factors that play important roles in plant growth and development. However, little is known about the GRAS gene family in apple. In this study, 127 GRAS genes were identified in the apple ( Malus domestica Borkh.) genome and named MdGRAS1 to MdGRAS127 according to their chromosomal locations. The chemical characteristics, gene structures and evolutionary relationships of the MdGRAS genes were investigated. The 127 MdGRAS genes could be grouped into eight subfamilies based on their structural features and phylogenetic relationships. Further analysis of gene structures, segmental and tandem duplication, gene phylogeny and tissue-specific expression with ArrayExpress database indicated their diversification in quantity, structure and function. We further examined the expression pattern of MdGRAS genes during apple flower induction with transcriptome sequencing. Eight higher MdGRAS ( MdGRAS6, 26, 28, 44, 53, 64, 107 , and 122 ) genes were surfaced. Further quantitative reverse transcription PCR indicated that the candidate eight genes showed distinct expression patterns among different tissues (leaves, stems, flowers, buds, and fruits). The transcription levels of eight genes were also investigated with various flowering related treatments (GA 3 , 6-BA, and sucrose) and different flowering varieties (Yanfu No. 6 and Nagafu No. 2). They all were affected by flowering-related circumstance and showed different expression level. Changes in response to these hormone or sugar related treatments indicated their potential involvement during apple flower induction. Taken together, our results provide rich resources for studying GRAS genes and their potential clues in genetic improvement of apple flowering, which enriches biological theories of GRAS genes in apple and their involvement in flower induction of fruit trees.
Molecular phylogeny and evolution of alcohol dehydrogenase (Adh) genes in legumes
Fukuda, Tatsuya; Yokoyama, Jun; Nakamura, Toru; Song, In-Ja; Ito, Takuro; Ochiai, Toshinori; Kanno, Akira; Kameya, Toshiaki; Maki, Masayuki
2005-01-01
Background Nuclear genes determine the vast range of phenotypes that are responsible for the adaptive abilities of organisms in nature. Nevertheless, the evolutionary processes that generate the structures and functions of nuclear genes are only now be coming understood. The aim of our study is to isolate the alcohol dehydrogenase (Adh) genes in two distantly related legumes, and use these sequences to examine the molecular evolutionary history of this nuclear gene. Results We isolated the expressed Adh genes from two species of legumes, Sophora flavescens Ait. and Wisteria floribunda DC., by a RT-PCR based approach and found a new Adh locus in addition to homologues of the Adh genes found previously in legumes. To examine the evolution of these genes, we compared the species and gene trees and found gene duplication of the Adh loci in the legumes occurred as an ancient event. Conclusion This is the first report revealing that some legume species have at least two Adh gene loci belonging to separate clades. Phylogenetic analyses suggest that these genes resulted from relatively ancient duplication events. PMID:15836788
Lescat, Mathilde; Hoede, Claire; Clermont, Olivier; Garry, Louis; Darlu, Pierre; Tuffery, Pierre; Denamur, Erick; Picard, Bertrand
2009-12-29
Previous studies have established a correlation between electrophoretic polymorphism of esterase B, and virulence and phylogeny of Escherichia coli. Strains belonging to the phylogenetic group B2 are more frequently implicated in extraintestinal infections and include esterase B2 variants, whereas phylogenetic groups A, B1 and D contain less virulent strains and include esterase B1 variants. We investigated esterase B as a marker of phylogeny and/or virulence, in a thorough analysis of the esterase B-encoding gene. We identified the gene encoding esterase B as the acetyl-esterase gene (aes) using gene disruption. The analysis of aes nucleotide sequences in a panel of 78 reference strains, including the E. coli reference (ECOR) strains, demonstrated that the gene is under purifying selection. The phylogenetic tree reconstructed from aes sequences showed a strong correlation with the species phylogenetic history, based on multi-locus sequence typing using six housekeeping genes. The unambiguous distinction between variants B1 and B2 by electrophoresis was consistent with Aes amino-acid sequence analysis and protein modelling, which showed that substituted amino acids in the two esterase B variants occurred mostly at different sites on the protein surface. Studies in an experimental mouse model of septicaemia using mutant strains did not reveal a direct link between aes and extraintestinal virulence. Moreover, we did not find any genes in the chromosomal region of aes to be associated with virulence. Our findings suggest that aes does not play a direct role in the virulence of E. coli extraintestinal infection. However, this gene acts as a powerful marker of phylogeny, illustrating the extensive divergence of B2 phylogenetic group strains from the rest of the species.
On the Complexity of Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.
Kordi, Misagh; Bansal, Mukul S
2017-01-01
Duplication-Transfer-Loss (DTL) reconciliation has emerged as a powerful technique for studying gene family evolution in the presence of horizontal gene transfer. DTL reconciliation takes as input a gene family phylogeny and the corresponding species phylogeny, and reconciles the two by postulating speciation, gene duplication, horizontal gene transfer, and gene loss events. Efficient algorithms exist for finding optimal DTL reconciliations when the gene tree is binary. However, gene trees are frequently non-binary. With such non-binary gene trees, the reconciliation problem seeks to find a binary resolution of the gene tree that minimizes the reconciliation cost. Given the prevalence of non-binary gene trees, many efficient algorithms have been developed for this problem in the context of the simpler Duplication-Loss (DL) reconciliation model. Yet, no efficient algorithms exist for DTL reconciliation with non-binary gene trees and the complexity of the problem remains unknown. In this work, we resolve this open question by showing that the problem is, in fact, NP-hard. Our reduction applies to both the dated and undated formulations of DTL reconciliation. By resolving this long-standing open problem, this work will spur the development of both exact and heuristic algorithms for this important problem.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija
Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset ofmore » genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.« less
Corbin, Cyrielle; Drouet, Samantha; Markulin, Lucija; Auguin, Daniel; Lainé, Éric; Davin, Laurence B; Cort, John R; Lewis, Norman G; Hano, Christophe
2018-05-01
Identification of DIR encoding genes in flax genome. Analysis of phylogeny, gene/protein structures and evolution. Identification of new conserved motifs linked to biochemical functions. Investigation of spatio-temporal gene expression and response to stress. Dirigent proteins (DIRs) were discovered during 8-8' lignan biosynthesis studies, through identification of stereoselective coupling to afford either (+)- or (-)-pinoresinols from E-coniferyl alcohol. DIRs are also involved or potentially involved in terpenoid, allyl/propenyl phenol lignan, pterocarpan and lignin biosynthesis. DIRs have very large multigene families in different vascular plants including flax, with most still of unknown function. DIR studies typically focus on a small subset of genes and identification of biochemical/physiological functions. Herein, a genome-wide analysis and characterization of the predicted flax DIR 44-membered multigene family was performed, this species being a rich natural grain source of 8-8' linked secoisolariciresinol-derived lignan oligomers. All predicted DIR sequences, including their promoters, were analyzed together with their public gene expression datasets. Expression patterns of selected DIRs were examined using qPCR, as well as through clustering analysis of DIR gene expression. These analyses further implicated roles for specific DIRs in (-)-pinoresinol formation in seed-coats, as well as (+)-pinoresinol in vegetative organs and/or specific responses to stress. Phylogeny and gene expression analysis segregated flax DIRs into six distinct clusters with new cluster-specific motifs identified. We propose that these findings can serve as a foundation to further systematically determine functions of DIRs, i.e. other than those already known in lignan biosynthesis in flax and other species. Given the differential expression profiles and inducibility of the flax DIR family, we provisionally propose that some DIR genes of unknown function could be involved in different aspects of secondary cell wall biosynthesis and plant defense.
Castalanelli, M A; Baker, A M; Munyard, K A; Grimm, M; Groth, D M
2012-02-01
To date, a molecular phylogenetic approach has not been used to investigate the evolutionary structure of Trogoderma and closely related genera. Using two mitochondrial genes, Cytochrome Oxidase I and Cytochrome B, and the nuclear gene, 18S, the reported polyphyletic positioning of Trogoderma was examined. Paraphyly in Trogoderma was observed, with one Australian Trogoderma species reconciled as sister to all Dermestidae and the Anthrenocerus genus deeply nested within the Australian Trogoderma clade. In addition, time to most recent common ancestor for a number of Dermestidae was calculated. Based on these estimations, the Dermestidae origin exceeded 175 million years, placing the origins of this family in Pangaea.
Cheng, Shawn; Kirton, Laurence G.; Panandam, Jothi M.; Siraj, Siti S.; Ng, Kevin Kit-Siong; Tan, Soon-Guan
2011-01-01
Termites of the genus Odontotermes are important decomposers in the Old World tropics and are sometimes important pests of crops, timber and trees. The species within the genus often have overlapping size ranges and are difficult to differentiate based on morphology. As a result, the taxonomy of Odontotermes in Peninsular Malaysia has not been adequately worked out. In this study, we examined the phylogeny of 40 samples of Odontotermes from Peninsular Malaysia using two mitochondrial DNA regions, that is, the 16S ribosomal RNA and cytochrome oxidase subunit I genes, to aid in elucidating the number of species in the peninsula. Phylogenies were reconstructed from the individual gene and combined gene data sets using parsimony and likelihood criteria. The phylogenies supported the presence of up to eleven species in Peninsular Malaysia, which were identified as O. escherichi, O. hainanensis, O. javanicus, O. longignathus, O. malaccensis, O. oblongatus, O. paraoblongatus, O. sarawakensis, and three possibly new species. Additionally, some of our taxa are thought to comprise a complex of two or more species. The number of species found in this study using DNA methods was more than the initial nine species thought to occur in Peninsular Malaysia. The support values for the clades and morphology of the soldiers provided further evidence for the existence of eleven or more species. Higher resolution genetic markers such as microsatellites would be required to confirm the presence of cryptic species in some taxa. PMID:21687629
2010-01-01
Background Terpenoids are among the most important constituents of grape flavour and wine bouquet, and serve as useful metabolite markers in viticulture and enology. Based on the initial 8-fold sequencing of a nearly homozygous Pinot noir inbred line, 89 putative terpenoid synthase genes (VvTPS) were predicted by in silico analysis of the grapevine (Vitis vinifera) genome assembly [1]. The finding of this very large VvTPS family, combined with the importance of terpenoid metabolism for the organoleptic properties of grapevine berries and finished wines, prompted a detailed examination of this gene family at the genomic level as well as an investigation into VvTPS biochemical functions. Results We present findings from the analysis of the up-dated 12-fold sequencing and assembly of the grapevine genome that place the number of predicted VvTPS genes at 69 putatively functional VvTPS, 20 partial VvTPS, and 63 VvTPS probable pseudogenes. Gene discovery and annotation included information about gene architecture and chromosomal location. A dense cluster of 45 VvTPS is localized on chromosome 18. Extensive FLcDNA cloning, gene synthesis, and protein expression enabled functional characterization of 39 VvTPS; this is the largest number of functionally characterized TPS for any species reported to date. Of these enzymes, 23 have unique functions and/or phylogenetic locations within the plant TPS gene family. Phylogenetic analyses of the TPS gene family showed that while most VvTPS form species-specific gene clusters, there are several examples of gene orthology with TPS of other plant species, representing perhaps more ancient VvTPS, which have maintained functions independent of speciation. Conclusions The highly expanded VvTPS gene family underpins the prominence of terpenoid metabolism in grapevine. We provide a detailed experimental functional annotation of 39 members of this important gene family in grapevine and comprehensive information about gene structure and phylogeny for the entire currently known VvTPS gene family. PMID:20964856
Chidebe, Ifeoma N; Jaiswal, Sanjay K; Dakora, Felix D
2018-01-15
Cowpea derives most of its N nutrition from biological nitrogen fixation (BNF) via symbiotic bacteroids in root nodules. In Sub-Saharan Africa, the diversity and biogeographic distribution of bacterial microsymbionts nodulating cowpea and other indigenous legumes are not well understood, though needed for increased legume production. The aim of this study was to describe the distribution and phylogenies of rhizobia at different agroecological regions of Mozambique using PCR of the BOX element (BOX-PCR), restriction fragment length polymorphism of the internal transcribed spacer (ITS-RFLP), and sequence analysis of ribosomal, symbiotic, and housekeeping genes. A total of 122 microsymbionts isolated from two cowpea varieties (IT-1263 and IT-18) grouped into 17 clades within the BOX-PCR dendrogram. The PCR-ITS analysis yielded 17 ITS types for the bacterial isolates, while ITS-RFLP analysis placed all test isolates in six distinct clusters (I to VI). BLAST n sequence analysis of 16S rRNA and four housekeeping genes ( glnII , gyrB , recA , and rpoB ) showed their alignment with Rhizobium and Bradyrhizobium species. The results revealed a group of highly diverse and adapted cowpea-nodulating microsymbionts which included Bradyrhizobium pachyrhizi , Bradyrhizobium arachidis , Bradyrhizobium yuanmingense , and a novel Bradyrhizobium sp., as well as Rhizobium tropici , Rhizobium pusense , and Neorhizobium galegae in Mozambican soils. Discordances observed in single-gene phylogenies could be attributed to horizontal gene transfer and/or subsequent recombinations of the genes. Natural deletion of 60 bp of the gyrB region was observed in isolate TUTVU7; however, this deletion effect on DNA gyrase function still needs to be confirmed. The inconsistency of nifH with core gene phylogenies suggested differences in the evolutionary history of both chromosomal and symbiotic genes. IMPORTANCE A diverse group of both Bradyrhizobium and Rhizobium species responsible for cowpea nodulation in Mozambique was found in this study. Future studies could prove useful in evaluating these bacterial isolates for symbiotic efficiency and strain competitiveness in Mozambican soils. Copyright © 2018 Chidebe et al.
Stephens, Jessica D; Rogers, Willie L; Heyduk, Karolina; Cruse-Sanders, Jennifer M; Determann, Ron O; Glenn, Travis C; Malmberg, Russell L
2015-04-01
The North American carnivorous pitcher plant genus Sarracenia (Sarraceniaceae) is a relatively young clade (<3 million years ago) displaying a wide range of morphological diversity in complex trapping structures. This recently radiated group is a promising system to examine the structural evolution and diversification of carnivorous plants; however, little is known regarding evolutionary relationships within the genus. Previous attempts at resolving the phylogeny have been unsuccessful, most likely due to few parsimony-informative sites compounded by incomplete lineage sorting. Here, we applied a target enrichment approach using multiple accessions to assess the relationships of Sarracenia species. This resulted in 199 nuclear genes from 75 accessions covering the putative 8-11 species and 8 subspecies/varieties. In addition, we recovered 42kb of plastome sequence from each accession to estimate a cpDNA-derived phylogeny. Unsurprisingly, the cpDNA had few parsimony-informative sites (0.5%) and provided little information on species relationships. In contrast, use of the targeted nuclear loci in concatenation and coalescent frameworks elucidated many relationships within Sarracenia even with high heterogeneity among gene trees. Results were largely consistent for both concatenation and coalescent approaches. The only major disagreement was with the placement of the purpurea complex. Moreover, results suggest an Appalachian massif biogeographic origin of the genus. Overall, this study highlights the utility of target enrichment using multiple accessions to resolve relationships in recently radiated taxa. Copyright © 2015 Elsevier Inc. All rights reserved.
When outgroups fail; phylogenomics of rooting the emerging pathogen, Coxiella burnetii.
Pearson, Talima; Hornstra, Heidie M; Sahl, Jason W; Schaack, Sarah; Schupp, James M; Beckstrom-Sternberg, Stephen M; O'Neill, Matthew W; Priestley, Rachael A; Champion, Mia D; Beckstrom-Sternberg, James S; Kersh, Gilbert J; Samuel, James E; Massung, Robert F; Keim, Paul
2013-09-01
Rooting phylogenies is critical for understanding evolution, yet the importance, intricacies and difficulties of rooting are often overlooked. For rooting, polymorphic characters among the group of interest (ingroup) must be compared to those of a relative (outgroup) that diverged before the last common ancestor (LCA) of the ingroup. Problems arise if an outgroup does not exist, is unknown, or is so distant that few characters are shared, in which case duplicated genes originating before the LCA can be used as proxy outgroups to root diverse phylogenies. Here, we describe a genome-wide expansion of this technique that can be used to solve problems at the other end of the evolutionary scale: where ingroup individuals are all very closely related to each other, but the next closest relative is very distant. We used shared orthologous single nucleotide polymorphisms (SNPs) from 10 whole genome sequences of Coxiella burnetii, the causative agent of Q fever in humans, to create a robust, but unrooted phylogeny. To maximize the number of characters informative about the rooting, we searched entire genomes for polymorphic duplicated regions where orthologs of each paralog could be identified so that the paralogs could be used to root the tree. Recent radiations, such as those of emerging pathogens, often pose rooting challenges due to a lack of ingroup variation and large genomic differences with known outgroups. Using a phylogenomic approach, we created a robust, rooted phylogeny for C. burnetii. [Coxiella burnetii; paralog SNPs; pathogen evolution; phylogeny; recent radiation; root; rooting using duplicated genes.].
Resolution of ray-finned fish phylogeny and timing of diversification.
Near, Thomas J; Eytan, Ron I; Dornburg, Alex; Kuhn, Kristen L; Moore, Jon A; Davis, Matthew P; Wainwright, Peter C; Friedman, Matt; Smith, W Leo
2012-08-21
Ray-finned fishes make up half of all living vertebrate species. Nearly all ray-finned fishes are teleosts, which include most commercially important fish species, several model organisms for genomics and developmental biology, and the dominant component of marine and freshwater vertebrate faunas. Despite the economic and scientific importance of ray-finned fishes, the lack of a single comprehensive phylogeny with corresponding divergence-time estimates has limited our understanding of the evolution and diversification of this radiation. Our analyses, which use multiple nuclear gene sequences in conjunction with 36 fossil age constraints, result in a well-supported phylogeny of all major ray-finned fish lineages and molecular age estimates that are generally consistent with the fossil record. This phylogeny informs three long-standing problems: specifically identifying elopomorphs (eels and tarpons) as the sister lineage of all other teleosts, providing a unique hypothesis on the radiation of early euteleosts, and offering a promising strategy for resolution of the "bush at the top of the tree" that includes percomorphs and other spiny-finned teleosts. Contrasting our divergence time estimates with studies using a single nuclear gene or whole mitochondrial genomes, we find that the former underestimates ages of the oldest ray-finned fish divergences, but the latter dramatically overestimates ages for derived teleost lineages. Our time-calibrated phylogeny reveals that much of the diversification leading to extant groups of teleosts occurred between the late Mesozoic and early Cenozoic, identifying this period as the "Second Age of Fishes."
Resolution of ray-finned fish phylogeny and timing of diversification
Near, Thomas J.; Eytan, Ron I.; Dornburg, Alex; Kuhn, Kristen L.; Moore, Jon A.; Davis, Matthew P.; Wainwright, Peter C.; Friedman, Matt; Smith, W. Leo
2012-01-01
Ray-finned fishes make up half of all living vertebrate species. Nearly all ray-finned fishes are teleosts, which include most commercially important fish species, several model organisms for genomics and developmental biology, and the dominant component of marine and freshwater vertebrate faunas. Despite the economic and scientific importance of ray-finned fishes, the lack of a single comprehensive phylogeny with corresponding divergence-time estimates has limited our understanding of the evolution and diversification of this radiation. Our analyses, which use multiple nuclear gene sequences in conjunction with 36 fossil age constraints, result in a well-supported phylogeny of all major ray-finned fish lineages and molecular age estimates that are generally consistent with the fossil record. This phylogeny informs three long-standing problems: specifically identifying elopomorphs (eels and tarpons) as the sister lineage of all other teleosts, providing a unique hypothesis on the radiation of early euteleosts, and offering a promising strategy for resolution of the “bush at the top of the tree” that includes percomorphs and other spiny-finned teleosts. Contrasting our divergence time estimates with studies using a single nuclear gene or whole mitochondrial genomes, we find that the former underestimates ages of the oldest ray-finned fish divergences, but the latter dramatically overestimates ages for derived teleost lineages. Our time-calibrated phylogeny reveals that much of the diversification leading to extant groups of teleosts occurred between the late Mesozoic and early Cenozoic, identifying this period as the “Second Age of Fishes.” PMID:22869754
Romiguier, Jonathan; Ranwez, Vincent; Delsuc, Frédéric; Galtier, Nicolas; Douzery, Emmanuel J P
2013-09-01
Despite the rapid increase of size in phylogenomic data sets, a number of important nodes on animal phylogeny are still unresolved. Among these, the rooting of the placental mammal tree is still a controversial issue. One difficulty lies in the pervasive phylogenetic conflicts among genes, with each one telling its own story, which may be reliable or not. Here, we identified a simple criterion, that is, the GC content, which substantially helps in determining which gene trees best reflect the species tree. We assessed the ability of 13,111 coding sequence alignments to correctly reconstruct the placental phylogeny. We found that GC-rich genes induced a higher amount of conflict among gene trees and performed worse than AT-rich genes in retrieving well-supported, consensual nodes on the placental tree. We interpret this GC effect mainly as a consequence of genome-wide variations in recombination rate. Indeed, recombination is known to drive GC-content evolution through GC-biased gene conversion and might be problematic for phylogenetic reconstruction, for instance, in an incomplete lineage sorting context. When we focused on the AT-richest fraction of the data set, the resolution level of the placental phylogeny was greatly increased, and a strong support was obtained in favor of an Afrotheria rooting, that is, Afrotheria as the sister group of all other placentals. We show that in mammals most conflicts among gene trees, which have so far hampered the resolution of the placental tree, are concentrated in the GC-rich regions of the genome. We argue that the GC content-because it is a reliable indicator of the long-term recombination rate-is an informative criterion that could help in identifying the most reliable molecular markers for species tree inference.
Kutschera, Verena E; Bidon, Tobias; Hailer, Frank; Rodi, Julia L; Fain, Steven R; Janke, Axel
2014-08-01
Ursine bears are a mammalian subfamily that comprises six morphologically and ecologically distinct extant species. Previous phylogenetic analyses of concatenated nuclear genes could not resolve all relationships among bears, and appeared to conflict with the mitochondrial phylogeny. Evolutionary processes such as incomplete lineage sorting and introgression can cause gene tree discordance and complicate phylogenetic inferences, but are not accounted for in phylogenetic analyses of concatenated data. We generated a high-resolution data set of autosomal introns from several individuals per species and of Y-chromosomal markers. Incorporating intraspecific variability in coalescence-based phylogenetic and gene flow estimation approaches, we traced the genealogical history of individual alleles. Considerable heterogeneity among nuclear loci and discordance between nuclear and mitochondrial phylogenies were found. A species tree with divergence time estimates indicated that ursine bears diversified within less than 2 My. Consistent with a complex branching order within a clade of Asian bear species, we identified unidirectional gene flow from Asian black into sloth bears. Moreover, gene flow detected from brown into American black bears can explain the conflicting placement of the American black bear in mitochondrial and nuclear phylogenies. These results highlight that both incomplete lineage sorting and introgression are prominent evolutionary forces even on time scales up to several million years. Complex evolutionary patterns are not adequately captured by strictly bifurcating models, and can only be fully understood when analyzing multiple independently inherited loci in a coalescence framework. Phylogenetic incongruence among gene trees hence needs to be recognized as a biologically meaningful signal. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Moore, Abigail J; Vos, Jurriaan M De; Hancock, Lillian P; Goolsby, Eric; Edwards, Erika J
2018-05-01
Hybrid enrichment is an increasingly popular approach for obtaining hundreds of loci for phylogenetic analysis across many taxa quickly and cheaply. The genes targeted for sequencing are typically single-copy loci, which facilitate a more straightforward sequence assembly and homology assignment process. However, this approach limits the inclusion of most genes of functional interest, which often belong to multi-gene families. Here, we demonstrate the feasibility of including large gene families in hybrid enrichment protocols for phylogeny reconstruction and subsequent analyses of molecular evolution, using a new set of bait sequences designed for the "portullugo" (Caryophyllales), a moderately sized lineage of flowering plants (~ 2200 species) that includes the cacti and harbors many evolutionary transitions to C$_{\\mathrm{4}}$ and CAM photosynthesis. Including multi-gene families allowed us to simultaneously infer a robust phylogeny and construct a dense sampling of sequences for a major enzyme of C$_{\\mathrm{4}}$ and CAM photosynthesis, which revealed the accumulation of adaptive amino acid substitutions associated with C$_{\\mathrm{4}}$ and CAM origins in particular paralogs. Our final set of matrices for phylogenetic analyses included 75-218 loci across 74 taxa, with ~ 50% matrix completeness across data sets. Phylogenetic resolution was greatly improved across the tree, at both shallow and deep levels. Concatenation and coalescent-based approaches both resolve the sister lineage of the cacti with strong support: Anacampserotaceae $+$ Portulacaceae, two lineages of mostly diminutive succulent herbs of warm, arid regions. In spite of this congruence, BUCKy concordance analyses demonstrated strong and conflicting signals across gene trees. Our results add to the growing number of examples illustrating the complexity of phylogenetic signals in genomic-scale data.
Delamuta, Jakeline Renata Marçon; Menna, Pâmela; Ribeiro, Renan Augusto; Hungria, Mariangela
2017-07-01
Bradyrhizobium comprises most tropical symbiotic nitrogen-fixing strains, but the correlation between symbiotic and core genes with host specificity is still unclear. In this study, the phylogenies of the nodY/K and nifH genes of 45 Bradyrhizobium strains isolated from legumes of economic and environmental importance in Brazil (Arachis hypogaea, Acacia auriculiformis, Glycine max, Lespedeza striata, Lupinus albus, Stylosanthes sp. and Vigna unguiculata) were compared to 16S rRNA gene phylogeny and genetic diversity by rep-PCR. In the 16S rRNA tree, strains were distributed into two superclades-B. japonicum and B. elkanii-with several strains being very similar within each clade. The rep-PCR analysis also revealed high intra-species diversity. Clustering of strains in the nodY/K and nifH trees was identical: 39 strains isolated from soybean grouped with Bradyrhizobium type species symbionts of soybean, whereas five others occupied isolated positions. Only one strain isolated from Stylosanthes sp. showed similar nodY/K and nifH sequences to soybean strains, and it also nodulated soybean. Twenty-one representative strains of the 16S rRNA phylogram were selected and taxonomically classified using a concatenated glnII-recA phylogeny; nodC sequences were also compared and revealed the same clusters as observed in the nodY/K and nifH phylograms. The analyses of symbiotic genes indicated that a large group of strains from the B. elkanii superclade comprised the novel symbiovar sojae, whereas for another group, including B. pachyrhizi, the symbiovar pachyrhizi could be proposed. Other potential new symbiovars were also detected. The co-evolution hypotheses is discussed and it is suggested that nodY/K analysis would be useful for investigating the symbiotic diversity of the genus Bradyrhizobium. Copyright © 2017 Elsevier GmbH. All rights reserved.
The evolutionary landscape of intergenic trans-splicing events in insects
Kong, Yimeng; Zhou, Hongxia; Yu, Yao; Chen, Longxian; Hao, Pei; Li, Xuan
2015-01-01
To explore the landscape of intergenic trans-splicing events and characterize their functions and evolutionary dynamics, we conduct a mega-data study of a phylogeny containing eight species across five orders of class Insecta, a model system spanning 400 million years of evolution. A total of 1,627 trans-splicing events involving 2,199 genes are identified, accounting for 1.58% of the total genes. Homology analysis reveals that mod(mdg4)-like trans-splicing is the only conserved event that is consistently observed in multiple species across two orders, which represents a unique case of functional diversification involving trans-splicing. Thus, evolutionarily its potential for generating proteins with novel function is not broadly utilized by insects. Furthermore, 146 non-mod trans-spliced transcripts are found to resemble canonical genes from different species. Trans-splicing preserving the function of ‘breakup' genes may serve as a general mechanism for relaxing the constraints on gene structure, with profound implications for the evolution of genes and genomes. PMID:26521696
Molecular phylogeny and systematics of the Echinostomatoidea Looss, 1899 (Platyhelminthes: Digenea).
Tkach, Vasyl V; Kudlai, Olena; Kostadinova, Aneta
2016-03-01
The Echinostomatoidea is a large, cosmopolitan group of digeneans currently including nine families and 105 genera, the vast majority parasitic, as adults, in birds with relatively few taxa parasitising mammals, reptiles and, exceptionally, fish. Despite the complex structure, diverse content and substantial species richness of the group, almost no attempt has been made to elucidate its phylogenetic relationships at the suprageneric level based on molecules due to the lack of data. Herein, we evaluate the consistency of the present morphology-based classification system of the Echinostomatoidea with the phylogenetic relationships of its members based on partial sequences of the nuclear lsrRNA gene for a broad diversity of taxa (80 species, representing eight families and 40 genera), including representatives of five subfamilies of the Echinostomatidae, which currently exhibits the most complex taxonomic structure within the superfamily. This first comprehensive phylogeny for the Echinostomatoidea challenged the current systematic framework based on comparative morphology. A morphology-based evaluation of this new molecular framework resulted in a number of systematic and nomenclatural changes consistent with the phylogenetic estimates of the generic and suprageneric boundaries and a new phylogeny-based classification of the Echinostomatoidea. In the current systematic treatment: (i) the rank of two family level lineages, the former Himasthlinae and Echinochasminae, is elevated to full family status; (ii) Caballerotrema is distinguished at the family level; (iii) the content and diagnosis of the Echinostomatidae (sensu stricto) (s. str.) are revised to reflect its phylogeny, resulting in the abolition of the Nephrostominae and Chaunocephalinae as synonyms of the Echinostomatidae (s. str.); (iv) Artyfechinostomum, Cathaemasia, Rhopalias and Ribeiroia are re-allocated within the Echinostomatidae (s. str.), resulting in the abolition of the Cathaemasiidae, Rhopaliidae and Ribeiroiinae, which become synonyms of the Echinostomatidae (s. str.); and (v) refinements of the generic boundaries within the Echinostomatidae (s. str.), Psilostomidae and Fasciolidae are made. Copyright © 2015 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Graupner, Nadine; Bock, Christina; Wodniok, Sabina; Grossmann, Lars; Vos, Matthijs; Sures, Bernd
2017-01-01
Background Chrysophytes are protist model species in ecology and ecophysiology and important grazers of bacteria-sized microorganisms and primary producers. However, they have not yet been investigated in detail at the molecular level, and no genomic and only little transcriptomic information is available. Chrysophytes exhibit different trophic modes: while phototrophic chrysophytes perform only photosynthesis, mixotrophs can gain carbon from bacterial food as well as from photosynthesis, and heterotrophs solely feed on bacteria-sized microorganisms. Recent phylogenies and megasystematics demonstrate an immense complexity of eukaryotic diversity with numerous transitions between phototrophic and heterotrophic organisms. The question we aim to answer is how the diverse nutritional strategies, accompanied or brought about by a reduction of the plasmid and size reduction in heterotrophic strains, affect physiology and molecular processes. Results We sequenced the mRNA of 18 chrysophyte strains on the Illumina HiSeq platform and analysed the transcriptomes to determine relations between the trophic mode (mixotrophic vs. heterotrophic) and gene expression. We observed an enrichment of genes for photosynthesis, porphyrin and chlorophyll metabolism for phototrophic and mixotrophic strains that can perform photosynthesis. Genes involved in nutrient absorption, environmental information processing and various transporters (e.g., monosaccharide, peptide, lipid transporters) were present or highly expressed only in heterotrophic strains that have to sense, digest and absorb bacterial food. We furthermore present a transcriptome-based alignment-free phylogeny construction approach using transcripts assembled from short reads to determine the evolutionary relationships between the strains and the possible influence of nutritional strategies on the reconstructed phylogeny. We discuss the resulting phylogenies in comparison to those from established approaches based on ribosomal RNA and orthologous genes. Finally, we make functionally annotated reference transcriptomes of each strain available to the community, significantly enhancing publicly available data on Chrysophyceae. Conclusions Our study is the first comprehensive transcriptomic characterisation of a diverse set of Chrysophyceaen strains. In addition, we showcase the possibility of inferring phylogenies from assembled transcriptomes using an alignment-free approach. The raw and functionally annotated data we provide will prove beneficial for further examination of the diversity within this taxon. Our molecular characterisation of different trophic modes presents a first such example. PMID:28097055
Graham Reynolds, R; Niemiller, Matthew L; Revell, Liam J
2014-02-01
Snakes in the families Boidae and Pythonidae constitute some of the most spectacular reptiles and comprise an enormous diversity of morphology, behavior, and ecology. While many species of boas and pythons are familiar, taxonomy and evolutionary relationships within these families remain contentious and fluid. A major effort in evolutionary and conservation biology is to assemble a comprehensive Tree-of-Life, or a macro-scale phylogenetic hypothesis, for all known life on Earth. No previously published study has produced a species-level molecular phylogeny for more than 61% of boa species or 65% of python species. Using both novel and previously published sequence data, we have produced a species-level phylogeny for 84.5% of boid species and 82.5% of pythonid species, contextualized within a larger phylogeny of henophidian snakes. We obtained new sequence data for three boid, one pythonid, and two tropidophiid taxa which have never previously been included in a molecular study, in addition to generating novel sequences for seven genes across an additional 12 taxa. We compiled an 11-gene dataset for 127 taxa, consisting of the mitochondrial genes CYTB, 12S, and 16S, and the nuclear genes bdnf, bmp2, c-mos, gpr35, rag1, ntf3, odc, and slc30a1, totaling up to 7561 base pairs per taxon. We analyzed this dataset using both maximum likelihood and Bayesian inference and recovered a well-supported phylogeny for these species. We found significant evidence of discordance between taxonomy and evolutionary relationships in the genera Tropidophis, Morelia, Liasis, and Leiopython, and we found support for elevating two previously suggested boid species. We suggest a revised taxonomy for the boas (13 genera, 58 species) and pythons (8 genera, 40 species), review relationships between our study and the many other molecular phylogenetic studies of henophidian snakes, and present a taxonomic database and alignment which may be easily used and built upon by other researchers. Copyright © 2013 Elsevier Inc. All rights reserved.
Fulton, Tara Lynn; Strobeck, Curtis
2010-04-07
Despite decades of study, some aspects of Phocidae (Pinnipedia, Carnivora) phylogeny still remain unresolved. Using the largest novel dataset to date, including all extant phocids and comprising 15 nuclear and 13 mitochondrial genes, we illustrate the utility of including multiple individuals per species in resolving rapid radiations, and provide new insight into phocid phylogeny. In line with longstanding morphological views, Pusa is recovered as monophyletic for the first time with genetic data. The data are also used to explore the relationship between genetic distance and taxonomic rank. Intraspecific sampling also highlights the discrepancy between molecular and morphological rates of evolution within Phocidae.
MANTIS: a phylogenetic framework for multi-species genome comparisons.
Tzika, Athanasia C; Helaers, Raphaël; Van de Peer, Yves; Milinkovitch, Michel C
2008-01-15
Practitioners of comparative genomics face huge analytical challenges as whole genome sequences and functional/expression data accumulate. Furthermore, the field would greatly benefit from a better integration of this wealth of data with evolutionary concepts. Here, we present MANTIS, a relational database for the analysis of (i) gains and losses of genes on specific branches of the metazoan phylogeny, (ii) reconstructed genome content of ancestral species and (iii) over- or under-representation of functions/processes and tissue specificity of gained, duplicated and lost genes. MANTIS estimates the most likely positions of gene losses on the true phylogeny using a maximum-likelihood function. A user-friendly interface and an extensive query system allow to investigate questions pertaining to gene identity, phylogenetic mapping and function/expression parameters. MANTIS is freely available at http://www.mantisdb.org and constitutes the missing link between multi-species genome comparisons and functional analyses.
Construction of a Species-Level Tree of Life for the Insects and Utility in Taxonomic Profiling
Chesters, Douglas
2017-01-01
Abstract Although comprehensive phylogenies have proven an invaluable tool in ecology and evolution, their construction is made increasingly challenging both by the scale and structure of publically available sequences. The distinct partition between gene-rich (genomic) and species-rich (DNA barcode) data is a feature of data that has been largely overlooked, yet presents a key obstacle to scaling supermatrix analysis. I present a phyloinformatics framework for draft construction of a species-level phylogeny of insects (Class Insecta). Matrix-building requires separately optimized pipelines for nuclear transcriptomic, mitochondrial genomic, and species-rich markers, whereas tree-building requires hierarchical inference in order to capture species-breadth while retaining deep-level resolution. The phylogeny of insects contains 49,358 species, 13,865 genera, 760 families. Deep-level splits largely reflected previous findings for sections of the tree that are data rich or unambiguous, such as inter-ordinal Endopterygota and Dictyoptera, the recently evolved and relatively homogeneous Lepidoptera, Hymenoptera, Brachycera (Diptera), and Cucujiformia (Coleoptera). However, analysis of bias, matrix construction and gene-tree variation suggests confidence in some relationships (such as in Polyneoptera) is less than has been indicated by the matrix bootstrap method. To assess the utility of the insect tree as a tool in query profiling several tree-based taxonomic assignment methods are compared. Using test data sets with existing taxonomic annotations, a tendency is observed for greater accuracy of species-level assignments where using a fixed comprehensive tree of life in contrast to methods generating smaller de novo reference trees. Described herein is a solution to the discrepancy in the way data are fit into supermatrices. The resulting tree facilitates wider studies of insect diversification and application of advanced descriptions of diversity in community studies, among other presumed applications. PMID:27798407
Koç, Ibrahim; Caetano-Anollés, Gustavo
2017-01-01
The origin and natural history of molecular functions hold the key to the emergence of cellular organization and modern biochemistry. Here we use a genomic census of Gene Ontology (GO) terms to reconstruct phylogenies at the three highest (1, 2 and 3) and the lowest (terminal) levels of the hierarchy of molecular functions, which reflect the broadest and the most specific GO definitions, respectively. These phylogenies define evolutionary timelines of functional innovation. We analyzed 249 free-living organisms comprising the three superkingdoms of life, Archaea, Bacteria, and Eukarya. Phylogenies indicate catalytic, binding and transport functions were the oldest, suggesting a ‘metabolism-first’ origin scenario for biochemistry. Metabolism made use of increasingly complicated organic chemistry. Primordial features of ancient molecular functions and functional recruitments were further distilled by studying the oldest child terms of the oldest level 1 GO definitions. Network analyses showed the existence of an hourglass pattern of enzyme recruitment in the molecular functions of the directed acyclic graph of molecular functions. Older high-level molecular functions were thoroughly recruited at younger lower levels, while very young high-level functions were used throughout the timeline. This pattern repeated in every one of the three mappings, which gave a criss-cross pattern. The timelines and their mappings were remarkable. They revealed the progressive evolutionary development of functional toolkits, starting with the early rise of metabolic activities, followed chronologically by the rise of macromolecular biosynthesis, the establishment of controlled interactions with the environment and self, adaptation to oxygen, and enzyme coordinated regulation, and ending with the rise of structural and cellular complexity. This historical account holds important clues for dissection of the emergence of biomcomplexity and life. PMID:28467492
te Biesebeke, Rob; Levasseur, Anthony; Boussier, Amandine; Record, Eric; van den Hondel, Cees A M J J; Punt, Peter J
2010-01-01
The fhbA genes encoding putative flavohemoglobins (FHb) from Aspergillus niger and Aspergillus oryzae were isolated. Comparison of the deduced amino acid sequence of the A. niger fhbA gene and other putative filamentous fungal FHb-encoding genes to that of Ralstonia eutropha shows an overall conserved gene structure and completely conserved catalytic amino acids. Several yeasts and filamentous fungi, including both Aspergillus species have been found to contain a small FHb gene family mostly consisting of two family members. Based on these sequences the evolutionary history of the fungal FHb family was reconstructed. The isolated fhbA genes from A. oryzae and A. niger belong to a phylogenetic group, which exclusively contains Aspergillus genes. Different experimental approaches show that fhbA transcript levels appear during active hyphal growth. Moreover, in a pclA-disrupted strain with a hyperbranching growth phenotype, the transcript levels of the fhbA gene were 2–5 times higher compared to the wild-type. These results suggest that FHb from filamentous fungi have a function that is correlated to the hyphal growth phenotype.
Genome-wide identification, phylogeny, and expression analysis of the SWEET gene family in tomato.
Feng, Chao-Yang; Han, Jia-Xuan; Han, Xiao-Xue; Jiang, Jing
2015-12-01
The SWEET (Sugars Will Eventually Be Exported Transporters) gene family encodes membrane-embedded sugar transporters containing seven transmembrane helices harboring two MtN3 and saliva domain. SWEETs play important roles in diverse biological processes, including plant growth, development, and response to environmental stimuli. Here, we conducted an exhaustive search of the tomato genome, leading to the identification of 29 SWEET genes. We analyzed the structures, conserved domains, and phylogenetic relationships of these protein-coding genes in detail. We also analyzed the transcript levels of SWEET genes in various tissues, organs, and developmental stages to obtain information about their functions. Furthermore, we investigated the expression patterns of the SWEET genes in response to exogenous sugar and adverse environmental stress (high and low temperatures). Some family members exhibited tissue-specific expression, whereas others were more ubiquitously expressed. Numerous stress-responsive candidate genes were obtained. The results of this study provide insights into the characteristics of the SWEET genes in tomato and may serve as a basis for further functional studies of such genes. Copyright © 2015 Elsevier B.V. All rights reserved.
A salmonid EST genomic study: genes, duplications, phylogeny and microarrays
USDA-ARS?s Scientific Manuscript database
Background: Salmonids are of interest because of their relatively recent genome duplication, and their extensive use in wild fisheries and aquaculture. A comprehensive gene list and a comparison of genes in some of the different species provide valuable genomic information for one of the most wide...
Inferring Epidemic Contact Structure from Phylogenetic Trees
Leventhal, Gabriel E.; Kouyos, Roger; Stadler, Tanja; von Wyl, Viktor; Yerly, Sabine; Böni, Jürg; Cellerai, Cristina; Klimkait, Thomas; Günthard, Huldrych F.; Bonhoeffer, Sebastian
2012-01-01
Contact structure is believed to have a large impact on epidemic spreading and consequently using networks to model such contact structure continues to gain interest in epidemiology. However, detailed knowledge of the exact contact structure underlying real epidemics is limited. Here we address the question whether the structure of the contact network leaves a detectable genetic fingerprint in the pathogen population. To this end we compare phylogenies generated by disease outbreaks in simulated populations with different types of contact networks. We find that the shape of these phylogenies strongly depends on contact structure. In particular, measures of tree imbalance allow us to quantify to what extent the contact structure underlying an epidemic deviates from a null model contact network and illustrate this in the case of random mixing. Using a phylogeny from the Swiss HIV epidemic, we show that this epidemic has a significantly more unbalanced tree than would be expected from random mixing. PMID:22412361
Nadimi, Maryam; Daubois, Laurence; Hijri, Mohamed
2016-05-01
Mitochondrial (mt) genes, such as cytochrome C oxidase genes (cox), have been widely used for barcoding in many groups of organisms, although this approach has been less powerful in the fungal kingdom due to the rapid evolution of their mt genomes. The use of mt genes in phylogenetic studies of Dikarya has been met with success, while early diverging fungal lineages remain less studied, particularly the arbuscular mycorrhizal fungi (AMF). Advances in next-generation sequencing have substantially increased the number of publically available mtDNA sequences for the Glomeromycota. As a result, comparison of mtDNA across key AMF taxa can now be applied to assess the phylogenetic signal of individual mt coding genes, as well as concatenated subsets of coding genes. Here we show comparative analyses of publically available mt genomes of Glomeromycota, augmented with two mtDNA genomes that were newly sequenced for this study (Rhizophagus irregularis DAOM240159 and Glomus aggregatum DAOM240163), resulting in 16 complete mtDNA datasets. R. irregularis isolate DAOM240159 and G. aggregatum isolate DAOM240163 showed mt genomes measuring 72,293bp and 69,505bp with G+C contents of 37.1% and 37.3%, respectively. We assessed the phylogenies inferred from single mt genes and complete sets of coding genes, which are referred to as "supergenes" (16 concatenated coding genes), using Shimodaira-Hasegawa tests, in order to identify genes that best described AMF phylogeny. We found that rnl, nad5, cox1, and nad2 genes, as well as concatenated subset of these genes, provided phylogenies that were similar to the supergene set. This mitochondrial genomic analysis was also combined with principal coordinate and partitioning analyses, which helped to unravel certain evolutionary relationships in the Rhizophagus genus and for G. aggregatum within the Glomeromycota. We showed evidence to support the position of G. aggregatum within the R. irregularis 'species complex'. Copyright © 2016 Elsevier Inc. All rights reserved.
Rickettsia Phylogenomics: Unwinding the Intricacies of Obligate Intracellular Life
Gillespie, Joseph J.; Williams, Kelly; Shukla, Maulik; Snyder, Eric E.; Nordberg, Eric K.; Ceraul, Shane M.; Dharmanolla, Chitti; Rainey, Daphne; Soneja, Jeetendra; Shallom, Joshua M.; Vishnubhat, Nataraj Dongre; Wattam, Rebecca; Purkayastha, Anjan; Czar, Michael; Crasta, Oswald; Setubal, Joao C.; Azad, Abdu F.; Sobral, Bruno S.
2008-01-01
Background Completed genome sequences are rapidly increasing for Rickettsia, obligate intracellular α-proteobacteria responsible for various human diseases, including epidemic typhus and Rocky Mountain spotted fever. In light of phylogeny, the establishment of orthologous groups (OGs) of open reading frames (ORFs) will distinguish the core rickettsial genes and other group specific genes (class 1 OGs or C1OGs) from those distributed indiscriminately throughout the rickettsial tree (class 2 OG or C2OGs). Methodology/Principal Findings We present 1823 representative (no gene duplications) and 259 non-representative (at least one gene duplication) rickettsial OGs. While the highly reductive (∼1.2 MB) Rickettsia genomes range in predicted ORFs from 872 to 1512, a core of 752 OGs was identified, depicting the essential Rickettsia genes. Unsurprisingly, this core lacks many metabolic genes, reflecting the dependence on host resources for growth and survival. Additionally, we bolster our recent reclassification of Rickettsia by identifying OGs that define the AG (ancestral group), TG (typhus group), TRG (transitional group), and SFG (spotted fever group) rickettsiae. OGs for insect-associated species, tick-associated species and species that harbor plasmids were also predicted. Through superimposition of all OGs over robust phylogeny estimation, we discern between C1OGs and C2OGs, the latter depicting genes either decaying from the conserved C1OGs or acquired laterally. Finally, scrutiny of non-representative OGs revealed high levels of split genes versus gene duplications, with both phenomena confounding gene orthology assignment. Interestingly, non-representative OGs, as well as OGs comprised of several gene families typically involved in microbial pathogenicity and/or the acquisition of virulence factors, fall predominantly within C2OG distributions. Conclusion/Significance Collectively, we determined the relative conservation and distribution of 14354 predicted ORFs from 10 rickettsial genomes across robust phylogeny estimation. The data, available at PATRIC (PathoSystems Resource Integration Center), provide novel information for unwinding the intricacies associated with Rickettsia pathogenesis, expanding the range of potential diagnostic, vaccine and therapeutic targets. PMID:19194535
Ghedotti, Michael J; Barton, Ryan W; Simons, Andrew M; Davis, Matthew P
2015-03-01
Bioluminescent organs that provide ventral camouflage are common among fishes in the meso-bathypelagic zones of the deep sea. However, the anatomical structures that have been modified to produce light vary substantially among different groups of fishes. Although the anatomical structure and evolutionary derivation of some of these organs have been well studied, the light organs of the naked barracudinas have received little scientific attention. This study describes the anatomy and evolution of bioluminescent organs in the Lestidiidae (naked barracudinas) in the context of a new phylogeny of barracudinas and closely related alepisauroid fishes. Gross and histological examination of bioluminescent organs or homologous structures from preserved museum specimens indicate that the ventral light organ is derived from hepatopancreatic tissue and that the antorbital spot in Lestrolepis is, in fact, a second dermal light organ. In the context of the phylogeny generated from DNA-sequence data from eight gene fragments (7 nuclear and 1 mitochondrial), a complex liver with a narrow ventral strand running along the ventral midline evolves first in the Lestidiidae. The ventral hepatopancreatic tissue later evolves into a ventral bioluminescent organ in the ancestor of Lestidium and Lestrolepis with the lineage leading to the genus Lestrolepis evolving a dermal antorbital bioluminescent organ, likely for light-intensity matching. This is the first described hepatopancreatic bioluminescent organ in fishes. © 2014 Wiley Periodicals, Inc.
Phylogeny of the bears (Ursidae) based on nuclear and mitochondrial genes.
Yu, Li; Li, Qing-wei; Ryder, O A; Zhang, Ya-ping
2004-08-01
The taxomic classification and phylogenetic relationships within the bear family remain argumentative subjects in recent years. Prior investigation has been concentrated on the application of different mitochondrial (mt) sequence data, herein we employ two nuclear single-copy gene segments, the partial exon 1 from gene encoding interphotoreceptor retinoid binding protein (IRBP) and the complete intron 1 from transthyretin (TTR) gene, in conjunction with previously published mt data, to clarify these enigmatic problems. The combined analyses of nuclear IRBP and TTR datasets not only corroborated prior hypotheses, positioning the spectacled bear most basally and grouping the brown and polar bear together but also provided new insights into the bear phylogeny, suggesting the sister-taxa association of sloth bear and sun bear with strong support. Analyses based on combination of nuclear and mt genes differed from nuclear analysis in recognizing the sloth bears as the earliest diverging species among the subfamily ursine representatives while the exact placement of the sun bear did not resolved. Asiatic and American black bears clustered as sister group in all analyses with moderate levels of bootstrap support and high posterior probabilities. Comparisons between the nuclear and mtDNA findings suggested that our combined nuclear dataset have the resolving power comparable to mtDNA dataset for the phylogenetic interpretation of the bear family. As can be seen from present study, the unanimous phylogeny for this recently derived family was still not produced and additional independent genetic markers were in need.
Mishra, Ravi P N; Tisseyre, Pierre; Melkonian, Rémy; Chaintreuil, Clémence; Miché, Lucie; Klonowska, Agnieszka; Gonzalez, Sophie; Bena, Gilles; Laguerre, Gisèle; Moulin, Lionel
2012-02-01
The genetic diversity of 221 Mimosa pudica bacterial symbionts trapped from eight soils from diverse environments in French Guiana was assessed by 16S rRNA PCR-RFLP, REP-PCR fingerprints, as well as by phylogenies of their 16S rRNA and recA housekeeping genes, and by their nifH, nodA and nodC symbiotic genes. Interestingly, we found a large diversity of beta-rhizobia, with Burkholderia phymatum and Burkholderia tuberum being the most frequent and diverse symbiotic species. Other species were also found, such as Burkholderia mimosarum, an unnamed Burkholderia species and, for the first time in South America, Cupriavidus taiwanensis. The sampling site had a strong influence on the diversity of the symbionts sampled, and the specific distributions of symbiotic populations between the soils were related to soil composition in some cases. Some alpha-rhizobial strains taxonomically close to Rhizobium endophyticum were also trapped in one soil, and these carried two copies of the nodA gene, a feature not previously reported. Phylogenies of nodA, nodC and nifH genes showed a monophyly of symbiotic genes for beta-rhizobia isolated from Mimosa spp., indicative of a long history of interaction between beta-rhizobia and Mimosa species. Based on their symbiotic gene phylogenies and legume hosts, B. tuberum was shown to contain two large biovars: one specific to the mimosoid genus Mimosa and one to South African papilionoid legumes. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.
C.L. Schoch; G.-H. Sung; F. Lopez-Giraldez
2009-01-01
We present a six-gene, 420-species maximum-likelihood phylogeny of Ascomycota, the largest phylum of fungi. This analysis is the most taxonomically complete to date with species sampled from all 15 currently circumscribed classes. A number of superclass-level nodes that have previously evaded resolution and were unnamed in classifications of the fungi are resolved for...
Zeng, Liping; Zhang, Ning; Zhang, Qiang; Endress, Peter K; Huang, Jie; Ma, Hong
2017-05-01
Explosive diversification is widespread in eukaryotes, making it difficult to resolve phylogenetic relationships. Eudicots contain c. 75% of extant flowering plants, are important for human livelihood and terrestrial ecosystems, and have probably experienced explosive diversifications. The eudicot phylogenetic relationships, especially among those of the Pentapetalae, remain unresolved. Here, we present a highly supported eudicot phylogeny and diversification rate shifts using 31 newly generated transcriptomes and 88 other datasets covering 70% of eudicot orders. A highly supported eudicot phylogeny divided Pentapetalae into two groups: one with rosids, Saxifragales, Vitales and Santalales; the other containing asterids, Caryophyllales and Dilleniaceae, with uncertainty for Berberidopsidales. Molecular clock analysis estimated that crown eudicots originated c. 146 Ma, considerably earlier than earliest tricolpate pollen fossils and most other molecular clock estimates, and Pentapetalae sequentially diverged into eight major lineages within c. 15 Myr. Two identified increases of diversification rate are located in the stems leading to Pentapetalae and asterids, and lagged behind the gamma hexaploidization. The nuclear genes from newly generated transcriptomes revealed a well-resolved eudicot phylogeny, sequential separation of major core eudicot lineages and temporal mode of diversifications, providing new insights into the evolutionary trend of morphologies and contributions to the diversification of eudicots. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Kinship structures create persistent channels for language transmission
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lansing, J. Stephen; Abundo, Cheryl; Jacobs, Guy S.
Here, languages are transmitted through channels created by kinship systems. Given sufficient time, these kinship channels can change the genetic and linguistic structure of populations. In traditional societies of eastern Indonesia, finely resolved cophylogenies of languages and genes reveal persistent movements between stable speech communities facilitated by kinship rules. When multiple languages are present in a region and postmarital residence rules encourage sustained directional movement between speech communities, then languages should be channeled along uniparental lines. We find strong evidence for this pattern in 982 individuals from 25 villages on two adjacent islands, where different kinship rules have been followed.more » Core groups of close relatives have stayed together for generations, while remaining in contact with, and marrying into, surrounding groups. Over time, these kinship systems shaped their gene and language phylogenies: Consistently following a postmarital residence rule turned social communities into speech communities.« less
Kinship structures create persistent channels for language transmission
Lansing, J. Stephen; Abundo, Cheryl; Jacobs, Guy S.; ...
2017-11-20
Here, languages are transmitted through channels created by kinship systems. Given sufficient time, these kinship channels can change the genetic and linguistic structure of populations. In traditional societies of eastern Indonesia, finely resolved cophylogenies of languages and genes reveal persistent movements between stable speech communities facilitated by kinship rules. When multiple languages are present in a region and postmarital residence rules encourage sustained directional movement between speech communities, then languages should be channeled along uniparental lines. We find strong evidence for this pattern in 982 individuals from 25 villages on two adjacent islands, where different kinship rules have been followed.more » Core groups of close relatives have stayed together for generations, while remaining in contact with, and marrying into, surrounding groups. Over time, these kinship systems shaped their gene and language phylogenies: Consistently following a postmarital residence rule turned social communities into speech communities.« less
Kinship structures create persistent channels for language transmission.
Lansing, J Stephen; Abundo, Cheryl; Jacobs, Guy S; Guillot, Elsa G; Thurner, Stefan; Downey, Sean S; Chew, Lock Yue; Bhattacharya, Tanmoy; Chung, Ning Ning; Sudoyo, Herawati; Cox, Murray P
2017-12-05
Languages are transmitted through channels created by kinship systems. Given sufficient time, these kinship channels can change the genetic and linguistic structure of populations. In traditional societies of eastern Indonesia, finely resolved cophylogenies of languages and genes reveal persistent movements between stable speech communities facilitated by kinship rules. When multiple languages are present in a region and postmarital residence rules encourage sustained directional movement between speech communities, then languages should be channeled along uniparental lines. We find strong evidence for this pattern in 982 individuals from 25 villages on two adjacent islands, where different kinship rules have been followed. Core groups of close relatives have stayed together for generations, while remaining in contact with, and marrying into, surrounding groups. Over time, these kinship systems shaped their gene and language phylogenies: Consistently following a postmarital residence rule turned social communities into speech communities. Copyright © 2017 the Author(s). Published by PNAS.
Zheng, Xiaoyan; Cai, Danying; Potter, Daniel; Postman, Joseph; Liu, Jing; Teng, Yuanwen
2014-11-01
Reconstructing the phylogeny of Pyrus has been difficult due to the wide distribution of the genus and lack of informative data. In this study, we collected 110 accessions representing 25 Pyrus species and constructed both phylogenetic trees and phylogenetic networks based on multiple DNA sequence datasets. Phylogenetic trees based on both cpDNA and nuclear LFY2int2-N (LN) data resulted in poor resolution, especially, only five primary species were monophyletic in the LN tree. A phylogenetic network of LN suggested that reticulation caused by hybridization is one of the major evolutionary processes for Pyrus species. Polytomies of the gene trees and star-like structure of cpDNA networks suggested rapid radiation is another major evolutionary process, especially for the occidental species. Pyrus calleryana and P. regelii were the earliest diverged Pyrus species. Two North African species, P. cordata, P. spinosa and P. betulaefolia were descendent of primitive stock Pyrus species and still share some common molecular characters. Southwestern China, where a large number of P. pashia populations are found, is probably the most important diversification center of Pyrus. More accessions and nuclear genes are needed for further understanding the evolutionary histories of Pyrus. Copyright © 2014 Elsevier Inc. All rights reserved.
Berenger, Byron M; Berry, Chrystal; Peterson, Trevor; Fach, Patrick; Delannoy, Sabine; Li, Vincent; Tschetter, Lorelee; Nadon, Celine; Honish, Lance; Louie, Marie; Chui, Linda
2015-01-01
A standardised method for determining Escherichia coli O157:H7 strain relatedness using whole genome sequencing or virulence gene profiling is not yet established. We sought to assess the capacity of either high-throughput polymerase chain reaction (PCR) of 49 virulence genes, core-genome single nt variants (SNVs) or k-mer clustering to discriminate between outbreak-associated and sporadic E. coli O157:H7 isolates. Three outbreaks and multiple sporadic isolates from the province of Alberta, Canada were included in the study. Two of the outbreaks occurred concurrently in 2014 and one occurred in 2012. Pulsed-field gel electrophoresis (PFGE) and multilocus variable-number tandem repeat analysis (MLVA) were employed as comparator typing methods. The virulence gene profiles of isolates from the 2012 and 2014 Alberta outbreak events and contemporary sporadic isolates were mostly identical; therefore the set of virulence genes chosen in this study were not discriminatory enough to distinguish between outbreak clusters. Concordant with PFGE and MLVA results, core genome SNV and k-mer phylogenies clustered isolates from the 2012 and 2014 outbreaks as distinct events. k-mer phylogenies demonstrated increased discriminatory power compared with core SNV phylogenies. Prior to the widespread implementation of whole genome sequencing for routine public health use, issues surrounding cost, technical expertise, software standardisation, and data sharing/comparisons must be addressed.
Navarro, Aaron; Martínez-Murcia, Antonio
2018-04-19
The phylogenies derived from housekeeping gene sequence alignments, although mere evolutionary hypotheses, have increased our knowledge about the Aeromonas genetic diversity, providing a robust species delineation framework invaluable for reliable, easy and fast species identification. Previous classifications of Aeromonas, have been fully surpassed by recently developed phylogenetic (natural) classification obtained from the analysis of so-called "molecular chronometers". Despite ribosomal RNAs cannot split all known Aeromonas species, the conserved nature of 16S rRNA offers reliable alignments containing mosaics of sequence signatures which may serve as targets of genus-specific oligonucleotides for subsequent identification/detection tests in samples without culturing. On the contrary, some housekeeping genes coding for proteins show a much better chronometric capacity to discriminate highly related strains. Although both, species and loci, do not all evolve at exactly the same rate, published Aeromonas phylogenies were congruent to each other, indicating that, phylogenetic markers are synchronized and a concatenated multi-gene phylogeny, may be "the mirror" of the entire genomic relationships. Thanks to MLPA approaches, the discovery of new Aeromonas species and strains of rarely isolated species is today more frequent and, consequently, should be extensively promoted for isolate screening and species identification. Although, accumulated data still should be carefully catalogued to inherit a reliable database. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Thongbai, Benjarong; Miller, Steven L.; Stadler, Marc; Wittstein, Kathrin; Hyde, Kevin D.; Lumyong, Saisamorn
2017-01-01
Amanita ballerina and A. brunneitoxicaria spp. nov. are introduced from Thailand. Amanita fuligineoides is also reported for the first time from Thailand, increasing the known distribution of this taxon. Together, those findings support our view that many taxa are yet to be discovered in the region. While both morphological characters and a multiple-gene phylogeny clearly place A. brunneitoxicaria and A. fuligineoides in sect. Phalloideae (Fr.) Quél., the placement of A. ballerina is problematic. On the one hand, the morphology of A. ballerina shows clear affinities with stirps Limbatula of sect. Lepidella. On the other hand, in a multiple-gene phylogeny including taxa of all sections in subg. Lepidella, A. ballerina and two other species, including A. zangii, form a well-supported clade sister to the Phalloideae sensu Bas 1969, which include the lethal “death caps” and “destroying angels”. Together, the A. ballerina-A. zangii clade and the Phalloideae sensu Bas 1969 also form a well-supported clade. We therefore screened for two of the most notorious toxins by HPLC-MS analysis of methanolic extracts from the basidiomata. Interestingly, neither α-amanitin nor phalloidin was found in A. ballerina, whereas Amanita fuligineoides was confirmed to contain both α-amanitin and phalloidin, and A. brunneitoxicaria contained only α-amanitin. Together with unique morphological characteristics, the position in the phylogeny indicates that A. ballerina is either an important link in the evolution of the deadly Amanita sect. Phalloideae species, or a member of a new section also including A. zangii. PMID:28767681
Trait Differentiation within the Fungus-Feeding (Mycophagous) Bacterial Genus Collimonas
Ballhausen, Max-Bernhard; Vandamme, Peter; de Boer, Wietse
2016-01-01
The genus Collimonas consists of facultative, fungus-feeding (mycophagous) bacteria. To date, 3 species (C. fungivorans, C. pratensis and C. arenae) have been described and over 100 strains have been isolated from different habitats. Functional traits of Collimonas bacteria that are potentially involved in interactions with soil fungi mostly negatively (fungal inhibition e.g.), but also positively (mineral weathering e.g.), affect fungal fitness. We hypothesized that variation in such traits between Collimonas strains leads to different mycophagous bacterial feeding patterns. We investigated a) whether phylogenetically closely related Collimonas strains possess similar traits, b) how far phylogenetic resolution influences the detection of phylogenetic signal (possession of similar traits by related strains) and c) if there is a pattern of co-occurrence among the studied traits. We measured genetically encoded (nifH genes, antifungal collimomycin gene cluster e.g.) as well as phenotypically expressed traits (chitinase- and siderophore production, fungal inhibition and others) and related those to a high-resolution phylogeny (MLSA), constructed by sequencing the housekeeping genes gyrB and rpoB and concatenating those with partial 16S rDNA sequences. Additionally, high-resolution and 16S rDNA derived phylogenies were compared. We show that MLSA is superior to 16SrDNA phylogeny when analyzing trait distribution and relating it to phylogeny at fine taxonomic resolution (a single bacterial genus). We observe that several traits involved in the interaction of collimonads and their host fungus (fungal inhibition e.g.) carry phylogenetic signal. Furthermore, we compare Collimonas trait possession with sister genera like Herbaspirillum and Janthinobacterium. PMID:27309848
Using secondary structure to identify ribosomal numts: cautionary examples from the human genome.
Olson, Link E; Yoder, Anne D
2002-01-01
The identification of inadvertently sequenced mitochondrial pseudogenes (numts) is critical to any study employing mitochondrial DNA sequence data. Failure to discriminate numts correctly can confound phylogenetic reconstruction and studies of molecular evolution. This is especially problematic for ribosomal mtDNA genes. Unlike protein-coding loci, whose pseudogenes tend to accumulate diagnostic frameshift or premature stop mutations, functional ribosomal genes are not constrained to maintain a reading frame and can accumulate insertion-deletion events of varying length, particularly in nonpairing regions. Several authors have advocated using structural features of the transcribed rRNA molecule to differentiate functional mitochondrial rRNA genes from their nuclear paralogs. We explored this approach using the mitochondrial 12S rRNA gene and three known 12S numts from the human genome in the context of anthropoid phylogeny and the inferred secondary structure of primate 12S rRNA. Contrary to expectation, each of the three human numts exhibits striking concordance with secondary structure models, with little, if any, indication of their pseudogene status, and would likely escape detection based on structural criteria alone. Furthermore, we show that the unwitting inclusion of a particularly ancient (18-25 Myr old) and surprisingly cryptic human numt in a phylogenetic analysis would yield a well-supported but dramatically incorrect conclusion regarding anthropoid relationships. Though we endorse the use of secondary structure models for inferring positional homology wholeheartedly, we caution against reliance on structural criteria for the discrimination of rRNA numts, given the potential fallibility of this approach.
Genome-wide analysis of Atlantic salmon (Salmo salar) mucin genes and their role as biomarkers
Grammes, Fabian Thomas; Ytteborg, Elisabeth; Takle, Harald; Jørgensen, Sven Martin
2017-01-01
The aim of this study was to identify potential mucin genes in the Atlantic salmon genome and evaluate tissue-specific distribution and transcriptional regulation in response to aquaculture-relevant stress conditions in post-smolts. Seven secreted gel-forming mucin genes were identified based on several layers of evidence; annotation, transcription, phylogeny and domain structure. Two genes were annotated as muc2 and five genes as muc5. The muc2 genes were predominantly transcribed in the intestinal region while the different genes in the muc5 family were mainly transcribed in either skin, gill or pyloric caeca. In order to investigate transcriptional regulation of mucins during stress conditions, two controlled experiments were conducted. In the first experiment, handling stress induced mucin transcription in the gill, while transcription decreased in the skin and intestine. In the second experiment, long term intensive rearing conditions (fish biomass ~125 kg/m3) interrupted by additional confinement led to increased transcription of mucin genes in the skin at one, seven and fourteen days post-confinement. PMID:29236729
When Outgroups Fail; Phylogenomics of Rooting the Emerging Pathogen, Coxiella burnetii
Pearson, Talima; Hornstra, Heidie M.; Sahl, Jason W.; Schaack, Sarah; Schupp, James M.; Beckstrom-Sternberg, Stephen M.; O'Neill, Matthew W.; Priestley, Rachael A.; Champion, Mia D.; Beckstrom-Sternberg, James S.; Kersh, Gilbert J.; Samuel, James E.; Massung, Robert F.; Keim, Paul
2013-01-01
Rooting phylogenies is critical for understanding evolution, yet the importance, intricacies and difficulties of rooting are often overlooked. For rooting, polymorphic characters among the group of interest (ingroup) must be compared to those of a relative (outgroup) that diverged before the last common ancestor (LCA) of the ingroup. Problems arise if an outgroup does not exist, is unknown, or is so distant that few characters are shared, in which case duplicated genes originating before the LCA can be used as proxy outgroups to root diverse phylogenies. Here, we describe a genome-wide expansion of this technique that can be used to solve problems at the other end of the evolutionary scale: where ingroup individuals are all very closely related to each other, but the next closest relative is very distant. We used shared orthologous single nucleotide polymorphisms (SNPs) from 10 whole genome sequences of Coxiella burnetii, the causative agent of Q fever in humans, to create a robust, but unrooted phylogeny. To maximize the number of characters informative about the rooting, we searched entire genomes for polymorphic duplicated regions where orthologs of each paralog could be identified so that the paralogs could be used to root the tree. Recent radiations, such as those of emerging pathogens, often pose rooting challenges due to a lack of ingroup variation and large genomic differences with known outgroups. Using a phylogenomic approach, we created a robust, rooted phylogeny for C. burnetii. [Coxiella burnetii; paralog SNPs; pathogen evolution; phylogeny; recent radiation; root; rooting using duplicated genes.] PMID:23736103
A phylogeny and revised classification of Squamata, including 4161 species of lizards and snakes
2013-01-01
Background The extant squamates (>9400 known species of lizards and snakes) are one of the most diverse and conspicuous radiations of terrestrial vertebrates, but no studies have attempted to reconstruct a phylogeny for the group with large-scale taxon sampling. Such an estimate is invaluable for comparative evolutionary studies, and to address their classification. Here, we present the first large-scale phylogenetic estimate for Squamata. Results The estimated phylogeny contains 4161 species, representing all currently recognized families and subfamilies. The analysis is based on up to 12896 base pairs of sequence data per species (average = 2497 bp) from 12 genes, including seven nuclear loci (BDNF, c-mos, NT3, PDC, R35, RAG-1, and RAG-2), and five mitochondrial genes (12S, 16S, cytochrome b, ND2, and ND4). The tree provides important confirmation for recent estimates of higher-level squamate phylogeny based on molecular data (but with more limited taxon sampling), estimates that are very different from previous morphology-based hypotheses. The tree also includes many relationships that differ from previous molecular estimates and many that differ from traditional taxonomy. Conclusions We present a new large-scale phylogeny of squamate reptiles that should be a valuable resource for future comparative studies. We also present a revised classification of squamates at the family and subfamily level to bring the taxonomy more in line with the new phylogenetic hypothesis. This classification includes new, resurrected, and modified subfamilies within gymnophthalmid and scincid lizards, and boid, colubrid, and lamprophiid snakes. PMID:23627680
2010-01-01
Background Mitochondria are a valuable resource for studying the evolutionary process and deducing phylogeny. A few mitochondria genomes have been sequenced, but a comprehensive picture of the domestication event for silkworm mitochondria remains to be established. In this study, we integrate the extant data, and perform a whole genome resequencing of Japanese wild silkworm to obtain breakthrough results in silkworm mitochondrial (mt) population, and finally use these to deduce a more comprehensive phylogeny of the Bombycidae. Results We identified 347 single nucleotide polymorphisms (SNPs) in the mt genome, but found no past recombination event to have occurred in the silkworm progenitor. A phylogeny inferred from these whole genome SNPs resulted in a well-classified tree, confirming that the domesticated silkworm, Bombyx mori, most recently diverged from the Chinese wild silkworm, rather than from the Japanese wild silkworm. We showed that the population sizes of the domesticated and Chinese wild silkworms both experience neither expansion nor contraction. We also discovered that one mt gene, named cytochrome b, shows a strong signal of positive selection in the domesticated clade. This gene is related to energy metabolism, and may have played an important role during silkworm domestication. Conclusions We present a comparative analysis on 41 mt genomes of B. mori and B. mandarina from China and Japan. With these, we obtain a much clearer picture of the evolution history of the silkworm. The data and analyses presented here aid our understanding of the silkworm in general, and provide a crucial insight into silkworm phylogeny. PMID:20334646
NASA Technical Reports Server (NTRS)
Giribet, Gonzalo; Edgecombe, Gregory D.; Wheeler, Ward C.; Babbitt, Courtney
2002-01-01
The ordinal level phylogeny of the Arachnida and the suprafamilial level phylogeny of the Opiliones were studied on the basis of a combined analysis of 253 morphological characters, the complete sequence of the 18S rRNA gene, and the D3 region of the 28S rRNA gene. Molecular data were collected for 63 terminal taxa. Morphological data were collected for 35 exemplar taxa of Opiliones, but groundplans were applied to some of the remaining chelicerate groups. Six extinct terminals, including Paleozoic scorpions, are scored for morphological characters. The data were analyzed using strict parsimony for the morphological data matrix and via direct optimization for the molecular and combined data matrices. A sensitivity analysis of 15 parameter sets was undertaken, and character congruence was used as the optimality criterion to choose among competing hypotheses. The results obtained are unstable for the high-level chelicerate relationships (except for Tetrapulmonata, Pedipalpi, and Camarostomata), and the sister group of the Opiliones is not clearly established, although the monophyly of Dromopoda is supported under many parameter sets. However, the internal phylogeny of the Opiliones is robust to parameter choice and allows the discarding of previous hypotheses of opilionid phylogeny such as the "Cyphopalpatores" or "Palpatores." The topology obtained is congruent with the previous hypothesis of "Palpatores" paraphyly as follows: (Cyphophthalmi (Eupnoi (Dyspnoi + Laniatores))). Resolution within the Eupnoi, Dyspnoi, and Laniatores (the latter two united as Dyspnolaniatores nov.) is also stable to the superfamily level, permitting a new classification system for the Opiliones. c2002 The Willi Hennig Society.
Phylogenetic composition of host plant communities drives plant-herbivore food web structure.
Volf, Martin; Pyszko, Petr; Abe, Tomokazu; Libra, Martin; Kotásková, Nela; Šigut, Martin; Kumar, Rajesh; Kaman, Ondřej; Butterill, Philip T; Šipoš, Jan; Abe, Haruka; Fukushima, Hiroaki; Drozd, Pavel; Kamata, Naoto; Murakami, Masashi; Novotny, Vojtech
2017-05-01
Insects tend to feed on related hosts. The phylogenetic composition of host plant communities thus plays a prominent role in determining insect specialization, food web structure, and diversity. Previous studies showed a high preference of insect herbivores for congeneric and confamilial hosts suggesting that some levels of host plant relationships may play more prominent role that others. We aim to quantify the effects of host phylogeny on the structure of quantitative plant-herbivore food webs. Further, we identify specific patterns in three insect guilds with different life histories and discuss the role of host plant phylogeny in maintaining their diversity. We studied herbivore assemblages in three temperate forests in Japan and the Czech Republic. Sampling from a canopy crane, a cherry picker and felled trees allowed a complete census of plant-herbivore interactions within three 0·1 ha plots for leaf chewing larvae, miners, and gallers. We analyzed the effects of host phylogeny by comparing the observed food webs with randomized models of host selection. Larval leaf chewers exhibited high generality at all three sites, whereas gallers and miners were almost exclusively monophagous. Leaf chewer generality dropped rapidly when older host lineages (5-80 myr) were collated into a single lineage but only decreased slightly when the most closely related congeneric hosts were collated. This shows that leaf chewer generality has been maintained by feeding on confamilial hosts while only a few herbivores were shared between more distant plant lineages and, surprisingly, between some congeneric hosts. In contrast, miner and galler generality was maintained mainly by the terminal nodes of the host phylogeny and dropped immediately after collating congeneric hosts into single lineages. We show that not all levels of host plant phylogeny are equal in their effect on structuring plant-herbivore food webs. In the case of generalist guilds, it is the phylogeny of deeper plant lineages that drives the food web structure whereas the terminal relationships play minor roles. In contrast, the specialization and abundance of monophagous guilds are affected mainly by the terminal parts of the plant phylogeny and do not generally reflect deeper host phylogeny. © 2017 The Authors. Journal of Animal Ecology © 2017 British Ecological Society.
Bello, María A.; Cubas, Pilar; Álvarez, Inés; Sanjuanbenito, Guillermo; Fuertes-Aguilar, Javier
2017-01-01
Homologs of the CYC/TB1 gene family have been independently recruited many times across the eudicots to control aspects of floral symmetry The family Asteraceae exhibits the largest known diversification in this gene paralog family accompanied by a parallel morphological floral richness in its specialized head-like inflorescence. In Asteraceae, whether or not CYC/TB1 gene floral symmetry function is preserved along organismic and gene lineages is unknown. In this study, we used phylogenetic, structural and expression analyses focused on the highly derived genus Anacyclus (tribe Anthemidae) to address this question. Phylogenetic reconstruction recovered eight main gene lineages present in Asteraceae: two from CYC1, four from CYC2 and two from CYC3-like genes. The species phylogeny was recovered in most of the gene lineages, allowing the delimitation of orthologous sets of CYC/TB1 genes in Asteraceae. Quantitative real-time PCR analysis indicated that in Anacyclus three of the four isolated CYC2 genes are more highly expressed in ray flowers. The expression of the four AcCYC2 genes overlaps in several organs including the ligule of ray flowers, as well as in anthers and ovules throughout development. PMID:28487706
Merckel, Michael C; Huiskonen, Juha T; Bamford, Dennis H; Goldman, Adrian; Tuma, Roman
2005-04-15
Comparisons of bacteriophage PRD1 and adenovirus protein structures and virion architectures have been instrumental in unraveling an evolutionary relationship and have led to a proposal of a phylogeny-based virus classification. The structure of the PRD1 spike protein P5 provides further insight into the evolution of viral proteins. The crystallized P5 fragment comprises two structural domains: a globular knob and a fibrous shaft. The head folds into a ten-stranded jelly roll beta barrel, which is structurally related to the tumor necrosis factor (TNF) and the PRD1 coat protein domains. The shaft domain is a structural counterpart to the adenovirus spike shaft. The structural relationships between PRD1, TNF, and adenovirus proteins suggest that the vertex proteins may have originated from an ancestral TNF-like jelly roll coat protein via a combination of gene duplication and deletion.
Asaf, Sajjad; Khan, Abdul Latif; Khan, Muhammad Aaqil; Waqas, Muhammad; Kang, Sang-Mo; Yun, Byung-Wook; Lee, In-Jung
2017-08-08
We investigated the complete chloroplast (cp) genomes of non-model Arabidopsis halleri ssp. gemmifera and Arabidopsis lyrata ssp. petraea using Illumina paired-end sequencing to understand their genetic organization and structure. Detailed bioinformatics analysis revealed genome sizes of both subspecies ranging between 154.4~154.5 kbp, with a large single-copy region (84,197~84,158 bp), a small single-copy region (17,738~17,813 bp) and pair of inverted repeats (IRa/IRb; 26,264~26,259 bp). Both cp genomes encode 130 genes, including 85 protein-coding genes, eight ribosomal RNA genes and 37 transfer RNA genes. Whole cp genome comparison of A. halleri ssp. gemmifera and A. lyrata ssp. petraea, along with ten other Arabidopsis species, showed an overall high degree of sequence similarity, with divergence among some intergenic spacers. The location and distribution of repeat sequences were determined, and sequence divergences of shared genes were calculated among related species. Comparative phylogenetic analysis of the entire genomic data set and 70 shared genes between both cp genomes confirmed the previous phylogeny and generated phylogenetic trees with the same topologies. The sister species of A. halleri ssp. gemmifera is A. umezawana, whereas the closest relative of A. lyrata spp. petraea is A. arenicola.
Kress, W John; Erickson, David L; Swenson, Nathan G; Thompson, Jill; Uriarte, Maria; Zimmerman, Jess K
2010-11-09
Species number, functional traits, and phylogenetic history all contribute to characterizing the biological diversity in plant communities. The phylogenetic component of diversity has been particularly difficult to quantify in species-rich tropical tree assemblages. The compilation of previously published (and often incomplete) data on evolutionary relationships of species into a composite phylogeny of the taxa in a forest, through such programs as Phylomatic, has proven useful in building community phylogenies although often of limited resolution. Recently, DNA barcodes have been used to construct a robust community phylogeny for nearly 300 tree species in a forest dynamics plot in Panama using a supermatrix method. In that study sequence data from three barcode loci were used to generate a well-resolved species-level phylogeny. Here we expand upon this earlier investigation and present results on the use of a phylogenetic constraint tree to generate a community phylogeny for a diverse, tropical forest dynamics plot in Puerto Rico. This enhanced method of phylogenetic reconstruction insures the congruence of the barcode phylogeny with broadly accepted hypotheses on the phylogeny of flowering plants (i.e., APG III) regardless of the number and taxonomic breadth of the taxa sampled. We also compare maximum parsimony versus maximum likelihood estimates of community phylogenetic relationships as well as evaluate the effectiveness of one- versus two- versus three-gene barcodes in resolving community evolutionary history. As first demonstrated in the Panamanian forest dynamics plot, the results for the Puerto Rican plot illustrate that highly resolved phylogenies derived from DNA barcode sequence data combined with a constraint tree based on APG III are particularly useful in comparative analysis of phylogenetic diversity and will enhance research on the interface between community ecology and evolution.
Near, Thomas J; Dornburg, Alex; Friedman, Matt
2014-11-01
The Gonorynchiformes are the sister lineage of the species-rich Otophysi and provide important insights into the diversification of ostariophysan fishes. Phylogenies of gonorynchiforms inferred using morphological characters and mtDNA gene sequences provide differing resolutions with regard to the sister lineage of all other gonorynchiforms (Chanos vs. Gonorynchus) and support for monophyly of the two miniaturized lineages Cromeria and Grasseichthys. In this study the phylogeny and divergence times of gonorynchiforms are investigated with DNA sequences sampled from nine nuclear genes and a published morphological character matrix. Bayesian phylogenetic analyses reveal substantial congruence among individual gene trees with inferences from eight genes placing Gonorynchus as the sister lineage to all other gonorynchiforms. Seven gene trees resolve Cromeria and Grasseichthys as a clade, supporting previous inferences using morphological characters. Phylogenies resulting from either concatenating the nuclear genes, performing a multispecies coalescent species tree analysis, or combining the morphological and nuclear gene DNA sequences resolve Gonorynchus as the living sister lineage of all other gonorynchiforms, strongly support the monophyly of Cromeria and Grasseichthys, and resolve a clade containing Parakneria, Cromeria, and Grasseichthys. The morphological dataset, which includes 13 gonorynchiform fossil taxa that range in age from Early Cretaceous to Eocene, was analyzed in combination with DNA sequences from the nine nuclear genes and a relaxed molecular clock to estimate times of evolutionary divergence. This "tip dating" strategy accommodates uncertainty in the phylogenetic resolution of fossil taxa that provide calibration information in the relaxed molecular clock analysis. The estimated age of the most recent common ancestor (MRCA) of living gonorynchiforms is slightly older than estimates from previous node dating efforts, but the molecular tip dating estimated ages of Kneriinae (Kneria, Parakneria, Cromeria, and Grasseichthys) and the two paedomorphic lineages, Cromeria and Grasseichthys, are considerably younger. Copyright © 2014 Elsevier Inc. All rights reserved.
Structural analysis of the α subunit of Na(+)/K(+) ATPase genes in invertebrates.
Thabet, Rahma; Rouault, J-D; Ayadi, Habib; Leignel, Vincent
2016-01-01
The Na(+)/K(+) ATPase is a ubiquitous pump coordinating the transport of Na(+) and K(+) across the membrane of cells and its role is fundamental to cellular functions. It is heteromer in eukaryotes including two or three subunits (α, β and γ which is specific to the vertebrates). The catalytic functions of the enzyme have been attributed to the α subunit. Several complete α protein sequences are available, but only few gene structures were characterized. We identified the genomic sequences coding the α-subunit of the Na(+)/K(+) ATPase, from the whole-genome shotgun contigs (WGS), NCBI Genomes (chromosome), Genomic Survey Sequences (GSS) and High Throughput Genomic Sequences (HTGS) databases across distinct phyla. One copy of the α subunit gene was found in Annelida, Arthropoda, Cnidaria, Echinodermata, Hemichordata, Mollusca, Placozoa, Porifera, Platyhelminthes, Urochordata, but the nematodes seem to possess 2 to 4 copies. The number of introns varied from 0 (Platyhelminthes) to 26 (Porifera); and their localization and length are also highly variable. Molecular phylogenies (Maximum Likelihood and Maximum Parsimony methods) showed some clusters constituted by (Chordata/(Echinodermata/Hemichordata)) or (Plathelminthes/(Annelida/Mollusca)) and a basal position for Porifera. These structural analyses increase our knowledge about the evolutionary events of the α subunit genes in the invertebrates. Copyright © 2016 Elsevier Inc. All rights reserved.
Wang, Cheng; Dong, Da; Strong, P J; Zhu, Weijing; Ma, Zhuang; Qin, Yong; Wu, Weixiang
2017-08-16
Animal manure is a reservoir of antibiotic resistance genes (ARGs) that pose a potential health risk globally, especially for resistance to the antibiotics commonly used in livestock production (such as tetracycline, sulfonamide, and fluoroquinolone). Currently, the effects of biological treatment (composting) on the transcriptional response of manure ARGs and their microbial hosts are not well characterized. Composting is a dynamic process that consists of four distinct phases that are distinguished by the temperature resulting from microbial activity, namely the mesophilic, thermophilic, cooling, and maturing phases. In this study, changes of resistome expression were determined and related to active microbiome profiles during the dynamic composting process. This was achieved by integrating metagenomic and time series metatranscriptomic data for the evolving microbial community during composting. Composting noticeably reduced the aggregated expression level of the manure resistome, which primarily consisted of genes encoding for tetracycline, vancomycin, fluoroquinolone, beta-lactam, and aminoglycoside resistance, as well as efflux pumps. Furthermore, a varied transcriptional response of resistome to composting at the ARG levels was highlighted. The expression of tetracycline resistance genes (tetM-tetW-tetO-tetS) decreased during composting, where distinctive shifts in the four phases of composting were related to variations in antibiotic concentration. Composting had no effect on the expression of sulfonamide and fluoroquinolone resistance genes, which increased slightly during the thermophilic phase and then decreased to initial levels. As indigenous populations switched greatly throughout the dynamic composting, the core resistome persisted and their reservoir hosts' composition was significantly correlated with dynamic active microbial phylogenetic structure. Hosts for sulfonamide and fuoroquinolone resistance genes changed notably in phylognetic structure and underwent an initial increase and then a decrease in abundance. By contrast, hosts for tetracycline resistance genes (tetM-tetW-tetO-tetS) exhibited a constant decline through time. The transcriptional patterns of a core resistome over the course of composting were identified, and microbial phylogeny was the key determinant in defining the varied transcriptional response of resistome to this dynamic biological process. This research demonstrated the benefits of composting for manure treatment. It reduced the risk of emerging environmental contaminants such as tetracyclines, tetracycline resistance genes, and clinically relevant pathogens carrying ARGs, as well as RNA viruses and bacteriophages.
Corradi, Nicolas; Hijri, Mohamed; Fumagalli, Luca; Sanders, Ian R
2004-11-01
The genes encoding alpha- and beta-tubulins have been widely sampled in most major fungal phyla and they are useful tools for fungal phylogeny. Here, we report the first isolation of alpha-tubulin sequences from arbuscular mycorrhizal fungi (AMF). In parallel, AMF beta-tubulins were sampled and analysed to identify the presence of paralogs of this gene. The AMF alpha-tubulin amino acid phylogeny was congruent with the results previously reported for AMF beta-tubulins and showed that AMF tubulins group together at a basal position in the fungal clade and showed high sequence similarities with members of the Chytridiomycota. This is in contrast with phylogenies for other regions of the AMF genome. The amount and nature of substitutions are consistent with an ancient divergence of both orthologs and paralogs of AMF tubulins. At the amino acid level, however, AMF tubulins have hardly evolved from those of the chytrids. This is remarkable given that these two groups are ancient and the monophyletic Glomeromycota probably diverged from basal fungal ancestors at least 500 million years ago. The specific primers we designed for the AMF tubulins, together with the high molecular variation we found among the AMF species we analysed, make AMF tubulin sequences potentially useful for AMF identification purposes.
Phylogeny of Rieske/cytb Complexes with a Special Focus on the Haloarchaeal Enzymes
Baymann, Frauke; Schoepp-Cothenet, Barbara; Lebrun, Evelyne; van Lis, Robert; Nitschke, Wolfgang
2012-01-01
Rieske/cytochrome b (Rieske/cytb) complexes are proton pumping quinol oxidases that are present in most bacteria and Archaea. The phylogeny of their subunits follows closely the 16S-rRNA phylogeny, indicating that chemiosmotic coupling was already present in the last universal common ancestor of Archaea and bacteria. Haloarchaea are the only organisms found so far that acquired Rieske/cytb complexes via interdomain lateral gene transfer. They encode two Rieske/cytb complexes in their genomes; one of them is found in genetic context with nitrate reductase genes and has its closest relatives among Actinobacteria and the Thermus/Deinococcus group. It is likely to function in nitrate respiration. The second Rieske/cytb complex of Haloarchaea features a split cytochrome b sequence as do Cyanobacteria, chloroplasts, Heliobacteria, and Bacilli. It seems that Haloarchaea acquired this complex from an ancestor of the above-mentioned phyla. Its involvement in the bioenergetic reaction chains of Haloarchaea is unknown. We present arguments in favor of the hypothesis that the ancestor of Haloarchaea, which relied on a highly specialized bioenergetic metabolism, that is, methanogenesis, and was devoid of quinones and most enzymes of anaerobic or aerobic bioenergetic reaction chains, integrated laterally transferred genes into its genome to respond to a change in environmental conditions that made methanogenesis unfavorable. PMID:22798450
Frequent gene flow blurred taxonomic boundaries of sections in Lilium L. (Liliaceae)
Liu, Shih-Hui; Chiang, Tzen-Yuh
2017-01-01
Gene flow between species may last a long time in plants. Reticulation inevitably causes difficulties in phylogenetic reconstruction. In this study, we looked into the genetic divergence and phylogeny of 20 Lilium species based on multilocus analyses of 8 genes of chloroplast DNA (cpDNA), the internally transcribed nuclear ribosomal DNA (nrITS) spacer and 20 loci extracted from the expressed sequence tag (EST) libraries of L. longiflorum Thunb. and L. formosanum Wallace. The phylogeny based on the combined data of the maternally inherited cpDNA and nrITS was largely consistent with the taxonomy of Lilium sections. This phylogeny was deemed the hypothetical species tree and uncovered three groups, i.e., Cluster A consisting of 4 taxa from the sections Pseudolirium and Liriotypus, Cluster B consisting of the 4 taxa from the sections Leucolirion, Archelirion and Daurolirion, and Cluster C comprising 10 taxa mostly from the sections Martagon and Sinomartagon. In contrast, systematic inconsistency occurred across the EST loci, with up to 19 genes (95%) displaying tree topologies deviating from the hypothetical species tree. The phylogenetic incongruence was likely attributable to the frequent genetic exchanges between species/sections, as indicated by the high levels of genetic recombination and the IMa analyses with the EST loci. Nevertheless, multilocus analysis could provide complementary information among the loci on the species split and the extent of gene flow between the species. In conclusion, this study not only detected frequent gene flow among Lilium sections that resulted in phylogenetic incongruence but also reconstructed a hypothetical species tree that gave insights into the nature of the complex relationships among Lilium species. PMID:28841664
Regier, Jerome C.; Brown, John W.; Mitter, Charles; Baixeras, Joaquín; Cho, Soowon; Cummings, Michael P.; Zwick, Andreas
2012-01-01
Background Tortricidae, one of the largest families of microlepidopterans, comprise about 10,000 described species worldwide, including important pests, biological control agents and experimental models. Understanding of tortricid phylogeny, the basis for a predictive classification, is currently provisional. We present the first detailed molecular estimate of relationships across the tribes and subfamilies of Tortricidae, assess its concordance with previous morphological evidence, and re-examine postulated evolutionary trends in host plant use and biogeography. Methodology/Principal Findings We sequenced up to five nuclear genes (6,633 bp) in each of 52 tortricids spanning all three subfamilies and 19 of the 22 tribes, plus up to 14 additional genes, for a total of 14,826 bp, in 29 of those taxa plus all 14 outgroup taxa. Maximum likelihood analyses yield trees that, within Tortricidae, differ little among data sets and character treatments and are nearly always strongly supported at all levels of divergence. Support for several nodes was greatly increased by the additional 14 genes sequenced in just 29 of 52 tortricids, with no evidence of phylogenetic artifacts from deliberately incomplete gene sampling. There is strong support for the monophyly of Tortricinae and of Olethreutinae, and for grouping of these to the exclusion of Chlidanotinae. Relationships among tribes are robustly resolved in Tortricinae and mostly so in Olethreutinae. Feeding habit (internal versus external) is strongly conserved on the phylogeny. Within Tortricinae, a clade characterized by eggs being deposited in large clusters, in contrast to singly or in small batches, has markedly elevated incidence of polyphagous species. The five earliest-branching tortricid lineages are all species-poor tribes with mainly southern/tropical distributions, consistent with a hypothesized Gondwanan origin for the family. Conclusions/Significance We present the first robustly supported phylogeny for Tortricidae, and a revised classification in which all of the sampled tribes are now monophyletic. PMID:22536410
Rearrangement moves on rooted phylogenetic networks
Gambette, Philippe; van Iersel, Leo; Jones, Mark; Scornavacca, Celine
2017-01-01
Phylogenetic tree reconstruction is usually done by local search heuristics that explore the space of the possible tree topologies via simple rearrangements of their structure. Tree rearrangement heuristics have been used in combination with practically all optimization criteria in use, from maximum likelihood and parsimony to distance-based principles, and in a Bayesian context. Their basic components are rearrangement moves that specify all possible ways of generating alternative phylogenies from a given one, and whose fundamental property is to be able to transform, by repeated application, any phylogeny into any other phylogeny. Despite their long tradition in tree-based phylogenetics, very little research has gone into studying similar rearrangement operations for phylogenetic network—that is, phylogenies explicitly representing scenarios that include reticulate events such as hybridization, horizontal gene transfer, population admixture, and recombination. To fill this gap, we propose “horizontal” moves that ensure that every network of a certain complexity can be reached from any other network of the same complexity, and “vertical” moves that ensure reachability between networks of different complexities. When applied to phylogenetic trees, our horizontal moves—named rNNI and rSPR—reduce to the best-known moves on rooted phylogenetic trees, nearest-neighbor interchange and rooted subtree pruning and regrafting. Besides a number of reachability results—separating the contributions of horizontal and vertical moves—we prove that rNNI moves are local versions of rSPR moves, and provide bounds on the sizes of the rNNI neighborhoods. The paper focuses on the most biologically meaningful versions of phylogenetic networks, where edges are oriented and reticulation events clearly identified. Moreover, our rearrangement moves are robust to the fact that networks with higher complexity usually allow a better fit with the data. Our goal is to provide a solid basis for practical phylogenetic network reconstruction. PMID:28763439
Hughes, A L
1998-03-01
Protein phylogenies were used to test the hypothesis that aspects of the innate immune system of vertebrates have been conserved since the last common ancestor of vertebrates and arthropods. The phylogeny of lysozymes showed evidence of conservation of function, but phylogenies of seven other protein families did not. Natural resistance-associated macrophage protein, nitric oxide synthetase, and serine protease families all showed a pattern of gene duplication within vertebrates after their divergence from arthropods, giving rise to immune system-expressed genes in vertebrates. Insect hemolin, a member of the immunoglobulin superfamily, was found not to be closely related to members of that family having an immune system role in vertebrates; rather, it appeared most closely related to both arthropod and vertebrate molecules expressed in the nervous system. Thus, hemolin seems to have evolved its role independently in insects, probably through duplication of a neuroglian-like ancestor. Furthermore, vertebrate immune system-expressed serpins, chitinases, and pentraxins were found to lack orthologous relationships with arthropod members of the same families also functioning in immunity. Therefore members of these families have evolved immune system functions independently in the two phyla. It is now widely recognized that the specific immune system of vertebrates has no counterpart in invertebrates; these phylogenetic analyses suggest that there is a similar evolutionary discontinuity with respect to innate immunity as well.
Dumack, Kenneth; Mylnikov, Alexander P; Bonkowski, Michael
2017-07-01
The genus Kraken represents a distinct lineage of filose amoebae within the Cercozoa. Currently a single species, Kraken carinae, has been described. SSU rDNA phylogeny showed an affiliation to the Cercomonadida, branching with weak support at its base, close to Paracercomonas, Metabolomonas, and Brevimastigomonas. Light microscopical analyses showed several unique features of the genus Kraken, but ultrastructure data were lacking. In this study, K. carinae has been studied by electron microscopy, these data conjoined with a two-gene phylogeny were used to give more insight into the evolutionary relationship of the genus Kraken within Cercozoa. The data confirmed the absence of flagella, but also showed novel characteristics, such as the presence of extrusomes, osmiophilic bodies, and mitochondria with flat cristae. Surprising was the presence of single-tier scales which are carried by cell outgrowths, much of what is expected of the last common ancestor of the class Imbricatea. The phylogenetic analyses however confirmed previous results, indicating Kraken as a sister group to Paracercomonas in Sarcomonadea with an increased but still low support of 0.98 PP/63 BP. Based on the unique features of Kraken we establish the Krakenidae fam. nov. that we, due to contradictory results in morphology and phylogeny, assign incertae sedis, Monadofilosa. Copyright © 2017 Elsevier GmbH. All rights reserved.
Dorn, Patricia L; de la Rúa, Nicholas M; Axen, Heather; Smith, Nicholas; Richards, Bethany R; Charabati, Jirias; Suarez, Julianne; Woods, Adrienne; Pessoa, Rafaela; Monroy, Carlota; Kilpatrick, C William; Stevens, Lori
2016-10-01
The widespread and diverse Triatoma dimidiata is the kissing bug species most important for Chagas disease transmission in Central America and a secondary vector in Mexico and northern South America. Its diversity may contribute to different Chagas disease prevalence in different localities and has led to conflicting systematic hypotheses describing various populations as subspecies or cryptic species. To resolve these conflicting hypotheses, we sequenced a nuclear (internal transcribed spacer 2, ITS-2) and mitochondrial gene (cytochrome b) from an extensive sampling of T. dimidiata across its geographic range. We evaluated the congruence of ITS-2 and cyt b phylogenies and tested the support for the previously proposed subspecies (inferred from ITS-2) by: (1) overlaying the ITS-2 subspecies assignments on a cyt b tree and, (2) assessing the statistical support for a cyt b topology constrained by the subspecies hypothesis. Unconstrained phylogenies inferred from ITS-2 and cyt b are congruent and reveal three clades including two putative cryptic species in addition to T. dimidiata sensu stricto. Neither the cyt b phylogeny nor hypothesis testing support the proposed subspecies inferred from ITS-2. Additionally, the two cryptic species are supported by phylogenies inferred from mitochondrially-encoded genes cytochrome c oxidase I and NADH dehydrogenase 4. In summary, our results reveal two cryptic species. Phylogenetic relationships indicate T. dimidiata sensu stricto is not subdivided into monophyletic clades consistent with subspecies. Based on increased support by hypothesis testing, we propose an updated systematic hypothesis for T. dimidiata based on extensive taxon sampling and analysis of both mitochondrial and nuclear genes. Copyright © 2016 Elsevier B.V. All rights reserved.
Tekle, Yonas I; Anderson, O Roger; Katz, Laura A; Maurer-Alcalá, Xyrus X; Romero, Mario Alberto Cerón; Molestina, Robert
2016-06-01
The majority of amoeboid lineages with flattened body forms are placed under a taxonomic hypothetical class 'Discosea' sensu Smirnov et al. (2011), which encompasses some of the most diverse morphs within Amoebozoa. However, its taxonomy and phylogeny is poorly understood. This is partly due to lack of support in studies that are based on limited gene sampling. In this study we use a phylogenomic approach including newly-generated RNA-Seq data and comprehensive taxon sampling to resolve the phylogeny of 'Discosea'. Our analysis included representatives from all orders of 'Discosea' and up to 550 genes, the largest gene sampling in Amoebozoa to date. We conducted extensive analyses to assess the robustness of our resulting phylogenies to effects of missing data and outgroup choice using probabilistic methods. All of our analyses, which explore the impact of varying amounts of missing data, consistently recover well-resolved and supported groups of Amoebozoa. Our results neither support the monophyly nor dichotomy of 'Discosea' as defined by Smirnov et al. (2011). Rather, we recover a robust well-resolved clade referred to as Eudiscosea encompassing the majority of discosean orders (seven of the nine studied here), while the Dactylopodida, Thecamoebida and Himatismenida, previously included in 'Discosea,' are non-monophyletic. We also recover novel relationships within the Eudiscosea that are largely congruent with morphology. Our analyses enabled us to place some incertae sedis lineages and previously unstable lineages such as Vermistella, Mayorella, Gocevia, and Stereomyxa. We recommend some phylogeny-based taxonomic amendments highlighting the new findings of this study and discuss the evolution of the group based on our current understanding. Copyright © 2016 Elsevier Inc. All rights reserved.
Liu, Jun; Li, Qi; Kong, Lingfeng; Yu, Hong; Zheng, Xiaodong
2011-09-01
Oysters (family Ostreidae), with high levels of phenotypic plasticity and wide geographic distribution, are a challenging group for taxonomists and phylogenetics. As a useful tool for molecular species identification, DNA barcoding might offer significant potential for oyster identification and taxonomy. This study used two mitochondrial fragments, cytochrome c oxidase I (COI) and the large ribosomal subunit (16S rDNA), to assess whether oyster species could be identified by phylogeny and distance-based DNA barcoding techniques. Relationships among species were estimated by the phylogenetic analyses of both genes, and then pairwise inter- and intraspecific genetic divergences were assessed. Species forming well-differentiated clades in the molecular phylogenies were identical for both genes even when the closely related species were included. Intraspecific variability of 16S rDNA overlapped with interspecific divergence. However, average intra- and interspecific genetic divergences for COI were 0-1.4% (maximum 2.2%) and 2.6-32.2% (minimum 2.2%), respectively, indicating the existence of a barcoding gap. These results confirm the efficacy of species identification in oysters via DNA barcodes and phylogenetic analysis. © 2011 Blackwell Publishing Ltd.
Orr, Russell J. S.; Murray, Shauna A.; Stüken, Anke; Rhodes, Lesley; Jakobsen, Kjetill S.
2012-01-01
The dinoflagellates are a diverse lineage of microbial eukaryotes. Dinoflagellate monophyly and their position within the group Alveolata are well established. However, phylogenetic relationships between dinoflagellate orders remain unresolved. To date, only a limited number of dinoflagellate studies have used a broad taxon sample with more than two concatenated markers. This lack of resolution makes it difficult to determine the evolution of major phenotypic characters such as morphological features or toxin production e.g. saxitoxin. Here we present an improved dinoflagellate phylogeny, based on eight genes, with the broadest taxon sampling to date. Fifty-five sequences for eight phylogenetic markers from nuclear and mitochondrial regions were amplified from 13 species, four orders, and concatenated phylogenetic inferences were conducted with orthologous sequences. Phylogenetic resolution is increased with addition of support for the deepest branches, though can be improved yet further. We show for the first time that the characteristic dinoflagellate thecal plates, cellulosic material that is present within the sub-cuticular alveoli, appears to have had a single origin. In addition, the monophyly of most dinoflagellate orders is confirmed: the Dinophysiales, the Gonyaulacales, the Prorocentrales, the Suessiales, and the Syndiniales. Our improved phylogeny, along with results of PCR to detect the sxtA gene in various lineages, allows us to suggest that this gene was probably acquired separately in Gymnodinium and the common ancestor of Alexandrium and Pyrodinium and subsequently lost in some descendent species of Alexandrium. PMID:23185516
Teske, Peter R; Cherry, Michael I; Matthee, Conrad A
2004-02-01
Sequence data derived from four markers (the nuclear RP1 and Aldolase and the mitochondrial 16S rRNA and cytochrome b genes) were used to determine the phylogenetic relationships among 32 species belonging to the genus Hippocampus. There were marked differences in the rate of evolution among these gene fragments, with Aldolase evolving the slowest and the mtDNA cytochrome b gene the fastest. The RP1 gene recovered the highest number of nodes supported by >70% bootstrap values from parsimony analysis and >95% posterior probabilities from Bayesian inference. The combined analysis based on 2317 nucleotides resulted in the most robust phylogeny. A distinct phylogenetic split was identified between the pygmy seahorse, Hippocampus bargibanti, and a clade including all other species. Three species from the western Pacific Ocean included in our study, namely H. bargibanti, H. breviceps, and H. abdominalis occupy basal positions in the phylogeny. This and the high species richness in the region suggests that the genus evolved somewhere in the West Pacific. There is also fairly strong molecular support for the remaining species being subdivided into three main evolutionary lineages: two West Pacific clades and a clade of species present in both the Indo-Pacific and the Atlantic Ocean. The phylogeny obtained herein suggests at least two independent colonization events of the Atlantic Ocean, once before the closure of the Tethyan seaway, and once afterwards.
Chaw, Shu-Miaw; Walters, Terrence W; Chang, Chien-Chang; Hu, Shu-Hsuan; Chen, Shin-Hsiao
2005-10-01
Phylogenetic relationships among the three families and 12 living genera of cycads were reconstructed by distance and parsimony criteria using three markers: the chloroplast matK gene, the chloroplast trnK intron and the nuclear ITS/5.8S rDNA sequence. All datasets indicate that Cycadaceae (including only the genus Cycas) is remotely related to other cycads, in which Dioon was resolved as the basal-most clade, followed by Bowenia and a clade containing the remaining nine genera. Encephalartos and Lepidozamia are closer to each other than to Macrozamia. The African genus Stangeria is embedded within the New World subfamily Zamiodeae. Therefore, Bowenia is an unlikely sister to Stangeria, contrary to the view that they form the Stangeriaceae. The generic status of Dyerocycas and Chigua is unsupportable as they are paraphyletic with Cycas and the Zamia, respectively. Nonsense mutations in the matK gene and indels in the other two datasets lend evidence to reinforce the above conclusions. According to the phylogenies, the past geography of the genera of cycads and the evolution of character states are hypothesized and discussed. Within the suborder Zamiieae, Stangeria, and the tribe Zamieae evolved significantly faster than other genera. The matK gene and ITS/5.8S region contain more useful information than the trnK intron in addressing phylogeny. Redelimitations of Zamiaceae, Stangeriaceae, subfamily Encephalartoideae and subtribe Macrozamiineae are necessary.
Lanier, Hayley C; Knowles, L Lacey
2015-02-01
Coalescent-based methods for species-tree estimation are becoming a dominant approach for reconstructing species histories from multi-locus data, with most of the studies examining these methodologies focused on recently diverged species. However, deeper phylogenies, such as the datasets that comprise many Tree of Life (ToL) studies, also exhibit gene-tree discordance. This discord may also arise from the stochastic sorting of gene lineages during the speciation process (i.e., reflecting the random coalescence of gene lineages in ancestral populations). It remains unknown whether guidelines regarding methodologies and numbers of loci established by simulation studies at shallow tree depths translate into accurate species relationships for deeper phylogenetic histories. We address this knowledge gap and specifically identify the challenges and limitations of species-tree methods that account for coalescent variance for deeper phylogenies. Using simulated data with characteristics informed by empirical studies, we evaluate both the accuracy of estimated species trees and the characteristics associated with recalcitrant nodes, with a specific focus on whether coalescent variance is generally responsible for the lack of resolution. By determining the proportion of coalescent genealogies that support a particular node, we demonstrate that (1) species-tree methods account for coalescent variance at deep nodes and (2) mutational variance - not gene-tree discord arising from the coalescent - posed the primary challenge for accurate reconstruction across the tree. For example, many nodes were accurately resolved despite predicted discord from the random coalescence of gene lineages and nodes with poor support were distributed across a range of depths (i.e., they were not restricted to a particular recent divergences). Given their broad taxonomic scope and large sampling of taxa, deep level phylogenies pose several potential methodological complications including difficulties with MCMC convergence and estimation of requisite population genetic parameters for coalescent-based approaches. Despite these difficulties, the findings generally support the utility of species-tree analyses for the estimation of species relationships throughout the ToL. We discuss strategies for successful application of species-tree approaches to deep phylogenies. Copyright © 2014 Elsevier Inc. All rights reserved.
Eiler, Alexander; Zaremba-Niedzwiedzka, Katarzyna; Martínez-García, Manuel; McMahon, Katherine D; Stepanauskas, Ramunas; Andersson, Siv G E; Bertilsson, Stefan
2014-01-01
Little is known about the diversity and structuring of freshwater microbial communities beyond the patterns revealed by tracing their distribution in the landscape with common taxonomic markers such as the ribosomal RNA. To address this gap in knowledge, metagenomes from temperate lakes were compared to selected marine metagenomes. Taxonomic analyses of rRNA genes in these freshwater metagenomes confirm the previously reported dominance of a limited subset of uncultured lineages of freshwater bacteria, whereas Archaea were rare. Diversification into marine and freshwater microbial lineages was also reflected in phylogenies of functional genes, and there were also significant differences in functional beta-diversity. The pathways and functions that accounted for these differences are involved in osmoregulation, active transport, carbohydrate and amino acid metabolism. Moreover, predicted genes orthologous to active transporters and recalcitrant organic matter degradation were more common in microbial genomes from oligotrophic versus eutrophic lakes. This comparative metagenomic analysis allowed us to formulate a general hypothesis that oceanic- compared with freshwater-dwelling microorganisms, invest more in metabolism of amino acids and that strategies of carbohydrate metabolism differ significantly between marine and freshwater microbial communities. PMID:24118837
Wu, Hai-Yan; Ji, Xiao-Yu; Yu, Wei-Wei; Du, Yu-Zhou
2014-03-10
We present the complete mitogenome of a stonefly, Cryptoperla stilifera Sivec (Plecoptera; Peltoperlidae). The mitogenome was a circular molecule consisting of 15,633 nucleotides, 37 genes and a A+T-rich region. C. stilifera mitogenome was similar to Pteronarcys princeps mitogenome (Plecoptera; Pteronarcyidae). All transfer RNA genes (tRNAs) had typical cloverleaf secondary structures except for trnSer (AGN), where the stem-loop structure of the dihydrouridine (DHU) arm was missing. The A+T-rich region of C. stilifera had two stem-loops and each had two interlink. Three conserved sequence blocks (CSBs) were present in the A+T-rich regions of C. stilifera, Peltoperla tarteri and Peltoperla arcuata. Moreover, many polynucleotide stretches (Poly N, N=A, T and C) in the A+T-rich region of C. stilifera Phylogenetic relationships of Polyneopteran species were constructed based on the nucleotide sequences of 13 protein coding genes (PCGs). Both maximum likelihood (ML) and Bayesian inference (BI) analyses supported Grylloblattodea as the sister group to Plecoptera+Dermaptera and Embiidina and Phasmatodea as sister groups. Copyright © 2014 Elsevier B.V. All rights reserved.
Alternaria section Alternaria: Species, formae speciales or pathotypes?
Woudenberg, J.H.C.; Seidl, M.F.; Groenewald, J.Z.; de Vries, M.; Stielow, J.B.; Thomma, B.P.H.J.; Crous, P.W.
2015-01-01
The cosmopolitan fungal genus Alternaria consists of multiple saprophytic and pathogenic species. Based on phylogenetic and morphological studies, the genus is currently divided into 26 sections. Alternaria sect. Alternaria contains most of the small-spored Alternaria species with concatenated conidia, including important plant, human and postharvest pathogens. Species within sect. Alternaria have been mostly described based on morphology and / or host-specificity, yet molecular variation between them is minimal. To investigate whether the described morphospecies within sect. Alternaria are supported by molecular data, whole-genome sequencing of nine Alternaria morphospecies supplemented with transcriptome sequencing of 12 Alternaria morphospecies as well as multi-gene sequencing of 168 Alternaria isolates was performed. The assembled genomes ranged in size from 33.3–35.2 Mb within sect. Alternaria and from 32.0–39.1 Mb for all Alternaria genomes. The number of repetitive sequences differed significantly between the different Alternaria genomes; ranging from 1.4–16.5 %. The repeat content within sect. Alternaria was relatively low with only 1.4–2.7 % of repeats. Whole-genome alignments revealed 96.7–98.2 % genome identity between sect. Alternaria isolates, compared to 85.1–89.3 % genome identity for isolates from other sections to the A. alternata reference genome. Similarly, 1.4–2.8 % and 0.8–1.8 % single nucleotide polymorphisms (SNPs) were observed in genomic and transcriptomic sequences, respectively, between isolates from sect. Alternaria, while the percentage of SNPs found in isolates from different sections compared to the A. alternata reference genome was considerably higher; 8.0–10.3 % and 6.1–8.5 %. The topology of a phylogenetic tree based on the whole-genome and transcriptome reads was congruent with multi-gene phylogenies based on commonly used gene regions. Based on the genome and transcriptome data, a set of core proteins was extracted, and primers were designed on two gene regions with a relatively low degree of conservation within sect. Alternaria (96.8 and 97.3 % conservation). Their potential discriminatory power within sect. Alternaria was tested next to nine commonly used gene regions in sect. Alternaria, namely the SSU, LSU, ITS, gapdh, rpb2, tef1, Alt a 1, endoPG and OPA10-2 gene regions. The phylogenies from the two gene regions with a relatively low conservation, KOG1058 and KOG1077, could not distinguish the described morphospecies within sect. Alternaria more effectively than the phylogenies based on the commonly used gene regions for Alternaria. Based on genome and transcriptome comparisons and molecular phylogenies, Alternaria sect. Alternaria consists of only 11 phylogenetic species and one species complex. Thirty-five morphospecies, which cannot be distinguished based on the multi-gene phylogeny, are synonymised under A. alternata. By providing guidelines for the naming and identification of phylogenetic species in Alternaria sect. Alternaria, this manuscript provides a clear and stable species classification in this section. PMID:26951037
Mitochondrial genomes of parasitic flatworms.
Le, Thanh H; Blair, David; McManus, Donald P
2002-05-01
Complete or near-complete mitochondrial genomes are now available for 11 species or strains of parasitic flatworms belonging to the Trematoda and the Cestoda. The organization of these genomes is not strikingly different from those of other eumetazoans, although one gene (atp8) commonly found in other phyla is absent from flatworms. The gene order in most flatworms has similarities to those seen in higher protostomes such as annelids. However, the gene order has been drastically altered in Schistosoma mansoni, which obscures this possible relationship. Among the sequenced taxa, base composition varies considerably, creating potential difficulties for phylogeny reconstruction. Long non-coding regions are present in all taxa, but these vary in length from only a few hundred to approximately 10000 nucleotides. Among Schistosoma spp., the long non-coding regions are rich in repeats and length variation among individuals is known. Data from mitochondrial genomes are valuable for studies on species identification, phylogenies and biogeography.
Setoguchi, H; Watanabe, I
2000-06-01
Hybridization and introgression play important roles in plant evolution, and their occurrence on the oceanic islands provides good examples of plant speciation and diversification. Restriction fragment length polymorphisms (RFLPs) and trnL (UAA) 3'exon-trnF (GAA) intergenic spacer (IGS) sequences of chloroplast DNA (cpDNA), and the sequences of internal transcribed spacer (ITS) of nuclear ribosomal DNA were examined to investigate the occurrence of gene transfer in Ilex species on the Bonin Islands and the Ryukyu Islands in Japan. A gene phylogeny for the plastid genome is in agreement with the morphologically based taxonomy, whereas the nuclear genome phylogeny clusters putatively unrelated endemics both on the Bonin and the Ryukyu Islands. Intersectional hybridization and nuclear gene flow were independently observed in insular endemics of Ilex on both sets of islands without evidence of plastid introgression. Gene flow observed in these island systems can be explained by ecological features of insular endemics, i.e., limits of distribution range or sympatric distribution in a small land area.
Owen, Christopher L; Marshall, David C; Hill, Kathy B R; Simon, Chris
2015-02-01
The Pauropsalta generic complex is a large group of cicadas (72 described spp.; >82 undescribed spp.) endemic to Australia. No previous molecular work on deep level relationships within this complex has been conducted, but a recent morphological revision and phylogenetic analysis proposed relationships among the 11 genera. We present here the first comprehensive molecular phylogeny of the complex using five loci (1 mtDNA, 4 nDNA), two of which are from nuclear genes new to cicada systematics. We compare the molecular phylogeny to the morphological phylogeny. We evaluate the phylogenetic informativeness of the new loci to traditional cicada systematics loci to generate a baseline of performance and behavior to aid in gene choice decisions in future systematic and phylogenomic studies. Our maximum likelihood and Bayesian inference phylogenies strongly support the monophyly of most of the newly described genera; however, relationships among genera differ from the morphological phylogeny. A comparison of phylogenetic informativeness among all loci revealed that COI 3rd positions dominate the informativeness profiles relative to all other loci but exhibit some among taxon nucleotide bias. After removing COI 3rd positions, COI 1st positions dominate near the terminals, while the period intron has the most phylogenetic informativeness near the root. Among the nuclear loci, ARD1 and QtRNA have lower phylogenetic informativeness than period intron and elongation factor 1 alpha intron, but the informativeness increases at you move from the tips to the root. The increase in phylogenetic informativeness deeper in the tree suggests these loci may be useful for resolving older relationships. Copyright © 2015. Published by Elsevier Inc.
Dubey, Bhawna; Meganathan, P R; Haque, Ikramul
2012-07-01
This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
Systematics of the genus Nectria based on a six-gene phylogeny
USDA-ARS?s Scientific Manuscript database
The genus Nectria sensu stricto is characterized by red, fleshy, warted perithecia that become cupulate when dry, and sporodochial conidiomata in Tubercularia and pycnidial anamorphs in Gyrostroma and Zythiostroma. Nectria is the type of the family Nectriaceae currently considered to include 21 gene...
The complete plastid genome of the middle Asian endemic of Stipa lipskyi (Poaceae).
Myszczyński, Kamil; Nobis, Marcin; Szczecinska, Monika; Sawicki, Jakub; Nowak, Arkadiusz
2016-11-01
The structure of the Stipa lipskyi (GenBank accession no. KT692644) plastid genome is similar to that of closely related Poaceae species: it has a total length of 137 755 bp, the base composition of the plastome is the following: A (30.7%), C (19.3%), G (19.4%) and T (30.5%). The S. lipskyi plastid genome contains 71 genes, excluding second IR region. A complete plastome sequence of S. lipskyi will help the development of primers for examining phylogeny and hybridization events in this taxonomically difficult genus.
2013-01-01
Background In vertebrates, it has been repeatedly demonstrated that genes encoding proteins involved in pathogen-recognition by adaptive immunity (e.g. MHC) are subject to intensive diversifying selection. On the other hand, the role and the type of selection processes shaping the evolution of innate-immunity genes are currently far less clear. In this study we analysed the natural variation and the evolutionary processes acting on two genes involved in the innate-immunity recognition of Microbe-Associated Molecular Patterns (MAMPs). Results We sequenced genes encoding Toll-like receptor 4 (Tlr4) and 7 (Tlr7), two of the key bacterial- and viral-sensing receptors of innate immunity, across 23 species within the subfamily Murinae. Although we have shown that the phylogeny of both Tlr genes is largely congruent with the phylogeny of rodents based on a comparably sized non-immune sequence dataset, we also identified several potentially important discrepancies. The sequence analyses revealed that major parts of both Tlrs are evolving under strong purifying selection, likely due to functional constraints. Yet, also several signatures of positive selection have been found in both genes, with more intense signal in the bacterial-sensing Tlr4 than in the viral-sensing Tlr7. 92% and 100% of sites evolving under positive selection in Tlr4 and Tlr7, respectively, were located in the extracellular domain. Directly in the Ligand-Binding Region (LBR) of TLR4 we identified two rapidly evolving amino acid residues and one site under positive selection, all three likely involved in species-specific recognition of lipopolysaccharide of gram-negative bacteria. In contrast, all putative sites of LBRTLR7 involved in the detection of viral nucleic acids were highly conserved across rodents. Interspecific differences in the predicted 3D-structure of the LBR of both Tlrs were not related to phylogenetic history, while analyses of protein charges clearly discriminated Rattini and Murini clades. Conclusions In consequence of the constraints given by the receptor protein function purifying selection has been a dominant force in evolution of Tlrs. Nevertheless, our results show that episodic diversifying parasite-mediated selection has shaped the present species-specific variability in rodent Tlrs. The intensity of diversifying selection was higher in Tlr4 than in Tlr7, presumably due to structural properties of their ligands. PMID:24028551
Gómez, Africa; Serra, Manuel; Carvalho, Gary R; Lunt, David H
2002-07-01
Continental lake-dwelling zooplanktonic organisms have long been considered cosmopolitan species with little geographic variation in spite of the isolation of their habitats. Evidence of morphological cohesiveness and high dispersal capabilities support this interpretation. However, this view has been challenged recently as many such species have been shown either to comprise cryptic species complexes or to exhibit marked population genetic differentiation and strong phylogeographic structuring at a regional scale. Here we investigate the molecular phylogeny of the cosmopolitan passively dispersing rotifer Brachionus plicatilis (Rotifera: Monogononta) species complex using nucleotide sequence variation from both nuclear (ribosomal internal transcribed spacer 1, ITS1) and mitochondrial (cytochrome c oxidase subunit I, COI) genes. Analysis of rotifer resting eggs from 27 salt lakes in the Iberian Peninsula plus lakes from four continents revealed nine genetically divergent lineages. The high level of sequence divergence, absence of hybridization, and extensive sympatry observed support the specific status of these lineages. Sequence divergence estimates indicate that the B. plicatilis complex began diversifying many millions of years ago, yet has showed relatively high levels of morphological stasis. We discuss these results in relation to the ecology and genetics of aquatic invertebrates possessing dispersive resting propagules and address the apparent contradiction between zooplanktonic population structure and their morphological stasis.
Graña-Miraglia, Lucía; Lozano, Luis F.; Velázquez, Consuelo; Volkow-Fernández, Patricia; Pérez-Oseguera, Ángeles; Cevallos, Miguel A.; Castillo-Ramírez, Santiago
2017-01-01
Genome sequencing has been useful to gain an understanding of bacterial evolution. It has been used for studying the phylogeography and/or the impact of mutation and recombination on bacterial populations. However, it has rarely been used to study gene turnover at microevolutionary scales. Here, we sequenced Mexican strains of the human pathogen Acinetobacter baumannii sampled from the same locale over a 3 year period to obtain insights into the microevolutionary dynamics of gene content variability. We found that the Mexican A. baumannii population was recently founded and has been emerging due to a rapid clonal expansion. Furthermore, we noticed that on average the Mexican strains differed from each other by over 300 genes and, notably, this gene content variation has accrued more frequently and faster than the accumulation of mutations. Moreover, due to its rapid pace, gene content variation reflects the phylogeny only at very short periods of time. Additionally, we found that the external branches of the phylogeny had almost 100 more genes than the internal branches. All in all, these results show that rapid gene turnover has been of paramount importance in producing genetic variation within this population and demonstrate the utility of genome sequencing to study alternative forms of genetic variation. PMID:28979253
Graña-Miraglia, Lucía; Lozano, Luis F; Velázquez, Consuelo; Volkow-Fernández, Patricia; Pérez-Oseguera, Ángeles; Cevallos, Miguel A; Castillo-Ramírez, Santiago
2017-01-01
Genome sequencing has been useful to gain an understanding of bacterial evolution. It has been used for studying the phylogeography and/or the impact of mutation and recombination on bacterial populations. However, it has rarely been used to study gene turnover at microevolutionary scales. Here, we sequenced Mexican strains of the human pathogen Acinetobacter baumannii sampled from the same locale over a 3 year period to obtain insights into the microevolutionary dynamics of gene content variability. We found that the Mexican A. baumannii population was recently founded and has been emerging due to a rapid clonal expansion. Furthermore, we noticed that on average the Mexican strains differed from each other by over 300 genes and, notably, this gene content variation has accrued more frequently and faster than the accumulation of mutations. Moreover, due to its rapid pace, gene content variation reflects the phylogeny only at very short periods of time. Additionally, we found that the external branches of the phylogeny had almost 100 more genes than the internal branches. All in all, these results show that rapid gene turnover has been of paramount importance in producing genetic variation within this population and demonstrate the utility of genome sequencing to study alternative forms of genetic variation.
Xu, Teng; Qin, Song; Hu, Yongwu; Song, Zhijian; Ying, Jianchao; Li, Peizhen; Dong, Wei; Zhao, Fangqing; Yang, Huanming; Bao, Qiyu
2016-01-01
Arthrospira platensis is a multi-cellular and filamentous non-N2-fixing cyanobacterium that is capable of performing oxygenic photosynthesis. In this study, we determined the nearly complete genome sequence of A. platensis YZ. A. platensis YZ genome is a single, circular chromosome of 6.62 Mb in size. Phylogenetic and comparative genomic analyses revealed that A. platensis YZ was more closely related to A. platensis NIES-39 than Arthrospira sp. PCC 8005 and A. platensis C1. Broad gene gains were identified between A. platensis YZ and three other Arthrospira speices, some of which have been previously demonstrated that can be laterally transferred among different species, such as restriction-modification systems-coding genes. Moreover, unprecedented extensive chromosomal rearrangements among different strains were observed. The chromosomal rearrangements, particularly the chromosomal inversions, were analysed and estimated to be closely related to palindromes that involved long inverted repeat sequences and the extensively distributed type IIR restriction enzyme in the Arthrospira genome. In addition, species from genus Arthrospira unanimously contained the highest rate of repetitive sequence compared with the other species of order Oscillatoriales, suggested that sequence duplication significantly contributed to Arthrospira genome phylogeny. These results provided in-depth views into the genomic phylogeny and structural variation of A. platensis, as well as provide a valuable resource for functional genomics studies. PMID:27330141
Kyrillos, Alexandra; Arora, Gaurav; Murray, Bradley; Rosenwald, Anne G
2016-06-01
The bacterium Helicobacter pylori is associated with ulcers and the development of gastric cancer. Several genes, including cytotoxin-associated gene A (CagA) and vacuolating cytotoxin A (VacA), are associated with increased gastric cancer risk. Some strains of H. pylori also contain sequences related to bacteriophage phiHP33; however, the significance of these phage-related sequences remains unknown. We assessed the extent to which phiHP33-related sequences are present in 335 H. pylori strains using homology searches then mapped shared genes between phiHP33 and H. pylori strains onto an existing phylogeny. One hundred and twenty-one H. pylori strains contain phage orthologous sequences, and the presence of the phage-related sequences correlates with the presence of CagA and VacA. Mapping of the phage orthologs onto a phylogeny of H. pylori is consistent with the hypothesis that these genes were acquired by horizontal gene transfer. phiHP33 phage orthologous sequences might be of significance in understanding virulence of different H. pylori strains. © 2015 John Wiley & Sons Ltd.
Arthropod phylogeny based on eight molecular loci and morphology
NASA Technical Reports Server (NTRS)
Giribet, G.; Edgecombe, G. D.; Wheeler, W. C.
2001-01-01
The interrelationships of major clades within the Arthropoda remain one of the most contentious issues in systematics, which has traditionally been the domain of morphologists. A growing body of DNA sequences and other types of molecular data has revitalized study of arthropod phylogeny and has inspired new considerations of character evolution. Novel hypotheses such as a crustacean-hexapod affinity were based on analyses of single or few genes and limited taxon sampling, but have received recent support from mitochondrial gene order, and eye and brain ultrastructure and neurogenesis. Here we assess relationships within Arthropoda based on a synthesis of all well sampled molecular loci together with a comprehensive data set of morphological, developmental, ultrastructural and gene-order characters. The molecular data include sequences of three nuclear ribosomal genes, three nuclear protein-coding genes, and two mitochondrial genes (one protein coding, one ribosomal). We devised new optimization procedures and constructed a parallel computer cluster with 256 central processing units to analyse molecular data on a scale not previously possible. The optimal 'total evidence' cladogram supports the crustacean-hexapod clade, recognizes pycnogonids as sister to other euarthropods, and indicates monophyly of Myriapoda and Mandibulata.
A Comprehensive Analysis of Transcript-Supported De Novo Genes in Saccharomyces sensu stricto Yeasts
Lu, Tzu-Chiao; Leu, Jun-Yi; Lin, Wen-Chang
2017-01-01
Abstract Novel genes arising from random DNA sequences (de novo genes) have been suggested to be widespread in the genomes of different organisms. However, our knowledge about the origin and evolution of de novo genes is still limited. To systematically understand the general features of de novo genes, we established a robust pipeline to analyze >20,000 transcript-supported coding sequences (CDSs) from the budding yeast Saccharomyces cerevisiae. Our analysis pipeline combined phylogeny, synteny, and sequence alignment information to identify possible orthologs across 20 Saccharomycetaceae yeasts and discovered 4,340 S. cerevisiae-specific de novo genes and 8,871 S. sensu stricto-specific de novo genes. We further combine information on CDS positions and transcript structures to show that >65% of de novo genes arose from transcript isoforms of ancient genes, especially in the upstream and internal regions of ancient genes. Fourteen identified de novo genes with high transcript levels were chosen to verify their protein expressions. Ten of them, including eight transcript isoform-associated CDSs, showed translation signals and five proteins exhibited specific cytosolic localizations. Our results suggest that de novo genes frequently arise in the S. sensu stricto complex and have the potential to be quickly integrated into ancient cellular network. PMID:28981695
Yamaguchi, M; Miya, M; Okiyama, M; Nishida, M
2000-04-01
Larvae of the deep-sea lanternfish genus Hygophum (Myctophidae) exhibit a remarkable morphological diversity that is quite unexpected, considering their homogeneous adult morphology. In an attempt to elucidate the evolutionary patterns of such larval morphological diversity, nucleotide sequences of a portion of the mitochondrially encoded 16S ribosomal RNA gene were determined for seven Hygophum species and three outgroup taxa. Secondary structure-based alignment resulted in a character matrix consisting of 1172 bp of unambiguously aligned sequences, which were subjected to phylogenetic analyses using maximum-parsimony, maximum-likelihood, and neighbor-joining methods. The resultant tree topologies from the three methods were congruent, with most nodes, including that of the genus Hygophum, being strongly supported by various tree statistics. The most parsimonious reconstruction of the three previously recognized, distinct larval morphs onto the molecular phylogeny revealed that one of the morphs had originated as the common ancestor of the genus, the other two having diversified separately in two subsequent major clades. The patterns of such diversification are discussed in terms of the unusual larval eye morphology and geographic distribution. Copyright 2000 Academic Press.
Refuting phylogenetic relationships
Bucknam, James; Boucher, Yan; Bapteste, Eric
2006-01-01
Background Phylogenetic methods are philosophically grounded, and so can be philosophically biased in ways that limit explanatory power. This constitutes an important methodologic dimension not often taken into account. Here we address this dimension in the context of concatenation approaches to phylogeny. Results We discuss some of the limits of a methodology restricted to verificationism, the philosophy on which gene concatenation practices generally rely. As an alternative, we describe a software which identifies and focuses on impossible or refuted relationships, through a simple analysis of bootstrap bipartitions, followed by multivariate statistical analyses. We show how refuting phylogenetic relationships could in principle facilitate systematics. We also apply our method to the study of two complex phylogenies: the phylogeny of the archaea and the phylogeny of the core of genes shared by all life forms. While many groups are rejected, our results left open a possible proximity of N. equitans and the Methanopyrales, of the Archaea and the Cyanobacteria, and as well the possible grouping of the Methanobacteriales/Methanoccocales and Thermosplasmatales, of the Spirochaetes and the Actinobacteria and of the Proteobacteria and firmicutes. Conclusion It is sometimes easier (and preferable) to decide which species do not group together than which ones do. When possible topologies are limited, identifying local relationships that are rejected may be a useful alternative to classical concatenation approaches aiming to find a globally resolved tree on the basis of weak phylogenetic markers. Reviewers This article was reviewed by Mark Ragan, Eugene V Koonin and J Peter Gogarten. PMID:16956399
MADS-Box gene diversity in seed plants 300 million years ago.
Becker, A; Winter, K U; Meyer, B; Saedler, H; Theissen, G
2000-10-01
MADS-box genes encode a family of transcription factors which control diverse developmental processes in flowering plants ranging from root development to flower and fruit development. Through phylogeny reconstructions, most of these genes can be subdivided into defined monophyletic gene clades whose members share similar expression patterns and functions. Therefore, the establishment of the diversity of gene clades was probably an important event in land plant evolution. In order to determine when these clades originated, we isolated cDNAs of 19 different MADS-box genes from Gnetum gnemon, a gymnosperm model species and thus a representative of the sister group of the angiosperms. Phylogeny reconstructions involving all published MADS-box genes were then used to identify gene clades containing putative orthologs from both angiosperm and gymnosperm lineages. Thus, the minimal number of MADS-box genes that were already present in the last common ancestor of extant gymnosperms and angiosperms was determined. Comparative expression studies involving pairs of putatively orthologous genes revealed a diversity of patterns that has been largely conserved since the time when the angiosperm and gymnosperm lineages separated. Taken together, our data suggest that there were already at least seven different MADS-box genes present at the base of extant seed plants about 300 MYA. These genes were probably already quite diverse in terms of both sequence and function. In addition, our data demonstrate that the MADS-box gene families of extant gymnosperms and angiosperms are of similar complexities.
Construction of a Species-Level Tree of Life for the Insects and Utility in Taxonomic Profiling.
Chesters, Douglas
2017-05-01
Although comprehensive phylogenies have proven an invaluable tool in ecology and evolution, their construction is made increasingly challenging both by the scale and structure of publically available sequences. The distinct partition between gene-rich (genomic) and species-rich (DNA barcode) data is a feature of data that has been largely overlooked, yet presents a key obstacle to scaling supermatrix analysis. I present a phyloinformatics framework for draft construction of a species-level phylogeny of insects (Class Insecta). Matrix-building requires separately optimized pipelines for nuclear transcriptomic, mitochondrial genomic, and species-rich markers, whereas tree-building requires hierarchical inference in order to capture species-breadth while retaining deep-level resolution. The phylogeny of insects contains 49,358 species, 13,865 genera, 760 families. Deep-level splits largely reflected previous findings for sections of the tree that are data rich or unambiguous, such as inter-ordinal Endopterygota and Dictyoptera, the recently evolved and relatively homogeneous Lepidoptera, Hymenoptera, Brachycera (Diptera), and Cucujiformia (Coleoptera). However, analysis of bias, matrix construction and gene-tree variation suggests confidence in some relationships (such as in Polyneoptera) is less than has been indicated by the matrix bootstrap method. To assess the utility of the insect tree as a tool in query profiling several tree-based taxonomic assignment methods are compared. Using test data sets with existing taxonomic annotations, a tendency is observed for greater accuracy of species-level assignments where using a fixed comprehensive tree of life in contrast to methods generating smaller de novo reference trees. Described herein is a solution to the discrepancy in the way data are fit into supermatrices. The resulting tree facilitates wider studies of insect diversification and application of advanced descriptions of diversity in community studies, among other presumed applications. [Data integration; data mining; insects; phylogenomics; phyloinformatics; tree of life.]. © The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
A study on the characterization of Propionibacterium acnes isolated from ocular clinical specimens.
Sowmiya, Murali; Malathi, Jambulingam; Swarnali, Sen; Priya, Jeyavel Padma; Therese, Kulandai Lily; Madhavan, Hajib N
2015-10-01
There are only a few reports available on characterization of Propionibacterium acnes isolated from various ocular clinical specimens. We undertook this study to evaluate the role of P. acnes in ocular infections and biofilm production, and also do the phylogenetic analysis of the bacilli. One hundred isolates of P. acnes collected prospectively from ocular clinical specimens at a tertiary care eye hospital between January 2010 and December 2011, were studied for their association with various ocular disease conditions. The isolates were also subjected to genotyping and phylogenetic analysis, and were also tested for their ability to produce biofilms. Among preoperative conjunctival swabs, P. acnes was a probably significant pathogen in one case; a possibly significant pathogen in two cases. In other clinical conditions, 13 per cent isolates were probably significant pathogens and 38 per cent as possibly significant pathogens. The analysis of 16S rRNA gene revealed four different phylogenies whereas analysis of recA gene showed two phylogenies confirming that recA gene was more reliable than 16S rRNA with less sequence variation. Results of polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) had 100 per cent concordance with phylogenetic results. No association was seen between P. acnes subtypes and biofilm production. RecA gene phylogenetic studies revealed two different phylogenies. RFLP technique was found to be cost-effective with high sensitivity and specificity in phylogenetic analysis. No association between P. acnes subtypes and pathogenetic ability was observed. Biofilm producing isolates showed increased antibiotic resistance compared with non-biofilm producing isolates.
Evolution of Chemical Diversity in Echinocandin Lipopeptide Antifungal Metabolites
Yue, Qun; Chen, Li; Zhang, Xiaoling; Li, Kuan; Sun, Jingzu; Liu, Xingzhong
2015-01-01
The echinocandins are a class of antifungal drugs that includes caspofungin, micafungin, and anidulafungin. Gene clusters encoding most of the structural complexity of the echinocandins provided a framework for hypotheses about the evolutionary history and chemical logic of echinocandin biosynthesis. Gene orthologs among echinocandin-producing fungi were identified. Pathway genes, including the nonribosomal peptide synthetases (NRPSs), were analyzed phylogenetically to address the hypothesis that these pathways represent descent from a common ancestor. The clusters share cooperative gene contents and linkages among the different strains. Individual pathway genes analyzed in the context of similar genes formed unique echinocandin-exclusive phylogenetic lineages. The echinocandin NRPSs, along with the NRPS from the inp gene cluster in Aspergillus nidulans and its orthologs, comprise a novel lineage among fungal NRPSs. NRPS adenylation domains from different species exhibited a one-to-one correspondence between modules and amino acid specificity that is consistent with models of tandem duplication and subfunctionalization. Pathway gene trees and Ascomycota phylogenies are congruent and consistent with the hypothesis that the echinocandin gene clusters have a common origin. The disjunct Eurotiomycete-Leotiomycete distribution appears to be consistent with a scenario of vertical descent accompanied by incomplete lineage sorting and loss of the clusters from most lineages of the Ascomycota. We present evidence for a single evolutionary origin of the echinocandin family of gene clusters and a progression of structural diversification in two fungal classes that diverged approximately 290 to 390 million years ago. Lineage-specific gene cluster evolution driven by selection of new chemotypes contributed to diversification of the molecular functionalities. PMID:26024901
Baute, Gregory J; Owens, Gregory L; Bock, Dan G; Rieseberg, Loren H
2016-12-01
Wild sunflowers harbor considerable genetic diversity and are a major resource for improvement of the cultivated sunflower, Helianthus annuus. The Helianthus genus is also well known for its propensity for gene flow between taxa. We surveyed genomic diversity of 292 samples of wild Helianthus from 22 taxa that are cross-compatible with the cultivar using genotyping by sequencing. With these data, we derived a high-resolution phylogeny of the taxa, interrogated genome-wide levels of diversity, explored H. annuus population structure, and identified localized gene flow between H. annuus and its close relatives. Our phylogenomic analyses confirmed a number of previously established interspecific relationships and indicated for the first time that a newly described annual sunflower, H. winteri, is nested within H. annuus. Principal component analyses showed that H. annuus has geographic population structure with most notable subpopulations occurring in California and Texas. While gene flow was identified between H. annuus and H. bolanderi in California and between H. annuus and H. argophyllus in Texas, this genetic exchange does not appear to drive observed patterns of H. annuus population structure. Wild H. annuus remains an excellent resource for cultivated sunflower breeding effort because of its diversity and the ease with which it can be crossed with cultivated H. annuus. Cases of interspecific gene flow such as those documented here also indicate wild H. annuus can act as a bridge to capture alleles from other wild taxa; continued breeding efforts with it may therefore reap the largest rewards. © 2016 Botanical Society of America.
Yang, Zefeng; Gu, Shiliang; Wang, Xuefeng; Li, Wenjuan; Tang, Zaixiang; Xu, Chenwu
2008-09-01
CPP-like genes are members of a small family which features the existence of two similar Cys-rich domains termed CXC domains in their protein products and are distributed widely in plants and animals but do not exist in yeast. The members of this family in plants play an important role in development of reproductive tissue and control of cell division. To gain insights into how CPP-like genes evolved in plants, we conducted a comparative phylogenetic and molecular evolutionary analysis of the CPP-like gene family in Arabidopsis and rice. The results of phylogeny revealed that both gene loss and species-specific expansion contributed to the evolution of this family in Arabidopsis and rice. Both intron gain and intron loss were observed through intron/exon structure analysis for duplicated genes. Our results also suggested that positive selection was a major force during the evolution of CPP-like genes in plants, and most amino acid residues under positive selection were disproportionately located in the region outside the CXC domains. Further analysis revealed that two CXC domains and sequences connecting them might have coevolved during the long evolutionary period.
Ren, Ren; Sun, Yazhou; Zhao, Yue; Geiser, David
2016-01-01
Abstract A comprehensive and reliable eukaryotic tree of life is important for many aspects of biological studies from comparative developmental and physiological analyses to translational medicine and agriculture. Both gene-rich and taxon-rich approaches are effective strategies to improve phylogenetic accuracy and are greatly facilitated by marker genes that are universally distributed, well conserved, and orthologous among divergent eukaryotes. In this article, we report the identification of 943 low-copy eukaryotic genes and we show that many of these genes are promising tools in resolving eukaryotic phylogenies, despite the challenges of determining deep eukaryotic relationships. As a case study, we demonstrate that smaller subsets of ∼20 and 52 genes could resolve controversial relationships among widely divergent taxa and provide strong support for deep relationships such as the monophyly and branching order of several eukaryotic supergroups. In addition, the use of these genes resulted in fungal phylogenies that are congruent with previous phylogenomic studies that used much larger datasets, and successfully resolved several difficult relationships (e.g., forming a highly supported clade with Microsporidia, Mitosporidium and Rozella sister to other fungi). We propose that these genes are excellent for both gene-rich and taxon-rich analyses and can be applied at multiple taxonomic levels and facilitate a more complete understanding of the eukaryotic tree of life. PMID:27604879
Liu, Hongyun; Qin, Jiajia; Fan, Hui; Cheng, Jinjin; Li, Lin; Liu, Zheng
2017-07-01
As a member of the GRAS gene family, SCARECROW - LIKE ( SCL ) genes encode transcriptional regulators that are involved in plant information transmission and signal transduction. In this study, 44 SCL genes including two SCARECROW genes in millet were identified to be distributed on eight chromosomes, except chromosome 6. All the millet genes contain motifs 6-8, indicating that these motifs are conserved during the evolution. SCL genes of millet were divided into eight groups based on the phylogenetic relationship and classification of Arabidopsis SCL genes. Several putative millet orthologous genes in Arabidopsis , maize and rice were identified. High throughput RNA sequencing revealed that the expressions of millet SCL genes in root, stem, leaf, spica, and along leaf gradient varied greatly. Analyses combining the gene expression patterns, gene structures, motif compositions, promoter cis -elements identification, alternative splicing of transcripts and phylogenetic relationship of SCL genes indicate that the these genes may play diverse functions. Functionally characterized SCL genes in maize, rice and Arabidopsis would provide us some clues for future characterization of their homologues in millet. To the best of our knowledge, this is the first study of millet SCL genes at the genome wide level. Our work provides a useful platform for functional analysis of SCL genes in millet, a model crop for C 4 photosynthesis and bioenergy studies.
ERIC Educational Resources Information Center
White, Stephanie A.
2010-01-01
Could a mutation in a single gene be the evolutionary lynchpin supporting the development of human language? A rare mutation in the molecule known as FOXP2 discovered in a human family seemed to suggest so, and its sequence phylogeny reinforced a Chomskian view that language emerged wholesale in humans. Spurred by this discovery, research in…
Osca, David; Templado, José; Zardoya, Rafael
2014-09-01
The complete nucleotide sequence of the mitochondrial (mt) genome of the deep-sea vent snail Ifremeria nautilei (Gastropoda: Abyssochrysoidea) was determined. The double stranded circular molecule is 15,664 pb in length and encodes for the typical 37 metazoan mitochondrial genes. The gene arrangement of the Ifremeria mt genome is most similar to genome organization of caenogastropods and differs only on the relative position of the trnW gene. The deduced amino acid sequences of the mt protein coding genes of Ifremeria mt genome were aligned with orthologous sequences from representatives of the main lineages of gastropods and phylogenetic relationships were inferred. The reconstructed phylogeny supports that Ifremeria belongs to Caenogastropoda and that it is closely related to hypsogastropod superfamilies. Results were compared with a reconstructed nuclear-based phylogeny. Moreover, a relaxed molecular-clock timetree calibrated with fossils dated the divergence of Abyssochrysoidea in the Late Jurassic-Early Cretaceous indicating a relatively modern colonization of deep-sea environments by these snails. Copyright © 2014 Elsevier B.V. All rights reserved.
Lee, I M; Bartoszyk, I M; Gundersen-Rindal, D E; Davis, R E
1997-07-01
A phylogenetic analysis by parsimony of 16S rRNA gene sequences (16S rDNA) revealed that species and subspecies of Clavibacter and Rathayibacter form a discrete monophyletic clade, paraphyletic to Corynebacterium species. Within the Clavibacter-Rathayibacter clade, four major phylogenetic groups (subclades) with a total of 10 distinct taxa were recognized: (I) species C. michiganensis; (II) species C. xyli; (III) species R. iranicus and R. tritici; and (IV) species R. rathayi. The first three groups form a monophyletic cluster, paraphyletic to R. rathayi. On the basis of the phylogeny inferred, reclassification of members of Clavibacter-Rathayibacter group is proposed. A system for classification of taxa in Clavibacter and Rathayibacter was developed based on restriction fragment length polymorphism (RFLP) analysis of the PCR-amplified 16S rDNA sequences. The groups delineated on the basis of RFLP patterns of 16S rDNA coincided well with the subclades delineated on the basis of phylogeny. In contrast to previous classification systems, which are based primarily on phenotypic properties and are laborious, the RFLP analyses allow for rapid differentiation among species and subspecies in the two genera.
Duplicated growth hormone genes in a passerine bird, the jungle crow (Corvus macrorhynchos).
Arai, Natsumi; Iigo, Masayuki
2010-07-02
Molecular cloning, molecular phylogeny, gene structure and expression analyses of growth hormone (GH) were performed in a passerine bird, the jungle crow (Corvus macrorhynchos). Unexpectedly, duplicated GH cDNA and genes were identified and designated as GH1A and GH1B. In silico analyses identified the zebra finch orthologs. Both GH genes encode 217 amino acid residues and consist of five exons and four introns, spanning 5.2 kbp in GH1A and 4.2 kbp in GH1B. Predicted GH proteins of the jungle crow and zebra finch contain four conserved cysteine residues, suggesting duplicated GH genes are functional. Molecular phylogenetic analysis revealed that duplication of GH genes occur after divergence of the passerine lineage from the other avian orders as has been suggested from partial genomic DNA sequences of passerine GH genes. RT-PCR analyses confirmed expression of GH1A and GH1B in the pituitary gland. In addition, GH1A gene is expressed in all the tissues examined. However, expression of GH1B is confined to several brain areas and blood cells. These results indicate that the regulatory mechanisms of duplicated GH genes are different and that duplicated GH genes exert both endocrine and autocrine/paracrine functions. Copyright 2010 Elsevier Inc. All rights reserved.
Endosymbiotic gene transfer from prokaryotic pangenomes: Inherited chimerism in eukaryotes.
Ku, Chuan; Nelson-Sathi, Shijulal; Roettger, Mayo; Garg, Sriram; Hazkani-Covo, Einat; Martin, William F
2015-08-18
Endosymbiotic theory in eukaryotic-cell evolution rests upon a foundation of three cornerstone partners--the plastid (a cyanobacterium), the mitochondrion (a proteobacterium), and its host (an archaeon)--and carries a corollary that, over time, the majority of genes once present in the organelle genomes were relinquished to the chromosomes of the host (endosymbiotic gene transfer). However, notwithstanding eukaryote-specific gene inventions, single-gene phylogenies have never traced eukaryotic genes to three single prokaryotic sources, an issue that hinges crucially upon factors influencing phylogenetic inference. In the age of genomes, single-gene trees, once used to test the predictions of endosymbiotic theory, now spawn new theories that stand to eventually replace endosymbiotic theory with descriptive, gene tree-based variants featuring supernumerary symbionts: prokaryotic partners distinct from the cornerstone trio and whose existence is inferred solely from single-gene trees. We reason that the endosymbiotic ancestors of mitochondria and chloroplasts brought into the eukaryotic--and plant and algal--lineage a genome-sized sample of genes from the proteobacterial and cyanobacterial pangenomes of their respective day and that, even if molecular phylogeny were artifact-free, sampling prokaryotic pangenomes through endosymbiotic gene transfer would lead to inherited chimerism. Recombination in prokaryotes (transduction, conjugation, transformation) differs from recombination in eukaryotes (sex). Prokaryotic recombination leads to pangenomes, and eukaryotic recombination leads to vertical inheritance. Viewed from the perspective of endosymbiotic theory, the critical transition at the eukaryote origin that allowed escape from Muller's ratchet--the origin of eukaryotic recombination, or sex--might have required surprisingly little evolutionary innovation.
Hemipteran Mitochondrial Genomes: Features, Structures and Implications for Phylogeny
Wang, Yuan; Chen, Jing; Jiang, Li-Yun; Qiao, Ge-Xia
2015-01-01
The study of Hemipteran mitochondrial genomes (mitogenomes) began with the Chagas disease vector, Triatoma dimidiata, in 2001. At present, 90 complete Hemipteran mitogenomes have been sequenced and annotated. This review examines the history of Hemipteran mitogenomes research and summarizes the main features of them including genome organization, nucleotide composition, protein-coding genes, tRNAs and rRNAs, and non-coding regions. Special attention is given to the comparative analysis of repeat regions. Gene rearrangements are an additional data type for a few families, and most mitogenomes are arranged in the same order to the proposed ancestral insect. We also discuss and provide insights on the phylogenetic analyses of a variety of taxonomic levels. This review is expected to further expand our understanding of research in this field and serve as a valuable reference resource. PMID:26039239
Wang, Xiao-Jing; Wang, Xiao-Xing; Wang, Ya-Jun; Wang, Xi-Zhong; He, Guang-Xin; Chen, Hong-Wei; Fei, Li-Song
2002-09-01
Activin, which is included in the transforming growth factor-beta (TGF beta) superfamily of proteins and receptors, is known to have broad-ranging effects in the creatures. The mature peptide of beta A subunit of this gene, one of the most highly conserved sequence, can elevate the basal secretion of follicle-stimulating hormone (FSH) in the pituitary and FSH is pivotal to organism's reproduction. Reproduction block is one of the main reasons which cause giant panda to extinct. The sequence of Activin beta A subunit gene mature peptides has been successfully amplified from giant panda, red panda and malayan sun bear's genomic DNA by using polymerase chain reaction (PCR) with a pair of degenerate primers. The PCR products were cloned into the vector pBlueScript+ of Esherichia coli. Sequence analysis of Activin beta A subunit gene mature peptides shows that the length of this gene segment is the same (359 bp) and there is no intron in all three species. The sequence encodes a peptide of 119 amino acid residues. The homology comparison demonstrates 93.9% DNA homology and 99% homology in amino acid among these three species. Both GenBank blast search result and restriction enzyme map reveal that the sequences of Activin beta A subunit gene mature peptides of different species are highly conserved during the evolution process. Phylogeny analysis is performed with PHYLIP software package. A consistent phylogeny tree has been drawn with three different methods. The software analysis outcome accords with the academic view that giant panda has a closer relationship to the malayan sun bear than the red panda. Giant panda should be grouped into the bear family (Uersidae) with the malayan sun bear. As to the red panda, it would be better that this animal be grouped into the unique family (red panda family) because of great difference between the red panda and the bears (Uersidae).
Ferla, Matteo P.; Thrash, J. Cameron; Giovannoni, Stephen J.; Patrick, Wayne M.
2013-01-01
Bacteria in the class Alphaproteobacteria have a wide variety of lifestyles and physiologies. They include pathogens of humans and livestock, agriculturally valuable strains, and several highly abundant marine groups. The ancestor of mitochondria also originated in this clade. Despite significant effort to investigate the phylogeny of the Alphaproteobacteria with a variety of methods, there remains considerable disparity in the placement of several groups. Recent emphasis on phylogenies derived from multiple protein-coding genes remains contentious due to disagreement over appropriate gene selection and the potential influences of systematic error. We revisited previous investigations in this area using concatenated alignments of the small and large subunit (SSU and LSU) rRNA genes, as we show here that these loci have much lower GC bias than whole genomes. This approach has allowed us to update the canonical 16S rRNA gene tree of the Alphaproteobacteria with additional important taxa that were not previously included, and with added resolution provided by concatenating the SSU and LSU genes. We investigated the topological stability of the Alphaproteobacteria by varying alignment methods, rate models, taxon selection and RY-recoding to circumvent GC content bias. We also introduce RYMK-recoding and show that it avoids some of the information loss in RY-recoding. We demonstrate that the topology of the Alphaproteobacteria is sensitive to inclusion of several groups of taxa, but it is less affected by the choice of alignment and rate methods. The majority of topologies and comparative results from Approximately Unbiased tests provide support for positioning the Rickettsiales and the mitochondrial branch within a clade. This composite clade is a sister group to the abundant marine SAR11 clade (Pelagibacterales). Furthermore, we add support for taxonomic assignment of several recently sequenced taxa. Accordingly, we propose three subclasses within the Alphaproteobacteria: the Caulobacteridae, the Rickettsidae, and the Magnetococcidae. PMID:24349502
Yi, Zhenzhen; Song, Weibo; Clamp, John C; Chen, Zigui; Gao, Shan; Zhang, Qianqian
2009-03-01
Comprehensive molecular analyses of phylogenetic relationships within euplotid ciliates are relatively rare, and the relationships among some families remain questionable. We performed phylogenetic analyses of the order Euplotida based on new sequences of the gene coding for small-subunit RNA (SSrRNA) from a variety of taxa across the entire order as well as sequences from some of these taxa of other genes (ITS1-5.8S-ITS2 region and histone H4) that have not been included in previous analyses. Phylogenetic trees based on SSrRNA gene sequences constructed with four different methods had a consistent branching pattern that included the following features: (1) the "typical" euplotids comprised a paraphyletic assemblage composed of two divergent clades (family Uronychiidae and families Euplotidae-Certesiidae-Aspidiscidae-Gastrocirrhidae), (2) in the family Uronychiidae, the genera Uronychia and Paradiophrys formed a clearly outlined, well-supported clade that seemed to be rather divergent from Diophrys and Diophryopsis, suggesting that the Diophrys-complex may have had a longer and more separate evolutionary history than previously supposed, (3) inclusion of 12 new SSrRNA sequences in analyses of Euplotidae revealed two new clades of species within the family and cast additional doubt on the present classification of genera within the family, and (4) the intraspecific divergence among five species of Aspidisca was far greater than those of closely related genera. The ITS1-5.8S-ITS2 coding regions and partial histone H4 genes of six morphospecies in the Diophrys-complex were sequenced along with their SSrRNA genes and used to compare phylogenies constructed from single data sets to those constructed from combined sets. Results indicated that combined analyses could be used to construct more reliable, less ambiguous phylogenies of complex groups like the order Euplotida, because they provide a greater amount and diversity of information.
Masuda, R; Lopez, J V; Slattery, J P; Yuhki, N; O'Brien, S J
1996-12-01
Molecular phylogeny of the cat family Felidae is derived using two mitochondrial genes, cytochrome b and 12S rRNA. Phylogenetic methods of weighted maximum parsimony and minimum evolution estimated by neighbor-joining are employed to reconstruct topologies among 20 extant felid species. Sequence analyses of 363 bp of cytochrome b and 376 bp of the 12S rRNA genes yielded average pair-wise similarity values between felids ranging from 94 to 99% and from 85 to 99%, respectively. Phylogenetic reconstruction supports more recent, intralineage associations but fails to completely resolve interlineage relationships. Both genes produce a monophyletic group of Felis species but vary in the placement of the pallas cat. The ocelot lineage represents an early divergence within the Felidae, with strong associations between ocelot and margay, Geoffroy's cat and kodkod, and pampas cat and tigrina. Implications of the relative recency of felid evolution, presence of ancestral polymorphisms, and influence of outgroups in placement of the topological root are discussed.
Using MOEA with Redistribution and Consensus Branches to Infer Phylogenies.
Min, Xiaoping; Zhang, Mouzhao; Yuan, Sisi; Ge, Shengxiang; Liu, Xiangrong; Zeng, Xiangxiang; Xia, Ningshao
2017-12-26
In recent years, to infer phylogenies, which are NP-hard problems, more and more research has focused on using metaheuristics. Maximum Parsimony and Maximum Likelihood are two effective ways to conduct inference. Based on these methods, which can also be considered as the optimal criteria for phylogenies, various kinds of multi-objective metaheuristics have been used to reconstruct phylogenies. However, combining these two time-consuming methods results in those multi-objective metaheuristics being slower than a single objective. Therefore, we propose a novel, multi-objective optimization algorithm, MOEA-RC, to accelerate the processes of rebuilding phylogenies using structural information of elites in current populations. We compare MOEA-RC with two representative multi-objective algorithms, MOEA/D and NAGA-II, and a non-consensus version of MOEA-RC on three real-world datasets. The result is, within a given number of iterations, MOEA-RC achieves better solutions than the other algorithms.
Genomic insights into the taxonomic status of the Bacillus cereus group
Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P.; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze
2015-01-01
The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19–20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441
Using nearly full-genome HIV sequence data improves phylogeny reconstruction in a simulated epidemic
Yebra, Gonzalo; Hodcroft, Emma B.; Ragonnet-Cronin, Manon L.; Pillay, Deenan; Brown, Andrew J. Leigh; Fraser, Christophe; Kellam, Paul; de Oliveira, Tulio; Dennis, Ann; Hoppe, Anne; Kityo, Cissy; Frampton, Dan; Ssemwanga, Deogratius; Tanser, Frank; Keshani, Jagoda; Lingappa, Jairam; Herbeck, Joshua; Wawer, Maria; Essex, Max; Cohen, Myron S.; Paton, Nicholas; Ratmann, Oliver; Kaleebu, Pontiano; Hayes, Richard; Fidler, Sarah; Quinn, Thomas; Novitsky, Vladimir; Haywards, Andrew; Nastouli, Eleni; Morris, Steven; Clark, Duncan; Kozlakidis, Zisis
2016-01-01
HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree’s using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences. PMID:28008945
Yebra, Gonzalo; Hodcroft, Emma B; Ragonnet-Cronin, Manon L; Pillay, Deenan; Brown, Andrew J Leigh
2016-12-23
HIV molecular epidemiology studies analyse viral pol gene sequences due to their availability, but whole genome sequencing allows to use other genes. We aimed to determine what gene(s) provide(s) the best approximation to the real phylogeny by analysing a simulated epidemic (created as part of the PANGEA_HIV project) with a known transmission tree. We sub-sampled a simulated dataset of 4662 sequences into different combinations of genes (gag-pol-env, gag-pol, gag, pol, env and partial pol) and sampling depths (100%, 60%, 20% and 5%), generating 100 replicates for each case. We built maximum-likelihood trees for each combination using RAxML (GTR + Γ), and compared their topologies to the corresponding true tree's using CompareTree. The accuracy of the trees was significantly proportional to the length of the sequences used, with the gag-pol-env datasets showing the best performance and gag and partial pol sequences showing the worst. The lowest sampling depths (20% and 5%) greatly reduced the accuracy of tree reconstruction and showed high variability among replicates, especially when using the shortest gene datasets. In conclusion, using longer sequences derived from nearly whole genomes will improve the reliability of phylogenetic reconstruction. With low sample coverage, results can be highly variable, particularly when based on short sequences.
Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric
2017-01-01
Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells. PMID:28731469
Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric
2017-11-01
Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.
NASA Astrophysics Data System (ADS)
Xue, Zhuang; Li, Hui; Liu, Yang; Zhou, Wei; Sun, Jing; Wang, Xiuli
2017-12-01
As a `living fossil' of species origin and `rich treasure' of food and nutrition development, sea cucumber has received a lot of attentions from researchers. The cDNA library construction and EST sequencing of blood had been conducted previously in our lab. The bioinformatic analysis provided a gene fragment which is highly homologous with the genes of lectin family, named AjL ( Apostichopus japonicus lectin). To characterize and determine the phylogeny of AjL genes in early evolution, we isolated a full-length cDNA of lectin gene from the body wall of A. japonicus. The open reading frame of this gene contained 489 bp and encoded a 163 amino acids secretory protein being homologous to lectins of mammals and aquatic organisms. The deduced protein included a lectin-like domain. SDS-PAGE analysis showed that AjL migrated as a specific band (about 36.09 kDa under reducing), and agglutinated against rabbit red blood cells. AjL was similar to chain A of CEL-IV in space structure. We predicted that AjL may play the same role of CEL-IV. Our results suggested that more than one lectin gene functioned in sea cucumber and most of other species, which was fused by uncertain sequences during the evolution and encoded different proteins with diverse functions. Our findings provided the insights into the function and characteristics of lectin genes invertebrates. The results will also be helpful for the identification and structural, functional, and evolutionary analyses of lectin genes.
Xia, Rong; Durand, Jean-Dominique; Fu, Cuizhang
2016-03-01
The interrelationships among mugilids (Mugiliformes: Mugilidae) remain highly debated. Using a mitochondrial gene-based phylogeny as criterion, a revised classification with 25 genera in the Mugilidae has recently been proposed. However, phylogenetic relationships of major mitochondrial lineages remain unresolved and to gain a general acceptance the classification requires confirmation based on multilocus evidence and diagnostic morphological characters. Here, we construct a species-tree using twelve nuclear and three mitochondrial loci and infer the evolution of 71 morphological characters. Our multilocus phylogeny does not agree with previous morphology-based hypotheses for the relationships within Mugilidae, confirms the revised classification with 25 genera and further resolves their phylogenetic relationships. Using the well-resolved multilocus phylogeny as the criterion, we reclassify Mugilidae genera into three new subfamilies (Myxinae, Rhinomugilinae, and Cheloninae) and one new, recombined, subfamily (Mugilinae). The Rhinomugilinae subfamily is further divided into four tribes. The revised classification of Mugilidae is supported by morpho-anatomical synapomorphies or a combination of characters. These characters are used to erect a key to the subfamilies and genera. Copyright © 2015 Elsevier Inc. All rights reserved.
Bowe, L M; Coat, G; dePamphilis, C W
2000-04-11
Efforts to resolve Darwin's "abominable mystery"-the origin of angiosperms-have led to the conclusion that Gnetales and various fossil groups are sister to angiosperms, forming the "anthophytes." Morphological homologies, however, are difficult to interpret, and molecular data have not provided clear resolution of relationships among major groups of seed plants. We introduce two sequence data sets from slowly evolving mitochondrial genes, cox1 and atpA, which unambiguously reject the anthophyte hypothesis, favoring instead a close relationship between Gnetales and conifers. Parsimony- and likelihood-based analyses of plastid rbcL and nuclear 18S rDNA alone and with cox1 and atpA also strongly support a gnetophyte-conifer grouping. Surprisingly, three of four genes (all but nuclear rDNA) and combined three-genome analyses also suggest or strongly support Gnetales as derived conifers, sister to Pinaceae. Analyses with outgroups screened to avoid long branches consistently identify all gymnosperms as a monophyletic sister group to angiosperms. Combined three- and four-gene rooted analyses resolve the branching order for the remaining major groups-cycads separate from other gymnosperms first, followed by Ginkgo and then (Gnetales + Pinaceae) sister to a monophyletic group with all other conifer families. The molecular phylogeny strongly conflicts with current interpretations of seed plant morphology, and implies that many similarities between gnetophytes and angiosperms, such as "flower-like" reproductive structures and double fertilization, were independently derived, whereas other characters could emerge as synapomorphies for an expanded conifer group including Gnetales. An initial angiosperm-gymnosperm split implies a long stem lineage preceding the explosive Mesozoic radiation of flowering plants and suggests that angiosperm origins and homologies should be sought among extinct seed plant groups.
Miao, Wenwen; Sun, Lirong; Tian, Mi; Wang, Ji
2017-01-01
Abscisic acid (ABA) receptor pyrabactin resistance1/PYR1-like/regulatory components of ABA receptor (PYR1/PYL/RCAR) (named PYLs for simplicity) are core regulators of ABA signaling, and have been well studied in Arabidopsis and rice. However, knowledge is limited about the PYL family regarding genome organization, gene structure, phylogenesis, gene expression and protein interaction with downstream targets in Gossypium. A comprehensive analysis of the Gossypium PYL family was carried out, and 21, 20, 40 and 39 PYL genes were identified in the genomes from the diploid progenitor G. arboretum, G. raimondii and the tetraploid G. hirsutum and G. barbadense, respectively. Characterization of the physical properties, chromosomal locations, structures and phylogeny of these family members revealed that Gossypium PYLs were quite conservative among the surveyed cotton species. Segmental duplication might be the main force promoting the expansion of PYLs, and the majority of the PYLs underwent evolution under purifying selection in Gossypium. Additionally, the expression profiles of GhPYL genes were specific in tissues. Transcriptions of many GhPYL genes were inhibited by ABA treatments and induced by osmotic stress. A number of GhPYLs can interact with GhABI1A or GhABID in the presence and/or absence of ABA by the yeast-two hybrid method in cotton. PMID:29230363
Zhang, Gaofeng; Lu, Tingting; Miao, Wenwen; Sun, Lirong; Tian, Mi; Wang, Ji; Hao, Fushun
2017-01-01
Abscisic acid (ABA) receptor pyrabactin resistance1/PYR1-like/regulatory components of ABA receptor (PYR1/PYL/RCAR) (named PYLs for simplicity) are core regulators of ABA signaling, and have been well studied in Arabidopsis and rice. However, knowledge is limited about the PYL family regarding genome organization, gene structure, phylogenesis, gene expression and protein interaction with downstream targets in Gossypium . A comprehensive analysis of the Gossypium PYL family was carried out, and 21, 20, 40 and 39 PYL genes were identified in the genomes from the diploid progenitor G. arboretum , G. raimondii and the tetraploid G. hirsutum and G. barbadense , respectively. Characterization of the physical properties, chromosomal locations, structures and phylogeny of these family members revealed that Gossypium PYLs were quite conservative among the surveyed cotton species. Segmental duplication might be the main force promoting the expansion of PYLs , and the majority of the PYLs underwent evolution under purifying selection in Gossypium . Additionally, the expression profiles of GhPYL genes were specific in tissues. Transcriptions of many GhPYL genes were inhibited by ABA treatments and induced by osmotic stress. A number of GhPYLs can interact with GhABI1A or GhABID in the presence and/or absence of ABA by the yeast-two hybrid method in cotton.
A monkey's tale: The origin of Plasmodium vivax as a human malaria parasite
Escalante, Ananias A.; Cornejo, Omar E.; Freeland, Denise E.; Poe, Amanda C.; Durrego, Ester; Collins, William E.; Lal, Altaf A.
2005-01-01
The high prevalence of Duffy negativity (lack of the Duffy blood group antigen) among human populations in sub-Saharan Africa has been used to argue that Plasmodium vivax originated on that continent. Here, we investigate the phylogenetic relationships among 10 species of Plasmodium that infect primates by using three genes, two nuclear (β-tubulin and cell division cycle 2) and a gene from the plastid genome (the elongation factor Tu). We find compelling evidence that P. vivax is derived from a species that inhabited macaques in Southeast Asia. Specifically, those phylogenies that include P. vivax as an ancient lineage from which all of the macaque parasites could originate are significantly less likely to explain the data. We estimate the time to the most recent common ancestor at four neutral gene loci from Asian and South American isolates (a minimum sample of seven isolates per locus). Our analysis estimates that the extant populations of P. vivax originated between 45,680 and 81,607 years ago. The phylogeny and the estimated time frame for the origination of current P. vivax populations are consistent with an “out of Asia” origin for P. vivax as hominoid parasite. The current debate regarding how the Duffy negative trait became fixed in Africa needs to be revisited, taking into account not only human genetic data but also the genetic diversity observed in the extant P. vivax populations and the phylogeny of the genus Plasmodium. PMID:15684081
USDA-ARS?s Scientific Manuscript database
Background: Serovars of the human pathogen Chlamydia trachomatis occupy one of three specific tissue niches. Genomic analyses indicate that the serovars have a phylogeny congruent with their pathobiology and have an average substitution rate of less than one nucleotide per kilobase. The ompA gene, h...
I. Alvarez; R. Cronn; J.F. Wendel
2005-01-01
American diploid cottons (Gossypium L., subgenus Houzingenia Fryxell) form a monophyletic group of 13 species distributed mainly in western Mexico, extending into Arizona, Baja California, and with one disjunct species each in the Galapagos Islands and Peru. Prior phylogenetic analyses based on an alcohol dehydrogenase gene (...
USDA-ARS?s Scientific Manuscript database
Arecaceae tribe Cocoseae is the most economically important tribe of palms, including both coconut and African oil palm. It is mostly represented in the Neotropics, with one and two genera endemic to South Africa and Madagascar, respectively. Using primers for six single copy WRKY gene family loci...
USDA-ARS?s Scientific Manuscript database
Next-generation sequencing has taken a central role in studies of microbial ecology, especially with regard to culture-independent methods based on molecular phylogenies of the small-subunit ribosomal RNA gene (16S rRNA gene). The ability to relate trends at the species or genus level to host/envir...
Payyavula, Raja S.; Navarre, Duroy A.
2013-01-01
Much remains unknown about how transcription factors and sugars regulate phenylpropanoid metabolism in tuber crops like potato (Solanum tuberosum). Based on phylogeny and protein similarity to known regulators of phenylpropanoid metabolism, 15 transcription factors were selected and their expression was compared in white, yellow, red, and purple genotypes with contrasting phenolic and anthocyanin profiles. Red and purple genotypes had increased phenylalanine ammonia lyase (PAL) enzyme activity, markedly higher levels of phenylpropanoids, and elevated expression of most phenylpropanoid structural genes, including a novel anthocyanin O-methyltransferase. The transcription factors Anthocyanin1 (StAN1), basic Helix Loop Helix1 (StbHLH1), and StWD40 were more strongly expressed in red and purple potatoes. Expression of 12 other transcription factors was not associated with phenylpropanoid content, except for StMYB12B, which showed a negative relationship. Increased expression of AN1, bHLH1, and WD40 was also associated with environmentally mediated increases in tuber phenylpropanoids. Treatment of potato plantlets with sucrose induced hydroxycinnamic acids, flavonols, anthocyanins, structural genes, AN1, bHLH1, WD40, and genes encoding the sucrose-hydrolysing enzymes SUSY1, SUSY4, and INV2. Transient expression of StAN1 in tobacco leaves induced bHLH1, structural genes, SUSY1, SUSY4, and INV1, and increased phenylpropanoid amounts. StAN1 infiltration into tobacco leaves decreased sucrose and glucose concentrations. In silico promoter analysis revealed the presence of MYB and bHLH regulatory elements on sucrolytic gene promoters and sucrose-responsive elements on the AN1 promoter. These findings reveal an interesting dynamic between AN1, sucrose, and sucrose metabolic genes in modulating potato phenylpropanoids. PMID:24098049
Presence and transcriptional activity of anaerobic fungi in agricultural biogas plants.
Dollhofer, Veronika; Callaghan, Tony M; Griffith, Gareth W; Lebuhn, Michael; Bauer, Johann
2017-07-01
Bioaugmentation with anaerobic fungi (AF) is promising for improved biogas generation from lignocelluloses-rich substrates. However, before implementing AF into biogas processes it is necessary to investigate their natural occurrence, community structure and transcriptional activity in agricultural biogas plants. Thus, AF were detected with three specific PCR based methods: (i) Copies of their 18S genes were found in 7 of 10 biogas plants. (ii) Transcripts of a GH5 endoglucanase gene were present at low level in two digesters, indicating transcriptional cellulolytic activity of AF. (iii) Phylogeny of the AF-community was inferred with the 28S gene. A new Piromyces species was isolated from a PCR-positive digester. Evidence for AF was only found in biogas plants operated with high proportions of animal feces. Thus, AF were most likely transferred into digesters with animal derived substrates. Additionally, high process temperatures in combination with long retention times seemed to impede AF survival and activity. Copyright © 2017 Elsevier Ltd. All rights reserved.
A molecular phylogeny of anseriformes based on mitochondrial DNA analysis.
Donne-Goussé, Carole; Laudet, Vincent; Hänni, Catherine
2002-06-01
To study the phylogenetic relationships among Anseriformes, sequences for the complete mitochondrial control region (CR) were determined from 45 waterfowl representing 24 genera, i.e., half of the existing genera. To confirm the results based on CR analysis we also analyzed representative species based on two mitochondrial protein-coding genes, cytochrome b (cytb) and NADH dehydrogenase subunit 2 (ND2). These data allowed us to construct a robust phylogeny of the Anseriformes and to compare it with existing phylogenies based on morphological or molecular data. Chauna and Dendrocygna were identified as early offshoots of the Anseriformes. All the remaining taxa fell into two clades that correspond to the two subfamilies Anatinae and Anserinae. Within Anserinae Branta and Anser cluster together, whereas Coscoroba, Cygnus, and Cereopsis form a relatively weak clade with Cygnus diverging first. Five clades are clearly recognizable among Anatinae: (i) the Anatini with Anas and Lophonetta; (ii) the Aythyini with Aythya and Netta; (iii) the Cairinini with Cairina and Aix; (iv) the Mergini with Mergus, Bucephala, Melanitta, Callonetta, Somateria, and Clangula, and (v) the Tadornini with Tadorna, Chloephaga, and Alopochen. The Tadornini diverged early on from the Anatinae; then the Mergini and a large group that comprises the Anatini, Aythyini, Cairinini, and two isolated genera, Chenonetta and Marmaronetta, diverged. The phylogeny obtained with the control region appears more robust than the one obtained with mitochondrial protein-coding genes such as ND2 and cytb. This suggests that the CR is a powerful tool for bird phylogeny, not only at a small scale (i.e., relationships between species) but also at the family level. Whereas morphological analysis effectively resolved the split between Anatinae and Anserinae and the existence of some of the clades, the precise composition of the clades are different when morphological and molecular data are compared. (c) 2002 Elsevier Science (USA).
Gutiérrez, Verónica; Rego, Natalia; Naya, Hugo; García, Graciela
2015-10-28
Among teleosts, the South American genus Austrolebias (Cyprinodontiformes: Rivulidae) includes 42 taxa of annual fishes divided into five different species groups. It is a monophyletic genus, but morphological and molecular data do not resolve the relationship among intrageneric clades and high rates of substitution have been previously described in some mitochondrial genes. In this work, the complete mitogenome of a species of the genus was determined for the first time. We determined its structure, gene order and evolutionary peculiar features, which will allow us to evaluate the performance of mitochondrial genes in the phylogenetic resolution at different taxonomic levels. Regarding gene content and order, the circular mitogenome of A. charrua (17,271 pb) presents the typical pattern of vertebrate mitogenomes. It contains the full complement of 13 proteins-coding genes, 22 tRNA, 2 rRNA and one non-coding control region. Notably, the tRNA-Cys was only 57 bp in length and lacks the D-loop arm. In three full sibling individuals, heteroplasmatic condition was detected due to a total of 12 variable sites in seven protein-coding genes. Among cyprinodontiforms, the mitogenome of A. charrua exhibits the lowest G+C content (37 %) and GCskew, as well as the highest strand asymmetry with a net difference of T over A at 1st and 3rd codon positions. Considering the 12 coding-genes of the H strand, correspondence analyses of nucleotide composition and codon usage show that A and T at 1st and 3rd codon positions have the highest weight in the first axis, and segregate annual species from the other cyprinodontiforms analyzed. Given the annual life-style, their mitogenomes could be under different selective pressures. All 13 protein-coding genes are under strong purifying selection and we did not find any significant evidence of nucleotide sites showing episodic selection (dN >dS) at annual lineages. When fast evolving third codon positions were removed from alignments, the "supergene" tree recovers our reference species phylogeny as well as the Cytb, ND4L and ND6 genes. Therefore, third codon positions seem to be saturated in the aforementioned coding regions at intergeneric Cyprinodontiformes comparisons. The complete mitogenome obtained in present work, offers relevant data for further comparative studies on molecular phylogeny and systematics of this taxonomic controversial endemic genus of annual fishes.
Variance to mean ratio, R(t), for poisson processes on phylogenetic trees.
Goldman, N
1994-09-01
The ratio of expected variance to mean, R(t), of numbers of DNA base substitutions for contemporary sequences related by a "star" phylogeny is widely seen as a measure of the adherence of the sequences' evolution to a Poisson process with a molecular clock, as predicted by the "neutral theory" of molecular evolution under certain conditions. A number of estimators of R(t) have been proposed, all predicted to have mean 1 and distributions based on the chi 2. Various genes have previously been analyzed and found to have values of R(t) far in excess of 1, calling into question important aspects of the neutral theory. In this paper, I use Monte Carlo simulation to show that the previously suggested means and distributions of estimators of R(t) are highly inaccurate. The analysis is applied to star phylogenies and to general phylogenetic trees, and well-known gene sequences are reanalyzed. For star phylogenies the results show that Kimura's estimators ("The Neutral Theory of Molecular Evolution," Cambridge Univ. Press, Cambridge, 1983) are unsatisfactory for statistical testing of R(t), but confirm the accuracy of Bulmer's correction factor (Genetics 123: 615-619, 1989). For all three nonstar phylogenies studied, attained values of all three estimators of R(t), although larger than 1, are within their true confidence limits under simple Poisson process models. This shows that lineage effects can be responsible for high estimates of R(t), restoring some limited confidence in the molecular clock and showing that the distinction between lineage and molecular clock effects is vital.(ABSTRACT TRUNCATED AT 250 WORDS)
Evolution and diversity of Rickettsia bacteria
Weinert, Lucy A; Werren, John H; Aebi, Alexandre; Stone, Graham N; Jiggins, Francis M
2009-01-01
Background Rickettsia are intracellular symbionts of eukaryotes that are best known for infecting and causing serious diseases in humans and other mammals. All known vertebrate-associated Rickettsia are vectored by arthropods as part of their life-cycle, and many other Rickettsia are found exclusively in arthropods with no known secondary host. However, little is known about the biology of these latter strains. Here, we have identified 20 new strains of Rickettsia from arthropods, and constructed a multi-gene phylogeny of the entire genus which includes these new strains. Results We show that Rickettsia are primarily arthropod-associated bacteria, and identify several novel groups within the genus. Rickettsia do not co-speciate with their hosts but host shifts most often occur between related arthropods. Rickettsia have evolved adaptations including transmission through vertebrates and killing males in some arthropod hosts. We uncovered one case of horizontal gene transfer among Rickettsia, where a strain is a chimera from two distantly related groups, but multi-gene analysis indicates that different parts of the genome tend to share the same phylogeny. Conclusion Approximately 150 million years ago, Rickettsia split into two main clades, one of which primarily infects arthropods, and the other infects a diverse range of protists, other eukaryotes and arthropods. There was then a rapid radiation about 50 million years ago, which coincided with the evolution of life history adaptations in a few branches of the phylogeny. Even though Rickettsia are thought to be primarily transmitted vertically, host associations are short lived with frequent switching to new host lineages. Recombination throughout the genus is generally uncommon, although there is evidence of horizontal gene transfer. A better understanding of the evolution of Rickettsia will help in the future to elucidate the mechanisms of pathogenicity, transmission and virulence. PMID:19187530
Shi, Cheng-Min; Yang, Ziheng
2018-01-01
Abstract The phylogenetic relationships among extant gibbon species remain unresolved despite numerous efforts using morphological, behavorial, and genetic data and the sequencing of whole genomes. A major challenge in reconstructing the gibbon phylogeny is the radiative speciation process, which resulted in extremely short internal branches in the species phylogeny and extensive incomplete lineage sorting with extensive gene-tree heterogeneity across the genome. Here, we analyze two genomic-scale data sets, with ∼10,000 putative noncoding and exonic loci, respectively, to estimate the species tree for the major groups of gibbons. We used the Bayesian full-likelihood method bpp under the multispecies coalescent model, which naturally accommodates incomplete lineage sorting and uncertainties in the gene trees. For comparison, we included three heuristic coalescent-based methods (mp-est, SVDQuartets, and astral) as well as concatenation. From both data sets, we infer the phylogeny for the four extant gibbon genera to be (Hylobates, (Nomascus, (Hoolock, Symphalangus))). We used simulation guided by the real data to evaluate the accuracy of the methods used. Astral, while not as efficient as bpp, performed well in estimation of the species tree even in presence of excessive incomplete lineage sorting. Concatenation, mp-est and SVDQuartets were unreliable when the species tree contains very short internal branches. Likelihood ratio test of gene flow suggests a small amount of migration from Hylobates moloch to H. pileatus, while cross-genera migration is absent or rare. Our results highlight the utility of coalescent-based methods in addressing challenging species tree problems characterized by short internal branches and rampant gene tree-species tree discordance. PMID:29087487
Genome-Wide Identification of the Invertase Gene Family in Populus.
Chen, Zhong; Gao, Kai; Su, Xiaoxing; Rao, Pian; An, Xinmin
2015-01-01
Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials.
Genome-Wide Identification of the Invertase Gene Family in Populus
Su, Xiaoxing; Rao, Pian; An, Xinmin
2015-01-01
Invertase plays a crucial role in carbohydrate partitioning and plant development as it catalyses the irreversible hydrolysis of sucrose into glucose and fructose. The invertase family in plants is composed of two sub-families: acid invertases, which are targeted to the cell wall and vacuole; and neutral/alkaline invertases, which function in the cytosol. In this study, 5 cell wall invertase genes (PtCWINV1-5), 3 vacuolar invertase genes (PtVINV1-3) and 16 neutral/alkaline invertase genes (PtNINV1-16) were identified in the Populus genome and found to be distributed on 14 chromosomes. A comprehensive analysis of poplar invertase genes was performed, including structures, chromosome location, phylogeny, evolutionary pattern and expression profiles. Phylogenetic analysis indicated that the two sub-families were both divided into two clades. Segmental duplication is contributed to neutral/alkaline sub-family expansion. Furthermore, the Populus invertase genes displayed differential expression in roots, stems, leaves, leaf buds and in response to salt/cold stress and pathogen infection. In addition, the analysis of enzyme activity and sugar content revealed that invertase genes play key roles in the sucrose metabolism of various tissues and organs in poplar. This work lays the foundation for future functional analysis of the invertase genes in Populus and other woody perennials. PMID:26393355
Malviya, N; Gupta, S; Singh, V K; Yadav, M K; Bisht, N C; Sarangi, B K; Yadav, D
2015-02-01
The DNA binding with One Finger (Dof) protein is a plant specific transcription factor involved in the regulation of wide range of processes. The analysis of whole genome sequence of pigeonpea has identified 38 putative Dof genes (CcDof) distributed on 8 chromosomes. A total of 17 out of 38 CcDof genes were found to be intronless. A comprehensive in silico characterization of CcDof gene family including the gene structure, chromosome location, protein motif, phylogeny, gene duplication and functional divergence has been attempted. The phylogenetic analysis resulted in 3 major clusters with closely related members in phylogenetic tree revealed common motif distribution. The in silico cis-regulatory element analysis revealed functional diversity with predominance of light responsive and stress responsive elements indicating the possibility of these CcDof genes to be associated with photoperiodic control and biotic and abiotic stress. The duplication pattern showed that tandem duplication is predominant over segmental duplication events. The comparative phylogenetic analysis of these Dof proteins along with 78 soybean, 36 Arabidopsis and 30 rice Dof proteins revealed 7 major clusters. Several groups of orthologs and paralogs were identified based on phylogenetic tree constructed. Our study provides useful information for functional characterization of CcDof genes.
Boyd, Eric S.; Barkay, Tamar
2012-01-01
Mercuric mercury (Hg[II]) is a highly toxic and mobile element that is likely to have had a pronounced and adverse effect on biology since Earth’s oxygenation ∼2.4 billion years ago due to its high affinity for protein sulfhydryl groups, which upon binding destabilize protein structure and decrease enzyme activity, resulting in a decreased organismal fitness. The central enzyme in the microbial mercury detoxification system is the mercuric reductase (MerA) protein, which catalyzes the reduction of Hg(II) to volatile Hg(0). In addition to MerA, mer operons encode for proteins involved in regulation, Hg binding, and organomercury degradation. Mer-mediated approaches have had broad applications in the bioremediation of mercury-contaminated environments and industrial waste streams. Here, we examine the composition of 272 individual mer operons and quantitatively map the distribution of mer-encoded functions on both taxonomic SSU rRNA gene and MerA phylogenies. The results indicate an origin and early evolution of MerA among thermophilic bacteria and an overall increase in the complexity of mer operons through evolutionary time, suggesting continual gene recruitment and evolution leading to an improved efficiency and functional potential of the Mer detoxification system. Consistent with a positive relationship between the evolutionary history and topology of MerA and SSU rRNA gene phylogenies (Mantel R = 0.81, p < 0.01), the distribution of the majority of mer functions, when mapped on these phylograms, indicates an overall tendency to inherit mer-encoded functions through vertical descent. However, individual mer functions display evidence of a variable degree of vertical inheritance, with several genes exhibiting strong evidence for acquisition via lateral gene transfer and/or gene loss. Collectively, these data suggest that (i) mer has evolved from a simple system in geothermal environments to a widely distributed and more complex and efficient detoxification system, and (ii) merA is a suitable biomarker for examining the functional diversity of Hg detoxification and for predicting the composition of mer operons in natural environments. PMID:23087676
Keshri, Jitendra; Mishra, Avinash; Jha, Bhavanath
2013-03-30
Population indices of bacteria and archaea were investigated from saline-alkaline soil and a possible microbe-environment pattern was established using gene targeted metagenomics. Clone libraries were constructed using 16S rRNA and functional gene(s) involved in carbon fixation (cbbL), nitrogen fixation (nifH), ammonia oxidation (amoA) and sulfur metabolism (apsA). Molecular phylogeny revealed the dominance of Actinobacteria, Firmicutes and Proteobacteria along with archaeal members of Halobacteraceae. The library consisted of novel bacterial (20%) and archaeal (38%) genera showing ≤95% similarity to previously retrieved sequences. Phylogenetic analysis indicated ability of inhabitant to survive in stress condition. The 16S rRNA gene libraries contained novel gene sequences and were distantly homologous with cultured bacteria. Functional gene libraries were found unique and most of the clones were distantly related to Proteobacteria, while clones of nifH gene library also showed homology with Cyanobacteria and Firmicutes. Quantitative real-time PCR exhibited that bacterial abundance was two orders of magnitude higher than archaeal. The gene(s) quantification indicated the size of the functional guilds harboring relevant key genes. The study provides insights on microbial ecology and different metabolic interactions occurring in saline-alkaline soil, possessing phylogenetically diverse groups of bacteria and archaea, which may be explored further for gene cataloging and metabolic profiling. Copyright © 2012 Elsevier GmbH. All rights reserved.
Sielaff, Malte; Schmidt, Hanno; Struck, Torsten H; Rosenkranz, David; Mark Welch, David B; Hankeln, Thomas; Herlyn, Holger
2016-03-01
A monophyletic origin of endoparasitic thorny-headed worms (Acanthocephala) and wheel-animals (Rotifera) is widely accepted. However, the phylogeny inside the clade, be it called Syndermata or Rotifera, has lacked validation by mitochondrial (mt) data. Herein, we present the first mt genome of the key taxon Seison and report conflicting results of phylogenetic analyses: while mt sequence-based topologies showed monophyletic Lemniscea (Bdelloidea+Acanthocephala), gene order analyses supported monophyly of Pararotatoria (Seisonidea+Acanthocephala) and Hemirotifera (Bdelloidea+Pararotatoria). Sequence-based analyses obviously suffered from substitution saturation, compositional bias, and branch length heterogeneity; however, we observed no compromising effects in gene order analyses. Moreover, gene order-based topologies were robust to changes in coding (genes vs. gene pairs, two-state vs. multistate, aligned vs. non-aligned), tree reconstruction methods, and the treatment of the two monogonont mt genomes. Thus, mt gene order verifies seisonids as sister to acanthocephalans within monophyletic Hemirotifera, while deviating results of sequence-based analyses reflect artificial signal. This conclusion implies that the complex life cycle of extant acanthocephalans evolved from a free-living state, as retained by most monogononts and bdelloids, via an epizoic state with a simple life cycle, as shown by seisonids. Hence, Acanthocephala represent a rare example where ancestral transitional stages have counterparts amongst the closest relatives. Copyright © 2015 Elsevier Inc. All rights reserved.
Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A.
2015-01-01
Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study, next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina HiSeq 2500 instrument. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs. PMID:26656830
Zhang, Ning; Wen, Jun; Zimmer, Elizabeth A
2015-01-01
Vitaceae is well-known for having one of the most economically important fruits, i.e., the grape (Vitis vinifera). The deep phylogeny of the grape family was not resolved until a recent phylogenomic analysis of 417 nuclear genes from transcriptome data. However, it has been reported extensively that topologies based on nuclear and organellar genes may be incongruent due to differences in their evolutionary histories. Therefore, it is important to reconstruct a backbone phylogeny of the grape family using plastomes and mitochondrial genes. In this study,next-generation sequencing data sets of 27 species were obtained using genome skimming with total DNAs from silica-gel preserved tissue samples on an Illumina NextSeq 500 instrument [corrected]. Plastomes were assembled using the combination of de novo and reference genome (of V. vinifera) methods. Sixteen mitochondrial genes were also obtained via genome skimming using the reference genome of V. vinifera. Extensive phylogenetic analyses were performed using maximum likelihood and Bayesian methods. The topology based on either plastome data or mitochondrial genes is congruent with the one using hundreds of nuclear genes, indicating that the grape family did not exhibit significant reticulation at the deep level. The results showcase the power of genome skimming in capturing extensive phylogenetic data: especially from chloroplast and mitochondrial DNAs.
Peng, Yingmei; Cai, Jing; Wang, Wen; Su, Bing
2012-01-01
Pepcase is a gene encoding phosphoenolpyruvate carboxylase that exists in bacteria, archaea and plants,playing an important role in plant metabolism and development. Most plants have two or more pepcase genes belonging to two gene sub-families, while only one gene exists in other organisms. Previous research categorized one plant pepcase gene as plant-type pepcase (PTPC) while the other as bacteria-type pepcase (BTPC) because of its similarity with the pepcase gene found in bacteria. Phylogenetic reconstruction showed that PTPC is the ancestral lineage of plant pepcase, and that all bacteria, protistpepcase and BTPC in plants are derived from a lineage of pepcase closely related with PTPC in algae. However, their phylogeny contradicts the species tree and traditional chronology of organism evolution. Because the diversification of bacteria occurred much earlier than the origin of plants, presumably all bacterialpepcase derived from the ancestral PTPC of algal plants after divergingfrom the ancestor of vascular plant PTPC. To solve this contradiction, we reconstructed the phylogeny of pepcase gene family. Our result showed that both PTPC and BTPC are derived from an ancestral lineage of gamma-proteobacteriapepcases, possibly via an ancient inter-kingdom horizontal gene transfer (HGT) from bacteria to the eukaryotic common ancestor of plants, protists and cellular slime mold. Our phylogenetic analysis also found 48other pepcase genes originated from inter-kingdom HGTs. These results imply that inter-kingdom HGTs played important roles in the evolution of the pepcase gene family and furthermore that HGTsare a more frequent evolutionary event than previouslythought.
Carella, Mirco; Agell, Gemma; Cárdenas, Paco; Uriz, Maria J.
2016-01-01
Species of Tetillidae are distributed worldwide. However, some genera are unresolved and only a few genera and species of this family have been described from the Antarctic. The incorporation of 25 new COI and 18S sequences of Antarctic Tetillidae to those used recently for assessing the genera phylogeny, has allowed us to improve the resolution of some poorly resolved nodes and to confirm the monophyly of previously identified clades. Classical genera such as Craniella recovered their traditional diagnosis by moving the Antarctic Tetilla from Craniella, where they were placed in the previous family phylogeny, to Antarctotetilla gen. nov. The morphological re-examination of specimens used in the previous phylogeny and their comparison to the type material revealed misidentifications. The proposed monotypic new genus Levantinella had uncertain phylogenetic relationships depending on the gene partition used. Two more clades would require the inclusion of additional species to be formally established as new genera. The parsimony tree based on morphological characters and the secondary structure of the 18S (V4 region) almost completely matched the COI M1-M6 and the COI+18S concatenated phylogenies. Morphological synapomorphies have been identified for the genera proposed. New 15 28S (D3-D5) and 11 COI I3-M11 partitions were exclusively sequenced for the Antarctic species subset. Remarkably, species within the Antarctic genera Cinachyra (C. barbata and C. antarctica) and Antarctotetilla (A. leptoderma, A. grandis, and A. sagitta), which are clearly distinguishable morphologically, were not genetically differentiated with any of the markers assayed. Thus, as it has been reported for other Antarctic sponges, both the mitochondrial and nuclear partitions used did not differentiate species that were well characterized morphologically. Antarctic Tetillidae offers a rare example of genetically cryptic (with the traditional markers used for sponges), morphologically distinct species. PMID:27557130
Phylogeny of metabolic networks: a spectral graph theoretical approach.
Deyasi, Krishanu; Banerjee, Anirban; Deb, Bony
2015-10-01
Many methods have been developed for finding the commonalities between different organisms in order to study their phylogeny. The structure of metabolic networks also reveals valuable insights into metabolic capacity of species as well as into the habitats where they have evolved. We constructed metabolic networks of 79 fully sequenced organisms and compared their architectures. We used spectral density of normalized Laplacian matrix for comparing the structure of networks. The eigenvalues of this matrix reflect not only the global architecture of a network but also the local topologies that are produced by different graph evolutionary processes like motif duplication or joining. A divergence measure on spectral densities is used to quantify the distances between various metabolic networks, and a split network is constructed to analyse the phylogeny from these distances. In our analysis, we focused on the species that belong to different classes, but appear more related to each other in the phylogeny. We tried to explore whether they have evolved under similar environmental conditions or have similar life histories. With this focus, we have obtained interesting insights into the phylogenetic commonality between different organisms.
Insight into the Evolution of the Histidine Triad Protein (HTP) Family in Streptococcus
Pan, Xiu-Zhen; Wang, Bin; Chen, Jian-Qun
2013-01-01
The Histidine Triad Proteins (HTPs), also known as Pht proteins in Streptococcus pneumoniae, constitute a family of surface-exposed proteins that exist in many pathogenic streptococcal species. Although many studies have revealed the importance of HTPs in streptococcal physiology and pathogenicity, little is known about their origin and evolution. In this study, after identifying all htp homologs from 105 streptococcal genomes representing 38 different species/subspecies, we analyzed their domain structures, positions in genome, and most importantly, their evolutionary histories. By further projecting this information onto the streptococcal phylogeny, we made several major findings. First, htp genes originated earlier than the Streptococcus genus and gene-loss events have occurred among three streptococcal groups, resulting in the absence of the htp gene in the Bovis, Mutans and Salivarius groups. Second, the copy number of htp genes in other groups of Streptococcus is variable, ranging from one to four functional copies. Third, both phylogenetic evidence and domain structure analyses support the division of two htp subfamilies, designated as htp I and htp II. Although present mainly in the pyogenic group and in Streptococcus suis, htp II members are distinct from htp I due to the presence of an additional leucine-rich-repeat domain at the C-terminus. Finally, htp genes exhibit a faster nucleotide substitution rate than do housekeeping genes. Specifically, the regions outside the HTP domains are under strong positive selection. This distinct evolutionary pattern likely helped Streptococcus to easily escape from recognition by host immunity. PMID:23527301
A Stochastic Evolutionary Model for Protein Structure Alignment and Phylogeny
Challis, Christopher J.; Schmidler, Scott C.
2012-01-01
We present a stochastic process model for the joint evolution of protein primary and tertiary structure, suitable for use in alignment and estimation of phylogeny. Indels arise from a classic Links model, and mutations follow a standard substitution matrix, whereas backbone atoms diffuse in three-dimensional space according to an Ornstein–Uhlenbeck process. The model allows for simultaneous estimation of evolutionary distances, indel rates, structural drift rates, and alignments, while fully accounting for uncertainty. The inclusion of structural information enables phylogenetic inference on time scales not previously attainable with sequence evolution models. The model also provides a tool for testing evolutionary hypotheses and improving our understanding of protein structural evolution. PMID:22723302
Ruane, Sara; Raxworthy, Christopher J; Lemmon, Alan R; Lemmon, Emily Moriarty; Burbrink, Frank T
2015-10-12
Using molecular data generated by high throughput next generation sequencing (NGS) platforms to infer phylogeny is becoming common as costs go down and the ability to capture loci from across the genome goes up. While there is a general consensus that greater numbers of independent loci should result in more robust phylogenetic estimates, few studies have compared phylogenies resulting from smaller datasets for commonly used genetic markers with the large datasets captured using NGS. Here, we determine how a 5-locus Sanger dataset compares with a 377-locus anchored genomics dataset for understanding the evolutionary history of the pseudoxyrhophiine snake radiation centered in Madagascar. The Pseudoxyrhophiinae comprise ~86 % of Madagascar's serpent diversity, yet they are poorly known with respect to ecology, behavior, and systematics. Using the 377-locus NGS dataset and the summary statistics species-tree methods STAR and MP-EST, we estimated a well-supported species tree that provides new insights concerning intergeneric relationships for the pseudoxyrhophiines. We also compared how these and other methods performed with respect to estimating tree topology using datasets with varying numbers of loci. Using Sanger sequencing and an anchored phylogenomics approach, we sequenced datasets comprised of 5 and 377 loci, respectively, for 23 pseudoxyrhophiine taxa. For each dataset, we estimated phylogenies using both gene-tree (concatenation) and species-tree (STAR, MP-EST) approaches. We determined the similarity of resulting tree topologies from the different datasets using Robinson-Foulds distances. In addition, we examined how subsets of these data performed compared to the complete Sanger and anchored datasets for phylogenetic accuracy using the same tree inference methodologies, as well as the program *BEAST to determine if a full coalescent model for species tree estimation could generate robust results with fewer loci compared to the summary statistics species tree approaches. We also examined the individual gene trees in comparison to the 377-locus species tree using the program MetaTree. Using the full anchored dataset under a variety of methods gave us the same, well-supported phylogeny for pseudoxyrhophiines. The African pseudoxyrhophiine Duberria is the sister taxon to the Malagasy pseudoxyrhophiines genera, providing evidence for a monophyletic radiation in Madagascar. In addition, within Madagascar, the two major clades inferred correspond largely to the aglyphous and opisthoglyphous genera, suggesting that feeding specializations associated with tooth venom delivery may have played a major role in the early diversification of this radiation. The comparison of tree topologies from the concatenated and species-tree methods using different datasets indicated the 5-locus dataset cannot beused to infer a correct phylogeny for the pseudoxyrhophiines under any method tested here and that summary statistics methods require 50 or more loci to consistently recover the species-tree inferred using the complete anchored dataset. However, as few as 15 loci may infer the correct topology when using the full coalescent species tree method *BEAST. MetaTree analyses of each gene tree from the Sanger and anchored datasets found that none of the individual gene trees matched the 377-locus species tree, and that no gene trees were identical with respect to topology. Our results suggest that ≥50 loci may be necessary to confidently infer phylogenies when using summaryspecies-tree methods, but that the coalescent-based method *BEAST consistently recovers the same topology using only 15 loci. These results reinforce that datasets with small numbers of markers may result in misleading topologies, and further, that the method of inference used to generate a phylogeny also has a major influence on the number of loci necessary to infer robust species trees.
2008-01-01
Background Within the subfamily Murinae, African murines represent 25% of species biodiversity, making this group ideal for detailed studies of the patterns and timing of diversification of the African endemic fauna and its relationships with Asia. Here we report the results of phylogenetic analyses of the endemic African murines through a broad sampling of murine diversity from all their distribution area, based on the mitochondrial cytochrome b gene and the two nuclear gene fragments (IRBP exon 1 and GHR). Results A combined analysis of one mitochondrial and two nuclear gene sequences consistently identified and robustly supported ten primary lineages within Murinae. We propose to formalize a new tribal arrangement within the Murinae that reflects this phylogeny. The diverse African murine assemblage includes members of five of the ten tribes and clearly derives from multiple faunal exchanges between Africa and Eurasia. Molecular dating analyses using a relaxed Bayesian molecular clock put the first colonization of Africa around 11 Mya, which is consistent with the fossil record. The main period of African murine diversification occurred later following disruption of the migration route between Africa and Asia about 7–9 Mya. A second period of interchange, dating to around 5–6.5 Mya, saw the arrival in Africa of Mus (leading to the speciose endemic Nannomys), and explains the appearance of several distinctive African lineages in the late Miocene and Pliocene fossil record of Eurasia. Conclusion Our molecular survey of Murinae, which includes the most complete sampling so far of African taxa, indicates that there were at least four separate radiations within the African region, as well as several phases of dispersal between Asia and Africa during the last 12 My. We also reconstruct the phylogenetic structure of the Murinae, and propose a new classification at tribal level for this traditionally problematic group. PMID:18616808
Figueroa, Diego F.; Baco, Amy R.
2015-01-01
We use full mitochondrial genomes to test the robustness of the phylogeny of the Octocorallia, to determine the evolutionary pathway for the five known mitochondrial gene rearrangements in octocorals, and to test the suitability of using mitochondrial genomes for higher taxonomic-level phylogenetic reconstructions. Our phylogeny supports three major divisions within the Octocorallia and show that Paragorgiidae is paraphyletic, with Sibogagorgia forming a sister branch to the Coralliidae. Furthermore, Sibogagorgia cauliflora has what is presumed to be the ancestral gene order in octocorals, but the presence of a pair of inverted repeat sequences suggest that this gene order was not conserved but rather evolved back to this apparent ancestral state. Based on this we recommend the resurrection of the family Sibogagorgiidae to fix the paraphyly of the Paragorgiidae. This is the first study to show that in the Octocorallia, mitochondrial gene orders have evolved back to an ancestral state after going through a gene rearrangement, with at least one of the gene orders evolving independently in different lineages. A number of studies have used gene boundaries to determine the type of mitochondrial gene arrangement present. However, our findings suggest that this method known as gene junction screening may miss evolutionary reversals. Additionally, substitution saturation analysis demonstrates that while whole mitochondrial genomes can be used effectively for phylogenetic analyses within Octocorallia, their utility at higher taxonomic levels within Cnidaria is inadequate. Therefore for phylogenetic reconstruction at taxonomic levels higher than subclass within the Cnidaria, nuclear genes will be required, even when whole mitochondrial genomes are available. PMID:25539723
Krak, Karol; Alvarez, Inés; Caklová, Petra; Costa, Andrea; Chrtek, Jindrich; Fehrer, Judith
2012-02-01
The development of three low-copy nuclear markers for low taxonomic level phylogenies in Asteraceae with emphasis on the subtribe Hieraciinae is reported. Marker candidates were selected by comparing a Lactuca complementary DNA (cDNA) library with public DNA sequence databases. Interspecific variation and phylogenetic signal of the selected genes were investigated for diploid taxa from the subtribe Hieraciinae and compared to a reference phylogeny. Their ability to cross-amplify was assessed for other Asteraceae tribes. All three markers had higher variation (2.1-4.5 times) than the internal transcribed spacer (ITS) in Hieraciinae. Cross-amplification was successful in at least seven other tribes of the Asteraceae. Only three cases indicating the presence of paralogs or pseudogenes were detected. The results demonstrate the potential of these markers for phylogeny reconstruction in the Hieraciinae as well as in other Asteraceae tribes, especially for very closely related species.
Carvalho, Tiago P; Arce H, Mariangeles; Reis, Roberto E; Sabaj, Mark H
2018-04-30
The family Aspredinidae is a moderately diverse and broadly distributed group of freshwater fishes endemic to South America. Commonly known as Banjo Catfishes, Aspredinidae currently includes 44 valid species divided among 13 genera. The first species-comprehensive hypothesis on phylogenetic relationships among aspredinids is presented. The phylogeny is based on DNA sequence data for five gene fragments (mitochondrial 16S and COI; nuclear RAG1, MYH6 and SH3PX3) from 114 individuals representing 31 species in 12 aspredinid genera. Analyses of molecular data support the monophyly of most genera (Bunocephalus excepted) and several higher-level relationships previously proposed by morphological studies. Based on the molecular phylogeny, a new suprageneric classification for Aspredinidae is proposed with the new monotypic subfamily Pseudobunocephalinae as the sister taxon to all other aspredinids. Copyright © 2018 Elsevier Inc. All rights reserved.
Li, Jinlu; Yu, Jing; Wang, Ling; Yang, Xueying
2018-01-01
Maleae consists of economically and ecologically important plants. However, there are considerable disputes on generic circumscription due to the lack of a reliable phylogeny at generic level. In this study, molecular phylogeny of 35 generally accepted genera in Maleae is established using 15 chloroplast regions. Gillenia is the most basal clade of Maleae, followed by Kageneckia + Lindleya, Vauquelinia, and a typical radiation clade, the core Maleae, suggesting that the proposal of four subtribes is reasonable. In the core Maleae including 31 genera, chloroplast gene data support that the four Malus-related genera should better be merged into one genus and the six Sorbus-related genera would be classified into two genera, whereas all Photinia-related genera should be accepted as distinct genera. Although the phylogenetic relationships among the genera in Maleae are much clearer than before, it is still premature to make a formal taxonomic treatment for these genera. PMID:29750171
Phylogeny of the owlet-nightjars (Aves: Aegothelidae) based on mitochondrial DNA sequence
Dumbacher, J.P.; Pratt, T.K.; Fleischer, R.C.
2003-01-01
The avian family Aegothelidae (Owlet-nightjars) comprises nine extant species and one extinct species, all of which are currently classified in a single genus, Aegotheles. Owlet-nightjars are secretive nocturnal birds of the South Pacific. They are relatively poorly studied and some species are known from only a few specimens. Furthermore, their confusing morphological variation has made it difficult to cluster existing specimens unambiguously into hierarchical taxonomic units. Here we sample all extant owlet-nightjar species and all but three currently recognized subspecies. We use DNA extracted primarily from museum specimens to obtain mitochondrial gene sequences and construct a molecular phylogeny. Our phylogeny suggests that most species are reciprocally monophyletic, however A. albertisi appears paraphyletic. Our data also suggest splitting A. bennettii into two species and splitting A. insignis and A. tatei as suggested in another recent paper. ?? 2003 Elsevier Science (USA). All rights reserved.
2007-01-01
TYPE 3. DATES COVERED 00-00-2007 to 00-00-2007 4. TITLE AND SUBTITLE Phylogeny of the Leucosphyrus Group of Anopheles (Cellia) (Diptera...ACRONYM(S) 11. SPONSOR/MONITOR’S REPORT NUMBER(S) 12. DISTRIBUTION/ AVAILABILITY STATEMENT Approved for public release; distribution unlimited 13...by 4. cycles of 45 s at 94°C. 45 s at 50°C and 1 min at 7’r’C, with a final extension of 7 min at 72°C. PeR products were elec- trophoresed in 2
The Biogeography of Deep Time Phylogenetic Reticulation.
Burbrink, Frank T; Gehara, Marcelo
2018-03-09
Most phylogenies are typically represented as purely bifurcating. However, as genomic data has become more common in phylogenetic studies, it is not unusual to find reticulation among terminal lineages or among internal nodes (deep time reticulation; DTR). In these situations, gene flow must have happened in the same or adjacent geographic areas for these DTRs to have occurred and therefore biogeographic reconstruction should provide similar area estimates for parental nodes, provided extinction or dispersal has not eroded these patterns. We examine the phylogeny of the widely distributed New World kingsnakes (Lampropeltis), determine if DTR is present in this group, and estimate the ancestral area for reticulation. Importantly, we develop a new method that uses coalescent simulations in a machine learning framework to show conclusively that this phylogeny is best represented as reticulating at deeper time. Using joint probabilities of ancestral area reconstructions on the bifurcating parental lineages from the reticulating node, we show that this reticulation likely occurred in northwestern Mexico/southwestern US and subsequently led to the diversification of the Mexican kingsnakes. This region has been previously identified as an area important for understanding speciation and secondary contact with gene flow in snakes and other squamates. This research shows that phylogenetic reticulation is common, even in well-studied groups, and that the geographic scope of ancient hybridization is recoverable.
Brouard, Jean-Simon; Otis, Christian; Lemieux, Claude; Turmel, Monique
2008-01-01
Background To gain insight into the branching order of the five main lineages currently recognized in the green algal class Chlorophyceae and to expand our understanding of chloroplast genome evolution, we have undertaken the sequencing of chloroplast DNA (cpDNA) from representative taxa. The complete cpDNA sequences previously reported for Chlamydomonas (Chlamydomonadales), Scenedesmus (Sphaeropleales), and Stigeoclonium (Chaetophorales) revealed tremendous variability in their architecture, the retention of only few ancestral gene clusters, and derived clusters shared by Chlamydomonas and Scenedesmus. Unexpectedly, our recent phylogenies inferred from these cpDNAs and the partial sequences of three other chlorophycean cpDNAs disclosed two major clades, one uniting the Chlamydomonadales and Sphaeropleales (CS clade) and the other uniting the Oedogoniales, Chaetophorales and Chaetopeltidales (OCC clade). Although molecular signatures provided strong support for this dichotomy and for the branching of the Oedogoniales as the earliest-diverging lineage of the OCC clade, more data are required to validate these phylogenies. We describe here the complete cpDNA sequence of Oedogonium cardiacum (Oedogoniales). Results Like its three chlorophycean homologues, the 196,547-bp Oedogonium chloroplast genome displays a distinctive architecture. This genome is one of the most compact among photosynthetic chlorophytes. It has an atypical quadripartite structure, is intron-rich (17 group I and 4 group II introns), and displays 99 different conserved genes and four long open reading frames (ORFs), three of which are clustered in the spacious inverted repeat of 35,493 bp. Intriguingly, two of these ORFs (int and dpoB) revealed high similarities to genes not usually found in cpDNA. At the gene content and gene order levels, the Oedogonium genome most closely resembles its Stigeoclonium counterpart. Characters shared by these chlorophyceans but missing in members of the CS clade include the retention of psaM, rpl32 and trnL(caa), the loss of petA, the disruption of three ancestral clusters and the presence of five derived gene clusters. Conclusion The Oedogonium chloroplast genome disclosed additional characters that bolster the evidence for a close alliance between the Oedogoniales and Chaetophorales. Our unprecedented finding of int and dpoB in this cpDNA provides a clear example that novel genes were acquired by the chloroplast genome through horizontal transfers, possibly from a mitochondrial genome donor. PMID:18558012
Kropáčková, Lucie; Těšický, Martin; Albrecht, Tomáš; Kubovčiak, Jan; Čížková, Dagmar; Tomášek, Oldřich; Martin, Jean-François; Bobek, Lukáš; Králová, Tereza; Procházka, Petr; Kreisinger, Jakub
2017-10-01
Vertebrate gut microbiota (GM) is comprised of a taxonomically diverse consortium of symbiotic and commensal microorganisms that have a pronounced effect on host physiology, immune system function and health status. Despite much research on interactions between hosts and their GM, the factors affecting inter- and intraspecific GM variation in wild populations are still poorly known. We analysed data on faecal microbiota composition in 51 passerine species (319 individuals) using Illumina MiSeq sequencing of bacterial 16S rRNA (V3-V4 variable region). Despite pronounced interindividual variation, GM composition exhibited significant differences at the interspecific level, accounting for approximately 20%-30% of total GM variation. We also observed a significant correlation between GM composition divergence and host's phylogenetic divergence, with strength of correlation higher than that of GM vs. ecological or life history traits and geographic variation. The effect of host's phylogeny on GM composition was significant, even after statistical control for these confounding factors. Hence, our data do not support codiversification of GM and passerine phylogeny solely as a by-product of their ecological divergence. Furthermore, our findings do not support that GM vs. host's phylogeny codiversification is driven primarily through trans-generational GM transfer as the GM vs. phylogeny correlation does not increase with higher sequence similarity used when delimiting operational taxonomic units. Instead, we hypothesize that the GM vs. phylogeny correlation may arise as a consequence of interspecific divergence of genes that directly or indirectly modulate composition of GM. © 2017 John Wiley & Sons Ltd.
Turner, Hubert; Lieshout, Niek; Van Ginkel, Wil E.; Menken, Steph B. J.
2010-01-01
Background The small ermine moth genus Yponomeuta (Lepidoptera, Yponomeutidae) contains 76 species that are specialist feeders on hosts from Celastraceae, Rosaceae, Salicaceae, and several other plant families. The genus is a model for studies in the evolution of phytophagous insects and their host-plant associations. Here, we reconstruct the phylogeny to provide a solid framework for these studies, and to obtain insight into the history of host-plant use and the biogeography of the genus. Methodology/Principal Findings DNA sequences from an internal transcribed spacer region (ITS-1) and from the 16S rDNA (16S) and cytochrome oxidase (COII) mitochondrial genes were collected from 20–23 (depending on gene) species and two outgroup taxa to reconstruct the phylogeny of the Palaearctic members of this genus. Sequences were analysed using three different phylogenetic methods (parsimony, likelihood, and Bayesian inference). Conclusions/Significance Roughly the same patterns are retrieved irrespective of the method used, and they are similar among the three genes. Monophyly is well supported for a clade consisting of the Japanese (but not the Dutch) population of Yponomeuta sedellus and Y. yanagawanus, a Y. kanaiellus–polystictus clade, and a Rosaceae-feeding, western Palaearctic clade (Y. cagnagellus–irrorellus clade). Within these clades, relationships are less well supported, and the patterns between the different gene trees are not so similar. The position of the remaining taxa is also variable among the gene trees and rather weakly supported. The phylogenetic information was used to elucidate patterns of biogeography and resource use. In the Palaearctic, the genus most likely originated in the Far East, feeding on Celastraceae, dispersing to the West concomitant with a shift to Rosaceae and further to Salicaceae. The association of Y. cagnagellus with Euonymus europaeus (Celastraceae), however, is a reversal. The only oligophagous species, Y. padellus, belongs to the derived western Palaearctic clade, evidence that specialisation is reversible. PMID:20360968
Genome-wide analysis of putative peroxiredoxin in unicellular and filamentous cyanobacteria.
Cui, Hongli; Wang, Yipeng; Wang, Yinchu; Qin, Song
2012-11-16
Cyanobacteria are photoautotrophic prokaryotes with wide variations in genome sizes and ecological habitats. Peroxiredoxin (PRX) is an important protein that plays essential roles in protecting own cells against reactive oxygen species (ROS). PRXs have been identified from mammals, fungi and higher plants. However, knowledge on cyanobacterial PRXs still remains obscure. With the availability of 37 sequenced cyanobacterial genomes, we performed a comprehensive comparative analysis of PRXs and explored their diversity, distribution, domain structure and evolution. Overall 244 putative prx genes were identified, which were abundant in filamentous diazotrophic cyanobacteria, Acaryochloris marina MBIC 11017, and unicellular cyanobacteria inhabiting freshwater and hot-springs, while poor in all Prochlorococcus and marine Synechococcus strains. Among these putative genes, 25 open reading frames (ORFs) encoding hypothetical proteins were identified as prx gene family members and the others were already annotated as prx genes. All 244 putative PRXs were classified into five major subfamilies (1-Cys, 2-Cys, BCP, PRX5_like, and PRX-like) according to their domain structures. The catalytic motifs of the cyanobacterial PRXs were similar to those of eukaryotic PRXs and highly conserved in all but the PRX-like subfamily. Classical motif (CXXC) of thioredoxin was detected in protein sequences from the PRX-like subfamily. Phylogenetic tree constructed of catalytic domains coincided well with the domain structures of PRXs and the phylogenies based on 16s rRNA. The distribution of genes encoding PRXs in different unicellular and filamentous cyanobacteria especially those sub-families like PRX-like or 1-Cys PRX correlate with the genome size, eco-physiology, and physiological properties of the organisms. Cyanobacterial and eukaryotic PRXs share similar conserved motifs, indicating that cyanobacteria adopt similar catalytic mechanisms as eukaryotes. All cyanobacterial PRX proteins share highly similar structures, implying that these genes may originate from a common ancestor. In this study, a general framework of the sequence-structure-function connections of the PRXs was revealed, which may facilitate functional investigations of PRXs in various organisms.
Genome-wide analysis of putative peroxiredoxin in unicellular and filamentous cyanobacteria
2012-01-01
Background Cyanobacteria are photoautotrophic prokaryotes with wide variations in genome sizes and ecological habitats. Peroxiredoxin (PRX) is an important protein that plays essential roles in protecting own cells against reactive oxygen species (ROS). PRXs have been identified from mammals, fungi and higher plants. However, knowledge on cyanobacterial PRXs still remains obscure. With the availability of 37 sequenced cyanobacterial genomes, we performed a comprehensive comparative analysis of PRXs and explored their diversity, distribution, domain structure and evolution. Results Overall 244 putative prx genes were identified, which were abundant in filamentous diazotrophic cyanobacteria, Acaryochloris marina MBIC 11017, and unicellular cyanobacteria inhabiting freshwater and hot-springs, while poor in all Prochlorococcus and marine Synechococcus strains. Among these putative genes, 25 open reading frames (ORFs) encoding hypothetical proteins were identified as prx gene family members and the others were already annotated as prx genes. All 244 putative PRXs were classified into five major subfamilies (1-Cys, 2-Cys, BCP, PRX5_like, and PRX-like) according to their domain structures. The catalytic motifs of the cyanobacterial PRXs were similar to those of eukaryotic PRXs and highly conserved in all but the PRX-like subfamily. Classical motif (CXXC) of thioredoxin was detected in protein sequences from the PRX-like subfamily. Phylogenetic tree constructed of catalytic domains coincided well with the domain structures of PRXs and the phylogenies based on 16s rRNA. Conclusions The distribution of genes encoding PRXs in different unicellular and filamentous cyanobacteria especially those sub-families like PRX-like or 1-Cys PRX correlate with the genome size, eco-physiology, and physiological properties of the organisms. Cyanobacterial and eukaryotic PRXs share similar conserved motifs, indicating that cyanobacteria adopt similar catalytic mechanisms as eukaryotes. All cyanobacterial PRX proteins share highly similar structures, implying that these genes may originate from a common ancestor. In this study, a general framework of the sequence-structure-function connections of the PRXs was revealed, which may facilitate functional investigations of PRXs in various organisms. PMID:23157370
Comparative analysis of chloroplast genomes of the genus Citrus and its close relatives.
Liu, Xiaogang; Wu, Hongkun; Luo, Yan; Xi, Wanpeng; Zhou, Zhiqin
2017-01-01
The genus Citrus and its close relatives are economically and nutritionally important fruit trees. However, the huge controversy over the phylogeny of key wild species, as well as the genetic relationship between the cultivated species and their putative wild progenitors, remains unresolved. Comparative analyses of chloroplast (cp) genomes have been useful in resolving various phylogenetic issues. Thus far, the cp genomes of only two Citrus species have been sequenced. In this study, we sequenced six complete cp genomes, four belonging to the genus Citrus, and two belonging to the genera Fortunella and Poncirus, respectively. These newly sequenced genomes together with the two publicly available were used for comparative analyses of the genus Citrus and its close relatives. All eight cp genomes share similar basic structure, gene order and gene content. Phylogenetic analyses supported the monophyly of the three genera in the order Sapindales within the major clade Malvidae.
Major clades of Agaricales: a multilocus phylogenetic overview.
P. Brandon Matheny; Judd M. Curtis; Valerie Hofstetter; M. Catherine Aime; Jean-Marc Moncalvo; Zai-Wei Ge; Zhu-Liang Yang; Joseph F. Ammirati; Timothy J. Baroni; Neale L. Bougher; Karen W. Lodge Hughes; Richard W. Kerrigan; Michelle T. Seidl; Aanen; Matthew Duur K. DeNitis; Graciela M. Daniele; Dennis E. Desjardin; Bradley R. Kropp; Lorelei L. Norvell; Andrew Parker; Else C. Vellinga; Rytas Vilgalys; David S. Hibbett
2006-01-01
An overview of the phylogeny of the Agaricales is presented based on a multilocus analysis of a six-gene region supermatrix. Bayesian analyses of 5611 nucleotide characters of rpb1, rpb1-intron 2, rpb2 and 18S, 25S, and 5.8S ribosomal RNA genes recovered six major clades, which are recognized informally and labeled the Agaricoid, Tricholomatoid, Marasmioid, Pluteoid,...
Amy L. Ross-Davis; John W. Hanna; Mee-Sook Kim; Ned B. Klopfenstein
2012-01-01
The translation elongation factor-1 alpha gene was used to examine the phylogenetic relationships among 30 previously characterized isolates representing ten North American Armillaria species: A. solidipes (=A. ostoyae), A. gemina, A. calvescens, A. sinapina, A. mellea, A. gallica, A. nabsnona, North American biological species X, A. cepistipes, and A. tabescens. The...
USDA-ARS?s Scientific Manuscript database
This study is focused on the characterization and expression of genes in the red flour beetle, Tribolium castaneum, encoding proteins that possess six-cysteine-containing chitin-binding domains (CBDs) related to the peritrophin A domain (ChtBD2). An exhaustive bioinformatics search of the genome of...
Murray, Gemma G. R.; Weinert, Lucy A.; Rhule, Emma L.; Welch, John J.
2016-01-01
Rickettsia is a genus of intracellular bacteria whose hosts and transmission strategies are both impressively diverse, and this is reflected in a highly dynamic genome. Some previous studies have described the evolutionary history of Rickettsia as non-tree-like, due to incongruity between phylogenetic reconstructions using different portions of the genome. Here, we reconstruct the Rickettsia phylogeny using whole-genome data, including two new genomes from previously unsampled host groups. We find that a single topology, which is supported by multiple sources of phylogenetic signal, well describes the evolutionary history of the core genome. We do observe extensive incongruence between individual gene trees, but analyses of simulations over a single topology and interspersed partitions of sites show that this is more plausibly attributed to systematic error than to horizontal gene transfer. Some conflicting placements also result from phylogenetic analyses of accessory genome content (i.e., gene presence/absence), but we argue that these are also due to systematic error, stemming from convergent genome reduction, which cannot be accommodated by existing phylogenetic methods. Our results show that, even within a single genus, tests for gene exchange based on phylogenetic incongruence may be susceptible to false positives. PMID:26559010
Thollesson, M.
1999-01-01
The phylogeny of Euthyneura is analysed by using DNA sequences of the mitochondrial 16S rRNA gene. Despite the common notion that this gene is too variable to provide useful information at high taxonomic levels, such as in the present study, bootstrap proportions are high for several clades in the study. This indicates that there is a useful amount of variation despite the noise due to multiple substitutions. The analyses furthermore indicate that (i) Gymnosomata (represented by Clione) is not a part of Euthyneura, but Clione forms a clade with the caenogastropods; (ii) Acteon is the sister group to the remaining euthyneuran taxa in the study; (iii) the nudibranch taxa form two clades, one comprising Dendronotoidea, Arminoidea and Aeolidoidea (together Cladobranchia) with Notaspidea (represented by Berthella) as sister group, while the fourth nudibranch taxon, Doridoidea, forms a separate clade; (iv) Cephalaspidea s.s. and Anaspidea form clades that are each other's sister groups (together Pleurocoela). Finally, there is no clade present in the analyses corresponding to the taxon Opisthobranchia in the traditional sense, and the use of this name is probably better abandoned altogether.
Detection and characterization of Pasteuria 16S rRNA gene sequences from nematodes and soils.
Duan, Y P; Castro, H F; Hewlett, T E; White, J H; Ogram, A V
2003-01-01
Various bacterial species in the genus Pasteuria have great potential as biocontrol agents against plant-parasitic nematodes, although study of this important genus is hampered by the current inability to cultivate Pasteuria species outside their host. To aid in the study of this genus, an extensive 16S rRNA gene sequence phylogeny was constructed and this information was used to develop cultivation-independent methods for detection of Pasteuria in soils and nematodes. Thirty new clones of Pasteuria 16S rRNA genes were obtained directly from nematodes and soil samples. These were sequenced and used to construct an extensive phylogeny of this genus. These sequences were divided into two deeply branching clades within the low-G + C, Gram-positive division; some sequences appear to represent novel species within the genus Pasteuria. In addition, a surprising degree of 16S rRNA gene sequence diversity was observed within what had previously been designated a single strain of Pasteuria penetrans (P-20). PCR primers specific to Pasteuria 16S rRNA for detection of Pasteuria in soils were also designed and evaluated. Detection limits for soil DNA were 100-10,000 Pasteuria endospores (g soil)(-1).
Modeling adaptive kernels from probabilistic phylogenetic trees.
Nicotra, Luca; Micheli, Alessio
2009-01-01
Modeling phylogenetic interactions is an open issue in many computational biology problems. In the context of gene function prediction we introduce a class of kernels for structured data leveraging on a hierarchical probabilistic modeling of phylogeny among species. We derive three kernels belonging to this setting: a sufficient statistics kernel, a Fisher kernel, and a probability product kernel. The new kernels are used in the context of support vector machine learning. The kernels adaptivity is obtained through the estimation of the parameters of a tree structured model of evolution using as observed data phylogenetic profiles encoding the presence or absence of specific genes in a set of fully sequenced genomes. We report results obtained in the prediction of the functional class of the proteins of the budding yeast Saccharomyces cerevisae which favorably compare to a standard vector based kernel and to a non-adaptive tree kernel function. A further comparative analysis is performed in order to assess the impact of the different components of the proposed approach. We show that the key features of the proposed kernels are the adaptivity to the input domain and the ability to deal with structured data interpreted through a graphical model representation.
Chen, Jing Yu; Gu, Jun; Wang, En Tao; Ma, Xing Xian; Kang, Shi Tong; Huang, Ling Zi; Cao, Xue Ping; Li, Liang Bing; Wu, Yan Ling
2014-10-01
Aiming at learning the microsymbionts of Arachis duranensis, a diploid ancestor of cultivated peanut, genetic and symbiotic characterization of 32 isolates from root nodules of this plant grown in its new habitat Guangzhou was performed. Based upon the phylogeny of 16S rRNA, atpD and recA genes, diverse bacteria belonging to Bradyrhizobium yuanmingense, Bradyrhizobium elkanii, Bradyrhizobium iriomotense and four new lineages of Bradyrhizobium (19 isolates), Rhizobium/Agrobacterium (9 isolates), Herbaspirillum (2 isolates) and Burkholderia (2 isolates) were defined. In the nodulation test on peanut, only the bradyrhizobial strains were able to induce effective nodules. Phylogeny of nodC divided the Bradyrhizobium isolates into four lineages corresponding to the grouping results in phylogenetic analysis of housekeeping genes, suggesting that this symbiosis gene was mainly maintained by vertical gene transfer. These results demonstrate that A. duranensis is a promiscuous host preferred the Bradyrhizobium species with different symbiotic gene background as microsymbionts, and that it might have selected some native rhizobia, especially the novel lineages Bradyrhizobium sp. I and sp. II, in its new habitat Guangzhou. These findings formed a basis for further study on adaptation and evolution of symbiosis between the introduced legumes and the indigenous rhizobia. Copyright © 2014 Elsevier GmbH. All rights reserved.
Revised phylogeny of the Cellulose Synthase gene superfamily: insights into cell wall evolution.
Little, Alan; Schwerdt, Julian G; Shirley, Neil J; Khor, Shi F; Neumann, Kylie; O'Donovan, Lisa A; Lahnstein, Jelle; Collins, Helen M; Henderson, Marilyn; Fincher, Geoffrey B; Burton, Rachel A
2018-05-20
Cell walls are crucial for the integrity and function of all land plants, and are of central importance in human health, livestock production, and as a source of renewable bioenergy. Many enzymes that mediate the biosynthesis of cell wall polysaccharides are encoded by members of the large cellulose synthase (CesA) gene superfamily. Here, we analyzed 29 sequenced genomes and 17 transcriptomes to revise the phylogeny of the CesA gene superfamily in angiosperms. Our results identify ancestral gene clusters that predate the monocot-eudicot divergence and reveal several novel evolutionary observations, including the expansion of the Poaceae-specific cellulose synthase-like CslF family to the graminids and restiids and the characterisation of a previously unreported eudicot lineage, CslM, that forms a reciprocally monophyletic eudicot-monocot grouping with the CslJ clade. The CslM lineage is widely distributed in eudicots, and the CslJ clade, which was previously thought to be restricted to the Poales, is widely distributed in monocots. Our analyses show that some members of the CslJ lineage, but not the newly identified CslM genes, are capable of directing (1,3;1,4)-β-glucan biosynthesis, which, contrary to current dogma, is not restricted to Poaceae. {copyright, serif} 2018 American Society of Plant Biologists. All rights reserved.
Schorn, Michelle A; Alanjary, Mohammad M; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R; Ziemert, Nadine; Moore, Bradley S
2016-12-01
Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites.
Schorn, Michelle A.; Alanjary, Mohammad M.; Aguinaldo, Kristen; Korobeynikov, Anton; Podell, Sheila; Patin, Nastassia; Lincecum, Tommie; Jensen, Paul R.; Ziemert, Nadine
2016-01-01
Traditional natural product discovery methods have nearly exhausted the accessible diversity of microbial chemicals, making new sources and techniques paramount in the search for new molecules. Marine actinomycete bacteria have recently come into the spotlight as fruitful producers of structurally diverse secondary metabolites, and remain relatively untapped. In this study, we sequenced 21 marine-derived actinomycete strains, rarely studied for their secondary metabolite potential and under-represented in current genomic databases. We found that genome size and phylogeny were good predictors of biosynthetic gene cluster diversity, with larger genomes rivalling the well-known marine producers in the Streptomyces and Salinispora genera. Genomes in the Micrococcineae suborder, however, had consistently the lowest number of biosynthetic gene clusters. By networking individual gene clusters into gene cluster families, we were able to computationally estimate the degree of novelty each genus contributed to the current sequence databases. Based on the similarity measures between all actinobacteria in the Joint Genome Institute's Atlas of Biosynthetic gene Clusters database, rare marine genera show a high degree of novelty and diversity, with Corynebacterium, Gordonia, Nocardiopsis, Saccharomonospora and Pseudonocardia genera representing the highest gene cluster diversity. This research validates that rare marine actinomycetes are important candidates for exploration, as they are relatively unstudied, and their relatives are historically rich in secondary metabolites. PMID:27902408
Chung, Won-Hyong; Jeong, Namhee; Kim, Jiwoong; Lee, Woo Kyu; Lee, Yun-Gyeong; Lee, Sang-Heon; Yoon, Woongchang; Kim, Jin-Hyun; Choi, Ik-Young; Choi, Hong-Kyu; Moon, Jung-Kyung; Kim, Namshin; Jeong, Soon-Chun
2014-01-01
Despite the importance of soybean as a major crop, genome-wide variation and evolution of cultivated soybeans are largely unknown. Here, we catalogued genome variation in an annual soybean population by high-depth resequencing of 10 cultivated and 6 wild accessions and obtained 3.87 million high-quality single-nucleotide polymorphisms (SNPs) after excluding the sites with missing data in any accession. Nuclear genome phylogeny supported a single origin for the cultivated soybeans. We identified 10-fold longer linkage disequilibrium (LD) in the wild soybean relative to wild maize and rice. Despite the small population size, the long LD and large SNP data allowed us to identify 206 candidate domestication regions with significantly lower diversity in the cultivated, but not in the wild, soybeans. Some of the genes in these candidate regions were associated with soybean homologues of canonical domestication genes. However, several examples, which are likely specific to soybean or eudicot crop plants, were also observed. Consequently, the variation data identified in this study should be valuable for breeding and for identifying agronomically important genes in soybeans. However, the long LD of wild soybeans may hinder pinpointing causal gene(s) in the candidate regions. PMID:24271940
Parks, Matthew B; Wickett, Norman J; Alverson, Andrew J
2018-01-01
Abstract Diatoms (Bacillariophyta) are a species-rich group of eukaryotic microbes diverse in morphology, ecology, and metabolism. Previous reconstructions of the diatom phylogeny based on one or a few genes have resulted in inconsistent resolution or low support for critical nodes. We applied phylogenetic paralog pruning techniques to a data set of 94 diatom genomes and transcriptomes to infer perennially difficult species relationships, using concatenation and summary-coalescent methods to reconstruct species trees from data sets spanning a wide range of thresholds for taxon and column occupancy in gene alignments. Conflicts between gene and species trees decreased with both increasing taxon occupancy and bootstrap cutoffs applied to gene trees. Concordance between gene and species trees was lowest for short internodes and increased logarithmically with increasing edge length, suggesting that incomplete lineage sorting disproportionately affects species tree inference at short internodes, which are a common feature of the diatom phylogeny. Although species tree topologies were largely consistent across many data treatments, concatenation methods appeared to outperform summary-coalescent methods for sparse alignments. Our results underscore that approaches to species-tree inference based on few loci are likely to be misled by unrepresentative sampling of gene histories, particularly in lineages that may have diversified rapidly. In addition, phylogenomic studies of diatoms, and potentially other hyperdiverse groups, should maximize the number of gene trees with high taxon occupancy, though there is clearly a limit to how many of these genes will be available. PMID:29040712
DOE Office of Scientific and Technical Information (OSTI.GOV)
Medina, Monica; Collins, Timothy M.; Walsh, Patrick J.
2000-08-10
Sea hares within the genus Aplysia are important neurobiological model organisms, and as studies based on different Aplysia species appear in the literature, a phylogenetic framework has become essential. We present a phylogenetic hypothesis for this genus, based on portions of two mitochondrial genes (12S and 16S). In addition, we reconstruct the evolution of several behavioral characters of interest to neurobiologists in order to illustrate the potential benefits of a phylogeny for the genus Aplysia. These benefits include the determination of ancestral traits, the direction and timing of evolution of characters, prediction of the distribution of traits, and identification ofmore » cases of independent acquisition of traits within lineages. This last benefit may prove especially useful in understanding the linkage between behaviors and their underlying neurological basis.« less
Phylogeny and Phylogeography of Rhizobial Symbionts Nodulating Legumes of the Tribe Genisteae
Stępkowski, Tomasz; Banasiewicz, Joanna; Granada, Camille E.; Andrews, Mitchell; Passaglia, Luciane M. P.
2018-01-01
The legume tribe Genisteae comprises 618, predominantly temperate species, showing an amphi-Atlantic distribution that was caused by several long-distance dispersal events. Seven out of the 16 authenticated rhizobial genera can nodulate particular Genisteae species. Bradyrhizobium predominates among rhizobia nodulating Genisteae legumes. Bradyrhizobium strains that infect Genisteae species belong to both the Bradyrhizobium japonicum and Bradyrhizobium elkanii superclades. In symbiotic gene phylogenies, Genisteae bradyrhizobia are scattered among several distinct clades, comprising strains that originate from phylogenetically distant legumes. This indicates that the capacity for nodulation of Genisteae spp. has evolved independently in various symbiotic gene clades, and that it has not been a long-multi-step process. The exception is Bradyrhizobium Clade II, which unlike other clades comprises strains that are specialized in nodulation of Genisteae, but also Loteae spp. Presumably, Clade II represents an example of long-lasting co-evolution of bradyrhizobial symbionts with their legume hosts. PMID:29538303
Isolation with Migration Models for More Than Two Populations
Hey, Jody
2010-01-01
A method for studying the divergence of multiple closely related populations is described and assessed. The approach of Hey and Nielsen (2007, Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc Natl Acad Sci USA. 104:2785–2790) for fitting an isolation-with-migration model was extended to the case of multiple populations with a known phylogeny. Analysis of simulated data sets reveals the kinds of history that are accessible with a multipopulation analysis. Necessarily, processes associated with older time periods in a phylogeny are more difficult to estimate; and histories with high levels of gene flow are particularly difficult with more than two populations. However, for histories with modest levels of gene flow, or for very large data sets, it is possible to study large complex divergence problems that involve multiple closely related populations or species. PMID:19955477
Isolation with migration models for more than two populations.
Hey, Jody
2010-04-01
A method for studying the divergence of multiple closely related populations is described and assessed. The approach of Hey and Nielsen (2007, Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc Natl Acad Sci USA. 104:2785-2790) for fitting an isolation-with-migration model was extended to the case of multiple populations with a known phylogeny. Analysis of simulated data sets reveals the kinds of history that are accessible with a multipopulation analysis. Necessarily, processes associated with older time periods in a phylogeny are more difficult to estimate; and histories with high levels of gene flow are particularly difficult with more than two populations. However, for histories with modest levels of gene flow, or for very large data sets, it is possible to study large complex divergence problems that involve multiple closely related populations or species.
Benítez-Burraco, A
FOXP2 is the first gene linked to a hereditary variant of specific language impairment and seems to code for a transcriptional repressor that intervenes in the regulation of the development and the functioning of certain thalamic-cortical-striatal circuits. In the last three years, significant progress has been made in the determination of the structural and functional properties of the gene. These advances essentially have to do with the precise analysis of the most important structural motifs of the protein that it codes for and the main parameters that determine its interaction with DNA. They also concern the determination of the functional and behavioural properties in vivo of the main isoforms of the FOXP2 protein, the exact determination of the pattern of expression of new orthologues of the gene, and the identification of the different target genes for factor FOXP2. This new evidence suggests that protein FOXP2 protein has a high degree of versatility in vivo when it comes to binding to DNA; that its different isoforms are biologically functional; and that the FOXP2 gene is functional during embryonic development and during the adult phase. It also suggests that it is involved in the development and/or functioning of the thalamic-cortical-striatal circuits associated to motor planning, sequential behaviour and procedural learning (a significant saving in developmental terms of the regulatory mechanism in which the gene is involved), as well as the accuracy of the models of linguistic processing that consider language to be, to a large extent, the result of an interaction between certain cortical and subcortical structures.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhengqiu, C.; Penaflor, C.; Kuehl, J.V.
2006-06-01
The magnoliids represent the largest basal angiosperm clade with four orders, 19 families and 8,500 species. Although several recent angiosperm molecular phylogenies have supported the monophyly of magnoliids and suggested relationships among the orders, the limited number of genes examined resulted in only weak support, and these issues remain controversial. Furthermore, considerable incongruence has resulted in phylogenies supporting three different sets of relationships among magnoliids and the two large angiosperm clades, monocots and eudicots. This is one of the most important remaining issues concerning relationships among basal angiosperms. We sequenced the chloroplast genomes of three magnoliids, Drimys (Canellales), Liriodendron (Magnoliales),more » and Piper (Piperales), and used these data in combination with 32 other completed angiosperm chloroplast genomes to assess phylogenetic relationships among magnoliids. The Drimys and Piper chloroplast genomes are nearly identical in size at 160,606 and 160,624 bp, respectively. The genomes include a pair of inverted repeats of 26,649 bp (Drimys) and 27,039 (Piper), separated by a small single copy region of 18,621 (Drimys) and 18,878 (Piper) and a large single copy region of 88,685 bp (Drimys) and 87,666 bp (Piper). The gene order of both taxa is nearly identical to many other unrearranged angiosperm chloroplast genomes, including Calycanthus, the other published magnoliid genome. Comparisons of angiosperm chloroplast genomes indicate that GC content is not uniformly distributed across the genome. Overall GC content ranges from 34-39%, and coding regions have a substantially higher GC content than non-coding regions (both intergenic spacers and introns). Among protein-coding genes, GC content varies by codon position with 1st codon > 2nd codon > 3rd codon, and it varies by functional group with photosynthetic genes having the highest percentage and NADH genes the lowest. Across the genome, GC content is highest in the inverted repeat due to the presence of rRNA genes and lowest in the small single copy region where most NADH genes are located. Phylogenetic analyses using maximum parsimony and maximum likelihood methods were performed on DNA sequences of 61 protein-coding genes. Trees from both analyses provided strong support for the monophyly of magnoliids and two strongly supported groups were identified, the Canellales/Piperales and the Laurales/Magnoliales. The phylogenies also provided moderate to strong support for the basal position of Amborella, and a sister relationship of magnoliids to a clade that includes monocots and eudicots. The complete sequences of three magnoliid chloroplast genomes provide new data from the largest basal angiosperm clade. Evolutionary comparisons of these new genome sequences, combined with other published angiosperm genome, confirm that GC content is unevenly distributed across the genome by location, codon position, and functional group. Furthermore, phylogenetic analyses provide the strongest support so far for the hypothesis that the magnoliids are sister to a large clade that includes both monocots and eudicots.« less
Wang, Houshuai; Fan, Xiaoling; Owada, Mamoru; Wang, Min; Nylin, Sören
2014-01-01
The genus Panolis is a small group of noctuid moths with six recognized species distributed from Europe to East Asia, and best known for containing the widespread Palearctic pest species P. flammea, the pine beauty moth. However, a reliable classification and robust phylogenetic framework for this group of potentially economic importance are currently lacking. Here, we use morphological and molecular data (mitochondrial genes cytochrome c oxidase subunit I and 16S ribosomal RNA, nuclear gene elongation factor-1 alpha) to reconstruct the phylogeny of this genus, with a comprehensive systematic revision of all recognized species and a new one, P. ningshan sp. nov. The analysis results of maximum parsimony, maximum likelihood and Bayesian inferring methods for the combined morphological and molecular data sets are highly congruent, resulting in a robust phylogeny and identification of two clear species groups, i.e., the P. flammea species group and the P. exquisita species group. We also estimate the divergence times of Panolis moths using two conventional mutation rates for the arthropod mitochondrial COI gene with a comparison of two molecular clock models, as well as reconstruct their ancestral areas. Our results suggest that 1) Panolis is a young clade, originating from the Oriental region in China in the Late Miocene (6–10Mya), with an ancestral species in the P. flammea group extending northward to the Palearctic region some 3–6 Mya; 2) there is a clear possibility for a representative of the Palearctic clade to become established as an invasive species in the Nearctic taiga. PMID:24603596
Imhoff, Johannes F.; Rahn, Tanja; Künzel, Sven; Neulinger, Sven C.
2018-01-01
Two different photosystems for performing bacteriochlorophyll-mediated photosynthetic energy conversion are employed in different bacterial phyla. Those bacteria employing a photosystem II type of photosynthetic apparatus include the phototrophic purple bacteria (Proteobacteria), Gemmatimonas and Chloroflexus with their photosynthetic relatives. The proteins of the photosynthetic reaction center PufL and PufM are essential components and are common to all bacteria with a type-II photosynthetic apparatus, including the anaerobic as well as the aerobic phototrophic Proteobacteria. Therefore, PufL and PufM proteins and their genes are perfect tools to evaluate the phylogeny of the photosynthetic apparatus and to study the diversity of the bacteria employing this photosystem in nature. Almost complete pufLM gene sequences and the derived protein sequences from 152 type strains and 45 additional strains of phototrophic Proteobacteria employing photosystem II were compared. The results give interesting and comprehensive insights into the phylogeny of the photosynthetic apparatus and clearly define Chromatiales, Rhodobacterales, Sphingomonadales as major groups distinct from other Alphaproteobacteria, from Betaproteobacteria and from Caulobacterales (Brevundimonas subvibrioides). A special relationship exists between the PufLM sequences of those bacteria employing bacteriochlorophyll b instead of bacteriochlorophyll a. A clear phylogenetic association of aerobic phototrophic purple bacteria to anaerobic purple bacteria according to their PufLM sequences is demonstrated indicating multiple evolutionary lines from anaerobic to aerobic phototrophic purple bacteria. The impact of pufLM gene sequences for studies on the environmental diversity of phototrophic bacteria is discussed and the possibility of their identification on the species level in environmental samples is pointed out. PMID:29472894
Evolution, phylogeny, and molecular epidemiology of Chlamydia.
Nunes, Alexandra; Gomes, João P
2014-04-01
The Chlamydiaceae are a family of obligate intracellular bacteria characterized by a unique biphasic developmental cycle. It encompasses the single genus Chlamydia, which involves nine species that affect a wide range of vertebral hosts, causing infections with serious impact on human health (mainly due to Chlamydia trachomatis infections) and on farming and veterinary industries. It is believed that Chlamydiales originated ∼700mya, whereas C. trachomatis likely split from the other Chlamydiaceae during the last 6mya. This corresponds to the emergence of modern human lineages, with the first descriptions of chlamydial infections as ancient as four millennia. Chlamydiaceae have undergone a massive genome reduction, on behalf of the deletional bias "use it or lose it", stabilizing at 1-1.2Mb and keeping a striking genome synteny. Their phylogeny reveals species segregation according to biological properties, with huge differences in terms of host range, tissue tropism, and disease outcomes. Genome differences rely on the occurrence of mutations in the >700 orthologous genes, as well as on events of recombination, gene loss, inversion, and paralogous expansion, affecting both a hypervariable region named the plasticity zone, and genes essentially encoding polymorphic and transmembrane head membrane proteins, type III secretion effectors and some metabolic pathways. Procedures for molecular typing are still not consensual but have allowed the knowledge of molecular epidemiology patterns for some species as well as the identification of outbreaks and emergence of successful clones for C. trachomatis. This manuscript intends to provide a comprehensive review on the evolution, phylogeny, and molecular epidemiology of Chlamydia. Copyright © 2014 Elsevier B.V. All rights reserved.
Liu, Jun; Liu, Helu; Zhang, Haibin
2018-04-22
The marine mussels (Mytilidae) are distributed in the oceans worldwide and occupy various habitats with diverse life styles. However, their taxonomy and phylogeny remain unclear from genus to family level due to equivocal morphological and anatomical characters among some taxa. In this study, we inferred the deep phylogenetic relationships among 42 mytiloid species, 19 genera, and five subfamilies of the extant marine mussels by using two mitochondrial (COI and 16S rRNA) and three nuclear (18S and 28S rRNA, and histone H3) genes. Phylogeny was reconstructed with a combination of five genes using Bayesian inference and maximum likelihood method, and divergence time was estimated for the major nodes using a relaxed clock model with three fossil calibrations. Phylogenetic trees revealed two major clades (Clades 1 and 2). In Clade 1, the deep-sea mussels (subfamily Bathymodiolinae) were sister to subfamily Modiolinae (represented by Modiolus), and then was clustered with Leiosolenus (subfamily Lithophaginae). Clade 2 comprised Lithophaga (Lithophaginae) and subfamily Mytilinae. Additionally, a Modiolus species and Musculus senhousia (subfamily Crenellinae) were positioned within the subfamily Mytilinae. The phylogenetic results strongly indicated monophyly of Mytilidae and Bathymodiolinae, polyphyly of Modiolinae and Lithophaginae, and paraphyly of Mytilinae. Divergence time estimation showed an ancient and gradual divergence in most mussel groups, whereas the deep-sea mussels originated recently and diverged rapidly during the Paleogene. The present study provides new insight into the evolutionary history of the marine mussels, and supports taxonomic revision for this important bivalve group. Copyright © 2018 Elsevier Inc. All rights reserved.
de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles
2014-10-01
The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches' broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea.
de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles
2014-01-01
The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches’ broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea. PMID:25505843
Hartl, Daniel L.
2008-01-01
Simple models of molecular evolution assume that sequences evolve by a Poisson process in which nucleotide or amino acid substitutions occur as rare independent events. In these models, the expected ratio of the variance to the mean of substitution counts equals 1, and substitution processes with a ratio greater than 1 are called overdispersed. Comparing the genomes of 10 closely related species of Drosophila, we extend earlier evidence for overdispersion in amino acid replacements as well as in four-fold synonymous substitutions. The observed deviation from the Poisson expectation can be described as a linear function of the rate at which substitutions occur on a phylogeny, which implies that deviations from the Poisson expectation arise from gene-specific temporal variation in substitution rates. Amino acid sequences show greater temporal variation in substitution rates than do four-fold synonymous sequences. Our findings provide a general phenomenological framework for understanding overdispersion in the molecular clock. Also, the presence of substantial variation in gene-specific substitution rates has broad implications for work in phylogeny reconstruction and evolutionary rate estimation. PMID:18480070
Brucker, Robert M; Bordenstein, Seth R
2012-02-01
The comparative structure of bacterial communities among closely related host species remains relatively unexplored. For instance, as speciation events progress from incipient to complete stages, does divergence in the composition of the species' microbial communities parallel the divergence of host nuclear genes? To address this question, we used the recently diverged species of the parasitoid wasp genus Nasonia to test whether the evolutionary relationships of their bacterial microbiotas recapitulate the Nasonia phylogenetic history. We also assessed microbial diversity in Nasonia at different stages of development to determine the role that host age plays in microbiota structure. The results indicate that all three species of Nasonia share simple larval microbiotas dominated by the γ-proteobacteria class; however, bacterial species diversity increases as Nasonia develop into pupae and adults. Finally, under identical environmental conditions, the relationships of the microbial communities reflect the phylogeny of the Nasonia host species at multiple developmental stages, which suggests that the structure of an animal's microbial community is closely allied with divergence of host genes. These findings highlight the importance of host evolutionary relationships on microbiota composition and have broad implications for future studies of microbial symbiosis and animal speciation. © 2011 The Author(s). Evolution© 2011 The Society for the Study of Evolution.
The Evolution of SINEs and LINEs in the genus Chironomus (Diptera).
Papusheva, Ekaterina; Gruhl, Mary C; Berezikov, Eugene; Groudieva, Tatiana; Scherbik, Svetlana V; Martin, Jon; Blinov, Alexander; Bergtrom, Gerald
2004-03-01
Genomic DNA amplification from 51 species of the family Chironomidae shows that most contain relatives of NLRCth1 LINE and CTRT1 SINE retrotransposons first found in Chironomus thummi. More than 300 cloned PCR products were sequenced. The amplified region of the reverse transcriptase gene in the LINEs is intact and highly conserved, suggesting active elements. The SINEs are less conserved, consistent with minimal/no selection after transposition. A mitochondrial gene phylogeny resolves the Chironomus genus into six lineages (Guryev et al. 2001). LINE and SINE phylogenies resolve five of these lineages, indicating their monophyletic origin and vertical inheritance. However, both the LINE and the SINE tree topologies differ from the species phylogeny, resolving the elements into "clusters I-IV" and "cluster V" families. The data suggest a descent of all LINE and SINE subfamilies from two major families. Based on the species phylogeny, a few LINEs and a larger number of SINEs are cladisitically misplaced. Most misbranch with LINEs or SINEs from species with the same families of elements. From sequence comparisons, cladistically misplaced LINEs and several misplaced SINEs arose by convergent base substitutions. More diverged SINEs result from early transposition and some are derived from multiple source SINEs in the same species. SINEs from two species (C. dorsalis, C. pallidivittatus), expected to belong to the clusters I-IV family, branch instead with cluster V family SINEs; apparently both families predate separation of cluster V from clusters I-IV species. Correlation of the distribution of active SINEs and LINEs, as well as similar 3' sequence motifs in CTRT1 and NLRCth1, suggests coevolving retrotransposon pairs in which CTRT1 transposition depends on enzymes active during NLRCth1 LINE mobility.
Springer, Mark S.; Meredith, Robert W.; Gatesy, John; Emerling, Christopher A.; Park, Jong; Rabosky, Daniel L.; Stadler, Tanja; Steiner, Cynthia; Ryder, Oliver A.; Janečka, Jan E.; Fisher, Colleen A.; Murphy, William J.
2012-01-01
Phylogenetic relationships, divergence times, and patterns of biogeographic descent among primate species are both complex and contentious. Here, we generate a robust molecular phylogeny for 70 primate genera and 367 primate species based on a concatenation of 69 nuclear gene segments and ten mitochondrial gene sequences, most of which were extracted from GenBank. Relaxed clock analyses of divergence times with 14 fossil-calibrated nodes suggest that living Primates last shared a common ancestor 71–63 Ma, and that divergences within both Strepsirrhini and Haplorhini are entirely post-Cretaceous. These results are consistent with the hypothesis that the Cretaceous-Paleogene mass extinction of non-avian dinosaurs played an important role in the diversification of placental mammals. Previous queries into primate historical biogeography have suggested Africa, Asia, Europe, or North America as the ancestral area of crown primates, but were based on methods that were coopted from phylogeny reconstruction. By contrast, we analyzed our molecular phylogeny with two methods that were developed explicitly for ancestral area reconstruction, and find support for the hypothesis that the most recent common ancestor of living Primates resided in Asia. Analyses of primate macroevolutionary dynamics provide support for a diversification rate increase in the late Miocene, possibly in response to elevated global mean temperatures, and are consistent with the fossil record. By contrast, diversification analyses failed to detect evidence for rate-shift changes near the Eocene-Oligocene boundary even though the fossil record provides clear evidence for a major turnover event (“Grande Coupure”) at this time. Our results highlight the power and limitations of inferring diversification dynamics from molecular phylogenies, as well as the sensitivity of diversification analyses to different species concepts. PMID:23166696
Knapp, Jenny; Gottstein, Bruno; Saarma, Urmas; Millon, Laurence
2015-10-30
Alveolar echinococcosis, caused by the tapeworm Echinococcus multilocularis, is one of the most severe parasitic diseases in humans and represents one of the 17 neglected diseases prioritised by the World Health Organisation (WHO) in 2012. Considering the major medical and veterinary importance of this parasite, the phylogeny of the genus Echinococcus is of considerable importance; yet, despite numerous efforts with both mitochondrial and nuclear data, it has remained unresolved. The genus is clearly complex, and this is one of the reasons for the incomplete understanding of its taxonomy. Although taxonomic studies have recognised E. multilocularis as a separate entity from the Echinococcus granulosus complex and other members of the genus, it would be premature to draw firm conclusions about the taxonomy of the genus before the phylogeny of the whole genus is fully resolved. The recent sequencing of E. multilocularis and E. granulosus genomes opens new possibilities for performing in-depth phylogenetic analyses. In addition, whole genome data provide the possibility of inferring phylogenies based on a large number of functional genes, i.e. genes that trace the evolutionary history of adaptation in E. multilocularis and other members of the genus. Moreover, genomic data open new avenues for studying the molecular epidemiology of E. multilocularis: genotyping studies with larger panels of genetic markers allow the genetic diversity and spatial dynamics of parasites to be evaluated with greater precision. There is an urgent need for international coordination of genotyping of E. multilocularis isolates from animals and human patients. This could be fundamental for a better understanding of the transmission of alveolar echinococcosis and for designing efficient healthcare strategies. Copyright © 2015 Elsevier B.V. All rights reserved.
Janssen, Toon; Vizoso, Dita B; Schulte, Gregor; Littlewood, D Timothy J; Waeschenbach, Andrea; Schärer, Lukas
2015-11-01
The Macrostomorpha-an early branching and species-rich clade of free-living flatworms-is attracting interest because it contains Macrostomum lignano, a versatile model organism increasingly used in evolutionary, developmental, and molecular biology. We elucidate the macrostomorphan molecular phylogeny inferred from both nuclear (18S and 28S rDNA) and mitochondrial (16S rDNA and COI) marker genes from 40 representatives. Although our phylogeny does not recover the Macrostomorpha as a statistically supported monophyletic grouping, it (i) confirms many taxa previously proposed based on morphological evidence, (ii) permits the first placement of many families and genera, and (iii) reveals a number of unexpected placements. Specifically, Myozona and Bradynectes are outside the three classic families (Macrostomidae, Microstomidae and Dolichomacrostomidae) and the asexually fissioning Myomacrostomum belongs to a new subfamily, the Myozonariinae nov. subfam. (Dolichomacrostomidae), rather than diverging early. While this represents the first evidence for asexuality among the Dolichomacrostomidae, we show that fissioning also occurs in another Myozonariinae, Myozonaria fissipara nov. sp. Together with the placement of the (also fissioning) Microstomidae, namely as the sister taxon of Dolichomacrostomidae, this suggests that fissioning is not basal within the Macrostomorpha, but rather restricted to the new taxon Dolichomicrostomida (Dolichomacrostomidae+Microstomidae). Furthermore, our phylogeny allows new insights into the evolution of the reproductive system, as ancestral state reconstructions reveal convergent evolution of gonads, and male and female genitalia. Finally, the convergent evolution of sperm storage organs in the female genitalia appears to be linked to the widespread occurrence of hypodermic insemination among the Macrostomorpha. Copyright © 2015 Elsevier Inc. All rights reserved.
Chai, Wenbo; Jiang, Pengfei; Huang, Guoyu; Jiang, Haiyang; Li, Xiaoyu
2017-10-01
The TCP family is a group of plant-specific transcription factors. TCP genes encode proteins harboring bHLH structure, which is implicated in DNA binding and protein-protein interactions and known as the TCP domain. TCP genes play important roles in plant development and have been evolutionarily and functionally elaborated in various plants, however, no overall phylogenetic analysis or expression profiling of TCP genes in Zea mays has been reported. In the present study, a systematic analysis of molecular evolution and functional prediction of TCP family genes in maize ( Z . mays L.) has been conducted. We performed a genome-wide survey of TCP genes in maize, revealing the gene structure, chromosomal location and phylogenetic relationship of family members. Microsynteny between grass species and tissue-specific expression profiles were also investigated. In total, 29 TCP genes were identified in the maize genome, unevenly distributed on the 10 maize chromosomes. Additionally, ZmTCP genes were categorized into nine classes based on phylogeny and purifying selection may largely be responsible for maintaining the functions of maize TCP genes. What's more, microsynteny analysis suggested that TCP genes have been conserved during evolution. Finally, expression analysis revealed that most TCP genes are expressed in the stem and ear, which suggests that ZmTCP genes influence stem and ear growth. This result is consistent with the previous finding that maize TCP genes represses the growth of axillary organs and enables the formation of female inflorescences. Altogether, this study presents a thorough overview of TCP family in maize and provides a new perspective on the evolution of this gene family. The results also indicate that TCP family genes may be involved in development stage in plant growing conditions. Additionally, our results will be useful for further functional analysis of the TCP gene family in maize.
Bayesian phylogenetic analysis supports an agricultural origin of Japonic languages
Lee, Sean; Hasegawa, Toshikazu
2011-01-01
Languages, like genes, evolve by a process of descent with modification. This striking similarity between biological and linguistic evolution allows us to apply phylogenetic methods to explore how languages, as well as the people who speak them, are related to one another through evolutionary history. Language phylogenies constructed with lexical data have so far revealed population expansions of Austronesian, Indo-European and Bantu speakers. However, how robustly a phylogenetic approach can chart the history of language evolution and what language phylogenies reveal about human prehistory must be investigated more thoroughly on a global scale. Here we report a phylogeny of 59 Japonic languages and dialects. We used this phylogeny to estimate time depth of its root and compared it with the time suggested by an agricultural expansion scenario for Japanese origin. In agreement with the scenario, our results indicate that Japonic languages descended from a common ancestor approximately 2182 years ago. Together with archaeological and biological evidence, our results suggest that the first farmers of Japan had a profound impact on the origins of both people and languages. On a broader level, our results are consistent with a theory that agricultural expansion is the principal factor for shaping global linguistic diversity. PMID:21543358
Heavy metal resistant strains are widespread along Streptomyces phylogeny.
Alvarez, Analía; Catalano, Santiago A; Amoroso, María Julia
2013-03-01
The genus Streptomyces comprises a group of bacteria species with high economic importance. Several of these species are employed at industrial scale for the production of useful compounds. Other characteristic found in different strains within this genus is their capability to tolerate high level of substances toxic for humans, heavy metals among them. Although several studies have been conducted in different species of the genus in order to disentangle the mechanisms associated to heavy metal resistance, little is known about how they have evolved along Streptomyces phylogeny. In this study we built the largest Streptomyces phylogeny generated up to date comprising six genes, 113 species of Streptomyces and 27 outgroups. The parsimony-based phylogenetic analysis indicated that (i) Streptomyces is monophyletic and (ii) it appears as sister clade of a group formed by Kitasatospora and Streptacidiphilus species, both genera also monophyletic. Streptomyces strains resistant to heavy metals are not confined to a single lineage but widespread along Streptomyces phylogeny. Our result in combination with genomic, physiological and biochemical data suggest that the resistance to heavy metals originated several times and by different mechanisms in Streptomyces history. Copyright © 2012 Elsevier Inc. All rights reserved.
Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences
Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi
2015-01-01
Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera. PMID:25704094
Brammer, Colin A; von Dohlen, Carol D
2007-05-01
Stratiomyidae is a cosmopolitan family of Brachycera (Diptera) that contains over 2800 species. This study focused on the relationships of members of the subfamily Clitellariinae, which has had a complicated taxonomic history. To investigate the monophyly of the Clitellariinae, the relationships of its genera, and the ages of Stratiomyidae lineages, representatives for all 12 subfamilies of Stratiomyidae, totaling 68 taxa, were included in a phylogenetic reconstruction. A Xylomyidae representative, Solva sp., was used as an outgroup. Sequences of EF-1alpha and 28S rRNA genes were analyzed under maximum parsimony with bootstrapping, and Bayesian methods to recover the best estimate of phylogeny. A chronogram with estimated dates for all nodes in the phylogeny was generated with the program, r8s, and divergence dates and confidence intervals were further explored with the program, multidivtime. All subfamilies of Stratiomyidae with more than one representative were found to be monophyletic, except for Stratiomyinae and Clitellariinae. Clitellariinae were distributed among five separate clades in the phylogeny, and Raphiocerinae were nested within Stratiomyinae. Dating analysis suggested an early Cretaceous origin for the common ancestor of extant Stratiomyidae, and a radiation of several major Stratiomyidae lineages in the Late Cretaceous.
Molecular Evolution of the Non-Coding Eosinophil Granule Ontogeny Transcript
Rose, Dominic; Stadler, Peter F.
2011-01-01
Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs). The evolutionary history of mlncRNAs is still largely uncharted territory. In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT), an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs). EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyze patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrate here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved, and thermodynamic stable secondary structures. Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element. PMID:22303364
Li, Meiying; Ren, Licheng; Xu, Biyu; Yang, Xiaoliang; Xia, Qiyu; He, Pingping; Xiao, Susheng; Guo, Anping; Hu, Wei; Jin, Zhiqiang
2016-01-01
Plant 14-3-3 proteins act as critical components of various cellular signaling processes and play an important role in regulating multiple physiological processes. However, less information is known about the 14-3-3 gene family in banana. In this study, 25 14-3-3 genes were identified from the banana genome. Based on the evolutionary analysis, banana 14-3-3 proteins were clustered into ε and non-ε groups. Conserved motif analysis showed that all identified banana 14-3-3 genes had the typical 14-3-3 motif. The gene structure of banana 14-3-3 genes showed distinct class-specific divergence between the ε group and the non-ε group. Most banana 14-3-3 genes showed strong transcript accumulation changes during fruit development and postharvest ripening in two banana varieties, indicating that they might be involved in regulating fruit development and ripening. Moreover, some 14-3-3 genes also showed great changes after osmotic, cold, and salt treatments in two banana varieties, suggested their potential role in regulating banana response to abiotic stress. Taken together, this systemic analysis reveals the involvement of banana 14-3-3 genes in fruit development, postharvest ripening, and response to abiotic stress and provides useful information for understanding the functions of 14-3-3 genes in banana. PMID:27713761
Zanotto, Paolo Marinho de Andrade; Krakauer, David C.
2008-01-01
We consider the concerted evolution of viral genomes in four families of DNA viruses. Given the high rate of horizontal gene transfer among viruses and their hosts, it is an open question as to how representative particular genes are of the evolutionary history of the complete genome. To address the concerted evolution of viral genes, we compared genomic evolution across four distinct, extant viral families. For all four viral families we constructed DNA-dependent DNA polymerase-based (DdDp) phylogenies and in addition, whole genome sequence, as quantitative descriptions of inter-genome relationships. We found that the history of the polymerase gene was highly predictive of the history of the genome as a whole, which we explain in terms of repeated, co-divergence events of the core DdDp gene accompanied by a number of satellite, accessory genetic loci. We also found that the rate of gene gain in baculovirus and poxviruses proceeds significantly more quickly than the rate of gene loss and that there is convergent acquisition of satellite functions promoting contextual adaptation when distinct viral families infect related hosts. The congruence of the genome and polymerase trees suggests that a large set of viral genes, including polymerase, derive from a phylogenetically conserved core of genes of host origin, secondarily reinforced by gene acquisition from common hosts or co-infecting viruses within the host. A single viral genome can be thought of as a mutualistic network, with the core genes acting as an effective host and the satellite genes as effective symbionts. Larger virus genomes show a greater departure from linkage equilibrium between core and satellites functions. PMID:18941535
Wang, Yijun; Deng, Dexiang; Shi, Yating; Miao, Nan; Bian, Yunlong; Yin, Zhitong
2012-03-01
Auxin response factors (ARFs), member of the plant-specific B3 DNA binding superfamily, target specifically to auxin response elements (AuxREs) in promoters of primary auxin-responsive genes and heterodimerize with Aux/IAA proteins in auxin signaling transduction cascade. In previous research, we have isolated and characterized maize Aux/IAA genes in whole-genome scale. Here, we report the comprehensive analysis of ARF genes in maize. A total of 36 ARF genes were identified and validated from the B73 maize genome through an iterative strategy. Thirty-six maize ARF genes are distributed in all maize chromosomes except chromosome 7. Maize ARF genes expansion is mainly due to recent segmental duplications. Maize ARF proteins share one B3 DNA binding domain which consists of seven-stranded β sheets and two short α helixes. Twelve maize ARFs with glutamine-rich middle regions could be as activators in modulating expression of auxin-responsive genes. Eleven maize ARF proteins are lack of homo- and heterodimerization domains. Putative cis-elements involved in phytohormones and light signaling responses, biotic and abiotic stress adaption locate in promoters of maize ARF genes. Expression patterns vary greatly between clades and sister pairs of maize ARF genes. The B3 DNA binding and auxin response factor domains of maize ARF proteins are primarily subjected to negative selection during selective sweep. The mixed selective forces drive the diversification and evolution of genomic regions outside of B3 and ARF domains. Additionally, the dicot-specific proliferation of ARF genes was detected. Comparative genomics analysis indicated that maize, sorghum and rice duplicate chromosomal blocks containing ARF homologs are highly syntenic. This study provides insights into the distribution, phylogeny and evolution of ARF gene family.
Galewski, Thomas; Tilak, Marie-ka; Sanchez, Sophie; Chevret, Pascale; Paradis, Emmanuel; Douzery, Emmanuel JP
2006-01-01
Background Mitochondrial and nuclear genes have generally been employed for different purposes in molecular systematics, the former to resolve relationships within recently evolved groups and the latter to investigate phylogenies at a deeper level. In the case of rapid and recent evolutionary radiations, mitochondrial genes like cytochrome b (CYB) are often inefficient for resolving phylogenetic relationships. One of the best examples is illustrated by Arvicolinae rodents (Rodentia; Muridae), the most impressive mammalian radiation of the Northern Hemisphere which produced voles, lemmings and muskrats. Here, we compare the relative contribution of a nuclear marker – the exon 10 of the growth hormone receptor (GHR) gene – to the one of the mitochondrial CYB for inferring phylogenetic relationships among the major lineages of arvicoline rodents. Results The analysis of GHR sequences improves the overall resolution of the Arvicolinae phylogeny. Our results show that the Caucasian long-clawed vole (Prometheomys schaposnikowi) is one of the basalmost arvicolines, and confirm that true lemmings (Lemmus) and collared lemmings (Dicrostonyx) are not closely related as suggested by morphology. Red-backed voles (Myodini) are found as the sister-group of a clade encompassing water vole (Arvicola), snow vole (Chionomys), and meadow voles (Microtus and allies). Within the latter, no support is recovered for the generic recognition of Blanfordimys, Lasiopodomys, Neodon, and Phaiomys as suggested by morphology. Comparisons of parameter estimates for branch lengths, base composition, among sites rate heterogeneity, and GTR relative substitution rates indicate that CYB sequences consistently exhibit more heterogeneity among codon positions than GHR. By analyzing the contribution of each codon position to node resolution, we show that the apparent higher efficiency of GHR is due to their third positions. Although we focus on speciation events spanning the last 10 million years (Myr), CYB sequences display highly saturated codon positions contrary to the nuclear exon. Lastly, variable length bootstrap predicts a significant increase in resolution of arvicoline phylogeny through the sequencing of nuclear data in an order of magnitude three to five times greater than the size of GHR exon 10. Conclusion Our survey provides a first resolved gene tree for Arvicolinae. The comparison of CYB and GHR phylogenetic efficiency supports recent assertions that nuclear genes are useful for resolving relationships of recently evolved animals. The superiority of nuclear exons may reside both in (i) less heterogeneity among sites, and (ii) the presence of highly informative sites in third codon positions, that evolve rapidly enough to accumulate synapomorphies, but slow enough to avoid substitutional saturation. PMID:17029633
Chromosome phylogenies of man, great apes, and Old World monkeys.
De Grouchy, J
1987-08-31
The karyotypes of man and of the closely related Pongidae--chimpanzee, gorilla, and orangutan--differ by a small number of well known rearrangements, mainly pericentric inversions and one fusion which reduced the chromosome number from 48 in the Pongidae to 46 in man. Dutrillaux et al. (1973, 1975, 1979) reconstructed the chromosomal phylogeny of the entire primate order. More and more distantly related species were compared thus moving backward in evolution to the common ancestors of the Pongidae, of the Cercopithecoidae, the Catarrhini, the Platyrrhini, the Prosimians, and finally the common ancestor of all primates. Descending the pyramid it becomes possible to assign the rearrangements that occurred in each phylum, and the one that led to man in particular. The main conclusions are that this phylogeny is compatible with the occurrence during evolution of simple chromosome rearrangements--inversions, fusions, reciprocal translocation, acquisition or loss of heterochromatin--and that it is entirely consistent with the known primate phylogeny based on physical morphology and molecular evolution. If heterochromatin is not taken into account, man has in common with the other primates practically all of his chromosomal material as determined by chromosome banding. However, it is arranged differently, according to species, on account of chromosome rearrangements. This interpretation has been confirmed by comparative gene mapping, which established that the same chromosome segments, identified by banding, carry the same genes (Finaz et al., 1973; Human Gene Mapping 8, 1985). A remarkable observation made by Dutrillaux is that different primate phyla seem to have adopted different chromosome rearrangements in the course of evolution: inversions for the Pongidae, Robertsonian fusions for the lemurs, etc. This observation may raise many questions, among which is that of an organized evolution. Also, the breakpoints of chromosomal rearrangements observed during evolution, in human chromosomal diseases, and after ionizing irradiation do not seem to be distributed at random. Chromosomal rearrangements observed in evolution are known to be harmful in humans, leading to complete or partial sterility through abnormal offspring in the heterozygous state but not in the homozygous state. They then become a robust reproductive barrier capable of creating new species, far more powerful than gene mutations advocated by neo-Darwinism. The homozygous state may be achieved especially through inbreeding, which must have played a major role during primate evolution.(ABSTRACT TRUNCATED AT 400 WORDS)
Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat
2017-01-01
Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in plastomes is important to characterize the nature of evolutionary processes in nuclear and cytoplasmic genomes with the different selection pressures, population structures and breeding systems. PMID:28182646
M Salih, Rubar Hussein; Majeský, Ľuboš; Schwarzacher, Trude; Gornall, Richard; Heslop-Harrison, Pat
2017-01-01
Chloroplast DNA sequences show substantial variation between higher plant species, and less variation within species, so are typically excellent markers to investigate evolutionary, population and genetic relationships and phylogenies. We sequenced the plastomes of Taraxacum obtusifrons Markl. (O978); T. stridulum Trávniček ined. (S3); and T. amplum Markl. (A978), three apomictic triploid (2n = 3x = 24) dandelions from the T. officinale agg. We aimed to characterize the variation in plastomes, define relationships and correlations with the apomictic microspecies status, and refine placement of the microspecies in the evolutionary or phylogenetic context of the Asteraceae. The chloroplast genomes of accessions O978 and S3 were identical and 151,322 bp long (where the nuclear genes are known to show variation), while A978 was 151,349 bp long. All three genomes contained 135 unique genes, with an additional copy of the trnF-GGA gene in the LSC region and 20 duplicated genes in the IR region, along with short repeats, the typical major Inverted Repeats (IR1 and IR2, 24,431bp long), and Large and Small Single Copy regions (LSC 83,889bp and SSC 18,571bp in O978). Between the two Taraxacum plastomes types, we identified 28 SNPs. The distribution of polymorphisms suggests some parts of the Taraxacum plastome are evolving at a slower rate. There was a hemi-nested inversion in the LSC region that is common to Asteraceae, and an SSC inversion from ndhF to rps15 found only in some Asteraceae lineages. A comparative repeat analysis showed variation between Taraxacum and the phylogenetically close genus Lactuca, with many more direct repeats of 40bp or more in Lactuca (1% larger plastome than Taraxacum). When individual genes and non-coding regions were for Asteraceae phylogeny reconstruction, not all showed the same evolutionary scenario suggesting care is needed for interpretation of relationships if a limited number of markers are used. Studying genotypic diversity in plastomes is important to characterize the nature of evolutionary processes in nuclear and cytoplasmic genomes with the different selection pressures, population structures and breeding systems.
Bakhoum, Niokhor; Galiana, Antoine; Le Roux, Christine; Kane, Aboubacry; Duponnois, Robin; Ndoye, Fatou; Fall, Dioumacor; Noba, Kandioura; Sylla, Samba Ndao; Diouf, Diégane
2015-04-01
Acacia senegal and Acacia seyal are small, deciduous legume trees, most highly valued for nitrogen fixation and for the production of gum arabic, a commodity of international trade since ancient times. Symbiotic nitrogen fixation by legumes represents the main natural input of atmospheric N2 into ecosystems which may ultimately benefit all organisms. We analyzed the nod and nif symbiotic genes and symbiotic properties of root-nodulating bacteria isolated from A. senegal and A. seyal in Senegal. The symbiotic genes of rhizobial strains from the two Acacia species were closed to those of Mesorhizobium plurifarium and grouped separately in the phylogenetic trees. Phylogeny of rhizobial nitrogen fixation gene nifH was similar to those of nodulation genes (nodA and nodC). All A. senegal rhizobial strains showed identical nodA, nodC, and nifH gene sequences. By contrast, A. seyal rhizobial strains exhibited different symbiotic gene sequences. Efficiency tests demonstrated that inoculation of both Acacia species significantly affected nodulation, total dry weight, acetylene reduction activity (ARA), and specific acetylene reduction activity (SARA) of plants. However, these cross-inoculation tests did not show any specificity of Mesorhizobium strains toward a given Acacia host species in terms of infectivity and efficiency as stated by principal component analysis (PCA). This study demonstrates that large-scale inoculation of A. senegal and A. seyal in the framework of reafforestation programs requires a preliminary step of rhizobial strain selection for both Acacia species.
Koopman, W J; Guetta, E; van de Wiel, C C; Vosman, B; van den Berg, R G
1998-11-01
Internal transcribed spacer (ITS-1) sequences from 97 accessions representing 23 species of Lactuca and related genera were determined and used to evaluate species relationships of Lactuca sensu lato (s.l.). The ITS-1 phylogenies, calculated using PAUP and PHYLIP, correspond better to the classification of Feráková than to other classifications evaluated, although the inclusion of sect. Lactuca subsect. Cyanicae is not supported. Therefore, exclusion of subsect. Cyanicae from Lactuca sensu Feráková is proposed. The amended genus contains the entire gene pool (sensu Harlan and De Wet) of cultivated lettuce (Lactuca sativa). The position of the species in the amended classification corresponds to their position in the lettuce gene pool. In the ITS-1 phylogenies, a clade with L. sativa, L. serriola, L. dregeana, L. altaica, and L. aculeata represents the primary gene pool. L. virosa and L. saligna, branching off closest to this clade, encompass the secondary gene pool. L. virosa is possibly of hybrid origin. The primary and secondary gene pool species are classified in sect. Lactuca subsect. Lactuca. The species L. quercina, L. viminea, L. sibirica, and L. tatarica, branching off next, represent the tertiary gene pool. They are classified in Lactuca sect. Lactucopsis, sect. Phaenixopus, and sect. Mulgedium, respectively. L. perennis and L. tenerrima, classified in sect. Lactuca subsect. Cyanicae, form clades with species from related genera and are not part of the lettuce gene pool.
Bolsheva, Nadezhda L; Melnikova, Nataliya V; Kirov, Ilya V; Speranskaya, Anna S; Krinitsina, Anastasia A; Dmitriev, Alexey A; Belenikin, Maxim S; Krasnov, George S; Lakunina, Valentina A; Snezhkina, Anastasiya V; Rozhmina, Tatiana A; Samatadze, Tatiana E; Yurkevich, Olga Yu; Zoshchuk, Svyatoslav A; Amosova, Аlexandra V; Kudryavtseva, Anna V; Muravenko, Olga V
2017-12-28
The species relationships within the genus Linum have already been studied several times by means of different molecular and phylogenetic approaches. Nevertheless, a number of ambiguities in phylogeny of Linum still remain unresolved. In particular, the species relationships within the sections Stellerolinum and Dasylinum need further clarification. Also, the question of independence of the species of the section Adenolinum still remains unanswered. Moreover, the relationships of L. narbonense and other species of the section Linum require further clarification. Additionally, the origin of tetraploid species of the section Linum (2n = 30) including the cultivated species L. usitatissimum has not been explored. The present study examines the phylogeny of blue-flowered species of Linum by comparisons of 5S rRNA gene sequences as well as ITS1 and ITS2 sequences of 35S rRNA genes. High-throughput sequencing has been used for analysis of multicopy rRNA gene families. In addition to the molecular phylogenetic analysis, the number and chromosomal localization of 5S and 35S rDNA sites has been determined by FISH. Our findings confirm that L. stelleroides forms a basal branch from the clade of blue-flowered flaxes which is independent of the branch formed by species of the sect. Dasylinum. The current molecular phylogenetic approaches, the cytogenetic analysis as well as different genomic DNA fingerprinting methods applied previously did not discriminate certain species within the sect. Adenolinum. The allotetraploid cultivated species L. usitatissimum and its wild ancestor L. angustifolium (2n = 30) could originate either as the result of hybridization of two diploid species (2n = 16) related to the modern L. gandiflorum and L. decumbens, or hybridization of a diploid species (2n = 16) and a diploid ancestor of modern L. narbonense (2n = 14). High-throughput sequencing of multicopy rRNA gene families allowed us to make several adjustments to the phylogeny of blue-flowered flax species and also reveal intra- and interspecific divergence of the rRNA gene sequences.
Horizontal Gene Transfer and the History of Life
Daubin, Vincent; Szöllősi, Gergely J.
2016-01-01
Microbes acquire DNA from a variety of sources. The last decades, which have seen the development of genome sequencing, have revealed that horizontal gene transfer has been a major evolutionary force that has constantly reshaped genomes throughout evolution. However, because the history of life must ultimately be deduced from gene phylogenies, the lack of methods to account for horizontal gene transfer has thrown into confusion the very concept of the tree of life. As a result, many questions remain open, but emerging methodological developments promise to use information conveyed by horizontal gene transfer that remains unexploited today. PMID:26801681
Novel molecular markers of Chlamydia pecorum genetic diversity in the koala (Phascolarctos cinereus)
2011-01-01
Background Chlamydia pecorum is an obligate intracellular bacterium and the causative agent of reproductive and ocular disease in several animal hosts including koalas, sheep, cattle and goats. C. pecorum strains detected in koalas are genetically diverse, raising interesting questions about the origin and transmission of this species within koala hosts. While the ompA gene remains the most widely-used target in C. pecorum typing studies, it is generally recognised that surface protein encoding genes are not suited for phylogenetic analysis and it is becoming increasingly apparent that the ompA gene locus is not congruent with the phylogeny of the C. pecorum genome. Using the recently sequenced C. pecorum genome sequence (E58), we analysed 10 genes, including ompA, to evaluate the use of ompA as a molecular marker in the study of koala C. pecorum genetic diversity. Results Three genes (incA, ORF663, tarP) were found to contain sufficient nucleotide diversity and discriminatory power for detailed analysis and were used, with ompA, to genotype 24 C. pecorum PCR-positive koala samples from four populations. The most robust representation of the phylogeny of these samples was achieved through concatenation of all four gene sequences, enabling the recreation of a "true" phylogenetic signal. OmpA and incA were of limited value as fine-detailed genetic markers as they were unable to confer accurate phylogenetic distinctions between samples. On the other hand, the tarP and ORF663 genes were identified as useful "neutral" and "contingency" markers respectively, to represent the broad evolutionary history and intra-species genetic diversity of koala C. pecorum. Furthermore, the concatenation of ompA, incA and ORF663 sequences highlighted the monophyletic nature of koala C. pecorum infections by demonstrating a single evolutionary trajectory for koala hosts that is distinct from that seen in non-koala hosts. Conclusions While the continued use of ompA as a fine-detailed molecular marker for epidemiological analysis appears justified, the tarP and ORF663 genes also appear to be valuable markers of phylogenetic or biogeographic divisions at the C. pecorum intra-species level. This research has significant implications for future typing studies to understand the phylogeny, genetic diversity, and epidemiology of C. pecorum infections in the koala and other animal species. PMID:21496349
Marsh, James; Kollipara, Avinash; Timms, Peter; Polkinghorne, Adam
2011-04-18
Chlamydia pecorum is an obligate intracellular bacterium and the causative agent of reproductive and ocular disease in several animal hosts including koalas, sheep, cattle and goats. C. pecorum strains detected in koalas are genetically diverse, raising interesting questions about the origin and transmission of this species within koala hosts. While the ompA gene remains the most widely-used target in C. pecorum typing studies, it is generally recognised that surface protein encoding genes are not suited for phylogenetic analysis and it is becoming increasingly apparent that the ompA gene locus is not congruent with the phylogeny of the C. pecorum genome. Using the recently sequenced C. pecorum genome sequence (E58), we analysed 10 genes, including ompA, to evaluate the use of ompA as a molecular marker in the study of koala C. pecorum genetic diversity. Three genes (incA, ORF663, tarP) were found to contain sufficient nucleotide diversity and discriminatory power for detailed analysis and were used, with ompA, to genotype 24 C. pecorum PCR-positive koala samples from four populations. The most robust representation of the phylogeny of these samples was achieved through concatenation of all four gene sequences, enabling the recreation of a "true" phylogenetic signal. OmpA and incA were of limited value as fine-detailed genetic markers as they were unable to confer accurate phylogenetic distinctions between samples. On the other hand, the tarP and ORF663 genes were identified as useful "neutral" and "contingency" markers respectively, to represent the broad evolutionary history and intra-species genetic diversity of koala C. pecorum. Furthermore, the concatenation of ompA, incA and ORF663 sequences highlighted the monophyletic nature of koala C. pecorum infections by demonstrating a single evolutionary trajectory for koala hosts that is distinct from that seen in non-koala hosts. While the continued use of ompA as a fine-detailed molecular marker for epidemiological analysis appears justified, the tarP and ORF663 genes also appear to be valuable markers of phylogenetic or biogeographic divisions at the C. pecorum intra-species level. This research has significant implications for future typing studies to understand the phylogeny, genetic diversity, and epidemiology of C. pecorum infections in the koala and other animal species.
Quantifying the mechanisms of domain gain in animal proteins.
Buljan, Marija; Frankish, Adam; Bateman, Alex
2010-01-01
Protein domains are protein regions that are shared among different proteins and are frequently functionally and structurally independent from the rest of the protein. Novel domain combinations have a major role in evolutionary innovation. However, the relative contributions of the different molecular mechanisms that underlie domain gains in animals are still unknown. By using animal gene phylogenies we were able to identify a set of high confidence domain gain events and by looking at their coding DNA investigate the causative mechanisms. Here we show that the major mechanism for gains of new domains in metazoan proteins is likely to be gene fusion through joining of exons from adjacent genes, possibly mediated by non-allelic homologous recombination. Retroposition and insertion of exons into ancestral introns through intronic recombination are, in contrast to previous expectations, only minor contributors to domain gains and have accounted for less than 1% and 10% of high confidence domain gain events, respectively. Additionally, exonization of previously non-coding regions appears to be an important mechanism for addition of disordered segments to proteins. We observe that gene duplication has preceded domain gain in at least 80% of the gain events. The interplay of gene duplication and domain gain demonstrates an important mechanism for fast neofunctionalization of genes.
2012-01-01
Background The marine environment is comprised of numerous divergent organisms living under similar selective pressures, often resulting in the evolution of convergent structures such as the fusiform body shape of pelagic squids, fishes, and some marine mammals. However, little is known about the frequency of, and circumstances leading to, convergent evolution in the open ocean. Here, we present a comparative study of the molluscan class Cephalopoda, a marine group known to occupy habitats from the intertidal to the deep sea. Several lineages bear features that may coincide with a benthic or pelagic existence, making this a valuable group for testing hypotheses of correlated evolution. To test for convergence and correlation, we generate the most taxonomically comprehensive multi-gene phylogeny of cephalopods to date. We then create a character matrix of habitat type and morphological characters, which we use to infer ancestral character states and test for correlation between habitat and morphology. Results Our study utilizes a taxonomically well-sampled phylogeny to show convergent evolution in all six morphological characters we analyzed. Three of these characters also correlate with habitat. The presence of an autogenic photophore (those relying upon autonomous enzymatic light reactions) is correlated with a pelagic habitat, while the cornea and accessory nidamental gland correlate with a benthic lifestyle. Here, we present the first statistical tests for correlation between convergent traits and habitat in cephalopods to better understand the evolutionary history of characters that are adaptive in benthic or pelagic environments, respectively. Discussion Our study supports the hypothesis that habitat has influenced convergent evolution in the marine environment: benthic organisms tend to exhibit similar characteristics that confer protection from invasion by other benthic taxa, while pelagic organisms possess features that facilitate crypsis and communication in an environment lacking physical refuges. Features that have originated multiple times in distantly related lineages are likely adaptive for the organisms inhabiting a particular environment: studying the frequency and evolutionary history of such convergent characters can increase understanding of the underlying forces driving ecological and evolutionary transitions in the marine environment. PMID:22839506
Figueroa, Diego F; Baco, Amy R
2014-12-24
We use full mitochondrial genomes to test the robustness of the phylogeny of the Octocorallia, to determine the evolutionary pathway for the five known mitochondrial gene rearrangements in octocorals, and to test the suitability of using mitochondrial genomes for higher taxonomic-level phylogenetic reconstructions. Our phylogeny supports three major divisions within the Octocorallia and show that Paragorgiidae is paraphyletic, with Sibogagorgia forming a sister branch to the Coralliidae. Furthermore, Sibogagorgia cauliflora has what is presumed to be the ancestral gene order in octocorals, but the presence of a pair of inverted repeat sequences suggest that this gene order was not conserved but rather evolved back to this apparent ancestral state. Based on this we recommend the resurrection of the family Sibogagorgiidae to fix the paraphyly of the Paragorgiidae. This is the first study to show that in the Octocorallia, mitochondrial gene orders have evolved back to an ancestral state after going through a gene rearrangement, with at least one of the gene orders evolving independently in different lineages. A number of studies have used gene boundaries to determine the type of mitochondrial gene arrangement present. However, our findings suggest that this method known as gene junction screening may miss evolutionary reversals. Additionally, substitution saturation analysis demonstrates that while whole mitochondrial genomes can be used effectively for phylogenetic analyses within Octocorallia, their utility at higher taxonomic levels within Cnidaria is inadequate. Therefore for phylogenetic reconstruction at taxonomic levels higher than subclass within the Cnidaria, nuclear genes will be required, even when whole mitochondrial genomes are available. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Fu, Wen-Bo; Li, Bo; He, Zheng-Bo
2018-01-01
Chemosensory proteins (CSP) are soluble carrier proteins that may function in odorant reception in insects. CSPs have not been thoroughly studied at whole-genome level, despite the availability of insect genomes. Here, we identified/reidentified 283 CSP genes in the genomes of 22 mosquitoes. All 283 CSP genes possess a highly conserved OS-D domain. We comprehensively analyzed these CSP genes and determined their conserved domains, structure, genomic distribution, phylogeny, and evolutionary patterns. We found an average of seven CSP genes in each of 19 Anopheles genomes, 27 CSP genes in Cx. quinquefasciatus, 43 in Ae. aegypti, and 83 in Ae. albopictus. The Anopheles CSP genes had a simple genomic organization with a relatively consistent gene distribution, while most of the Culicinae CSP genes were distributed in clusters on the scaffolds. Our phylogenetic analysis clustered the CSPs into two major groups: CSP1-8 and CSE1-3. The CSP1-8 groups were all monophyletic with good bootstrap support. The CSE1-3 groups were an expansion of the CSP family of genes specific to the three Culicinae species. The Ka/Ks ratios indicated that the CSP genes had been subject to purifying selection with relatively slow evolution. Our results provide a comprehensive framework for the study of the CSP gene family in these 22 mosquito species, laying a foundation for future work on CSP function in the detection of chemical cues in the surrounding environment. PMID:29304168
Mei, Ting; Fu, Wen-Bo; Li, Bo; He, Zheng-Bo; Chen, Bin
2018-01-01
Chemosensory proteins (CSP) are soluble carrier proteins that may function in odorant reception in insects. CSPs have not been thoroughly studied at whole-genome level, despite the availability of insect genomes. Here, we identified/reidentified 283 CSP genes in the genomes of 22 mosquitoes. All 283 CSP genes possess a highly conserved OS-D domain. We comprehensively analyzed these CSP genes and determined their conserved domains, structure, genomic distribution, phylogeny, and evolutionary patterns. We found an average of seven CSP genes in each of 19 Anopheles genomes, 27 CSP genes in Cx. quinquefasciatus, 43 in Ae. aegypti, and 83 in Ae. albopictus. The Anopheles CSP genes had a simple genomic organization with a relatively consistent gene distribution, while most of the Culicinae CSP genes were distributed in clusters on the scaffolds. Our phylogenetic analysis clustered the CSPs into two major groups: CSP1-8 and CSE1-3. The CSP1-8 groups were all monophyletic with good bootstrap support. The CSE1-3 groups were an expansion of the CSP family of genes specific to the three Culicinae species. The Ka/Ks ratios indicated that the CSP genes had been subject to purifying selection with relatively slow evolution. Our results provide a comprehensive framework for the study of the CSP gene family in these 22 mosquito species, laying a foundation for future work on CSP function in the detection of chemical cues in the surrounding environment.
Lin, Y H; Zhang, W; Li, J W; Zhang, H W; Chen, D Y
2017-01-01
In vertebrates, evolutionarily conserved signaling intermediate in the Toll pathway (ECSIT) interacts with the TNF-receptor associated factor 6 (TRAF6) to regulate the processing of MEKK1, activate NF-κB, and also control BMP target genes. However, the role of ECSIT in invertebrates remains largely unexplored. We performed comparative investigations of the expression, gene structure, and phylogeny of ECSIT, Toll-like receptor (TLR), and Smad4 in the cephalochordate Branchiostoma belcheri. Phylogenetic analysis indicated that, in amphioxus, ECSIT, TLR, and Smad4 form independent clusters at the base of Chordate clusters. Interestingly, overall gene structures were comparable to those in vertebrate orthologs. Transcripts of AmphiECSIT were detectable at the mid-neural stage, and continued to be expressed in the epithelium of the pharyngeal region at later stages. In adult animals, strong expression was observed in the nerve cord, endostyle, epithelial cells of the gut and wheel organ, genital membrane of the testis, and coelom and lymphoid cavities, what is highly similar to AmphiTLR and AmphiSmad4 expression patterns during development and in adult organisms. Our data suggests that ECSIT is evolutionarily conserved. Its amphioxus ortholog functions during embryonic development and as part of the innate immune system and may be involved in TLR/BMP signaling.
Angus, Robert B.; Ribera, Ignacio; Jia, Fenglong
2017-01-01
Abstract Karyotypes are given for Boreonectes emmerichi (Falkenström, 1936) from its type locality at Kangding, China, and for B. alpestris (Dutton & Angus, 2007) from the St Gotthard and San Bernardino passes in the Swiss Alps. A phylogeny based on sequence data from a combination of mitochondrial and nuclear genes recovered western Palaearctic species of Boreonectes as monophyletic with strong support. Boreonectes emmerichi was placed as sister to the north American forms of B. griseostriatus (De Geer, 1774), although with low support. The diversity of Palaearctic species of the B. griseostriatus species group is discussed. PMID:28919958
Angus, Robert B; Ribera, Ignacio; Jia, Fenglong
2017-01-01
Karyotypes are given for Boreonectes emmerichi (Falkenström, 1936) from its type locality at Kangding, China, and for B. alpestris (Dutton & Angus, 2007) from the St Gotthard and San Bernardino passes in the Swiss Alps. A phylogeny based on sequence data from a combination of mitochondrial and nuclear genes recovered western Palaearctic species of Boreonectes as monophyletic with strong support. Boreonectes emmerichi was placed as sister to the north American forms of B. griseostriatus (De Geer, 1774), although with low support. The diversity of Palaearctic species of the B. griseostriatus species group is discussed.
Washburne, Alex D; Silverman, Justin D; Leff, Jonathan W; Bennett, Dominic J; Darcy, John L; Mukherjee, Sayan; Fierer, Noah; David, Lawrence A
2017-01-01
Marker gene sequencing of microbial communities has generated big datasets of microbial relative abundances varying across environmental conditions, sample sites and treatments. These data often come with putative phylogenies, providing unique opportunities to investigate how shared evolutionary history affects microbial abundance patterns. Here, we present a method to identify the phylogenetic factors driving patterns in microbial community composition. We use the method, "phylofactorization," to re-analyze datasets from the human body and soil microbial communities, demonstrating how phylofactorization is a dimensionality-reducing tool, an ordination-visualization tool, and an inferential tool for identifying edges in the phylogeny along which putative functional ecological traits may have arisen.
Faille, Arnaud; Bourdeau, Charles; Fresneda, Javier
2012-01-01
Abstract A molecular phylogeny of the species from the Trechus brucki clade (previously Trechus uhagoni group)based on fragments of four mitochondrial genes and one nuclear gene is given. We describe Trechus (Trechus) bouilloni sp. n. from the western pre–Pyrenees: Sierras de Urbasa–Andía, Navarra, Spain. The species was collected in mesovoid shallow substratum (mss), a subterranean environment. Molecular as well as morphological evidences demonstrate that the new species belongs to the Trechus brucki clade. A narrow endemic species of high altitude in western French Pyrenees merged with Trechus brucki Fairmaire, 1862a, Trechus bruckoides sp. n., is described. A lectotype is designated for Trechus brucki and Trechus planiusculus Fairmaire, 1862b (junior synonym of Trechus brucki). The species group is redefined based on molecular and morphological characters, and renamed as the brucki group, as Trechus brucki was the first described species of the clade. A unique synapomorphy of the male genitalia, a characteristic secondary sclerotization of the sperm duct, which is shared by all the species of the brucki group sensu novo, is described and illustrated. The Trechus brucki group sensu novo is composed of Trechus beusti (Schaufuss, 1863), Trechus bouilloni sp. n., Trechus brucki, Trechus bruckoides sp. n., Trechus grenieri Pandellé, 1867, T. uhagoni uhagoni Crotch, 1869, T. uhagoni ruteri Colas, 1935 and Trechus pieltaini Jeannel, 1920. We discuss the taxonomy of the group and provide illustrations of structures showing the differences between the species, along with distribution data and biogeographical comments. PMID:22977341
Aiese Cigliano, Riccardo; Sanseverino, Walter; Cremona, Gaetana; Ercolano, Maria R; Conicella, Clara; Consiglio, Federica M
2013-01-28
Histone post-translational modifications (HPTMs) including acetylation and methylation have been recognized as playing a crucial role in epigenetic regulation of plant growth and development. Although Solanum lycopersicum is a dicot model plant as well as an important crop, systematic analysis and expression profiling of histone modifier genes (HMs) in tomato are sketchy. Based on recently released tomato whole-genome sequences, we identified in silico 32 histone acetyltransferases (HATs), 15 histone deacetylases (HDACs), 52 histone methytransferases (HMTs) and 26 histone demethylases (HDMs), and compared them with those detected in Arabidopsis (Arabidopsis thaliana), maize (Zea mays) and rice (Oryza sativa) orthologs. Comprehensive analysis of the protein domain architecture and phylogeny revealed the presence of non-canonical motifs and new domain combinations, thereby suggesting for HATs the existence of a new family in plants. Due to species-specific diversification during evolutionary history tomato has fewer HMs than Arabidopsis. The transcription profiles of HMs within tomato organs revealed a broad functional role for some HMs and a more specific activity for others, suggesting key HM regulators in tomato development. Finally, we explored S. pennellii introgression lines (ILs) and integrated the map position of HMs, their expression profiles and the phenotype of ILs. We thereby proved that the strategy was useful to identify HM candidates involved in carotenoid biosynthesis in tomato fruits. In this study, we reveal the structure, phylogeny and spatial expression of members belonging to the classical families of HMs in tomato. We provide a framework for gene discovery and functional investigation of HMs in other Solanaceae species.
De-la-Mora, Marisol; Piñero, Daniel; Oyama, Ken; Farrell, Brian; Magallón, Susana; Núñez-Farfán, Juan
2018-07-01
The family Curculionidae (Coleoptera), the "true" weevils, have diversified tightly linked to the evolution of flowering plants. Here, we aim to assess diversification at a lower taxonomic level. We analyze the evolution of the genus Trichobaris in association with their host plants. Trichobaris comprises eight to thirteen species; their larvae feed inside the fruits of Datura spp. or inside the stem of wild and cultivated species of Solanaceae, such as potato, tobacco and tomato. We ask the following questions: (1) does the rostrum of Trichobaris species evolve according to the plant tissue used to oviposit, i.e., shorter rostrum to dig in stems and longer to dig in fruits? and (2) does Trichobaris diversify mainly in relation to the use of Datura species? For the first question, we estimated the phylogeny of Trichobaris based on four gene sequences (nuclear 18S and 28S rRNA genes and mitochondrial 16S rRNA and COI genes). Then, we carried out morphogeometric analyses of the Trichobaris species using 75 landmarks. For the second question, we calibrated a COI haplotype phylogeny using a constant rate of divergence to infer the diversification time of Trichobaris species, and we traced the host plant species on the haplotype network. We performed an ancestral state reconstruction analysis to infer recent colonization events and conserved associations with host plant species. We found that ancestral species in the Trichobaris phylogeny use the stem of Solanum plants for oviposition and display weak sexual dimorphism of rostrum size, whereas other, more recent species of Trichobaris display sexual dimorphism in rostrum size and use the fruits of Datura species, and a possible reversion to use the stem of Solanaceae was detected in one Trichobaris species. The use of Datura species by Trichobaris species is widely distributed on haplotype networks and restricted to Trichobaris species that originated ca. 5 ± 1.5 Ma. Given that the origin of Trichobaris is estimated to be ca. 6 ± 1.5 Ma, it is likely that Datura has played a role in its diversification. Copyright © 2018 Elsevier Inc. All rights reserved.
Raymond, James A; Morgan-Kiss, Rachael
2017-08-01
Ice-associated algae produce ice-binding proteins (IBPs) to prevent freezing damage. The IBPs of the three chlorophytes that have been examined so far share little similarity across species, making it likely that they were acquired by horizontal gene transfer (HGT). To clarify the importance and source of IBPs in chlorophytes, we sequenced the IBP genes of another Antarctic chlorophyte, Chlamydomonas sp. ICE-MDV (Chlamy-ICE). Genomic DNA and total RNA were sequenced and screened for known ice-associated genes. Chlamy-ICE has as many as 50 IBP isoforms, indicating that they have an important role in survival. The IBPs are of the DUF3494 type and have similar exon structures. The DUF3494 sequences are much more closely related to prokaryotic sequences than they are to sequences in other chlorophytes, and the chlorophyte IBP and ribosomal 18S phylogenies are dissimilar. The multiple IBP isoforms found in Chlamy-ICE and other algae may allow the algae to adapt to a greater variety of ice conditions than prokaryotes, which typically have a single IBP gene. The predicted structure of the DUF3494 domain has an ice-binding face with an orderly array of hydrophilic side chains. The results indicate that Chlamy-ICE acquired its IBP genes by HGT in a single event. The acquisitions of IBP genes by this and other species of Antarctic algae by HGT appear to be key evolutionary events that allowed algae to extend their ranges into polar environments. © 2017 Phycological Society of America.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Leebens-Mack, Jim; Raubeson, Linda A.; Cui, Liying
2005-05-27
While there has been strong support for Amborella and Nymphaeales (water lilies) as branching from basal-most nodes in the angiosperm phylogeny, this hypothesis has recently been challenged by phylogenetic analyses of 61 protein-coding genes extracted from the chloroplast genome sequences of Amborella, Nymphaea and 12 other available land plant chloroplast genomes. These character-rich analyses placed the monocots, represented by three grasses (Poaceae), as sister to all other extant angiosperm lineages. We have extracted protein-coding regions from draft sequences for six additional chloroplast genomes to test whether this surprising result could be an artifact of long-branch attraction due to limited taxonmore » sampling. The added taxa include three monocots (Acorus, Yucca and Typha), a water lily (Nuphar), a ranunculid(Ranunculus), and a gymnosperm (Ginkgo). Phylogenetic analyses of the expanded DNA and protein datasets together with microstructural characters (indels) provided unambiguous support for Amborella and the Nymphaeales as branching from the basal-most nodes in the angiospermphylogeny. However, their relative positions proved to be dependent on method of analysis, with parsimony favoring Amborella as sister to all other angiosperms, and maximum likelihood and neighbor-joining methods favoring an Amborella + Nympheales clade as sister. The maximum likelihood phylogeny supported the later hypothesis, but the likelihood for the former hypothesis was not significantly different. Parametric bootstrap analysis, single gene phylogenies, estimated divergence dates and conflicting in del characters all help to illuminate the nature of the conflict in resolution of the most basal nodes in the angiospermphylogeny. Molecular dating analyses provided median age estimates of 161 mya for the most recent common ancestor of all extant angiosperms and 145 mya for the most recent common ancestor of monocots, magnoliids andeudicots. Whereas long sequences reduce variance in branch lengths and molecular dating estimates, the impact of improved taxon sampling on the rooting of the angiosperm phylogeny together with the results of parametric bootstrap analyses demonstrate how long-branch attraction can mislead genome-scale phylogenetic analyses.« less
Enkh-Amgalan, Jigjiddorj; Kawasaki, Hiroko; Seki, Tatsuji
2006-01-01
A major nif cluster was detected in the strictly anaerobic, Gram-positive phototrophic bacterium Heliobacterium chlorum. The cluster consisted of 11 genes arranged within a 10 kb region in the order nifI1, nifI2, nifH, nifD, nifK, nifE, nifN, nifX, fdx, nifB and nifV. The phylogenetic position of Hbt. chlorum was the same in the NifH, NifD, NifK, NifE and NifN trees; Hbt. chlorum formed a cluster with Desulfitobacterium hafniense, the closest neighbour of heliobacteria based on the 16S rRNA phylogeny, and two species of the genus Geobacter belonging to the Deltaproteobacteria. Two nifI genes, known to occur in the nif clusters of methanogenic archaea between nifH and nifD, were found upstream of the nifH gene of Hbt. chlorum. The organization of the nif operon and the phylogeny of individual and concatenated gene products showed that the Hbt. chlorum nif operon carrying nifI genes upstream of the nifH gene was an intermediate between the nif operon with nifI downstream of nifH (group II and III of the nitrogenase classification) and the nif operon lacking nifI (group I). Thus, the phylogenetic position of Hbt. chlorum nitrogenase may reflect an evolutionary stage of a divergence of the two nitrogenase groups, with group I consisting of the aerobic diazotrophs and group II consisting of strictly anaerobic prokaryotes.
Gnat, Sebastian; Małek, Wanda; Oleńska, Ewa; Wdowiak-Wróbel, Sylwia; Kalita, Michał; Łotocka, Barbara; Wójcik, Magdalena
2015-01-01
The phylogeny of symbiotic genes of Astragalus glycyphyllos L. (liquorice milkvetch) nodule isolates was studied by comparative sequence analysis of nodA, nodC, nodH and nifH loci. In all these genes phylograms, liquorice milkvetch rhizobia (closely related to bacteria of three species, i.e. Mesorhizobium amorphae, Mesorhizobium septentrionale and Mesorhizobium ciceri) formed one clearly separate cluster suggesting the horizontal transfer of symbiotic genes from a single ancestor to the bacteria being studied. The high sequence similarity of the symbiotic genes of A. glycyphyllos rhizobia (99-100% in the case of nodAC and nifH genes, and 98-99% in the case of nodH one) points to the relatively recent (in evolutionary scale) lateral transfer of these genes. In the nodACH and nifH phylograms, A. glycyphyllos nodule isolates were grouped together with the genus Mesorhizobium species in one monophyletic clade, close to M. ciceri, Mesorhizobium opportunistum and Mesorhizobium australicum symbiovar biserrulae bacteria, which correlates with the close relationship of these rhizobia host plants. Plant tests revealed the narrow host range of A. glycyphyllos rhizobia. They formed effective symbiotic interactions with their native host (A. glycyphyllos) and Amorpha fruticosa but not with 11 other fabacean species. The nodules induced on A. glycyphyllos roots were indeterminate with apical, persistent meristem, an age gradient of nodule tissues and cortical vascular bundles. To reflect the symbiosis-adaptive phenotype of rhizobia, specific for A. glycyphyllos, we propose for these bacteria the new symbiovar "glycyphyllae", based on nodA and nodC genes sequences.
Gnat, Sebastian; Małek, Wanda; Oleńska, Ewa; Wdowiak-Wróbel, Sylwia; Kalita, Michał; Łotocka, Barbara; Wójcik, Magdalena
2015-01-01
The phylogeny of symbiotic genes of Astragalus glycyphyllos L. (liquorice milkvetch) nodule isolates was studied by comparative sequence analysis of nodA, nodC, nodH and nifH loci. In all these genes phylograms, liquorice milkvetch rhizobia (closely related to bacteria of three species, i.e. Mesorhizobium amorphae, Mesorhizobium septentrionale and Mesorhizobium ciceri) formed one clearly separate cluster suggesting the horizontal transfer of symbiotic genes from a single ancestor to the bacteria being studied. The high sequence similarity of the symbiotic genes of A. glycyphyllos rhizobia (99–100% in the case of nodAC and nifH genes, and 98–99% in the case of nodH one) points to the relatively recent (in evolutionary scale) lateral transfer of these genes. In the nodACH and nifH phylograms, A. glycyphyllos nodule isolates were grouped together with the genus Mesorhizobium species in one monophyletic clade, close to M. ciceri, Mesorhizobium opportunistum and Mesorhizobium australicum symbiovar biserrulae bacteria, which correlates with the close relationship of these rhizobia host plants. Plant tests revealed the narrow host range of A. glycyphyllos rhizobia. They formed effective symbiotic interactions with their native host (A. glycyphyllos) and Amorpha fruticosa but not with 11 other fabacean species. The nodules induced on A. glycyphyllos roots were indeterminate with apical, persistent meristem, an age gradient of nodule tissues and cortical vascular bundles. To reflect the symbiosis-adaptive phenotype of rhizobia, specific for A. glycyphyllos, we propose for these bacteria the new symbiovar “glycyphyllae”, based on nodA and nodC genes sequences. PMID:26496493
Ghosh, Jayadri Sekhar; Bhattacharya, Samik; Pal, Amita
2017-06-01
The unavailability of the reproductive structure and unpredictability of vegetative characters for the identification and phylogenetic study of bamboo prompted the application of molecular techniques for greater resolution and consensus. We first employed internal transcribed spacer (ITS1, 5.8S rRNA and ITS2) sequences to construct the phylogenetic tree of 21 tropical bamboo species. While the sequence alone could grossly reconstruct the traditional phylogeny amongst the 21-tropical species studied, some anomalies were encountered that prompted a further refinement of the phylogenetic analyses. Therefore, we integrated the secondary structure of the ITS sequences to derive individual sequence-structure matrix to gain more resolution on the phylogenetic reconstruction. The results showed that ITS sequence-structure is the reliable alternative to the conventional phenotypic method for the identification of bamboo species. The best-fit topology obtained by the sequence-structure based phylogeny over the sole sequence based one underscores closer clustering of all the studied Bambusa species (Sub-tribe Bambusinae), while Melocanna baccifera, which belongs to Sub-Tribe Melocanneae, disjointedly clustered as an out-group within the consensus phylogenetic tree. In this study, we demonstrated the dependability of the combined (ITS sequence+structure-based) approach over the only sequence-based analysis for phylogenetic relationship assessment of bamboo.
Ludwig, A; Belfiore, N M; Pitra, C; Svirsky, V; Jenneckens, I
2001-07-01
Sturgeon (order Acipenserformes) provide an ideal taxonomic context for examination of genome duplication events. Multiple levels of ploidy exist among these fish. In a novel microsatellite approach, data from 962 fish from 20 sturgeon species were used for analysis of ploidy in sturgeon. Allele numbers in a sample of individuals were assessed at six microsatellite loci. Species with approximately 120 chromosomes are classified as functional diploid species, species with approximately 250 chromosomes as functional tetraploid species, and with approximately 500 chromosomes as functional octaploids. A molecular phylogeny of the sturgeon was determined on the basis of sequences of the entire mitochondrial cytochrome b gene. By mapping the estimated levels of ploidy on this proposed phylogeny we demonstrate that (I) polyploidization events independently occurred in the acipenseriform radiation; (II) the process of functional genome reduction is nearly finished in species with approximately 120 chromosomes and more active in species with approximately 250 chromosomes and approximately 500 chromosomes; and (III) species with approximately 250 and approximately 500 chromosomes arose more recently than those with approximately 120 chromosomes. These results suggest that gene silencing, chromosomal rearrangements, and transposition events played an important role in the acipenseriform genome formation. Furthermore, this phylogeny is broadly consistent with previous hypotheses but reveals a highly supported oceanic (Atlantic-Pacific) subdivision within the Acipenser/Huso complex.
Ludwig, A; Belfiore, N M; Pitra, C; Svirsky, V; Jenneckens, I
2001-01-01
Sturgeon (order Acipenserformes) provide an ideal taxonomic context for examination of genome duplication events. Multiple levels of ploidy exist among these fish. In a novel microsatellite approach, data from 962 fish from 20 sturgeon species were used for analysis of ploidy in sturgeon. Allele numbers in a sample of individuals were assessed at six microsatellite loci. Species with approximately 120 chromosomes are classified as functional diploid species, species with approximately 250 chromosomes as functional tetraploid species, and with approximately 500 chromosomes as functional octaploids. A molecular phylogeny of the sturgeon was determined on the basis of sequences of the entire mitochondrial cytochrome b gene. By mapping the estimated levels of ploidy on this proposed phylogeny we demonstrate that (I) polyploidization events independently occurred in the acipenseriform radiation; (II) the process of functional genome reduction is nearly finished in species with approximately 120 chromosomes and more active in species with approximately 250 chromosomes and approximately 500 chromosomes; and (III) species with approximately 250 and approximately 500 chromosomes arose more recently than those with approximately 120 chromosomes. These results suggest that gene silencing, chromosomal rearrangements, and transposition events played an important role in the acipenseriform genome formation. Furthermore, this phylogeny is broadly consistent with previous hypotheses but reveals a highly supported oceanic (Atlantic-Pacific) subdivision within the Acipenser/Huso complex. PMID:11454768
The prehistory of potyviruses: their initial radiation was during the dawn of agriculture.
Gibbs, Adrian J; Ohshima, Kazusato; Phillips, Matthew J; Gibbs, Mark J
2008-06-25
Potyviruses are found world wide, are spread by probing aphids and cause considerable crop damage. Potyvirus is one of the two largest plant virus genera and contains about 15% of all named plant virus species. When and why did the potyviruses become so numerous? Here we answer the first question and discuss the other. We have inferred the phylogenies of the partial coat protein gene sequences of about 50 potyviruses, and studied in detail the phylogenies of some using various methods and evolutionary models. Their phylogenies have been calibrated using historical isolation and outbreak events: the plum pox virus epidemic which swept through Europe in the 20th century, incursions of potyviruses into Australia after agriculture was established by European colonists, the likely transport of cowpea aphid-borne mosaic virus in cowpea seed from Africa to the Americas with the 16th century slave trade and the similar transport of papaya ringspot virus from India to the Americas. Our studies indicate that the partial coat protein genes of potyviruses have an evolutionary rate of about 1.15x10(-4) nucleotide substitutions/site/year, and the initial radiation of the potyviruses occurred only about 6,600 years ago, and hence coincided with the dawn of agriculture. We discuss the ways in which agriculture may have triggered the prehistoric emergence of potyviruses and fostered their speciation.
The Prehistory of Potyviruses: Their Initial Radiation Was during the Dawn of Agriculture
Gibbs, Adrian J.; Ohshima, Kazusato; Phillips, Matthew J.; Gibbs, Mark J.
2008-01-01
Background Potyviruses are found world wide, are spread by probing aphids and cause considerable crop damage. Potyvirus is one of the two largest plant virus genera and contains about 15% of all named plant virus species. When and why did the potyviruses become so numerous? Here we answer the first question and discuss the other. Methods and Findings We have inferred the phylogenies of the partial coat protein gene sequences of about 50 potyviruses, and studied in detail the phylogenies of some using various methods and evolutionary models. Their phylogenies have been calibrated using historical isolation and outbreak events: the plum pox virus epidemic which swept through Europe in the 20th century, incursions of potyviruses into Australia after agriculture was established by European colonists, the likely transport of cowpea aphid-borne mosaic virus in cowpea seed from Africa to the Americas with the 16th century slave trade and the similar transport of papaya ringspot virus from India to the Americas. Conclusions/Significance Our studies indicate that the partial coat protein genes of potyviruses have an evolutionary rate of about 1.15×10−4 nucleotide substitutions/site/year, and the initial radiation of the potyviruses occurred only about 6,600 years ago, and hence coincided with the dawn of agriculture. We discuss the ways in which agriculture may have triggered the prehistoric emergence of potyviruses and fostered their speciation. PMID:18575612
Li, Xiaofang; Zhu, Yong-Guan; Shaban, Babak; Bruxner, Timothy J. C.; Bond, Philip L.; Huang, Longbin
2015-01-01
Characterizing the genetic diversity of microbial copper (Cu) resistance at the community level remains challenging, mainly due to the polymorphism of the core functional gene copA. In this study, a local BLASTN method using a copA database built in this study was developed to recover full-length putative copA sequences from an assembled tailings metagenome; these sequences were then screened for potentially functioning CopA using conserved metal-binding motifs, inferred by evolutionary trace analysis of CopA sequences from known Cu resistant microorganisms. In total, 99 putative copA sequences were recovered from the tailings metagenome, out of which 70 were found with high potential to be functioning in Cu resistance. Phylogenetic analysis of selected copA sequences detected in the tailings metagenome showed that topology of the copA phylogeny is largely congruent with that of the 16S-based phylogeny of the tailings microbial community obtained in our previous study, indicating that the development of copA diversity in the tailings might be mainly through vertical descent with few lateral gene transfer events. The method established here can be used to explore copA (and potentially other metal resistance genes) diversity in any metagenome and has the potential to exhaust the full-length gene sequences for downstream analyses. PMID:26286020
Isolation, phylogeny and evolution of the SymRK gene in the legume genus Lupinus L.
Mahé, Frédéric; Markova, Dragomira; Pasquet, Rémy; Misset, Marie-Thérèse; Aïnouche, Abdelkader
2011-07-01
SymRK is one of the key genes involved in initial steps of legume symbiotic association with fungi (mycorrhization) and nitrogen-fixing bacteria (nodulation). A large portion of the sequence encoding the extracellular domain of SYMRK was obtained for 38 lupine accessions and 2 outgroups in order to characterize this region, to evaluate its phylogenetic utility, and to examine whether its molecular evolutionary pattern is correlated with rhizobial diversity and specificity in Lupinus. The data suggested that, in Lupinus, SymRK is a single copy gene that shows good phylogenetic potential. Accordingly, SymRK provided additional support to previous molecular phylogenies, and shed additional light on relationships within the Old World group of Lupinus, especially among the African species. Similar to results of other studies, analyses of SymRK sequences were unable to resolve placement of the Florida unifoliolate lineage, whose relationship was weakly supported to either the Old or the New World lupines. Our data are consistent with strong purifying selection operating on SymRK in Lupinus, preserving rather than diversifying its function. Thus, although SymRK was demonstrated to be a vital gene in the early stages of the root-bacterial symbiotic associations, no evidence from present analyses indicate that this gene is involved in changes in rhizobial specificity in Lupinus. Copyright © 2011 Elsevier Inc. All rights reserved.
Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A
2009-03-30
Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of these gene clusters in dinoflagellates, the cause of human mortalities and significant financial loss to the tourism and shellfish industries.
Mihali, Troco K; Kellmann, Ralf; Neilan, Brett A
2009-01-01
Background Saxitoxin and its analogues collectively known as the paralytic shellfish toxins (PSTs) are neurotoxic alkaloids and are the cause of the syndrome named paralytic shellfish poisoning. PSTs are produced by a unique biosynthetic pathway, which involves reactions that are rare in microbial metabolic pathways. Nevertheless, distantly related organisms such as dinoflagellates and cyanobacteria appear to produce these toxins using the same pathway. Hypothesised explanations for such an unusual phylogenetic distribution of this shared uncommon metabolic pathway, include a polyphyletic origin, an involvement of symbiotic bacteria, and horizontal gene transfer. Results We describe the identification, annotation and bioinformatic characterisation of the putative paralytic shellfish toxin biosynthesis clusters in an Australian isolate of Anabaena circinalis and an American isolate of Aphanizomenon sp., both members of the Nostocales. These putative PST gene clusters span approximately 28 kb and contain genes coding for the biosynthesis and export of the toxin. A putative insertion/excision site in the Australian Anabaena circinalis AWQC131C was identified, and the organization and evolution of the gene clusters are discussed. A biosynthetic pathway leading to the formation of saxitoxin and its analogues in these organisms is proposed. Conclusion The PST biosynthesis gene cluster presents a mosaic structure, whereby genes have apparently transposed in segments of varying size, resulting in different gene arrangements in all three sxt clusters sequenced so far. The gene cluster organizational structure and sequence similarity seems to reflect the phylogeny of the producer organisms, indicating that the gene clusters have an ancient origin, or that their lateral transfer was also an ancient event. The knowledge we gain from the characterisation of the PST biosynthesis gene clusters, including the identity and sequence of the genes involved in the biosynthesis, may also afford the identification of these gene clusters in dinoflagellates, the cause of human mortalities and significant financial loss to the tourism and shellfish industries. PMID:19331657
TDR Targets: a chemogenomics resource for neglected diseases.
Magariños, María P; Carmona, Santiago J; Crowther, Gregory J; Ralph, Stuart A; Roos, David S; Shanmugam, Dhanasekaran; Van Voorhis, Wesley C; Agüero, Fernán
2012-01-01
The TDR Targets Database (http://tdrtargets.org) has been designed and developed as an online resource to facilitate the rapid identification and prioritization of molecular targets for drug development, focusing on pathogens responsible for neglected human diseases. The database integrates pathogen specific genomic information with functional data (e.g. expression, phylogeny, essentiality) for genes collected from various sources, including literature curation. This information can be browsed and queried using an extensive web interface with functionalities for combining, saving, exporting and sharing the query results. Target genes can be ranked and prioritized using numerical weights assigned to the criteria used for querying. In this report we describe recent updates to the TDR Targets database, including the addition of new genomes (specifically helminths), and integration of chemical structure, property and bioactivity information for biological ligands, drugs and inhibitors and cheminformatic tools for querying and visualizing these chemical data. These changes greatly facilitate exploration of linkages (both known and predicted) between genes and small molecules, yielding insight into whether particular proteins may be druggable, effectively allowing the navigation of chemical space in a genomics context.
TDR Targets: a chemogenomics resource for neglected diseases
Magariños, María P.; Carmona, Santiago J.; Crowther, Gregory J.; Ralph, Stuart A.; Roos, David S.; Shanmugam, Dhanasekaran; Van Voorhis, Wesley C.; Agüero, Fernán
2012-01-01
The TDR Targets Database (http://tdrtargets.org) has been designed and developed as an online resource to facilitate the rapid identification and prioritization of molecular targets for drug development, focusing on pathogens responsible for neglected human diseases. The database integrates pathogen specific genomic information with functional data (e.g. expression, phylogeny, essentiality) for genes collected from various sources, including literature curation. This information can be browsed and queried using an extensive web interface with functionalities for combining, saving, exporting and sharing the query results. Target genes can be ranked and prioritized using numerical weights assigned to the criteria used for querying. In this report we describe recent updates to the TDR Targets database, including the addition of new genomes (specifically helminths), and integration of chemical structure, property and bioactivity information for biological ligands, drugs and inhibitors and cheminformatic tools for querying and visualizing these chemical data. These changes greatly facilitate exploration of linkages (both known and predicted) between genes and small molecules, yielding insight into whether particular proteins may be druggable, effectively allowing the navigation of chemical space in a genomics context. PMID:22116064
Joy, Linu; Mohitha, C; Divya, P R; Gopalakrishnan, A; Basheer, V S; Jena, J K
2016-07-01
Cobia, Rachycentron canadum, is an economically important migratory fish distributed in tropical waters worldwide and is a candidate fish species for aquaculture practices. The genetic stock structure of R. canadum distributed along the Indian waters was identified using mitochondrial ATPase 6 and 8 genes. A total of 842 bp sequence of ATPase 6/8 genes obtained in this study revealed 15 haplotypes with mean low nucleotide diversity (π = 0.001) and high haplotype diversity (h = 0.785). AMOVA indicated the genetic differentiation of 90.47% for individuals within the population. This is well supported by co-efficient of genetic differentiation (FST) values obtained for pairwise populations that were low and non-significant with an overall value of 0.002. The parsimony network tree revealed star-like phylogeny and all the haplotypes were connected with each other by single mutational event. The findings of the present study indicated the panmixia nature of the species which can be managed as a unit stock in Indian waters.
Stamatakis, Alexandros; Ott, Michael
2008-12-27
The continuous accumulation of sequence data, for example, due to novel wet-laboratory techniques such as pyrosequencing, coupled with the increasing popularity of multi-gene phylogenies and emerging multi-core processor architectures that face problems of cache congestion, poses new challenges with respect to the efficient computation of the phylogenetic maximum-likelihood (ML) function. Here, we propose two approaches that can significantly speed up likelihood computations that typically represent over 95 per cent of the computational effort conducted by current ML or Bayesian inference programs. Initially, we present a method and an appropriate data structure to efficiently compute the likelihood score on 'gappy' multi-gene alignments. By 'gappy' we denote sampling-induced gaps owing to missing sequences in individual genes (partitions), i.e. not real alignment gaps. A first proof-of-concept implementation in RAXML indicates that this approach can accelerate inferences on large and gappy alignments by approximately one order of magnitude. Moreover, we present insights and initial performance results on multi-core architectures obtained during the transition from an OpenMP-based to a Pthreads-based fine-grained parallelization of the ML function.
Genome-wide analysis of the WRKY transcription factors in aegilops tauschii.
Ma, Jianhui; Zhang, Daijing; Shao, Yun; Liu, Pei; Jiang, Lina; Li, Chunxi
2014-01-01
The WRKY transcription factors (TFs) play important roles in responding to abiotic and biotic stress in plants. However, due to its unfinished genome sequencing, relatively few WRKY TFs with full-length coding sequences (CDSs) have been identified in wheat. Instead, the Aegilops tauschii genome, which is the D-genome progenitor of the hexaploid wheat genome, provides important resources for the discovery of new genes. In this study, we performed a bioinformatics analysis to identify WRKY TFs with full-length CDSs from the A. tauschii genome. A detailed evolutionary analysis for all these TFs was conducted, and quantitative real-time PCR was carried out to investigate the expression patterns of the abiotic stress-related WRKY TFs under different abiotic stress conditions in A. tauschii seedlings. A total of 93 WRKY TFs were identified from A. tauschii, and 79 of them were found to be newly discovered genes compared with wheat. Gene phylogeny, gene structure and chromosome location of the 93 WRKY TFs were fully analyzed. These studies provide a global view of the WRKY TFs from A. tauschii and a firm foundation for further investigations in both A. tauschii and wheat. © 2015 S. Karger AG, Basel.
Pagaling, Eulyn; Gatica, Joao; Yang, Kun; Cytryn, Eddie; Yan, Tao
2016-09-01
The aim of this study was to determine the phylogenetic diversity of ceftriaxone resistance and the presence of known extended-spectrum β-lactamase (ESBL) genes in culturable soil resistomes. Libraries of soil bacterial isolates resistant to ceftriaxone were established from six physicochemically diverse soils collected in Hawaii (USA) and Israel. The phylogenetic affiliation, ceftriaxone and multidrug resistance levels, and presence of known ESBL genes of the isolates were determined. The soil bacterial isolates were phylogenetically grouped with the Alphaproteobacteria, Betaproteobacteria, Gammaproteobacteria, Actinobacteria, Firmicutes and Bacteroidetes. Ceftriaxone minimum inhibitory concentrations (MICs) largely followed the phylogeny structure and higher levels of ceftriaxone resistance corresponded to higher multidrug resistance. Three distinct blaTEM variants were detected in soil bacterial isolates belonging to nine different genera. In conclusion, the culturable soil resistomes for ceftriaxone exhibited high phylogenetic diversity and multidrug resistance. blaTEM was the only known ESBL detected in the soil resistomes, and its distribution in different phylogenetic groups suggests its ubiquitous presence and/or possible horizontal gene transfer within the soil microbiomes. Copyright © 2016 International Society for Chemotherapy of Infection and Cancer. Published by Elsevier Ltd. All rights reserved.
Xu, Jinshi; Chen, Yu; Zhang, Lixia; Chai, Yongfu; Wang, Mao; Guo, Yaoxin; Li, Ting; Yue, Ming
2017-07-01
Community assembly processes is the primary focus of community ecology. Using phylogenetic-based and functional trait-based methods jointly to explore these processes along environmental gradients are useful ways to explain the change of assembly mechanisms under changing world. Our study combined these methods to test assembly processes in wide range gradients of elevation and other habitat environmental factors. We collected our data at 40 plots in Taibai Mountain, China, with more than 2,300 m altitude difference in study area and then measured traits and environmental factors. Variance partitioning was used to distinguish the main environment factors leading to phylogeny and traits change among 40 plots. Principal component analysis (PCA) was applied to colligate other environment factors. Community assembly patterns along environmental gradients based on phylogenetic and functional methods were studied for exploring assembly mechanisms. Phylogenetic signal was calculated for each community along environmental gradients in order to detect the variation of trait performance on phylogeny. Elevation showed a better explanatory power than other environment factors for phylogenetic and most traits' variance. Phylogenetic and several functional structure clustered at high elevation while some conserved traits overdispersed. Convergent tendency which might be caused by filtering or competition along elevation was detected based on functional traits. Leaf dry matter content (LDMC) and leaf nitrogen content along PCA 1 axis showed conflicting patterns comparing to patterns showed on elevation. LDMC exhibited the strongest phylogenetic signal. Only the phylogenetic signal of maximum plant height showed explicable change along environmental gradients. Synthesis . Elevation is the best environment factors for predicting phylogeny and traits change. Plant's phylogenetic and some functional structures show environmental filtering in alpine region while it shows different assembly processes in middle- and low-altitude region by different trait/phylogeny. The results highlight deterministic processes dominate community assembly in large-scale environmental gradients. Performance of phylogeny and traits along gradients may be independent with each other. The novel method for calculating functional structure which we used in this study and the focus of phylogenetic signal change along gradients may provide more useful ways to detect community assembly mechanisms.
Inferring explicit weighted consensus networks to represent alternative evolutionary histories
2013-01-01
Background The advent of molecular biology techniques and constant increase in availability of genetic material have triggered the development of many phylogenetic tree inference methods. However, several reticulate evolution processes, such as horizontal gene transfer and hybridization, have been shown to blur the species evolutionary history by causing discordance among phylogenies inferred from different genes. Methods To tackle this problem, we hereby describe a new method for inferring and representing alternative (reticulate) evolutionary histories of species as an explicit weighted consensus network which can be constructed from a collection of gene trees with or without prior knowledge of the species phylogeny. Results We provide a way of building a weighted phylogenetic network for each of the following reticulation mechanisms: diploid hybridization, intragenic recombination and complete or partial horizontal gene transfer. We successfully tested our method on some synthetic and real datasets to infer the above-mentioned evolutionary events which may have influenced the evolution of many species. Conclusions Our weighted consensus network inference method allows one to infer, visualize and validate statistically major conflicting signals induced by the mechanisms of reticulate evolution. The results provided by the new method can be used to represent the inferred conflicting signals by means of explicit and easy-to-interpret phylogenetic networks. PMID:24359207
Hu, Qianni; Sun, Genlou
2017-06-01
Two single-copy nuclear genes, the second largest subunit of RNA polymerase II (RPB2) and thioredoxin-like gene (HTL), were used to explore the phylogeny and origin of polyploid species in Hordeum. Our results were partly in accord with previous studies, but disclosed additional complexity. Both RPB2 and HTL trees confirmed the presence of Xa genome in H. capense and H. secalinum, and that H. depressum originated from H. californicum together with other American diploids, either H. intercedens or H. pusillum. American diploids solely contributed to the origin of H. depressum. The Asian diploids, either H. bogdanii or H. brevisubulatum, contributed to the formation of American polyploids except H. depressum. RPB2 and HTL sequences showed that H. roshevitzii did not contribute to the origin of American tetraploids. Our data showed a close relationship between the hexaploids H. procerum and H. parodii and the tetraploids H. brachyantherum, H. fuegianum, H. guatemalense, H. jubatum, and H. tetraploidum. The involvement of the diploid H. pusillum and the tetraploid H. jubatum in the formation of H. arizonicum was also indicated in the HTL phylogeny. Our results suggested a possible gene introgression of W- and P-genome species into the tetraploid H. jubatum and the hexaploid H. procerum.
Phylogenomics of the Zygomycete lineages: Exploring phylogeny and genome evolution
USDA-ARS?s Scientific Manuscript database
The Zygomycete lineages mark the major transition from zoosporic life histories of the common ancestors of Fungi and the earliest diverging chytrid lineages (Chytridiomycota and Blastocladiomycota). Genome comparisons from these lineages may reveal gene content changes that reflect the transition to...
Computational analysis of molt-inhibiting hormone from selected crustaceans.
C, Kumaraswamy Naidu; Y, Suneetha; P, Sreenivasula Reddy
2013-12-01
Molt-inhibiting hormone (MIH) is a principal endocrine hormone regulating the growth in crustaceans. In total, nine MIH peptide sequences representing members of the family Penaeidae (Penaeus monodon, Litopenaeus vannamei, Marsupenaeus japonicus), Portunidae (Portunus trituberculatus, Charybdis japonica, Charybdis feriata), Cambaridae (Procambarus bouvieri), Parastacidae (Cherax quadricarinatus) and Varunidae (Eriocheir sinensis) were selected for our study. In order to develop a structure based phylogeny, predict functionally important regions and to define stability changes upon single site mutations, the 3D structure of MIH for the crustaceans were built by using homology modeling based on the known structure of MIH from M. japonicus (1J0T). Structure based phylogeny showed a close relationship between P. bouvieri and C. japonica. ConSurf server analysis showed that the residues Cys(8), Arg(15), Cys(25), Asp(27), Cys(28), Asn(30), Arg(33), Cys(41), Cys(45), Phe(51), and Cys(54) may be functionally significant among the MIH of crustaceans. Single amino acid substitutions 'Y' and 'G' at the positions 71 and 72 of the MIH C-terminal region showed an alteration in the stability indicating that a change in this region may alter the function of MIH. In conclusion, we proposed a computational approach to analyze the structure, phylogeny and stability of MIH from crustaceans. © 2013.
2010-01-01
Background Cinnamyl Alcohol Dehydrogenase (CAD) proteins function in lignin biosynthesis and play a critical role in wood development and plant defense against stresses. Previous phylogenetic studies did not include genes from seedless plants and did not reflect the deep evolutionary history of this gene family. We reanalyzed the phylogeny of CAD and CAD-like genes using a representative dataset including lycophyte and bryophyte sequences. Many CAD/CAD-like genes do not seem to be associated with wood development under normal growth conditions. To gain insight into the functional evolution of CAD/CAD-like genes, we analyzed their expression in Populus plant tissues in response to feeding damage by gypsy moth larvae (Lymantria dispar L.). Expression of CAD/CAD-like genes in Populus tissues (xylem, leaves, and barks) was analyzed in herbivore-treated and non-treated plants by real time quantitative RT-PCR. Results CAD family genes were distributed in three classes based on sequence conservation. All the three classes are represented by seedless as well as seed plants, including the class of bona fide lignin pathway genes. The expression of some CAD/CAD-like genes that are not associated with xylem development were induced following herbivore damage in leaves, while other genes were induced in only bark or xylem tissues. Five of the CAD/CAD-like genes, however, showed a shift in expression from one tissue to another between non-treated and herbivore-treated plants. Systemic expression of the CAD/CAD-like genes was generally suppressed. Conclusions Our results indicated a correlation between the evolution of the CAD gene family and lignin and that the three classes of genes may have evolved in the ancestor of land plants. Our results also suggest that the CAD/CAD-like genes have evolved a diversity of expression profiles and potentially different functions, but that they are nonetheless co-regulated under stress conditions. PMID:20509918
Pair-flowered cymes in the Lamiales: structure, distribution and origin
Weber, Anton
2013-01-01
Background and Aims In the Lamiales, indeterminate thyrses (made up of axillary cymes) represent a significant inflorescence type. However, it has been largely overlooked that there occur two types of cymes: (1) ordinary cymes, and (2) ‘pair-flowered cymes’ (PFCs), with a flower pair (terminal and front flower) topping each cyme unit. PFCs are unique to the Lamiales and their distribution, origin and phylogeny are not well understood. Methods The Lamiales are screened as to the occurrence of PFCs, ordinary cymes and single flowers (constituting racemic inflorescences). Key Results PFCs are shown to exhibit a considerable morphological and developmental diversity and are documented to occur in four neighbouring taxa of Lamiales: Calceolariaceae, Sanango, Gesneriaceae and Plantaginaceae. They are omnipresent in the Calceolariaceae and almost so in the Gesneriaceae. In the Plantaginaceae, PFCs are restricted to the small sister tribes Russelieae and Cheloneae (while the large remainder has single flowers in the leaf/bract axils; ordinary cymes do not occur). Regarding the origin of PFCs, the inflorescences of the genus Peltanthera (unplaced as to family; sister to Calceolariaceae, Sanango and Gesneriaceae in most molecular phylogenies) support the idea that PFCs have originated from paniculate systems, with the front-flowers representing remnant flowers. Conclusions From the exclusive occurrence of PFCs in the Lamiales and the proximity of the respective taxa in molecular phylogenies it may be expected that PFCs have originated once, representing a synapomorphy for this group of taxa and fading out within the Plantaginaceae. However, molecular evidence is ambiguous. Depending on the position of Peltanthera (depending in turn on the kind and number of genes and taxa analysed) a single, a double (the most probable scenario) or a triple origin appears conceivable. PMID:23884395
Yutin, Natalya; Galperin, Michael Y.
2014-01-01
Summary The class Clostridia in the phylum Firmicutes (formerly low-G+C Gram-positive bacteria) includes diverse bacteria of medical, environmental, and biotechnological importance. The Selenomonas-Megasphaera-Sporomusa branch, which unifies members of the Firmicutes with Gram-negative-type cell envelopes, was recently moved from Clostridia to a separate class Negativicutes. However, draft genome sequences of the spore-forming members of the Negativicutes revealed typically clostridial sets of sporulation genes. To address this and other questions in clostridial phylogeny, we have compared a phylogenetic tree for a concatenated set of 50 widespread ribosomal proteins with the trees for beta subunits of the RNA polymerase (RpoB) and DNA gyrase (GyrB) and with the 16S rRNA-based phylogeny. The results obtained by these methods showed remarkable consistency, suggesting that they reflect the true evolutionary history of these bacteria. These data put the Selenomonas-Megasphaera-Sporomusa group back within the Clostridia. They also support placement of Clostridium difficile and its close relatives within the family Peptostreptococcaceae; we suggest resolving the long-standing naming conundrum by renaming it Peptoclostridium difficile. These data also indicate the existence of a group of cellulolytic clostridia that belong to the family Ruminococcaceae. As a tentative solution to resolve the current taxonomical problems, we propose assigning 78 validly described Clostridium species that clearly fall outside the family Clostridiaceae to six new genera: Peptoclostridium, Lachnoclostridium, Ruminiclostridium, Erysipelatoclostridium, Gottschalkia, and Tyzzerella. This work reaffirms that 16S rRNA and ribosomal protein sequences are better indicators of evolutionary proximity than phenotypic traits, even such key ones as the structure of the cell envelope and Gram-staining pattern. PMID:23834245
Erickson, David L.; Jones, Frank A.; Swenson, Nathan G.; Pei, Nancai; Bourg, Norman A.; Chen, Wenna; Davies, Stuart J.; Ge, Xue-jun; Hao, Zhanqing; Howe, Robert W.; Huang, Chun-Lin; Larson, Andrew J.; Lum, Shawn K. Y.; Lutz, James A.; Ma, Keping; Meegaskumbura, Madhava; Mi, Xiangcheng; Parker, John D.; Fang-Sun, I.; Wright, S. Joseph; Wolf, Amy T.; Ye, W.; Xing, Dingliang; Zimmerman, Jess K.; Kress, W. John
2014-01-01
Forest dynamics plots, which now span longitudes, latitudes, and habitat types across the globe, offer unparalleled insights into the ecological and evolutionary processes that determine how species are assembled into communities. Understanding phylogenetic relationships among species in a community has become an important component of assessing assembly processes. However, the application of evolutionary information to questions in community ecology has been limited in large part by the lack of accurate estimates of phylogenetic relationships among individual species found within communities, and is particularly limiting in comparisons between communities. Therefore, streamlining and maximizing the information content of these community phylogenies is a priority. To test the viability and advantage of a multi-community phylogeny, we constructed a multi-plot mega-phylogeny of 1347 species of trees across 15 forest dynamics plots in the ForestGEO network using DNA barcode sequence data (rbcL, matK, and psbA-trnH) and compared community phylogenies for each individual plot with respect to support for topology and branch lengths, which affect evolutionary inference of community processes. The levels of taxonomic differentiation across the phylogeny were examined by quantifying the frequency of resolved nodes throughout. In addition, three phylogenetic distance (PD) metrics that are commonly used to infer assembly processes were estimated for each plot [PD, Mean Phylogenetic Distance (MPD), and Mean Nearest Taxon Distance (MNTD)]. Lastly, we examine the partitioning of phylogenetic diversity among community plots through quantification of inter-community MPD and MNTD. Overall, evolutionary relationships were highly resolved across the DNA barcode-based mega-phylogeny, and phylogenetic resolution for each community plot was improved when estimated within the context of the mega-phylogeny. Likewise, when compared with phylogenies for individual plots, estimates of phylogenetic diversity in the mega-phylogeny were more consistent, thereby removing a potential source of bias at the plot-level, and demonstrating the value of assessing phylogenetic relationships simultaneously within a mega-phylogeny. An unexpected result of the comparisons among plots based on the mega-phylogeny was that the communities in the ForestGEO plots in general appear to be assemblages of more closely related species than expected by chance, and that differentiation among communities is very low, suggesting deep floristic connections among communities and new avenues for future analyses in community ecology. PMID:25414723
An experimental phylogeny to benchmark ancestral sequence reconstruction
Randall, Ryan N.; Radford, Caelan E.; Roof, Kelsey A.; Natarajan, Divya K.; Gaucher, Eric A.
2016-01-01
Ancestral sequence reconstruction (ASR) is a still-burgeoning method that has revealed many key mechanisms of molecular evolution. One criticism of the approach is an inability to validate its algorithms within a biological context as opposed to a computer simulation. Here we build an experimental phylogeny using the gene of a single red fluorescent protein to address this criticism. The evolved phylogeny consists of 19 operational taxonomic units (leaves) and 17 ancestral bifurcations (nodes) that display a wide variety of fluorescent phenotypes. The 19 leaves then serve as ‘modern' sequences that we subject to ASR analyses using various algorithms and to benchmark against the known ancestral genotypes and ancestral phenotypes. We confirm computer simulations that show all algorithms infer ancient sequences with high accuracy, yet we also reveal wide variation in the phenotypes encoded by incorrectly inferred sequences. Specifically, Bayesian methods incorporating rate variation significantly outperform the maximum parsimony criterion in phenotypic accuracy. Subsampling of extant sequences had minor effect on the inference of ancestral sequences. PMID:27628687
McGregor, Glenn B; Sendall, Barbara C
2015-02-01
Three populations of the freshwater filamentous cyanobacterium Lyngbya wollei (Farlow ex Gomont) Speziale and Dyck have been putatively identified from north-eastern Australia and found to produce the potent cyanotoxin cylindrospermopsin (CYN) and its analog deoxy-cylindrospermopsin (deoxy-CYN). We investigated the phylogeny and toxicology of strains and mats isolated from two of these populations using a combination of molecular and morphological techniques. Morphologically the strains corresponded to the type description, however, the frequency of false-branching was low, and variable over time. Strains and mat samples from both sites were positive for the cyrF and cyrJ genes associated with CYN biosynthesis. Phylogenetic analysis of these genes from Australian L. wollei sequences and comparable cyanobacterial sequences revealed that the genes in L. wollei were more closely related to homologous genes in Oscillatoria sp. PCC 6506 than to homologs in Nostocalean CYN-producers. These data suggest a common evolutionary origin of CYN biosynthesis in L. wollei and Oscillatoria. In both the 16S rRNA and nifH phylogenies, the Australian L. wollei strains formed well-supported clades with United States L. wollei (= Plectonema wollei) strains. Pair-wise sequence similarities within the 16S rRNA clade containing all eleven L. wollei strains were high, ranging from 97% to 100%. This group was distantly related (<92% nucleotide similarity) to other taxa within the group previously considered under the genus Lyngbya sensu lato (C. Agardh ex Gomont). Collectively, these results suggest that this toxigenic group is evolutionarily distinct and sufficiently distant as to be considered a separate genus, which we have described as Microseira gen. nov. and hence transfer to it the type M. wollei comb. nov. © 2014 State of Queensland. Journal of Phycology © 2014 Phycological Society of America.
Panzera, Alejandra; Leaché, Adam D; D'Elía, Guillermo; Victoriano, Pedro F
2017-01-01
The genus Liolaemus is one of the most ecologically diverse and species-rich genera of lizards worldwide. It currently includes more than 250 recognized species, which have been subject to many ecological and evolutionary studies. Nevertheless, Liolaemus lizards have a complex taxonomic history, mainly due to the incongruence between morphological and genetic data, incomplete taxon sampling, incomplete lineage sorting and hybridization. In addition, as many species have restricted and remote distributions, this has hampered their examination and inclusion in molecular systematic studies. The aims of this study are to infer a robust phylogeny for a subsample of lizards representing the Chilean clade (subgenus Liolaemus sensu stricto ), and to test the monophyly of several of the major species groups. We use a phylogenomic approach, targeting 541 ultra-conserved elements (UCEs) and 44 protein-coding genes for 16 taxa. We conduct a comparison of phylogenetic analyses using maximum-likelihood and several species tree inference methods. The UCEs provide stronger support for phylogenetic relationships compared to the protein-coding genes; however, the UCEs outnumber the protein-coding genes by 10-fold. On average, the protein-coding genes contain over twice the number of informative sites. Based on our phylogenomic analyses, all the groups sampled are polyphyletic. Liolaemus tenuis tenuis is difficult to place in the phylogeny, because only a few loci (nine) were recovered for this species. Topologies or support values did not change dramatically upon exclusion of L. t. tenuis from analyses, suggesting that missing data did not had a significant impact on phylogenetic inference in this data set. The phylogenomic analyses provide strong support for sister group relationships between L. fuscus , L. monticola , L. nigroviridis and L. nitidus , and L. platei and L. velosoi . Despite our limited taxon sampling, we have provided a reliable starting hypothesis for the relationships among many major groups of the Chilean clade of Liolaemus that will help future work aimed at resolving the Liolaemus phylogeny.
A reassessment of the emergence time of European bat lyssavirus type 1.
Hughes, Gareth J
2008-12-01
The previous study of the evolutionary rates of European bat lyssavirus type 1 (EBLV-1) used a strict molecular clock to estimate substitution rates of the nucleoprotein gene and in turn times of the most recent common ancestor (tMRCA) of the entire genotype and the two major EBLV-1 lineages (EBLV-1A and EBLV-1B). The results of that study suggested that the evolutionary rate of EBLV-1 was one of the lowest recorded for RNA viruses and that genetic diversity of EBLV-1 arose 500-750 years ago. Here I have shown that the use of a relaxed molecular clock (allowing branch rates to vary within a phylogeny) shows that these previous estimates should be revised. The relaxed clock provides a significantly better fit to all datasets. The substitution rate of EBLV-1B is compatible to that expected given previous estimates for the N gene of rabies virus whilst rate estimations for EBLV-1A appear to be confounded by substantial rate variation within the phylogeny. The relaxed clock substitution rate for EBLV-1 (1.1 x 10(-4)) is higher than had been estimated previously, and closer to that expected for the N gene. Moreover, tMRCA estimates for EBLV-1 are substantially reduced using the relaxed molecular clock (70-300 years) although the differing dynamics of EBLV-1A and EBLV-1B confound the confidence in this estimate. Current diversity of both EBLV-1A and EBLV-1B appears to have emerged within the last 100 years. Reconstruction of the population histories suggests that EBLV-1B may be emerging whilst the signal derived from the EBLV-1A phylogeny may be dampened by clade-specific dynamics.
Quest for Orthologs Entails Quest for Tree of Life: In Search of the Gene Stream
Boeckmann, Brigitte; Marcet-Houben, Marina; Rees, Jonathan A.; Forslund, Kristoffer; Huerta-Cepas, Jaime; Muffato, Matthieu; Yilmaz, Pelin; Xenarios, Ioannis; Bork, Peer; Lewis, Suzanna E.; Gabaldón, Toni
2015-01-01
Quest for Orthologs (QfO) is a community effort with the goal to improve and benchmark orthology predictions. As quality assessment assumes prior knowledge on species phylogenies, we investigated the congruency between existing species trees by comparing the relationships of 147 QfO reference organisms from six Tree of Life (ToL)/species tree projects: The National Center for Biotechnology Information (NCBI) taxonomy, Opentree of Life, the sequenced species/species ToL, the 16S ribosomal RNA (rRNA) database, and trees published by Ciccarelli et al. (Ciccarelli FD, et al. 2006. Toward automatic reconstruction of a highly resolved tree of life. Science 311:1283–1287) and by Huerta-Cepas et al. (Huerta-Cepas J, Marcet-Houben M, Gabaldon T. 2014. A nested phylogenetic reconstruction approach provides scalable resolution in the eukaryotic Tree Of Life. PeerJ PrePrints 2:223) Our study reveals that each species tree suggests a different phylogeny: 87 of the 146 (60%) possible splits of a dichotomous and rooted tree are congruent, while all other splits are incongruent in at least one of the species trees. Topological differences are observed not only at deep speciation events, but also within younger clades, such as Hominidae, Rodentia, Laurasiatheria, or rosids. The evolutionary relationships of 27 archaea and bacteria are highly inconsistent. By assessing 458,108 gene trees from 65 genomes, we show that consistent species topologies are more often supported by gene phylogenies than contradicting ones. The largest concordant species tree includes 77 of the QfO reference organisms at the most. Results are summarized in the form of a consensus ToL (http://swisstree.vital-it.ch/species_tree) that can serve different benchmarking purposes. PMID:26133389
Zhang, Bin; He, Kai; Wan, Tao; Chen, Peng; Sun, Guozheng; Liu, Shaoying; Nguyen, Truong Son; Lin, Liangkong; Jiang, Xuelong
2016-12-01
Niviventer is a genus of white-bellied rats that are among the most common rodents in the Indo-Sundaic region. The taxonomy of the genus has undergone extensive revisions and remains controversial. The current phylogeny is unresolved and was developed primarily on the basis of mitochondrial genes. Identification is extremely difficult, and a large number of GenBank sequences seem to be problematic. We extensively sampled specimens of Niviventer in China and neighboring northern Vietnam, including topotypes of the most reported species (n = 6), subspecies (n = 8), and synonyms (n = 4). We estimated phylogenetic relationships on the basis of one mitochondrial and three nuclear genes, using concatenation and coalescent-based approaches. We also employed molecular species delimitation approaches to test the existence of cryptic and putative new species. Our phylogeny was finely resolved, especially for the N. confucianus-like species. Our data provided the first support for N. brahma and N. eha as sister species, an assignment that is congruent with their morphological similarities. Species delimitation analyses provided new insight into species diversity and systematics. Three geographic populations of N. confucianus and one of N. fulvescens were supported as genetically distinct in our species delimitation analyses, while three recognized species (N. coninga, N. huang, and N. lotipes) were not strongly supported as distinct. Our results suggested that several genetically distinct species may be contained within the species currently known as N. confucianus and N. fulvescens. In addition, the results of Bayesian Phylogenetics and Phylogeography (BPP) for N. coninga, N. huang, and N. lotipes indicated that either inter-specific gene flow had occurred or imperfect taxonomy was present. Morphological examinations and morphometric analyses are warranted to examine the molecular results.
Multigene phylogeny and taxonomic revision of yeasts and related fungi in the Ustilaginomycotina.
Wang, Q-M; Begerow, D; Groenewald, M; Liu, X-Z; Theelen, B; Bai, F-Y; Boekhout, T
2015-06-01
The subphylum Ustilaginomycotina (Basidiomycota, Fungi) comprises mainly plant pathogenic fungi (smuts). Some of the lineages possess cultivable unicellular stages that are usually classified as yeast or yeast-like species in a largely artificial taxonomic system which is independent from and largely incompatible with that of the smut fungi. Here we performed phylogenetic analyses based on seven genes including three nuclear ribosomal RNA genes and four protein coding genes to address the molecular phylogeny of the ustilaginomycetous yeast species and their filamentous counterparts. Taxonomic revisions were proposed to reflect this phylogeny and to implement the 'One Fungus = One Name' principle. The results confirmed that the yeast-containing classes Malasseziomycetes, Moniliellomycetes and Ustilaginomycetes are monophyletic, whereas Exobasidiomycetes in the current sense remains paraphyletic. Four new genera, namely Dirkmeia gen. nov., Kalmanozyma gen. nov., Golubevia gen. nov. and Robbauera gen. nov. are proposed to accommodate Pseudozyma and Tilletiopsis species that are distinct from the other smut taxa and belong to clades that are separate from those containing type species of the hitherto described genera. Accordingly, new orders Golubeviales ord. nov. with Golubeviaceae fam. nov. and Robbauerales ord. nov. with Robbaueraceae fam. nov. are proposed to accommodate the sisterhood of Golubevia gen. nov. and Robbauera gen. nov. with other orders of Exobasidiomycetes. The majority of the remaining anamorphic yeast species are transferred to corresponding teleomorphic genera based on strongly supported phylogenetic affinities, resulting in the proposal of 28 new combinations. The taxonomic status of a few Pseudozyma species remains to be determined because of their uncertain phylogenetic positions. We propose to use the term pro tempore or pro tem. in abbreviation to indicate the single-species lineages that are temporarily maintained.
Homology and phylogeny and their automated inference
NASA Astrophysics Data System (ADS)
Fuellen, Georg
2008-06-01
The analysis of the ever-increasing amount of biological and biomedical data can be pushed forward by comparing the data within and among species. For example, an integrative analysis of data from the genome sequencing projects for various species traces the evolution of the genomes and identifies conserved and innovative parts. Here, I review the foundations and advantages of this “historical” approach and evaluate recent attempts at automating such analyses. Biological data is comparable if a common origin exists (homology), as is the case for members of a gene family originating via duplication of an ancestral gene. If the family has relatives in other species, we can assume that the ancestral gene was present in the ancestral species from which all the other species evolved. In particular, describing the relationships among the duplicated biological sequences found in the various species is often possible by a phylogeny, which is more informative than homology statements. Detecting and elaborating on common origins may answer how certain biological sequences developed, and predict what sequences are in a particular species and what their function is. Such knowledge transfer from sequences in one species to the homologous sequences of the other is based on the principle of ‘my closest relative looks and behaves like I do’, often referred to as ‘guilt by association’. To enable knowledge transfer on a large scale, several automated ‘phylogenomics pipelines’ have been developed in recent years, and seven of these will be described and compared. Overall, the examples in this review demonstrate that homology and phylogeny analyses, done on a large (and automated) scale, can give insights into function in biology and biomedicine.
Steinke, Dirk; Salzburger, Walter; Meyer, Axel
2006-06-01
The power of comparative phylogenomic analyses also depends on the amount of data that are included in such studies. We used expressed sequence tags (ESTs) from fish model species as a proof of principle approach in order to test the reliability of using ESTs for phylogenetic inference. As expected, the robustness increases with the amount of sequences. Although some progress has been made in the elucidation of the phylogeny of teleosts, relationships among the main lineages of the derived fish (Euteleostei) remain poorly defined and are still debated. We performed a phylogenomic analysis of a set of 42 of orthologous genes from 10 available fish model systems from seven different orders (Salmoniformes, Siluriformes, Cypriniformes, Tetraodontiformes, Cyprinodontiformes, Beloniformes, and Perciformes) of euteleostean fish to estimate divergence times and evolutionary relationships among those lineages. All 10 fish species serve as models for developmental, aquaculture, genomic, and comparative genetic studies. The phylogenetic signal and the strength of the contribution of each of the 42 orthologous genes were estimated with randomly chosen data subsets. Our study revealed a molecular phylogeny of higher-level relationships of derived teleosts, which indicates that the use of multiple genes produces robust phylogenies, a finding that is expected to apply to other phylogenetic issues among distantly related taxa. Our phylogenomic analyses confirm that the euteleostean superorders Ostariophysi and Acanthopterygii are monophyletic and the Protacanthopterygii and Ostariophysi are sister clades. In addition, and contrary to the traditional phylogenetic hypothesis, our analyses determine that killifish (Cyprinodontiformes), medaka (Beloniformes), and cichlids (Perciformes) appear to be more closely related to each other than either of them is to pufferfish (Tetraodontiformes). All 10 lineages split before or during the fragmentation of the supercontinent Pangea in the Jurassic.
Nozaki, Hisayoshi; Yang, Yi; Maruyama, Shinichiro; Suzaki, Toshinobu
2012-01-01
Recent multigene phylogenetic analyses have contributed much to our understanding of eukaryotic phylogeny. However, the phylogenetic positions of various lineages within the eukaryotes have remained unresolved or in conflict between different phylogenetic studies. These phylogenetic ambiguities might have resulted from mixtures or integration from various factors including limited taxon sampling, missing data in the alignment, saturations of rapidly evolving genes, mixed analyses of short- and long-branched operational taxonomic units (OTUs), intracellular endoparasite and ciliate OTUs with unusual substitution etc. In order to evaluate the effects from intracellular endoparasite and ciliate OTUs co-analyzed on the eukaryotic phylogeny and simplify the results, we here used two different sets of data matrices of multiple slowly evolving genes with small amounts of missing data and examined the phylogenetic position of the secondary photosynthetic chromalveolates Haptophyta, one of the most abundant groups of oceanic phytoplankton and significant primary producers. In both sets, a robust sister relationship between Haptophyta and SAR (stramenopiles, alveolates, rhizarians, or SA [stramenopiles and alveolates]) was resolved when intracellular endoparasite/ciliate OTUs were excluded, but not in their presence. Based on comparisons of character optimizations on a fixed tree (with a clade composed of haptophytes and SAR or SA), disruption of the monophyly between haptophytes and SAR (or SA) in the presence of intracellular endoparasite/ciliate OTUs can be considered to be a result of multiple evolutionary reversals of character positions that supported the synapomorphy of the haptophyte and SAR (or SA) clade in the absence of intracellular endoparasite/ciliate OTUs.
Hwang, Jung Shan; Takaku, Yasuharu; Momose, Tsuyoshi; Adamczyk, Patrizia; Özbek, Suat; Ikeo, Kazuho; Khalturin, Konstantin; Hemmrich, Georg; Bosch, Thomas C. G.; Holstein, Thomas W.; David, Charles N.; Gojobori, Takashi
2010-01-01
Taxonomically restricted genes or lineage-specific genes contribute to morphological diversification in metazoans and provide unique functions for particular taxa in adapting to specific environments. To understand how such genes arise and participate in morphological evolution, we have investigated a gene called nematogalectin in Hydra, which has a structural role in the formation of nematocysts, stinging organelles that are unique to the phylum Cnidaria. Nematogalectin is a 28-kDa protein with an N-terminal GlyXY domain (glycine followed by two hydrophobic amino acids), which can form a collagen triple helix, followed by a galactose-binding lectin domain. Alternative splicing of the nematogalectin transcript allows the gene to encode two proteins, nematogalectin A and nematogalectin B. We demonstrate that expression of nematogalectin A and B is mutually exclusive in different nematocyst types: Desmonemes express nematogalectin B, whereas stenoteles and isorhizas express nematogalectin B early in differentiation, followed by nematogalectin A. Like Hydra, the marine hydrozoan Clytia also has two nematogalectin transcripts, which are expressed in different nematocyte types. By comparison, anthozoans have only one nematogalectin gene. Gene phylogeny indicates that tandem duplication of nematogalectin B exons gave rise to nematogalectin A before the divergence of Anthozoa and Medusozoa and that nematogalectin A was subsequently lost in Anthozoa. The emergence of nematogalectin A may have played a role in the morphological diversification of nematocysts in the medusozoan lineage. PMID:20937891
Greenwold, Matthew J; Sawyer, Roger H
2013-09-01
The archosauria consist of two living groups, crocodilians, and birds. Here we compare the structure, expression, and phylogeny of the beta (β)-keratins in two crocodilian genomes and two avian genomes to gain a better understanding of the evolutionary origin of the feather β-keratins. Unlike squamates such as the green anole with 40 β-keratins in its genome, the chicken and zebra finch genomes have over 100 β-keratin genes in their genomes, while the American alligator has 20 β-keratin genes, and the saltwater crocodile has 21 β-keratin genes. The crocodilian β-keratins are similar to those of birds and these structural proteins have a central filament domain and N- and C-termini, which contribute to the matrix material between the twisted β-sheets, which form the 2-3 nm filament. Overall the expression of alligator β-keratin genes in the integument increases during development. Phylogenetic analysis demonstrates that a crocodilian β-keratin clade forms a monophyletic group with the avian scale and feather β-keratins, suggesting that avian scale and feather β-keratins along with a subset of crocodilian β-keratins evolved from a common ancestral gene/s. Overall, our analyses support the view that the epidermal appendages of basal archosaurs used a diverse array of β-keratins, which evolved into crocodilian and avian specific clades. In birds, the scale and feather subfamilies appear to have evolved independently in the avian lineage from a subset of archosaurian claw β-keratins. The expansion of the avian specific feather β-keratin genes accompanied the diversification of birds and the evolution of feathers. Copyright © 2013 Wiley Periodicals, Inc.
Zhi, Wei; Ge, Zheng; He, Zhen; Zhang, Husen
2014-11-01
Microbial fuel cells (MFCs) employ microorganisms to recover electric energy from organic matter. However, fundamental knowledge of electrochemically active bacteria is still required to maximize MFCs power output for practical applications. This review presents microbiological and electrochemical techniques to help researchers choose the appropriate methods for the MFCs study. Pre-genomic and genomic techniques such as 16S rRNA based phylogeny and metagenomics have provided important information in the structure and genetic potential of electrode-colonizing microbial communities. Post-genomic techniques such as metatranscriptomics allow functional characterizations of electrode biofilm communities by quantifying gene expression levels. Isotope-assisted phylogenetic analysis can further link taxonomic information to microbial metabolisms. A combination of electrochemical, phylogenetic, metagenomic, and post-metagenomic techniques offers opportunities to a better understanding of the extracellular electron transfer process, which in turn can lead to process optimization for power output. Copyright © 2014 Elsevier Ltd. All rights reserved.
2012-01-01
Background GDSL esterases/lipases are a newly discovered subclass of lipolytic enzymes that are very important and attractive research subjects because of their multifunctional properties, such as broad substrate specificity and regiospecificity. Compared with the current knowledge regarding these enzymes in bacteria, our understanding of the plant GDSL enzymes is very limited, although the GDSL gene family in plant species include numerous members in many fully sequenced plant genomes. Only two genes from a large rice GDSL esterase/lipase gene family were previously characterised, and the majority of the members remain unknown. In the present study, we describe the rice OsGELP (Oryza sativa GDSL esterase/lipase protein) gene family at the genomic and proteomic levels, and use this knowledge to provide insights into the multifunctionality of the rice OsGELP enzymes. Results In this study, an extensive bioinformatics analysis identified 114 genes in the rice OsGELP gene family. A complete overview of this family in rice is presented, including the chromosome locations, gene structures, phylogeny, and protein motifs. Among the OsGELPs and the plant GDSL esterase/lipase proteins of known functions, 41 motifs were found that represent the core secondary structure elements or appear specifically in different phylogenetic subclades. The specification and distribution of identified putative conserved clade-common and -specific peptide motifs, and their location on the predicted protein three dimensional structure may possibly signify their functional roles. Potentially important regions for substrate specificity are highlighted, in accordance with protein three-dimensional model and location of the phylogenetic specific conserved motifs. The differential expression of some representative genes were confirmed by quantitative real-time PCR. The phylogenetic analysis, together with protein motif architectures, and the expression profiling were analysed to predict the possible biological functions of the rice OsGELP genes. Conclusions Our current genomic analysis, for the first time, presents fundamental information on the organization of the rice OsGELP gene family. With combination of the genomic, phylogenetic, microarray expression, protein motif distribution, and protein structure analyses, we were able to create supported basis for the functional prediction of many members in the rice GDSL esterase/lipase family. The present study provides a platform for the selection of candidate genes for further detailed functional study. PMID:22793791
Liu, Nai-Yong; Xu, Wei; Dong, Shuang-Lin; Zhu, Jia-Ying; Xu, Yu-Xing; Anderson, Alisha
2018-05-22
The functions of the Ionotropic Receptor (IR) family have been well studied in Drosophila melanogaster, but only limited information is available in Lepidoptera. Here, we conducted a large-scale genome-wide analysis of the IR gene repertoire in 13 moths and 16 butterflies. Combining a homology-based approach and manual efforts, totally 996 IR candidates are identified including 31 pseudogenes and 825 full-length sequences, representing the most current comprehensive annotation in lepidopteran species. The phylogeny, expression and sequence characteristics classify Lepidoptera IRs into three sub-families: antennal IRs (A-IRs), divergent IRs (D-IRs) and Lepidoptera-specific IRs (LS-IRs), which is distinct from the case of Drosophila IRs. In comparison to LS-IRs and D-IRs, A-IRs members share a higher degree of protein identity and are distinguished into 16 orthologous groups in the phylogeny, showing conservation of gene structure. Analysis of selective forces on 27 orthologous groups reveals that these lepidopteran IRs have evolved under strong purifying selection (dN/dS≪1). Most notably, lineage-specific gene duplications that contribute primarily to gene number variations across Lepidoptera not only exist in D-IRs, but are present in the two other sub-families including members of IR41a, 76b, 87a, 100a and 100b. Expression profiling analysis reveals that over 80% (21/26) of Helicoverpa armigera A-IRs are expressed more highly in antennae of adults or larvae than other tissues, consistent with its proposed function in olfaction. However, some are also detected in taste organs like proboscises and legs. These results suggest that some A-IRs in H. armigera likely bear a dual function with their involvement in olfaction and gustation. Results from mating experiments show that two HarmIRs (IR1.2 and IR75d) expression is significantly up-regulated in antennae of mated female moths. However, no expression difference is observed between unmated female and male adults, suggesting an association with female host-searching behaviors. Our current study has greatly extended the IR gene repertoire resource in Lepidoptera, and more importantly, identifies potential IR candidates for olfactory, gustatory and oviposition behaviors in the cotton bollworm. Crown Copyright © 2018. Published by Elsevier Ltd. All rights reserved.
Pan-genome and phylogeny of Bacillus cereus sensu lato.
Bazinet, Adam L
2017-08-02
Bacillus cereus sensu lato (s. l.) is an ecologically diverse bacterial group of medical and agricultural significance. In this study, I use publicly available genomes and novel bioinformatic workflows to characterize the B. cereus s. l. pan-genome and perform the largest phylogenetic and population genetic analyses of this group to date in terms of the number of genes and taxa included. With these fundamental data in hand, I identify genes associated with particular phenotypic traits (i.e., "pan-GWAS" analysis), and quantify the degree to which taxa sharing common attributes are phylogenetically clustered. A rapid k-mer based approach (Mash) was used to create reduced representations of selected Bacillus genomes, and a fast distance-based phylogenetic analysis of this data (FastME) was performed to determine which species should be included in B. cereus s. l. The complete genomes of eight B. cereus s. l. species were annotated de novo with Prokka, and these annotations were used by Roary to produce the B. cereus s. l. pan-genome. Scoary was used to associate gene presence and absence patterns with various phenotypes. The orthologous protein sequence clusters produced by Roary were filtered and used to build HaMStR databases of gene models that were used in turn to construct phylogenetic data matrices. Phylogenetic analyses used RAxML, DendroPy, ClonalFrameML, PAUP*, and SplitsTree. Bayesian model-based population genetic analysis assigned taxa to clusters using hierBAPS. The genealogical sorting index was used to quantify the phylogenetic clustering of taxa sharing common attributes. The B. cereus s. l. pan-genome currently consists of ≈60,000 genes, ≈600 of which are "core" (common to at least 99% of taxa sampled). Pan-GWAS analysis revealed genes associated with phenotypes such as isolation source, oxygen requirement, and ability to cause diseases such as anthrax or food poisoning. Extensive phylogenetic analyses using an unprecedented amount of data produced phylogenies that were largely concordant with each other and with previous studies. Phylogenetic support as measured by bootstrap probabilities increased markedly when all suitable pan-genome data was included in phylogenetic analyses, as opposed to when only core genes were used. Bayesian population genetic analysis recommended subdividing the three major clades of B. cereus s. l. into nine clusters. Taxa sharing common traits and species designations exhibited varying degrees of phylogenetic clustering. All phylogenetic analyses recapitulated two previously used classification systems, and taxa were consistently assigned to the same major clade and group. By including accessory genes from the pan-genome in the phylogenetic analyses, I produced an exceptionally well-supported phylogeny of 114 complete B. cereus s. l. genomes. The best-performing methods were used to produce a phylogeny of all 498 publicly available B. cereus s. l. genomes, which was in turn used to compare three different classification systems and to test the monophyly status of various B. cereus s. l. species. The majority of the methodology used in this study is generic and could be leveraged to produce pan-genome estimates and similarly robust phylogenetic hypotheses for other bacterial groups.
Segatto, Ana Lúcia Anversa; Thompson, Claudia Elizabeth; Freitas, Loreta Brandão
2016-01-01
Abstract Developmental genes are believed to contribute to major changes during plant evolution, from infrageneric to higher levels. Due to their putative high sequence conservation, developmental genes are rarely used as molecular markers, and few studies including these sequences at low taxonomic levels exist. WUSCHEL-related homeobox genes (WOX) are transcription factors exclusively present in plants and are involved in developmental processes. In this study, we characterized the infrageneric genetic variation of Petunia WOX genes. We obtained phylogenetic relationships consistent with other phylogenies based on nuclear markers, but with higher statistical support, resolution in terminals, and compatibility with flower morphological changes. PMID:27768156
Phylogenetic congruence between subtropical trees and their associated fungi.
Liu, Xubing; Liang, Minxia; Etienne, Rampal S; Gilbert, Gregory S; Yu, Shixiao
2016-12-01
Recent studies have detected phylogenetic signals in pathogen-host networks for both soil-borne and leaf-infecting fungi, suggesting that pathogenic fungi may track or coevolve with their preferred hosts. However, a phylogenetically concordant relationship between multiple hosts and multiple fungi in has rarely been investigated. Using next-generation high-throughput DNA sequencing techniques, we analyzed fungal taxa associated with diseased leaves, rotten seeds, and infected seedlings of subtropical trees. We compared the topologies of the phylogenetic trees of the soil and foliar fungi based on the internal transcribed spacer (ITS) region with the phylogeny of host tree species based on matK , rbcL , atpB, and 5.8S genes. We identified 37 foliar and 103 soil pathogenic fungi belonging to the Ascomycota and Basidiomycota phyla and detected significantly nonrandom host-fungus combinations, which clustered on both the fungus phylogeny and the host phylogeny. The explicit evidence of congruent phylogenies between tree hosts and their potential fungal pathogens suggests either diffuse coevolution among the plant-fungal interaction networks or that the distribution of fungal species tracked spatially associated hosts with phylogenetically conserved traits and habitat preferences. Phylogenetic conservatism in plant-fungal interactions within a local community promotes host and parasite specificity, which is integral to the important role of fungi in promoting species coexistence and maintaining biodiversity of forest communities.
Shajitha, P P; Dhanesh, N R; Ebin, P J; Laly, Joseph; Aneesha, Devassy; Reshma, John; Augustine, Jomy; Linu, Mathew
2016-12-01
Only a few Impatiens spp. from South India (one of the five centers of diversity for Impatiens species) were included in the published datum of molecular phylogeny of the family Balsaminaceae. The present investigation is a novel attempt to reveal the phylogenetic association of Impatiens species of South India, by placing them in the global phylogeny of Impatiens based on a combined analysis of two chloroplast genes. Thirty species of genus Impatiens were collected from different locations of South India. Total genomic DNA was extracted from fresh plant leaf, and polymerase chain reaction was carried out using atpB-rbcL and trnL-F intergenic spacer-specific forward and reverse primers. Thirteen sequences of Impatiens species from three centers of diversity were obtained from GenBank for reconstructing the evolutionary relationships within the genus Impatiens. Bayesian inference analysis was carried out in MrBayes v.3.2.2. This analysis supported Southeast Asia as the ancestral place of origin of extant Impatiens species. Molecular phylogeny of South Indian Impatiens spp. based on combined chloroplast sequences showed the same association as that of morphological taxonomy. Sections Scapigerae, Tomentosae, Sub-Umbellatae, and Racemosae showed Southeast Asian relationship, while sections Annuae and Microsepalae showed African affinity.
Kumar, S; Gadagkar, S R
2000-12-01
The neighbor-joining (NJ) method is widely used in reconstructing large phylogenies because of its computational speed and the high accuracy in phylogenetic inference as revealed in computer simulation studies. However, most computer simulation studies have quantified the overall performance of the NJ method in terms of the percentage of branches inferred correctly or the percentage of replications in which the correct tree is recovered. We have examined other aspects of its performance, such as the relative efficiency in correctly reconstructing shallow (close to the external branches of the tree) and deep branches in large phylogenies; the contribution of zero-length branches to topological errors in the inferred trees; and the influence of increasing the tree size (number of sequences), evolutionary rate, and sequence length on the efficiency of the NJ method. Results show that the correct reconstruction of deep branches is no more difficult than that of shallower branches. The presence of zero-length branches in realized trees contributes significantly to the overall error observed in the NJ tree, especially in large phylogenies or slowly evolving genes. Furthermore, the tree size does not influence the efficiency of NJ in reconstructing shallow and deep branches in our simulation study, in which the evolutionary process is assumed to be homogeneous in all lineages.
Molecular Phylogeny of Heme Peroxidases
NASA Astrophysics Data System (ADS)
Zámocký, Marcel; Obinger, Christian
All currently available gene sequences of heme peroxidases can be phylogenetically divided in two superfamilies and three families. In this chapter, the phylogenetics and genomic distribution of each group are presented. Within the peroxidase-cyclooxygenase superfamily, the main evolutionary direction developed peroxidatic heme proteins involved in the innate immune defense system and in biosynthesis of (iodinated) hormones. The peroxidase-catalase superfamily is widely spread mainly among bacteria, fungi, and plants, and particularly in Class I led to the evolution of bifunctional catalase-peroxidases. Its numerous fungal representatives of Class II are involved in carbon recycling via lignin degradation, whereas Class III secretory peroxidases from algae and plants are included in various forms of secondary metabolism. The family of di-heme peroxidases are predominantly bacteria-inducible enzymes; however, a few corresponding genes were also detected in archaeal genomes. Four subfamilies of dyp-type peroxidases capable of degradation of various xenobiotics are abundant mainly among bacteria and fungi. Heme-haloperoxidase genes are widely spread among sac and club fungi, but corresponding genes were recently found also among oomycetes. All described families herein represent heme peroxidases of broad diversity in structure and function. Our accumulating knowledge about the evolution of various enzymatic functions and physiological roles can be exploited in future directed evolution approaches for engineering peroxidase genes de novo for various demands.
Phylogeny-dominant classification of J-proteins in Arabidopsis thaliana and Brassica oleracea.
Zhang, Bin; Qiu, Han-Lin; Qu, Dong-Hai; Ruan, Ying; Chen, Dong-Hong
2018-04-05
Hsp40s or DnaJ/J-proteins are evolutionarily conserved in all organisms as co-chaperones of molecular chaperone HSP70s that mainly participate in maintaining cellular protein homeostasis, such as protein folding, assembly, stabilization, and translocation under normal conditions as well as refolding and degradation under environmental stresses. It has been reported that Arabidopsis J-proteins are classified into four classes (types A-D) according to domain organization, but their phylogenetic relationships are unknown. Here, we identified 129 J-proteins in the world-wide popular vegetable Brassica oleracea, a close relative of the model plant Arabidopsis, and also revised the information of Arabidopsis J-proteins based on the latest online bioresources. According to phylogenetic analysis with domain organization and gene structure as references, the J-proteins from Arabidopsis and B. oleracea were classified into 15 main clades (I-XV) separated by a number of undefined small branches with remote relationship. Based on the number of members, they respectively belong to multigene clades, oligo-gene clades, and mono-gene clades. The J-protein genes from different clades may function together or separately to constitute a complicated regulatory network. This study provides a constructive viewpoint for J-protein classification and an informative platform for further functional dissection and resistant genes discovery related to genetic improvement of crop plants.
Joint amalgamation of most parsimonious reconciled gene trees
Scornavacca, Celine; Jacox, Edwin; Szöllősi, Gergely J.
2015-01-01
Motivation: Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are computationally expensive, parsimony methods—generally computationally more efficient—require a prior estimate of parameters and of the statistical support. Results: Here, we present the Tree Estimation using Reconciliation (TERA) algorithm, a parsimony based, species tree aware method for gene tree reconstruction based on a scoring scheme combining duplication, transfer and loss costs with an estimate of the sequence likelihood. TERA explores all reconciled gene trees that can be amalgamated from a sample of gene trees. Using a large scale simulated dataset, we demonstrate that TERA achieves the same accuracy as the corresponding probabilistic method while being faster, and outperforms other parsimony-based methods in both accuracy and speed. Running TERA on a set of 1099 homologous gene families from complete cyanobacterial genomes, we find that incorporating knowledge of the species tree results in a two thirds reduction in the number of apparent transfer events. Availability and implementation: The algorithm is implemented in our program TERA, which is freely available from http://mbb.univ-montp2.fr/MBB/download_sources/16__TERA. Contact: celine.scornavacca@univ-montp2.fr, ssolo@angel.elte.hu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25380957
Recurrent invasion and extinction of a selfish gene.
Goddard, M R; Burt, A
1999-11-23
Homing endonuclease genes show super-Mendelian inheritance, which allows them to spread in populations even when they are of no benefit to the host organism. To test the idea that regular horizontal transmission is necessary for the long-term persistence of these genes, we surveyed 20 species of yeasts for the omega-homing endonuclease gene and associated group I intron. The status of omega could be categorized into three states (functional, nonfunctional, or absent), and status was not clustered on the host phylogeny. Moreover, the phylogeny of omega differed significantly from that of the host, strong evidence of horizontal transmission. Further analyses indicate that horizontal transmission is more common than transposition, and that it occurs preferentially between closely related species. Parsimony analysis and coalescent theory suggest that there have been 15 horizontal transmission events in the ancestry of our yeast species, through simulations indicate that this value is probably an underestimate. Overall, the data support a cyclical model of invasion, degeneration, and loss, followed by reinvasion, and each of these transitions is estimated to occur about once every 2 million years. The data are thus consistent with the idea that frequent horizontal transmission is necessary for the long-term persistence of homing endonuclease genes, and further, that this requirement limits these genes to organisms with easily accessible germ lines. The data also show that mitochondrial DNA sequences are transferred intact between yeast species; if other genes do not show such high levels of horizontal transmission, it would be due to lack of selection, rather than lack of opportunity.
MacLeod, Dave; Charlebois, Robert L; Doolittle, Ford; Bapteste, Eric
2005-01-01
Background When organismal phylogenies based on sequences of single marker genes are poorly resolved, a logical approach is to add more markers, on the assumption that weak but congruent phylogenetic signal will be reinforced in such multigene trees. Such approaches are valid only when the several markers indeed have identical phylogenies, an issue which many multigene methods (such as the use of concatenated gene sequences or the assembly of supertrees) do not directly address. Indeed, even when the true history is a mixture of vertical descent for some genes and lateral gene transfer (LGT) for others, such methods produce unique topologies. Results We have developed software that aims to extract evidence for vertical and lateral inheritance from a set of gene trees compared against an arbitrary reference tree. This evidence is then displayed as a synthesis showing support over the tree for vertical inheritance, overlaid with explicit lateral gene transfer (LGT) events inferred to have occurred over the history of the tree. Like splits-tree methods, one can thus identify nodes at which conflict occurs. Additionally one can make reasonable inferences about vertical and lateral signal, assigning putative donors and recipients. Conclusion A tool such as ours can serve to explore the reticulated dimensionality of molecular evolution, by dissecting vertical and lateral inheritance at high resolution. By this, we mean that individual nodes can be examined not only for congruence, but also for coherence in light of LGT. We assert that our tools will facilitate the comparison of phylogenetic trees, and the interpretation of conflicting data. PMID:15819979
Impact of recent molecular phylogenetic studies on classification of ascomycete yeasts
USDA-ARS?s Scientific Manuscript database
Analyses of concatenated gene sequences as well as whole genome sequences are resolving relationships among the ascomycete yeasts (Saccharomycotina), thus allowing classification of members of this subphylum to be based on phylogeny. In addition, changes implemented in the new Botanical Code [Intern...
Horizontal transfer of potential mobile units in phytoplasmas
Ku, Chuan; Lo, Wen-Sui; Kuo, Chih-Horng
2013-01-01
Phytoplasmas are uncultivated phytopathogenic bacteria that cause diseases in a wide range of economically important plants. Through secretion of effector proteins, they are able to manipulate their plant hosts to facilitate their multiplication and dispersal by insect vectors. The genome sequences of several phytoplasmas have been characterized to date and a group of putative composite transposons called potential mobile units (PMUs) are found in these highly reduced genomes. Recently, our team reported the genome sequence and comparative analysis of a peanut witches’ broom (PnWB) phytoplasma, the first representative of the phytoplasma 16SrII group. Comparisons between the species phylogeny and the phylogenies of the PMU genes revealed that the PnWB PMU is likely to have been transferred from the 16SrI group. This indicates that PMUs are not only the DNA unit for transposition within a genome, but also for horizontal transfer among divergent phytoplasma lineages. Given the association of PMUs with effector genes, the mobility of PMUs across genomes has important implications for phytoplasma ecology and evolution. PMID:24251068
Electing a candidate: a speculative history of the bacterial phylum OP10.
Dunfield, Peter F; Tamas, Ivica; Lee, Kevin C; Morgan, Xochitl C; McDonald, Ian R; Stott, Matthew B
2012-12-01
In 1998, a cultivation-independent survey of the microbial community in Obsidian Pool, Yellowstone National Park, detected 12 new phyla within the Domain Bacteria. These were dubbed 'candidate divisions' OP1 to OP12. Since that time the OP10 candidate division has been commonly detected in various environments, usually as part of the rare biosphere, but occasionally as a predominant community component. Based on 16S rRNA gene phylogeny, OP10 comprises at least 12 class-level subdivisions. However, despite this broad ecological and evolutionary diversity, all OP10 bacteria have eluded cultivation until recently. In 2011, two reference species of OP10 were taxonomically validated, removing the phylum from its 'candidate' status. Construction of a highly resolved phylogeny based on 29 universally conserved genes verifies its standing as a unique bacterial phylum. In the following paper we summarize what is known and what is suspected about the newest described bacterial phylum, the Armatimonadetes. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.
Horizontal transfer of potential mobile units in phytoplasmas.
Ku, Chuan; Lo, Wen-Sui; Kuo, Chih-Horng
2013-09-01
Phytoplasmas are uncultivated phytopathogenic bacteria that cause diseases in a wide range of economically important plants. Through secretion of effector proteins, they are able to manipulate their plant hosts to facilitate their multiplication and dispersal by insect vectors. The genome sequences of several phytoplasmas have been characterized to date and a group of putative composite transposons called potential mobile units (PMUs) are found in these highly reduced genomes. Recently, our team reported the genome sequence and comparative analysis of a peanut witches' broom (PnWB) phytoplasma, the first representative of the phytoplasma 16SrII group. Comparisons between the species phylogeny and the phylogenies of the PMU genes revealed that the PnWB PMU is likely to have been transferred from the 16SrI group. This indicates that PMUs are not only the DNA unit for transposition within a genome, but also for horizontal transfer among divergent phytoplasma lineages. Given the association of PMUs with effector genes, the mobility of PMUs across genomes has important implications for phytoplasma ecology and evolution.
A Unique Box in 28S rRNA Is Shared by the Enigmatic Insect Order Zoraptera and Dictyoptera
Dang, Kai; Wu, Haoyang; Wang, Ying; Xie, Qiang; Bu, Wenjun
2013-01-01
The position of the Zoraptera remains one of the most challenging and uncertain concerns in ordinal-level phylogenies of the insects. Zoraptera have been viewed as having a close relationship with five different groups of Polyneoptera, or as being allied to the Paraneoptera or even Holometabola. Although rDNAs have been widely used in phylogenetic studies of insects, the application of the complete 28S rDNA are still scattered in only a few orders. In this study, a secondary structure model of the complete 28S rRNAs of insects was reconstructed based on all orders of Insecta. It was found that one length-variable region, D3-4, is particularly distinctive. The length and/or sequence of D3-4 is conservative within each order of Polyneoptera, but it can be divided into two types between the different orders of the supercohort, of which the enigmatic order Zoraptera and Dictyoptera share one type, while the remaining orders of Polyneoptera share the other. Additionally, independent evidence from phylogenetic results support the clade (Zoraptera+Dictyoptera) as well. Thus, the similarity of D3-4 between Zoraptera and Dictyoptera can serve as potentially valuable autapomorphy or synapomorphy in phylogeny reconstruction. The clades of (Plecoptera+Dermaptera) and ((Grylloblattodea+Mantophasmatodea)+(Embiodea+Phasmatodea)) were also recovered in the phylogenetic study. In addition, considering the other studies based on rDNAs, this study reached the highest congruence with previous phylogenetic studies of Holometabola based on nuclear protein coding genes or morphology characters. Future comparative studies of secondary structures across deep divergences and additional taxa are likely to reveal conserved patterns, structures and motifs that can provide support for major phylogenetic lineages. PMID:23301099
Deep phylogeny, ancestral groups and the four ages of life
Cavalier-Smith, Thomas
2010-01-01
Organismal phylogeny depends on cell division, stasis, mutational divergence, cell mergers (by sex or symbiogenesis), lateral gene transfer and death. The tree of life is a useful metaphor for organismal genealogical history provided we recognize that branches sometimes fuse. Hennigian cladistics emphasizes only lineage splitting, ignoring most other major phylogenetic processes. Though methodologically useful it has been conceptually confusing and harmed taxonomy, especially in mistakenly opposing ancestral (paraphyletic) taxa. The history of life involved about 10 really major innovations in cell structure. In membrane topology, there were five successive kinds of cell: (i) negibacteria, with two bounding membranes, (ii) unibacteria, with one bounding and no internal membranes, (iii) eukaryotes with endomembranes and mitochondria, (iv) plants with chloroplasts and (v) finally, chromists with plastids inside the rough endoplasmic reticulum. Membrane chemistry divides negibacteria into the more advanced Glycobacteria (e.g. Cyanobacteria and Proteobacteria) with outer membrane lipolysaccharide and primitive Eobacteria without lipopolysaccharide (deserving intenser study). It also divides unibacteria into posibacteria, ancestors of eukaryotes, and archaebacteria—the sisters (not ancestors) of eukaryotes and the youngest bacterial phylum. Anaerobic eobacteria, oxygenic cyanobacteria, desiccation-resistant posibacteria and finally neomura (eukaryotes plus archaebacteria) successively transformed Earth. Accidents and organizational constraints are as important as adaptiveness in body plan evolution. PMID:20008390
2017-01-01
Although oral dental tissue is a vertebrate attribute, trunk dental tissue evolved in several extinct vertebrate lineages but is rare among living species. The question of which processes trigger dental-tissue formation in the trunk remains open, and would shed light on odontogenesis evolution. Extra-oral dental structures (odontodes) in the trunk are associated with underlying dermal bony plates, leading us to ask whether the formation of trunk bony plates is necessary for trunk odontodes to emerge. To address this question, we focus on Loricarioidei: an extant, highly diverse group of catfish whose species all have odontodes. We examined the location and cover of odontodes and trunk dermal bony plates for all six loricarioid families and 17 non-loricarioid catfish families for comparison. We inferred the phylogeny of Loricarioidei using a new 10-gene dataset, eight time-calibration points, and noise-reduction techniques. Based on this phylogeny, we reconstructed the ancestral states of odontode and bony plate cover, and find that trunk odontodes emerged before dermal bony plates in Loricarioidei. Yet we discovered that when bony plates are absent, other surface bones are always associated with odontodes, suggesting a link between osteogenic and odontogenic developmental pathways, and indicating a remarkable trunk odontogenic potential in Loricarioidei. PMID:29046381
Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).
Rambaut, Andrew; Lam, Tommy T; Max Carvalho, Luiz; Pybus, Oliver G
2016-01-01
Gene sequences sampled at different points in time can be used to infer molecular phylogenies on a natural timescale of months or years, provided that the sequences in question undergo measurable amounts of evolutionary change between sampling times. Data sets with this property are termed heterochronous and have become increasingly common in several fields of biology, most notably the molecular epidemiology of rapidly evolving viruses. Here we introduce the cross-platform software tool, TempEst (formerly known as Path-O-Gen), for the visualization and analysis of temporally sampled sequence data. Given a molecular phylogeny and the dates of sampling for each sequence, TempEst uses an interactive regression approach to explore the association between genetic divergence through time and sampling dates. TempEst can be used to (1) assess whether there is sufficient temporal signal in the data to proceed with phylogenetic molecular clock analysis, and (2) identify sequences whose genetic divergence and sampling date are incongruent. Examination of the latter can help identify data quality problems, including errors in data annotation, sample contamination, sequence recombination, or alignment error. We recommend that all users of the molecular clock models implemented in BEAST first check their data using TempEst prior to analysis.
Progress, pitfalls and parallel universes: a history of insect phylogenetics
Simon, Chris; Yavorskaya, Margarita; Beutel, Rolf G.
2016-01-01
The phylogeny of insects has been both extensively studied and vigorously debated for over a century. A relatively accurate deep phylogeny had been produced by 1904. It was not substantially improved in topology until recently when phylogenomics settled many long-standing controversies. Intervening advances came instead through methodological improvement. Early molecular phylogenetic studies (1985–2005), dominated by a few genes, provided datasets that were too small to resolve controversial phylogenetic problems. Adding to the lack of consensus, this period was characterized by a polarization of philosophies, with individuals belonging to either parsimony or maximum-likelihood camps; each largely ignoring the insights of the other. The result was an unfortunate detour in which the few perceived phylogenetic revolutions published by both sides of the philosophical divide were probably erroneous. The size of datasets has been growing exponentially since the mid-1980s accompanied by a wave of confidence that all relationships will soon be known. However, large datasets create new challenges, and a large number of genes does not guarantee reliable results. If history is a guide, then the quality of conclusions will be determined by an improved understanding of both molecular and morphological evolution, and not simply the number of genes analysed. PMID:27558853
Liu, Jingjing; Sun, Faqian; Wang, Liang; Ju, Xi; Wu, Weixiang; Chen, Yingxu
2014-01-01
Methane can be used as an alternative carbon source in biological denitrification because it is nontoxic, widely available and relatively inexpensive. A microbial consortium involved in methane oxidation coupled to denitrification (MOD) was enriched with nitrite and nitrate as electron acceptors under micro-aerobic conditions. The 16S rRNA gene combined with pmoA phylogeny of methanotrophs and nirK phylogeny of denitrifiers were analysed to reveal the dominant microbial populations and functional microorganisms. Real-time quantitative polymerase chain reaction results showed high numbers of methanotrophs and denitrifiers in the enriched consortium. The 16S rRNA gene clone library revealed that Methylococcaceae and Methylophilaceae were the dominant populations in the MOD ecosystem. Phylogenetic analyses of pmoA gene clone libraries indicated that all methanotrophs belonged to Methylococcaceae, a type I methanotroph employing the ribulose monophosphate pathway for methane oxidation. Methylotrophic denitrifiers of the Methylophilaceae that can utilize organic intermediates (i.e. formaldehyde, citrate and acetate) released from the methanotrophs played a vital role in aerobic denitrification. This study is the first report to confirm micro-aerobic denitrification and to make phylogenetic and functional assignments for some members of the microbial assemblages involved in MOD. PMID:24245852
Vianna, J.A.; Bonde, R.K.; Caballero, S.; Giraldo, J.P.; Lima, R.P.; Clark, A.; Marmontel, M.; Morales-Vela, B.; De Souza, M. J.; Parr, L.; Rodriguez-Lopez, M.A.; Mignucci-Giannoni, A. A.; Powell, J.A.; Santos, F.R.
2006-01-01
The three living species of manatees, West Indian (Trichechus manatus), Amazonian (Trichechus inunguis) and West African (Trichechus senegalensis), are distributed across the shallow tropical and subtropical waters of America and the western coast of Africa. We have sequenced the mitochondrial DNA control region in 330 Trichechus to compare their phylogeographic patterns. In T. manatus we observed a marked population structure with the identification of three haplotype clusters showing a distinct spatial distribution. A geographic barrier represented by the continuity of the Lesser Antilles to Trinidad Island, near the mouth of the Orinoco River in Venezuela, appears to have restricted the gene flow historically in T. manatus. However, for T. inunguis we observed a single expanding population cluster, with a high diversity of very closely related haplotypes. A marked geographic population structure is likely present in T. senegalensis with at least two distinct clusters. Phylogenetic analyses with the mtDNA cytochrome b gene suggest a clade of the marine Trichechus species, with T. inunguis as the most basal trichechid. This is in agreement with previous morphological analyses. Mitochondrial DNA, autosomal microsatellites and cytogenetic analyses revealed the presence of hybrids between the T. manatus and T. inunguis species at the mouth of the Amazon River in Brazil, extending to the Guyanas and probably as far as the mouth of the Orinoco River. Future conservation strategies should consider the distinct population structure of manatee species, as well as the historical barriers to gene flow and the likely occurrence of interspecific hybridization. ?? 2006 Blackwell Publishing Ltd.
Zhang, Yanjie; Sun, Jin; Li, Xinzheng; Qiu, Jian-Wen
2016-01-01
We reported a nearly complete mitochondrial genome (mitogenome) from the glass sponge Lophophysema eversa, the second mitogenome in the order Amphidiscosida and the ninth in the class Hexactinellida. It is 20,651 base pairs in length and contains 39 genes including 13 protein-coding genes, 2 ribosomal RNA subunit genes and 24 tRNA genes. The gene content and order of L. eversa are identical to those of Tabachnickia sp., the other species with a sequenced mitogenome in Amphidiscosida, except with two additional tRNAs and three tRNA translocations. The cob gene has a +1 translational frameshift. These results will contribute to a better understanding of the phylogeny of glass sponges.
Schuelke, Taruna; Pereira, Tiago José; Hardy, Sarah M; Bik, Holly M
2018-04-01
Studies of host-associated microbes are critical for advancing our understanding of ecology and evolution across diverse taxa and ecosystems. Nematode worms are ubiquitous across most habitats on earth, yet little is known about host-associated microbial assemblages within the phylum. Free-living nematodes are globally abundant and diverse in marine sediments, with species exhibiting distinct buccal cavity (mouth) morphologies that are thought to play an important role in feeding ecology and life history strategies. Here, we investigated patterns in marine nematode microbiomes, by characterizing host-associated microbial taxa in 281 worms isolated from a range of habitat types (deep-sea, shallow water, methane seeps, Lophelia coral mounds, kelp holdfasts) across three distinct geographic regions (Arctic, Southern California and Gulf of Mexico). Microbiome profiles were generated from single worms spanning 33 distinct morphological genera, using a two-gene metabarcoding approach to amplify the V4 region of the 16S ribosomal RNA (rRNA) gene targeting bacteria/archaea and the V1-V2 region of the 18S rRNA gene targeting microbial eukaryotes. Contrary to our expectations, nematode microbiome profiles demonstrated no distinct patterns either globally (across depths and ocean basins) or locally (within site); prokaryotic and eukaryotic microbial assemblages did not correlate with nematode feeding morphology, host phylogeny or morphological identity, ocean region or marine habitat type. However, fine-scale analysis of nematode microbiomes revealed a variety of novel ecological interactions, including putative parasites and symbionts, and potential associations with bacterial/archaeal taxa involved in nitrogen and methane cycling. Our results suggest that in marine habitats, free-living nematodes may utilize diverse and generalist foraging strategies that are not correlated with host genotype or feeding morphology. Furthermore, some abiotic factors such as geographic region and habitat type do not appear to play an obvious role in structuring host-microbe associations or feeding preferences. © 2018 John Wiley & Sons Ltd.
McNeal, Joel R; Arumugunathan, Kathiravetpilla; Kuehl, Jennifer V; Boore, Jeffrey L; Depamphilis, Claude W
2007-12-13
The genus Cuscuta L. (Convolvulaceae), commonly known as dodders, are epiphytic vines that invade the stems of their host with haustorial feeding structures at the points of contact. Although they lack expanded leaves, some species are noticeably chlorophyllous, especially as seedlings and in maturing fruits. Some species are reported as crop pests of worldwide distribution, whereas others are extremely rare and have local distributions and apparent niche specificity. A strong phylogenetic framework for this large genus is essential to understand the interesting ecological, morphological and molecular phenomena that occur within these parasites in an evolutionary context. Here we present a well-supported phylogeny of Cuscuta using sequences of the nuclear ribosomal internal transcribed spacer and plastid rps2, rbcL and matK from representatives across most of the taxonomic diversity of the genus. We use the phylogeny to interpret morphological and plastid genome evolution within the genus. At least three currently recognized taxonomic sections are not monophyletic and subgenus Cuscuta is unequivocally paraphyletic. Plastid genes are extremely variable with regards to evolutionary constraint, with rbcL exhibiting even higher levels of purifying selection in Cuscuta than photosynthetic relatives. Nuclear genome size is highly variable within Cuscuta, particularly within subgenus Grammica, and in some cases may indicate the existence of cryptic species in this large clade of morphologically similar species. Some morphological characters traditionally used to define major taxonomic splits within Cuscuta are homoplastic and are of limited use in defining true evolutionary groups. Chloroplast genome evolution seems to have evolved in a punctuated fashion, with episodes of loss involving suites of genes or tRNAs followed by stabilization of gene content in major clades. Nearly all species of Cuscuta retain some photosynthetic ability, most likely for nutrient apportionment to their seeds, while complete loss of photosynthesis and possible loss of the entire chloroplast genome is limited to a single small clade of outcrossing species found primarily in western South America.
McNeal, Joel R; Arumugunathan, Kathiravetpilla; Kuehl, Jennifer V; Boore, Jeffrey L; dePamphilis, Claude W
2007-01-01
Background The genus Cuscuta L. (Convolvulaceae), commonly known as dodders, are epiphytic vines that invade the stems of their host with haustorial feeding structures at the points of contact. Although they lack expanded leaves, some species are noticeably chlorophyllous, especially as seedlings and in maturing fruits. Some species are reported as crop pests of worldwide distribution, whereas others are extremely rare and have local distributions and apparent niche specificity. A strong phylogenetic framework for this large genus is essential to understand the interesting ecological, morphological and molecular phenomena that occur within these parasites in an evolutionary context. Results Here we present a well-supported phylogeny of Cuscuta using sequences of the nuclear ribosomal internal transcribed spacer and plastid rps2, rbcL and matK from representatives across most of the taxonomic diversity of the genus. We use the phylogeny to interpret morphological and plastid genome evolution within the genus. At least three currently recognized taxonomic sections are not monophyletic and subgenus Cuscuta is unequivocally paraphyletic. Plastid genes are extremely variable with regards to evolutionary constraint, with rbcL exhibiting even higher levels of purifying selection in Cuscuta than photosynthetic relatives. Nuclear genome size is highly variable within Cuscuta, particularly within subgenus Grammica, and in some cases may indicate the existence of cryptic species in this large clade of morphologically similar species. Conclusion Some morphological characters traditionally used to define major taxonomic splits within Cuscuta are homoplastic and are of limited use in defining true evolutionary groups. Chloroplast genome evolution seems to have evolved in a punctuated fashion, with episodes of loss involving suites of genes or tRNAs followed by stabilization of gene content in major clades. Nearly all species of Cuscuta retain some photosynthetic ability, most likely for nutrient apportionment to their seeds, while complete loss of photosynthesis and possible loss of the entire chloroplast genome is limited to a single small clade of outcrossing species found primarily in western South America. PMID:18078516
Réblová, Martina; Jaklitsch, Walter M.; Réblová, Kamila; Štěpánek, Václav
2015-01-01
The Calosphaeriales is revisited with new collection data, living cultures, morphological studies of ascoma centrum, secondary structures of the internal transcribed spacer (ITS) rDNA and phylogeny based on novel DNA sequences of five nuclear ribosomal and protein-coding loci. Morphological features, molecular evidence and information from predicted RNA secondary structures of ITS converged upon robust phylogenies of the Calosphaeriales and Togniniales. The current concept of the Calosphaeriales includes the Calosphaeriaceae and Pleurostomataceae encompassing five monophyletic genera, Calosphaeria, Flabellascus gen. nov., Jattaea, Pleurostoma and Togniniella, strongly supported by Bayesian and Maximum Likelihood methods. The structural elements of ITS1 form characteristic patterns that are phylogenetically conserved, corroborate observations based on morphology and have a high predictive value at the generic level. Three major clades containing 44 species of Phaeoacremonium were recovered in the closely related Togniniales based on ITS, actin and β-tubulin sequences. They are newly characterized by sexual and RNA structural characters and ecology. This approach is a first step towards understanding of the molecular systematics of Phaeoacremonium and possibly its new classification. In the Calosphaeriales, Jattaea aphanospora sp. nov. and J. ribicola sp. nov. are introduced, Calosphaeria taediosa is combined in Jattaea and epitypified. The sexual morph of Phaeoacremonium cinereum was encountered for the first time on decaying wood and obtained in vitro. In order to achieve a single nomenclature, the genera of asexual morphs linked with the Calosphaeriales are transferred to synonymy of their sexual morphs following the principle of priority, i.e. Calosphaeriophora to Calosphaeria, Phaeocrella to Togniniella and Pleurostomophora to Pleurostoma. Three new combinations are proposed, i.e. Pleurostoma ochraceum comb. nov., P. repens comb. nov. and P. richardsiae comb. nov. The morphology-based key is provided to facilitate identification of genera accepted in the Calosphaeriales. PMID:26699541
Crottini, Angelica; Dordel, Janina; Köhler, Jörn; Glaw, Frank; Schmitz, Andreas; Vences, Miguel
2009-10-01
A phylogeny for 29 species of scincine lizards from Madagascar, based on 3693 bp of six mitochondrial and five nuclear genes, revealed multiple parallel evolution of adaptations for a burrowing life, and unexpected relationships of the monotypic genera Androngo and Cryptoscincus. Androngo trivittatus was sister to Pygomeles braconnieri, and Cryptoscincus minimus was deeply nested within the genus Paracontias, all of these being fossorial taxa of elongated bodies and partly or fully reduced limbs. To account for these results, we place Cryptoscincus as a junior synonym of Paracontias, and discuss possible taxonomic consequences that may affect the status of Androngo, once additional data become available.
Genetic Regulatory Networks in Embryogenesis and Evolution
NASA Technical Reports Server (NTRS)
1998-01-01
The article introduces a series of papers that were originally presented at a workshop titled Genetic Regulatory Network in Embryogenesis and Evaluation. Contents include the following: evolution of cleavage programs in relationship to axial specification and body plan evolution, changes in cell lineage specification elucidate evolutionary relations in spiralia, axial patterning in the leech: developmental mechanisms and evolutionary implications, hox genes in arthropod development and evolution, heterochronic genes in development and evolution, a common theme for LIM homeobox gene function across phylogeny, and mechanisms of specification in ascidian embryos.
Toward image phylogeny forests: automatically recovering semantically similar image relationships.
Dias, Zanoni; Goldenstein, Siome; Rocha, Anderson
2013-09-10
In the past few years, several near-duplicate detection methods appeared in the literature to identify the cohabiting versions of a given document online. Following this trend, there are some initial attempts to go beyond the detection task, and look into the structure of evolution within a set of related images overtime. In this paper, we aim at automatically identify the structure of relationships underlying the images, correctly reconstruct their past history and ancestry information, and group them in distinct trees of processing history. We introduce a new algorithm that automatically handles sets of images comprising different related images, and outputs the phylogeny trees (also known as a forest) associated with them. Image phylogeny algorithms have many applications such as finding the first image within a set posted online (useful for tracking copyright infringement perpetrators), hint at child pornography content creators, and narrowing down a list of suspects for online harassment using photographs. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Hoffmann, Federico G.; Opazo, Juan C.; Storz, Jay F.
2010-01-01
Natural selection often promotes evolutionary innovation by coopting preexisting genes for new functions, and this process may be greatly facilitated by gene duplication. Here we report an example of cooptive convergence where paralogous members of the globin gene superfamily independently evolved a specialized O2 transport function in the two deepest branches of the vertebrate family tree. Specifically, phylogenetic evidence demonstrates that erythroid-specific O2 transport hemoglobins evolved independently from different ancestral precursor proteins in jawed vertebrates (gnathostomes) and jawless fish (cyclostomes, represented by lamprey and hagfish). A comprehensive phylogenetic analysis of the vertebrate globin gene superfamily revealed that the erythroid hemoglobins of cyclostomes are orthologous to the cytoglobin protein of gnathostome vertebrates, a hexacoordinate globin that has no O2 transport function and that is predominantly expressed in fibroblasts and related cell types. The phylogeny reconstruction also revealed that vertebrate-specific globins are grouped into four main clades: (i) cyclostome hemoglobin + cytoglobin, (ii) myoglobin + globin E, (iii) globin Y, and (iv) the α- and β-chain hemoglobins of gnathostomes. In the hemoglobins of gnathostomes and cyclostomes, multisubunit quaternary structures provide the basis for cooperative O2 binding and allosteric regulation by coupling the effects of ligand binding at individual subunits with interactions between subunits. However, differences in numerous structural details belie their independent origins. This example of convergent evolution of protein function provides an impressive demonstration of the ability of natural selection to cobble together complex design solutions by tinkering with different variations of the same basic protein scaffold. PMID:20660759
Liu, Jian; Zhang, Shouzhou; Nagalingum, Nathalie S; Chiang, Yu-Chung; Lindstrom, Anders J; Gong, Xun
2018-05-18
The gymnosperm genus Cycas is the sole member of Cycadaceae, and is the largest genus of extant cycads. There are about 115 accepted Cycas species mainly distributed in the paleotropics. Based on morphology, the genus has been divided into six sections and eight subsections, but this taxonomy has not yet been tested in a molecular phylogenetic framework. Although the monophyly of Cycas is broadly accepted, the intrageneric relationships inferred from previous molecular phylogenetic analyses are unclear due to insufficient sampling or uninformative DNA sequence data. In this study, we reconstructed a phylogeny of Cycas using four chloroplast intergenic spacers and seven low-copy nuclear genes and sampling 90% of extant Cycas species. The maximum likelihood and Bayesian inference phylogenies suggest: (1) matrices of either concatenated cpDNA markers or of concatenated nDNA lack sufficient informative sites to resolve the phylogeny alone, however, the phylogeny from the combined cpDNA-nDNA dataset suggests the genus can be roughly divided into 13 clades and six sections that are in agreement with the current classification of the genus; (2) although with partial support, a clade combining sections Panzhihuaenses + Asiorientales is resolved as the earliest diverging branch; (3) section Stangerioides is not monophyletic because the species resolve as a grade; (4) section Indosinenses is not monophyletic as it includes Cycas macrocarpa and C. pranburiensis from section Cycas; (5) section Cycas is the most derived group and its subgroups correspond with geography. Copyright © 2018 Elsevier Inc. All rights reserved.
The evolutionary history of ferns inferred from 25 low-copy nuclear genes.
Rothfels, Carl J; Li, Fay-Wei; Sigel, Erin M; Huiet, Layne; Larsson, Anders; Burge, Dylan O; Ruhsam, Markus; Deyholos, Michael; Soltis, Douglas E; Stewart, C Neal; Shaw, Shane W; Pokorny, Lisa; Chen, Tao; dePamphilis, Claude; DeGironimo, Lisa; Chen, Li; Wei, Xiaofeng; Sun, Xiao; Korall, Petra; Stevenson, Dennis W; Graham, Sean W; Wong, Gane K-S; Pryer, Kathleen M
2015-07-01
• Understanding fern (monilophyte) phylogeny and its evolutionary timescale is critical for broad investigations of the evolution of land plants, and for providing the point of comparison necessary for studying the evolution of the fern sister group, seed plants. Molecular phylogenetic investigations have revolutionized our understanding of fern phylogeny, however, to date, these studies have relied almost exclusively on plastid data.• Here we take a curated phylogenomics approach to infer the first broad fern phylogeny from multiple nuclear loci, by combining broad taxon sampling (73 ferns and 12 outgroup species) with focused character sampling (25 loci comprising 35877 bp), along with rigorous alignment, orthology inference and model selection.• Our phylogeny corroborates some earlier inferences and provides novel insights; in particular, we find strong support for Equisetales as sister to the rest of ferns, Marattiales as sister to leptosporangiate ferns, and Dennstaedtiaceae as sister to the eupolypods. Our divergence-time analyses reveal that divergences among the extant fern orders all occurred prior to ∼200 MYA. Finally, our species-tree inferences are congruent with analyses of concatenated data, but generally with lower support. Those cases where species-tree support values are higher than expected involve relationships that have been supported by smaller plastid datasets, suggesting that deep coalescence may be reducing support from the concatenated nuclear data.• Our study demonstrates the utility of a curated phylogenomics approach to inferring fern phylogeny, and highlights the need to consider underlying data characteristics, along with data quantity, in phylogenetic studies. © 2015 Botanical Society of America, Inc.
Evaluating phylogenetic congruence in the post-genomic era.
Leigh, Jessica W; Lapointe, François-Joseph; Lopez, Philippe; Bapteste, Eric
2011-01-01
Congruence is a broadly applied notion in evolutionary biology used to justify multigene phylogeny or phylogenomics, as well as in studies of coevolution, lateral gene transfer, and as evidence for common descent. Existing methods for identifying incongruence or heterogeneity using character data were designed for data sets that are both small and expected to be rarely incongruent. At the same time, methods that assess incongruence using comparison of trees test a null hypothesis of uncorrelated tree structures, which may be inappropriate for phylogenomic studies. As such, they are ill-suited for the growing number of available genome sequences, most of which are from prokaryotes and viruses, either for phylogenomic analysis or for studies of the evolutionary forces and events that have shaped these genomes. Specifically, many existing methods scale poorly with large numbers of genes, cannot accommodate high levels of incongruence, and do not adequately model patterns of missing taxa for different markers. We propose the development of novel incongruence assessment methods suitable for the analysis of the molecular evolution of the vast majority of life and support the investigation of homogeneity of evolutionary process in cases where markers do not share identical tree structures.
Evaluating Phylogenetic Congruence in the Post-Genomic Era
Leigh, Jessica W.; Lapointe, François-Joseph; Lopez, Philippe; Bapteste, Eric
2011-01-01
Congruence is a broadly applied notion in evolutionary biology used to justify multigene phylogeny or phylogenomics, as well as in studies of coevolution, lateral gene transfer, and as evidence for common descent. Existing methods for identifying incongruence or heterogeneity using character data were designed for data sets that are both small and expected to be rarely incongruent. At the same time, methods that assess incongruence using comparison of trees test a null hypothesis of uncorrelated tree structures, which may be inappropriate for phylogenomic studies. As such, they are ill-suited for the growing number of available genome sequences, most of which are from prokaryotes and viruses, either for phylogenomic analysis or for studies of the evolutionary forces and events that have shaped these genomes. Specifically, many existing methods scale poorly with large numbers of genes, cannot accommodate high levels of incongruence, and do not adequately model patterns of missing taxa for different markers. We propose the development of novel incongruence assessment methods suitable for the analysis of the molecular evolution of the vast majority of life and support the investigation of homogeneity of evolutionary process in cases where markers do not share identical tree structures. PMID:21712432
USDA-ARS?s Scientific Manuscript database
Background: Citrus (Rutaceae) comprises of many important cultivated species which generally hybridize easily. Phylogenetic study of a group showing extensive hybridization is challenging. Since the genus Citrus has diverged recently (4-12 Ma), incomplete lineage sorting of ancestral polymorphisms...
Refined NrfA phylogeny improves PCR-based nrfA gene detection
USDA-ARS?s Scientific Manuscript database
Dissimilatory nitrate reduction to ammonium (DNRA) promotes N-retention in the terrestrial nitrogen- (N-) cycle. Respiratory nitrite reduction to ammonium is catalyzed by the nitrite reductase NrfA. Prior phylogenetic analyses showed that NrfA divided into18 distinct clades amongst available sequenc...
The genome of the social amoeba Dictyostelium discoideum
Eichinger, L.; Pachebat, J.A.; Glöckner, G.; Rajandream, M.-A.; Sucgang, R.; Berriman, M.; Song, J.; Olsen, R.; Szafranski, K.; Xu, Q.; Tunggal, B.; Kummerfeld, S.; Madera, M.; Konfortov, B. A.; Rivero, F.; Bankier, A. T.; Lehmann, R.; Hamlin, N.; Davies, R.; Gaudet, P.; Fey, P.; Pilcher, K.; Chen, G.; Saunders, D.; Sodergren, E.; Davis, P.; Kerhornou, A.; Nie, X.; Hall, N.; Anjard, C.; Hemphill, L.; Bason, N.; Farbrother, P.; Desany, B.; Just, E.; Morio, T.; Rost, R.; Churcher, C.; Cooper, J.; Haydock, S.; van Driessche, N.; Cronin, A.; Goodhead, I.; Muzny, D.; Mourier, T.; Pain, A.; Lu, M.; Harper, D.; Lindsay, R.; Hauser, H.; James, K.; Quiles, M.; Babu, M. Madan; Saito, T.; Buchrieser, C.; Wardroper, A.; Felder, M.; Thangavelu, M.; Johnson, D.; Knights, A.; Loulseged, H.; Mungall, K.; Oliver, K.; Price, C.; Quail, M.A.; Urushihara, H.; Hernandez, J.; Rabbinowitsch, E.; Steffen, D.; Sanders, M.; Ma, J.; Kohara, Y.; Sharp, S.; Simmonds, M.; Spiegler, S.; Tivey, A.; Sugano, S.; White, B.; Walker, D.; Woodward, J.; Winckler, T.; Tanaka, Y.; Shaulsky, G.; Schleicher, M.; Weinstock, G.; Rosenthal, A.; Cox, E.C.; Chisholm, R. L.; Gibbs, R.; Loomis, W. F.; Platzer, M.; Kay, R. R.; Williams, J.; Dear, P. H.; Noegel, A. A.; Barrell, B.; Kuspa, A.
2005-01-01
The social amoebae are exceptional in their ability to alternate between unicellular and multicellular forms. Here we describe the genome of the best-studied member of this group, Dictyostelium discoideum. The gene-dense chromosomes encode ~12,500 predicted proteins, a high proportion of which have long repetitive amino acid tracts. There are many genes for polyketide synthases and ABC transporters, suggesting an extensive secondary metabolism for producing and exporting small molecules. The genome is rich in complex repeats, one class of which is clustered and may serve as centromeres. Partial copies of the extrachromosomal rDNA element are found at the ends of each chromosome, suggesting a novel telomere structure and the use of a common mechanism to maintain both the rDNA and chromosomal termini. A proteome-based phylogeny shows that the amoebozoa diverged from the animal/fungal lineage after the plant/animal split, but Dictyostelium appears to have retained more of the diversity of the ancestral genome than either of these two groups. PMID:15875012
Rodríguez, Ariel; Burgon, James D; Lyra, Mariana; Irisarri, Iker; Baurain, Denis; Blaustein, Leon; Göçmen, Bayram; Künzel, Sven; Mable, Barbara K; Nolte, Arne W; Veith, Michael; Steinfartz, Sebastian; Elmer, Kathryn R; Philippe, Hervé; Vences, Miguel
2017-10-01
The rise of high-throughput sequencing techniques provides the unprecedented opportunity to analyse controversial phylogenetic relationships in great depth, but also introduces a risk of being misinterpreted by high node support values influenced by unevenly distributed missing data or unrealistic model assumptions. Here, we use three largely independent phylogenomic data sets to reconstruct the controversial phylogeny of true salamanders of the genus Salamandra, a group of amphibians providing an intriguing model to study the evolution of aposematism and viviparity. For all six species of the genus Salamandra, and two outgroup species from its sister genus Lyciasalamandra, we used RNA sequencing (RNAseq) and restriction site associated DNA sequencing (RADseq) to obtain data for: (1) 3070 nuclear protein-coding genes from RNAseq; (2) 7440 loci obtained by RADseq; and (3) full mitochondrial genomes. The RNAseq and RADseq data sets retrieved fully congruent topologies when each of them was analyzed in a concatenation approach, with high support for: (1) S. infraimmaculata being sister group to all other Salamandra species; (2) S. algira being sister to S. salamandra; (3) these two species being the sister group to a clade containing S. atra, S. corsica and S. lanzai; and (4) the alpine species S. atra and S. lanzai being sister taxa. The phylogeny inferred from the mitochondrial genome sequences differed from these results, most notably by strongly supporting a clade containing S. atra and S. corsica as sister taxa. A different placement of S. corsica was also retrieved when analysing the RNAseq and RADseq data under species tree approaches. Closer examination of gene trees derived from RNAseq revealed that only a low number of them supported each of the alternative placements of S. atra. Furthermore, gene jackknife support for the S. atra - S. lanzai node stabilized only with very large concatenated data sets. The phylogeny of true salamanders thus provides a compelling example of how classical node support metrics such as bootstrap and Bayesian posterior probability can provide high confidence values in a phylogenomic topology even if the phylogenetic signal for some nodes is spurious, highlighting the importance of complementary approaches such as gene jackknifing. Yet, the general congruence among the topologies recovered from the RNAseq and RADseq data sets increases our confidence in the results, and validates the use of phylotranscriptomic approaches for reconstructing shallow relationships among closely related taxa. We hypothesize that the evolution of Salamandra has been characterized by episodes of introgressive hybridization, which would explain the difficulties of fully reconstructing their evolutionary relationships. Copyright © 2017. Published by Elsevier Inc.
Zhang, Yanan; Song, Tao; Pan, Tao; Sun, Xiaonan; Sun, Zhonglou; Qian, Lifu; Zhang, Baowei
2016-07-01
The complete sequence of the mitochondrial genome was determined for Asio flammeus, which is distributed widely in geography. The length of the complete mitochondrial genome was 18,966 bp, containing 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes (PCGs), and 1 non-coding region (D-loop). All the genes were distributed on the H-strand, except for the ND6 subunit gene and eight tRNA genes which were encoded on the L-strand. The D-loop of A. flammeus contained many tandem repeats of varying lengths and repeat numbers. The molecular-based phylogeny showed that our species acted as the sister group to A. capensis and the supported Asio was the monophyletic group.
Kumar, Nitin; Lad, Ganesh; Giuntini, Elisa; Kaye, Maria E.; Udomwong, Piyachat; Shamsani, N. Jannah; Young, J. Peter W.; Bailly, Xavier
2015-01-01
Biological species may remain distinct because of genetic isolation or ecological adaptation, but these two aspects do not always coincide. To establish the nature of the species boundary within a local bacterial population, we characterized a sympatric population of the bacterium Rhizobium leguminosarum by genomic sequencing of 72 isolates. Although all strains have 16S rRNA typical of R. leguminosarum, they fall into five genospecies by the criterion of average nucleotide identity (ANI). Many genes, on plasmids as well as the chromosome, support this division: recombination of core genes has been largely within genospecies. Nevertheless, variation in ecological properties, including symbiotic host range and carbon-source utilization, cuts across these genospecies, so that none of these phenotypes is diagnostic of genospecies. This phenotypic variation is conferred by mobile genes. The genospecies meet the Mayr criteria for biological species in respect of their core genes, but do not correspond to coherent ecological groups, so periodic selection may not be effective in purging variation within them. The population structure is incompatible with traditional ‘polyphasic taxonomy′ that requires bacterial species to have both phylogenetic coherence and distinctive phenotypes. More generally, genomics has revealed that many bacterial species share adaptive modules by horizontal gene transfer, and we envisage a more consistent taxonomic framework that explicitly recognizes this. Significant phenotypes should be recognized as ‘biovars' within species that are defined by core gene phylogeny. PMID:25589577
Analysis of the Na+/Ca2+ Exchanger Gene Family within the Phylum Nematoda
He, Chao; O'Halloran, Damien M.
2014-01-01
Na+/Ca2+ exchangers are low affinity, high capacity transporters that rapidly transport calcium at the plasma membrane, mitochondrion, endoplasmic (and sarcoplasmic) reticulum, and the nucleus. Na+/Ca2+ exchangers are widely expressed in diverse cell types where they contribute homeostatic balance to calcium levels. In animals, Na+/Ca2+ exchangers are divided into three groups based upon stoichiometry: Na+/Ca2+ exchangers (NCX), Na+/Ca2+/K+ exchangers (NCKX), and Ca2+/Cation exchangers (CCX). In mammals there are three NCX genes, five NCKX genes and one CCX (NCLX) gene. The genome of the nematode Caenorhabditis elegans contains ten Na+/Ca2+ exchanger genes: three NCX; five CCX; and two NCKX genes. Here we set out to characterize structural and taxonomic specializations within the family of Na+/Ca2+ exchangers across the phylum Nematoda. In this analysis we identify Na+/Ca2+ exchanger genes from twelve species of nematodes and reconstruct their phylogenetic and evolutionary relationships. The most notable feature of the resulting phylogenies was the heterogeneous evolution observed within exchanger subtypes. Specifically, in the case of the CCX exchangers we did not detect members of this class in three Clade III nematodes. Within the Caenorhabditis and Pristionchus lineages we identify between three and five CCX representatives, whereas in other Clade V and also Clade IV nematode taxa we only observed a single CCX gene in each species, and in the Clade III nematode taxa that we sampled we identify NCX and NCKX encoding genes but no evidence of CCX representatives using our mining approach. We also provided re-annotation for predicted CCX gene structures from Heterorhabditis bacteriophora and Caenorhabditis japonica by RT-PCR and sequencing. Together, these findings reveal a complex picture of Na+/Ca2+ transporters in nematodes that suggest an incongruent evolutionary history of proteins that provide central control of calcium dynamics. PMID:25397810
Structure and Evolution of Insect Sperm: New Interpretations in the Age of Phylogenomics.
Dallai, Romano; Gottardo, Marco; Beutel, Rolf Georg
2016-01-01
This comprehensive review of the structure of sperm in all orders of insects evaluates phylogenetic implications, with the background of a phylogeny based on transcriptomes. Sperm characters strongly support several major branches of the phylogeny of insects-for instance, Cercophora, Dicondylia, and Psocodea-and also different infraordinal groups. Some closely related taxa, such as Trichoptera and Lepidoptera (Amphiesmenoptera), differ greatly in sperm structure. Sperm characters are very conservative in some groups (Heteroptera, Odonata) but highly variable in others, including Zoraptera, a small and morphologically uniform group with a tremendously accelerated rate of sperm evolution. Unusual patterns such as sperm dimorphism, the formation of bundles, or aflagellate and immotile sperm have evolved independently in several groups.
2012-01-01
Background The Nymphaeales (waterlilly and relatives) lineage has diverged as the second branch of basal angiosperms and comprises of two families: Cabombaceae and Nymphaceae. The classification of Nymphaeales and phylogeny within the flowering plants are quite intriguing as several systems (Thorne system, Dahlgren system, Cronquist system, Takhtajan system and APG III system (Angiosperm Phylogeny Group III system) have attempted to redefine the Nymphaeales taxonomy. There have been also fossil records consisting especially of seeds, pollen, stems, leaves and flowers as early as the lower Cretaceous. Here we present an in silico study of the order Nymphaeales taking maturaseK (matK) and internal transcribed spacer (ITS2) as biomarkers for phylogeny reconstruction (using character-based methods and Bayesian approach) and identification of motifs for DNA barcoding. Results The Maximum Likelihood (ML) and Bayesian approach yielded congruent fully resolved and well-supported trees using a concatenated (ITS2+ matK) supermatrix aligned dataset. The taxon sampling corroborates the monophyly of Cabombaceae. Nuphar emerges as a monophyletic clade in the family Nymphaeaceae while there are slight discrepancies in the monophyletic nature of the genera Nymphaea owing to Victoria-Euryale and Ondinea grouping in the same node of Nymphaeaceae. ITS2 secondary structures alignment corroborate the primary sequence analysis. Hydatellaceae emerged as a sister clade to Nymphaeaceae and had a basal lineage amongst the water lilly clades. Species from Cycas and Ginkgo were taken as outgroups and were rooted in the overall tree topology from various methods. Conclusions MatK genes are fast evolving highly variant regions of plant chloroplast DNA that can serve as potential biomarkers for DNA barcoding and also in generating primers for angiosperms with identification of unique motif regions. We have reported unique genus specific motif regions in the Order Nymphaeles from matK dataset which can be further validated for barcoding and designing of PCR primers. Our analysis using a novel approach of sequence-structure alignment and phylogenetic reconstruction using molecular morphometrics congrue with the current placement of Hydatellaceae within the early-divergent angiosperm order Nymphaeales. The results underscore the fact that more diverse genera, if not fully resolved to be monophyletic, should be represented by all major lineages. PMID:23282079
Ludwig, Yvonne; Zhang, Yanxiang; Hochholdinger, Frank
2013-01-01
The plant hormone auxin plays a key role in the coordination of many aspects of growth and development. AUXIN/INDOLE-3-ACETIC ACID (Aux/IAA) genes encode instable primary auxin responsive regulators of plant development that display a protein structure with four characteristic domains. In the present study, a comprehensive analysis of the 34 members of the maize Aux/IAA gene family was performed. Phylogenetic reconstructions revealed two classes of Aux/IAA proteins that can be distinguished by alterations in their domain III. Seven pairs of paralogous maize Aux/IAA proteins were discovered. Comprehensive root-type and tissue-specific expression profiling revealed unique expression patterns of the diverse members of the gene family. Remarkably, five of seven pairs of paralogous genes displayed highly correlated expression patterns in roots. All but one (ZmIAA23) tested maize Aux/IAA genes were auxin inducible, displaying two types of auxin induction within three hours of treatment. Moreover, 51 of 55 (93%) differential Aux/IAA expression patterns between different root-types followed the expression tendency: crown roots > seminal roots > primary roots > lateral roots. This pattern might imply root-type-specific regulation of Aux/IAA transcript abundance. In summary, the detailed analysis of the maize Aux/IAA gene family provides novel insights in the evolution and developmental regulation and thus the function of these genes in different root-types and tissues. PMID:24223858
DOE Office of Scientific and Technical Information (OSTI.GOV)
Norton, Jeanette M.; Klotz, Martin G; Stein, Lisa Y
2008-01-01
The complete genome of the ammonia-oxidizing bacterium, Nitrosospira multiformis (ATCC 25196T), consists of a circular chromosome and three small plasmids totaling 3,234,309 bp and encoding 2827 putative proteins. Of these, 2026 proteins have predicted functions and 801 are without conserved functional domains, yet 747 of these have similarity to other predicted proteins in databases. Gene homologs from Nitrosomonas europaea and N. eutropha were the best match for 42% of the predicted genes in N. multiformis. The genome contains three nearly identical copies of amo and hao gene clusters as large repeats. Distinguishing features compared to N. europaea include: the presencemore » of gene clusters encoding urease and hydrogenase, a RuBisCO-encoding operon of distinctive structure and phylogeny, and a relatively small complement of genes related to Fe acquisition. Systems for synthesis of a pyoverdine-like siderophore and for acyl-homoserine lactone were unique to N. multiformis among the sequenced AOB genomes. Gene clusters encoding proteins associated with outer membrane and cell envelope functions including transporters, porins, exopolysaccharide synthesis, capsule formation and protein sorting/export were abundant. Numerous sensory transduction and response regulator gene systems directed towards sensing of the extracellular environment are described. Gene clusters for glycogen, polyphosphate and cyanophycin storage and utilization were identified providing mechanisms for meeting energy requirements under substrate-limited conditions. The genome of N. multiformis encodes the core pathways for chemolithoautotrophy along with adaptations for surface growth and survival in soil environments.« less
Ludwig, Yvonne; Zhang, Yanxiang; Hochholdinger, Frank
2013-01-01
The plant hormone auxin plays a key role in the coordination of many aspects of growth and development. AUXIN/INDOLE-3-ACETIC ACID (Aux/IAA) genes encode instable primary auxin responsive regulators of plant development that display a protein structure with four characteristic domains. In the present study, a comprehensive analysis of the 34 members of the maize Aux/IAA gene family was performed. Phylogenetic reconstructions revealed two classes of Aux/IAA proteins that can be distinguished by alterations in their domain III. Seven pairs of paralogous maize Aux/IAA proteins were discovered. Comprehensive root-type and tissue-specific expression profiling revealed unique expression patterns of the diverse members of the gene family. Remarkably, five of seven pairs of paralogous genes displayed highly correlated expression patterns in roots. All but one (ZmIAA23) tested maize Aux/IAA genes were auxin inducible, displaying two types of auxin induction within three hours of treatment. Moreover, 51 of 55 (93%) differential Aux/IAA expression patterns between different root-types followed the expression tendency: crown roots > seminal roots > primary roots > lateral roots. This pattern might imply root-type-specific regulation of Aux/IAA transcript abundance. In summary, the detailed analysis of the maize Aux/IAA gene family provides novel insights in the evolution and developmental regulation and thus the function of these genes in different root-types and tissues.
Recombination and Population Mosaic of a Multifunctional Viral Gene, Adeno-Associated Virus cap
Takeuchi, Yasuhiro; Myers, Richard; Danos, Olivier
2008-01-01
Homologous recombination is a dominant force in evolution and results in genetic mosaics. To detect evidence of recombination events and assess the biological significance of genetic mosaics, genome sequences for various viral populations of reasonably large size are now available in the GenBank. We studied a multi-functional viral gene, the adeno-associated virus (AAV) cap gene, which codes for three capsid proteins, VP1, VP2 and VP3. VP1-3 share a common C-terminal domain corresponding to VP3, which forms the viral core structure, while the VP1 unique N-terminal part contains an enzymatic domain with phospholipase A2 activity. Our recombinant detection program (RecI) revealed five novel recombination events, four of which have their cross-over points in the N-terminal, VP1 and VP2 unique region. Comparison of phylogenetic trees for different cap gene regions confirmed discordant phylogenies for the recombinant sequences. Furthermore, differences in the phylogenetic tree structures for the VP1 unique (VP1u) region and the rest of cap highlighted the mosaic nature of cap gene in the AAV population: two dominant forms of VP1u sequences were identified and these forms are linked to diverse sequences in the rest of cap gene. This observation together with the finding of frequent recombination in the VP1 and 2 unique regions suggests that this region is a recombination hot spot. Recombination events in this region preserve protein blocks of distinctive functions and contribute to convergence in VP1u and divergence of the rest of cap. Additionally the possible biological significance of two dominant VP1u forms is inferred. PMID:18286191
Foster, Charles S P; Henwood, Murray J; Ho, Simon Y W
2018-05-25
Data sets comprising small numbers of genetic markers are not always able to resolve phylogenetic relationships. This has frequently been the case in molecular systematic studies of plants, with many analyses being based on sequence data from only two or three chloroplast genes. An example of this comes from the riceflowers Pimelea Banks & Sol. ex Gaertn. (Thymelaeaceae), a large genus of flowering plants predominantly distributed in Australia. Despite the considerable morphological variation in the genus, low sequence divergence in chloroplast markers has led to the phylogeny of Pimelea remaining largely uncertain. In this study, we resolve the backbone of the phylogeny of Pimelea in comprehensive Bayesian and maximum-likelihood analyses of plastome sequences from 41 taxa. However, some relationships received only moderate to poor support, and the Pimelea clade contained extremely short internal branches. By using topology-clustering analyses, we demonstrate that conflicting phylogenetic signals can be found across the trees estimated from individual chloroplast protein-coding genes. A relaxed-clock dating analysis reveals that Pimelea arose in the mid-Miocene, with most divergences within the genus occurring during a subsequent rapid diversification. Our new phylogenetic estimate offers better resolution and is more strongly supported than previous estimates, providing a platform for future taxonomic revisions of both Pimelea and the broader subfamily. Our study has demonstrated the substantial improvements in phylogenetic resolution that can be achieved using plastome-scale data sets in plant molecular systematics. Copyright © 2018 Elsevier Inc. All rights reserved.
Mingo-Casas, Patricia; Sandonís, Virginia; Obón, Elena; Berciano, José M; Vázquez-Morón, Sonia; Juste, Javier; Echevarría, Juan E
2018-04-01
Previous studies have shown that EBLV-1 strains exclusively hosted by Eptesicus isabellinus bats in the Iberian Peninsula cluster in a specific monophyletic group that is related to the EBLV-1b lineage found in the rest of Europe. More recently, enhanced passive surveillance has allowed the detection of the first EBLV-1 strains associated to Eptesicus serotinus south of the Pyrenees. The aim of this study is the reconstruction of the EBLV-1 phylogeny and phylodynamics in the Iberian Peninsula in the context of the European continent. We have sequenced 23 EBLV-1 strains detected on nine E. serotinus and 14 E. isabellinus. Phylogenetic analyses were performed on the first 400-bp-5' fragment of the Nucleoprotein (N) gene together with other 162 sequences from Europe. Besides, fragments of the variable region of the phosphoprotein (P) gene and the glycoprotein-polymerase (G-L) intergenic region were studied on Spanish samples. Phylogenies show that two of the new EBLV-1a strains from Iberian E. serotinus clustered together with French strains from the North of the Pyrenees, suggesting a recent expansion southwards of this subtype. The remaining seven Iberian strains from E. serotinus grouped, instead, within the cluster linked, so far, to E. isabellinus, indicating that spatial distribution prevails over species specificity in explaining rabies distribution and supporting interspecific transmission. The structure found within the Iberian Peninsula for EBLV-1b is in concordance with that described previously for E. isabellinus. Finally, we have found that the current EBLV-1 European strains could have emerged only 175 years ago according to our evolutionary dynamics analyses.
Ramasamy, Sukanya; Ometto, Lino; Crava, Cristina M.; Revadi, Santosh; Kaur, Rupinder; Horner, David S.; Pisani, Davide; Dekker, Teun; Anfora, Gianfranco; Rota-Stabelli, Omar
2016-01-01
How the evolution of olfactory genes correlates with adaption to new ecological niches is still a debated topic. We explored this issue in Drosophila suzukii, an emerging model that reproduces on fresh fruit rather than in fermenting substrates like most other Drosophila. We first annotated the repertoire of odorant receptors (ORs), odorant binding proteins (OBPs), and antennal ionotropic receptors (aIRs) in the genomes of two strains of D. suzukii and of its close relative Drosophila biarmipes. We then analyzed these genes on the phylogeny of 14 Drosophila species: whereas ORs and OBPs are characterized by higher turnover rates in some lineages including D. suzukii, aIRs are conserved throughout the genus. Drosophila suzukii is further characterized by a non-random distribution of OR turnover on the gene phylogeny, consistent with a change in selective pressures. In D. suzukii, we found duplications and signs of positive selection in ORs with affinity for short-chain esters, and loss of function of ORs with affinity for volatiles produced during fermentation. These receptors—Or85a and Or22a—are characterized by divergent alleles in the European and American genomes, and we hypothesize that they may have been replaced by some of the duplicated ORs in corresponding neurons, a hypothesis reciprocally confirmed by electrophysiological recordings. Our study quantifies the evolution of olfactory genes in Drosophila and reveals an array of genomic events that can be associated with the ecological adaptations of D. suzukii. PMID:27435796
Bybee, Seth M; Bracken-Grissom, Heather; Haynes, Benjamin D; Hermansen, Russell A; Byers, Robert L; Clement, Mark J; Udall, Joshua A; Wilcox, Edward R; Crandall, Keith A
2011-01-01
Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach.
Bybee, Seth M.; Bracken-Grissom, Heather; Haynes, Benjamin D.; Hermansen, Russell A.; Byers, Robert L.; Clement, Mark J.; Udall, Joshua A.; Wilcox, Edward R.; Crandall, Keith A.
2011-01-01
Next-gen sequencing technologies have revolutionized data collection in genetic studies and advanced genome biology to novel frontiers. However, to date, next-gen technologies have been used principally for whole genome sequencing and transcriptome sequencing. Yet many questions in population genetics and systematics rely on sequencing specific genes of known function or diversity levels. Here, we describe a targeted amplicon sequencing (TAS) approach capitalizing on next-gen capacity to sequence large numbers of targeted gene regions from a large number of samples. Our TAS approach is easily scalable, simple in execution, neither time-nor labor-intensive, relatively inexpensive, and can be applied to a broad diversity of organisms and/or genes. Our TAS approach includes a bioinformatic application, BarcodeCrucher, to take raw next-gen sequence reads and perform quality control checks and convert the data into FASTA format organized by gene and sample, ready for phylogenetic analyses. We demonstrate our approach by sequencing targeted genes of known phylogenetic utility to estimate a phylogeny for the Pancrustacea. We generated data from 44 taxa using 68 different 10-bp multiplexing identifiers. The overall quality of data produced was robust and was informative for phylogeny estimation. The potential for this method to produce copious amounts of data from a single 454 plate (e.g., 325 taxa for 24 loci) significantly reduces sequencing expenses incurred from traditional Sanger sequencing. We further discuss the advantages and disadvantages of this method, while offering suggestions to enhance the approach. PMID:22002916
Willerslev, Eske; Gilbert, M Thomas P; Binladen, Jonas; Ho, Simon YW; Campos, Paula F; Ratan, Aakrosh; Tomsho, Lynn P; da Fonseca, Rute R; Sher, Andrei; Kuznetsova, Tatanya V; Nowak-Kemp, Malgosia; Roth, Terri L; Miller, Webb; Schuster, Stephan C
2009-01-01
Background The scientific literature contains many examples where DNA sequence analyses have been used to provide definitive answers to phylogenetic problems that traditional (non-DNA based) approaches alone have failed to resolve. One notable example concerns the rhinoceroses, a group for which several contradictory phylogenies were proposed on the basis of morphology, then apparently resolved using mitochondrial DNA fragments. Results In this study we report the first complete mitochondrial genome sequences of the extinct ice-age woolly rhinoceros (Coelodonta antiquitatis), and the threatened Javan (Rhinoceros sondaicus), Sumatran (Dicerorhinus sumatrensis), and black (Diceros bicornis) rhinoceroses. In combination with the previously published mitochondrial genomes of the white (Ceratotherium simum) and Indian (Rhinoceros unicornis) rhinoceroses, this data set putatively enables reconstruction of the rhinoceros phylogeny. While the six species cluster into three strongly supported sister-pairings: (i) The black/white, (ii) the woolly/Sumatran, and (iii) the Javan/Indian, resolution of the higher-level relationships has no statistical support. The phylogenetic signal from individual genes is highly diffuse, with mixed topological support from different genes. Furthermore, the choice of outgroup (horse vs tapir) has considerable effect on reconstruction of the phylogeny. The lack of resolution is suggestive of a hard polytomy at the base of crown-group Rhinocerotidae, and this is supported by an investigation of the relative branch lengths. Conclusion Satisfactory resolution of the rhinoceros phylogeny may not be achievable without additional analyses of substantial amounts of nuclear DNA. This study provides a compelling demonstration that, in spite of substantial sequence length, there are significant limitations with single-locus phylogenetics. We expect further examples of this to appear as next-generation, large-scale sequencing of complete mitochondrial genomes becomes commonplace in evolutionary studies. "The human factor in classification is nowhere more evident than in dealing with this superfamily (Rhinocerotoidea)." G. G. Simpson (1945) PMID:19432984
Genome-wide diversity and selective pressure in the human rhinovirus
Kistler, Amy L; Webster, Dale R; Rouskin, Silvi; Magrini, Vince; Credle, Joel J; Schnurr, David P; Boushey, Homer A; Mardis, Elaine R; Li, Hao; DeRisi, Joseph L
2007-01-01
Background The human rhinoviruses (HRV) are one of the most common and diverse respiratory pathogens of humans. Over 100 distinct HRV serotypes are known, yet only 6 genomes are available. Due to the paucity of HRV genome sequence, little is known about the genetic diversity within HRV or the forces driving this diversity. Previous comparative genome sequence analyses indicate that recombination drives diversification in multiple genera of the picornavirus family, yet it remains unclear if this holds for HRV. Results To resolve this and gain insight into the forces driving diversification in HRV, we generated a representative set of 34 fully sequenced HRVs. Analysis of these genomes shows consistent phylogenies across the genome, conserved non-coding elements, and only limited recombination. However, spikes of genetic diversity at both the nucleotide and amino acid level are detectable within every locus of the genome. Despite this, the HRV genome as a whole is under purifying selective pressure, with islands of diversifying pressure in the VP1, VP2, and VP3 structural genes and two non-structural genes, the 3C protease and 3D polymerase. Mapping diversifying residues in these factors onto available 3-dimensional structures revealed the diversifying capsid residues partition to the external surface of the viral particle in statistically significant proximity to antigenic sites. Diversifying pressure in the pleconaril binding site is confined to a single residue known to confer drug resistance (VP1 191). In contrast, diversifying pressure in the non-structural genes is less clear, mapping both nearby and beyond characterized functional domains of these factors. Conclusion This work provides a foundation for understanding HRV genetic diversity and insight into the underlying biology driving evolution in HRV. It expands our knowledge of the genome sequence space that HRV reference serotypes occupy and how the pattern of genetic diversity across HRV genomes differs from other picornaviruses. It also reveals evidence of diversifying selective pressure in both structural genes known to interact with the host immune system and in domains of unassigned function in the non-structural 3C and 3D genes, raising the possibility that diversification of undiscovered functions in these essential factors may influence HRV fitness and evolution. PMID:17477878
Phylogeny-Based Systematization of Arabidopsis Proteins with Histone H1 Globular Domain1[OPEN
Knizewski, Lukasz; Schmidt, Anja; Ginalski, Krzysztof
2017-01-01
H1 (or linker) histones are basic nuclear proteins that possess an evolutionarily conserved nucleosome-binding globular domain, GH1. They perform critical functions in determining the accessibility of chromatin DNA to trans-acting factors. In most metazoan species studied so far, linker histones are highly heterogenous, with numerous nonallelic variants cooccurring in the same cells. The phylogenetic relationships among these variants as well as their structural and functional properties have been relatively well established. This contrasts markedly with the rather limited knowledge concerning the phylogeny and structural and functional roles of an unusually diverse group of GH1-containing proteins in plants. The dearth of information and the lack of a coherent phylogeny-based nomenclature of these proteins can lead to misunderstandings regarding their identity and possible relationships, thereby hampering plant chromatin research. Based on published data and our in silico and high-throughput analyses, we propose a systematization and coherent nomenclature of GH1-containing proteins of Arabidopsis (Arabidopsis thaliana [L.] Heynh) that will be useful for both the identification and structural and functional characterization of homologous proteins from other plant species. PMID:28298478
Phylogeny-Based Systematization of Arabidopsis Proteins with Histone H1 Globular Domain.
Kotliński, Maciej; Knizewski, Lukasz; Muszewska, Anna; Rutowicz, Kinga; Lirski, Maciej; Schmidt, Anja; Baroux, Célia; Ginalski, Krzysztof; Jerzmanowski, Andrzej
2017-05-01
H1 (or linker) histones are basic nuclear proteins that possess an evolutionarily conserved nucleosome-binding globular domain, GH1. They perform critical functions in determining the accessibility of chromatin DNA to trans-acting factors. In most metazoan species studied so far, linker histones are highly heterogenous, with numerous nonallelic variants cooccurring in the same cells. The phylogenetic relationships among these variants as well as their structural and functional properties have been relatively well established. This contrasts markedly with the rather limited knowledge concerning the phylogeny and structural and functional roles of an unusually diverse group of GH1-containing proteins in plants. The dearth of information and the lack of a coherent phylogeny-based nomenclature of these proteins can lead to misunderstandings regarding their identity and possible relationships, thereby hampering plant chromatin research. Based on published data and our in silico and high-throughput analyses, we propose a systematization and coherent nomenclature of GH1-containing proteins of Arabidopsis ( Arabidopsis thaliana [L.] Heynh) that will be useful for both the identification and structural and functional characterization of homologous proteins from other plant species. © 2017 American Society of Plant Biologists. All Rights Reserved.
Chen, Yong; Shen, Yubang; Pandit, Narayan Prasad; Fu, Jianjun; Li, Da; Li, Jiale
2013-06-15
The peptide YY (PYY) is a 36 amino acid peptide involved in the food intake control in vertebrates. We have cloned and characterized a PYY gene from grass carp Ctenopharyngodon idellus. The full-length cDNA encodes a precursor protein of grass carp PYY (gcPYY) that consists of a putative 28-amino acid signal peptide, a 36-amino acid mature peptide, an amidation-proteolytic site, and a 30-amino acid carboxy-terminal extension. The gcPYY gene is comprised of 4 exons interspaced by 3 introns as seen in PYYs from other species. Amino acid alignment and gene structure comparison indicate that the structure of PYY is well preserved throughout vertebrate phylogeny. The tissue distribution and postprandial changes in gcPYY mRNA expression were evaluated by real-time PCR, which showed that the gcPYY is expressed abundantly in the central nervous system, with significantly increased expression following a single meal. During embryogenesis, the presence of gcPYY mRNA was detected in early developing embryos, and high expression levels were observed when most larvae completed their switch from endogenous nourishment to exogenous feeding. Reduced food intake by juveniles during a single meal after giving perpheral injection of gcPYY1-36 suggests a potentially important role of PYY in the food intake attenuation in grass carp. Copyright © 2013 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Morris, K. J.; Herrera, S.; Gubili, C.; Tyler, P. A.; Rogers, A.; Hauton, C.
2012-12-01
Despite being an abundant group of significant ecological importance the phylogenetic relationships of the Octocorallia remain poorly understood and very much understudied. We used 1132 bp of two mitochondrial protein-coding genes, nad2 and mtMutS (previously referred to as msh1), to construct a phylogeny for 161 octocoral specimens from the Atlantic, including both Isididae and non-Isididae species. We found that four clades were supported using a concatenated alignment. Two of these (A and B) were in general agreement with the of Holaxonia-Alcyoniina and Anthomastus-Corallium clades identified by previous work. The third and fourth clades represent a split of the Calcaxonia-Pennatulacea clade resulting in a clade containing the Pennatulacea and a small number of Isididae specimens and a second clade containing the remaining Calcaxonia. When individual genes were considered nad2 largely agreed with previous work with MtMutS also producing a fourth clade corresponding to a split of Isididae species from the Calcaxonia-Pennatulacea clade. It is expected these difference are a consequence of the inclusion of Isisdae species that have undergone a gene inversion in the mtMutS gene causing their separation in the MtMutS only tree. The fourth clade in the concatenated tree is also suspected to be a result of this gene inversion, as there were very few Isidiae species included in previous work tree and thus this separation would not be clearly resolved. A~larger phylogeny including both Isididae and non Isididae species is required to further resolve these clades.
Clavicipitaceous entomopathogens: New species of Metarhizium and a new genus Nigelia
USDA-ARS?s Scientific Manuscript database
In several surveys in the tropical forests in Thailand, specimens that looked morphologically similar to Metarhizium martialis and Cordyceps variegata, as well as Metarhizium species were collected and cultured in vitro. A combined phylogeny of several genes including the small (18S) and large (28S)...
Occultocarpon, a new monotypic genus of Gnomoniaceae on Alnus nepalensis from China
USDA-ARS?s Scientific Manuscript database
A new monotypic genus Occultocarpon and its species, O. ailaoshanense, was discovered on the bark of branches of Alnus nepalensis (Betulaceae) in Yunnan, China. A phylogeny based on three genes (LSU, rpb2, tef1-a) reveals that O. ailaoshanense belongs to the Gnomoniaceae (Diaporthales, Ascomycetes) ...
USDA-ARS?s Scientific Manuscript database
Secondary metabolite phenotypes in nine species of the Hamigera clade were analysed to assess their correlations to a multi-gene species-level phylogeny. High-pressure-liquid-chromatography-based chemical analysis revealed three distinctive patterns of secondary metabolite production: (1) the nine s...
USDA-ARS?s Scientific Manuscript database
Ascosphaera fungi are highly associated with social and solitary bees. This genus includes an important group of bee pathogens, the chalkbrood fungi, and thus proper identification of species and an understanding of their relationships are important. However, Ascosphaera spp. are often unculturable...
Wang, Jianli; Wu, Zhenying; Shen, Zhongbao; Bai, Zetao; Zhong, Peng; Ma, Lichao; Pan, Duofeng; Zhang, Ruibo; Li, Daoming; Zhang, Hailing; Fu, Chunxiang; Han, Guiqing; Guo, Changhong
2018-01-01
Auxin response factors (ARFs) have been reported to play vital roles during plant growth and development. In order to reveal specific functions related to vegetative organs in grasses, an in-depth study of the ARF gene family was carried out in switchgrass ( Panicum virgatum L.), a warm-season C4 perennial grass that is mostly used as bioenergy and animal feedstock. A total of 47 putative ARF genes ( PvARFs ) were identified in the switchgrass genome (2n = 4x = 36), 42 of which were anchored to the seven pairs of chromosomes and found to be unevenly distributed. Sixteen PvARFs were predicted to be potential targets of small RNAs (microRNA160 and 167). Phylogenetically speaking, PvARFs were divided into seven distinct subgroups based on the phylogeny, exon/intron arrangement, and conserved motif distribution. Moreover, 15 pairs of PvARFs have different temporal-spatial expression profiles in vegetative organs (2nd, 3rd, and 4th internode and leaves), which implies that different PvARFs have specific functions in switchgrass growth and development. In addition, at least 14 pairs of PvARFs respond to naphthylacetic acid (NAA) treatment, which might be helpful for us to study on auxin response in switchgrass. The comprehensive analysis, described here, will facilitate the future functional analysis of ARF genes in grasses.
Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kass, D.H.; Batzer, M.A.; Deininger, P.L.
The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less
A Burst of miRNA Innovation in the Early Evolution of Butterflies and Moths
Quah, Shan; Hui, Jerome H.L.; Holland, Peter W.H.
2015-01-01
MicroRNAs (miRNAs) are involved in posttranscriptional regulation of gene expression. Because several miRNAs are known to affect the stability or translation of developmental regulatory genes, the origin of novel miRNAs may have contributed to the evolution of developmental processes and morphology. Lepidoptera (butterflies and moths) is a species-rich clade with a well-established phylogeny and abundant genomic resources, thereby representing an ideal system in which to study miRNA evolution. We sequenced small RNA libraries from developmental stages of two divergent lepidopterans, Cameraria ohridella (Horse chestnut Leafminer) and Pararge aegeria (Speckled Wood butterfly), discovering 90 and 81 conserved miRNAs, respectively, and many species-specific miRNA sequences. Mapping miRNAs onto the lepidopteran phylogeny reveals rapid miRNA turnover and an episode of miRNA fixation early in lepidopteran evolution, implying that miRNA acquisition accompanied the early radiation of the Lepidoptera. One lepidopteran-specific miRNA gene, miR-2768, is located within an intron of the homeobox gene invected, involved in insect segmental and wing patterning. We identified cubitus interruptus (ci) as a likely direct target of miR-2768, and validated this suppression using a luciferase assay system. We propose a model by which miR-2768 modulates expression of ci in the segmentation pathway and in patterning of lepidopteran wing primordia. PMID:25576364
Neumann, Karsten; Michaux, Johan; Lebedev, Vladimir; Yigit, Nuri; Colak, Ercument; Ivanova, Natalia; Poltoraus, Andrey; Surov, Alexei; Markov, Georgi; Maak, Steffen; Neumann, Sabine; Gattermann, Rolf
2006-04-01
Despite some popularity of hamsters as pets and laboratory animals there is no reliable phylogeny of the subfamily Cricetinae available so far. Contradicting views exist not only about the actual number of species but also concerning the validity of several genera. We used partial DNA sequences of two mitochondrial (cytochrome b, 12S rRNA) and one partial nuclear gene (von Willebrand Factor exon 28) to provide a first gene tree of the Cricetinae based on 15 taxa comprising six genera. According to our data, Palaearctic hamsters fall into three distinct phylogenetic groups: Phodopus, Mesocricetus, and Cricetus-related species which evolved during the late Miocene about 7-12MY ago. Surprisingly, the genus Phodopus, which was previously thought to have appeared during the Pleistocene, forms the oldest clade. The largest number of extant hamster genera is found in a group of Cricetus-related hamsters. The genus Cricetulus itself proved to be not truly monophyletic with Cricetulus migratorius appearing more closely related to Tscherskia, Cricetus, and Allocricetulus. We propose to place the species within a new monotypic genus. Molecular clock calculations are not always in line with the dating of fossil records. DNA based divergence time estimates as well as taxonomic relationships demand a reevaluation of morphological characters previously used to identify fossils and extant hamsters.
Li, Juan; Zhu, Jin-long; Lou, Shi-di; Wang, Ping; Zhang, You-sen; Wang, Lin; Yin, Ruo-chun; Zhang, Ping-ping
2018-01-01
Abstract Coptotermes suzhouensis (Isoptera: Rhinotermitidae) is a significant subterranean termite pest of wooden structures and is widely distributed in southeastern China. The complete mitochondrial DNA sequence of C. suzhouensis was analyzed in this study. The mitogenome was a circular molecule of 15,764 bp in length, which contained 13 protein-coding genes (PCGs), 22 transfer RNA genes, two ribosomal RNA genes, and an A+T-rich region with a gene arrangement typical of Isoptera mitogenomes. All PCGs were initiated by ATN codons and terminated by complete termination codons (TAA), except COX2, ND5, and Cytb, which ended with an incomplete termination codon T. All tRNAs displayed a typical clover-leaf structure, except for tRNASer(AGN), which did not contain the stem-loop structure in the DHU arm. The A+T content (69.23%) of the A+T-rich region (949 bp) was higher than that of the entire mitogenome (65.60%), and two different sets of repeat units (A+B) were distributed in this region. Comparison of complete mitogenome sequences with those of Coptotermes formosanus indicated that the two taxa have very high genetic similarity. Forty-one representative termite species were used to construct phylogenetic trees by maximum likelihood, maximum parsimony, and Bayesian inference methods. The phylogenetic analyses also strongly supported (BPP, MLBP, and MPBP = 100%) that all C. suzhouensis and C. formosanus samples gathered into one clade with genetic distances between 0.000 and 0.002. This study provides molecular evidence for a more robust phylogenetic position of C. suzhouensis and inferrs that C. suzhouensis was the synonymy of C. formosanus. PMID:29718488
ITS2 sequence-structure phylogeny reveals diverse endophytic Pseudocercospora fungi on poplars.
Yan, Dong-Hui; Gao, Qian; Sun, Xiaoming; Song, Xiaoyu; Li, Hongchang
2018-04-01
For matching the new fungal nomenclature to abolish pleomorphic names for a fungus, a genus Pseudocercospora s. str. was suggested to host holomorphic Pseudocercosproa fungi. But the Pseudocercosproa fungi need extra phylogenetic loci to clarify their taxonomy and diversity for their existing and coming species. Internal transcribed spacer 2 (ITS2) secondary structures have been promising in charactering species phylogeny in plants, animals and fungi. In present study, a conserved model of ITS2 secondary structures was confirmed on fungi in Pseudocercospora s. str. genus using RNAshape program. The model has a typical eukaryotic four-helix ITS2 secondary structure. But a single U base occurred in conserved motif of U-U mismatch in Helix 2, and a UG emerged in UGGU motif in Helix 3 to Pseudocercospora fungi. The phylogeny analyses based on the ITS2 sequence-secondary structures with compensatory base change characterizations are able to delimit more species for Pseudocercospora s. str. than phylogenic inferences of traditional multi-loci alignments do. The model was employed to explore the diversity of endophytic Pseudocercospora fungi in poplar trees. The analysis results also showed that endophytic Pseudocercospora fungi were diverse in species and evolved a specific lineage in poplar trees. This work suggested that ITS2 sequence-structures could become as additionally significant loci for species phylogenetic and taxonomic studies on Pseudocerospora fungi, and that Pseudocercospora endophytes could be important roles to Pseudocercospora fungi's evolution and function in ecology.
Phylogeny and divergence of the pinnipeds (Carnivora: Mammalia) assessed using a multigene dataset
Higdon, Jeff W; Bininda-Emonds, Olaf RP; Beck, Robin MD; Ferguson, Steven H
2007-01-01
Background Phylogenetic comparative methods are often improved by complete phylogenies with meaningful branch lengths (e.g., divergence dates). This study presents a dated molecular supertree for all 34 world pinniped species derived from a weighted matrix representation with parsimony (MRP) supertree analysis of 50 gene trees, each determined under a maximum likelihood (ML) framework. Divergence times were determined by mapping the same sequence data (plus two additional genes) on to the supertree topology and calibrating the ML branch lengths against a range of fossil calibrations. We assessed the sensitivity of our supertree topology in two ways: 1) a second supertree with all mtDNA genes combined into a single source tree, and 2) likelihood-based supermatrix analyses. Divergence dates were also calculated using a Bayesian relaxed molecular clock with rate autocorrelation to test the sensitivity of our supertree results further. Results The resulting phylogenies all agreed broadly with recent molecular studies, in particular supporting the monophyly of Phocidae, Otariidae, and the two phocid subfamilies, as well as an Odobenidae + Otariidae sister relationship; areas of disagreement were limited to four more poorly supported regions. Neither the supertree nor supermatrix analyses supported the monophyly of the two traditional otariid subfamilies, supporting suggestions for the need for taxonomic revision in this group. Phocid relationships were similar to other recent studies and deeper branches were generally well-resolved. Halichoerus grypus was nested within a paraphyletic Pusa, although relationships within Phocina tend to be poorly supported. Divergence date estimates for the supertree were in good agreement with other studies and the available fossil record; however, the Bayesian relaxed molecular clock divergence date estimates were significantly older. Conclusion Our results join other recent studies and highlight the need for a re-evaluation of pinniped taxonomy, especially as regards the subfamilial classification of otariids and the generic nomenclature of Phocina. Even with the recent publication of new sequence data, the available genetic sequence information for several species, particularly those in Arctocephalus, remains very limited, especially for nuclear markers. However, resolution of parts of the tree will probably remain difficult, even with additional data, due to apparent rapid radiations. Our study addresses the lack of a recent pinniped phylogeny that includes all species and robust divergence dates for all nodes, and will therefore prove indispensable to comparative and macroevolutionary studies of this group of carnivores. PMID:17996107
New progress in snake mitochondrial gene rearrangement.
Chen, Nian; Zhao, Shujin
2009-08-01
To further understand the evolution of snake mitochondrial genomes, the complete mitochondrial DNA (mtDNA) sequences were determined for representative species from two snake families: the Many-banded krait, the Banded krait, the Chinese cobra, the King cobra, the Hundred-pace viper, the Short-tailed mamushi, and the Chain viper. Thirteen protein-coding genes, 22-23 tRNA genes, 2 rRNA genes, and 2 control regions were identified in these mtDNAs. Duplication of the control region and translocation of the tRNAPro gene were two notable features of the snake mtDNAs. These results from the gene rearrangement comparisons confirm the correctness of traditional classification schemes and validate the utility of comparing complete mtDNA sequences for snake phylogeny reconstruction.
Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu
2011-01-01
Background Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). Methods and Findings In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Conclusions Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development. PMID:22164299
Zhao, Yang; Zhou, Yuqiong; Jiang, Haiyang; Li, Xiaoyu; Gan, Defang; Peng, Xiaojian; Zhu, Suwen; Cheng, Beijiu
2011-01-01
Members of the homeodomain-leucine zipper (HD-Zip) gene family encode transcription factors that are unique to plants and have diverse functions in plant growth and development such as various stress responses, organ formation and vascular development. Although systematic characterization of this family has been carried out in Arabidopsis and rice, little is known about HD-Zip genes in maize (Zea mays L.). In this study, we described the identification and structural characterization of HD-Zip genes in the maize genome. A complete set of 55 HD-Zip genes (Zmhdz1-55) were identified in the maize genome using Blast search tools and categorized into four classes (HD-Zip I-IV) based on phylogeny. Chromosomal location of these genes revealed that they are distributed unevenly across all 10 chromosomes. Segmental duplication contributed largely to the expansion of the maize HD-ZIP gene family, while tandem duplication was only responsible for the amplification of the HD-Zip II genes. Furthermore, most of the maize HD-Zip I genes were found to contain an overabundance of stress-related cis-elements in their promoter sequences. The expression levels of the 17 HD-Zip I genes under drought stress were also investigated by quantitative real-time PCR (qRT-PCR). All of the 17 maize HD-ZIP I genes were found to be regulated by drought stress, and the duplicated genes within a sister pair exhibited the similar expression patterns, suggesting their conserved functions during the process of evolution. Our results reveal a comprehensive overview of the maize HD-Zip gene family and provide the first step towards the selection of Zmhdz genes for cloning and functional research to uncover their roles in maize growth and development.
Covain, Raphaël; Fisch-Muller, Sonia; Oliveira, Claudio; Mol, Jan H; Montoya-Burgos, Juan I; Dray, Stéphane
2016-01-01
The Loricariinae belong to the Neotropical mailed catfish family Loricariidae, the most species-rich catfish family. Among loricariids, members of the Loricariinae are united by a long and flattened caudal peduncle and the absence of an adipose fin. Despite numerous studies of the Loricariidae, there is no comprehensive phylogeny of this morphologically highly diversified subfamily. To fill this gap, we present a molecular phylogeny of this group, including 350 representatives, based on the analysis of mitochondrial and nuclear genes (8426 positions). The resulting phylogeny indicates that Loricariinae are distributed into two sister tribes: Harttiini and Loricariini. The Harttiini tribe, as classically defined, constitutes a paraphyletic assemblage and is here restricted to the three genera Harttia, Cteniloricaria, and Harttiella. Two subtribes are distinguished within Loricariini: Farlowellina and Loricariina. Within Farlowellina, the nominal genus formed a paraphyletic group, as did Sturisoma and Sturisomatichthys. Within Loricariina, Loricaria, Crossoloricaria, and Apistoloricaria are also paraphyletic. To solve these issues, and given the lack of clear morphological diagnostic features, we propose here to synonymize several genera (Quiritixys with Harttia; East Andean members of Crossoloricaria, and Apistoloricaria with Rhadinoloricaria; Ixinandria, Hemiloricaria, Fonchiiichthys, and Leliella with Rineloricaria), to restrict others (Crossoloricaria, and Sturisomatichthys to the West Andean members, and Sturisoma to the East Andean species), and to revalidate the genus Proloricaria. Copyright © 2015 Elsevier Inc. All rights reserved.
Naughton, K M; O'Hara, T D; Appleton, B; Cisternas, P A
2014-09-01
In this paper we examine the phylogeny and biogeography of the temperate genera of the Ophiocomidae (Echinodermata: Ophiuroidea) which have an interesting asymmetrical anti-tropical distribution, with two genera (Ophiocomina and Ophiopteris) previously considered to have a separate species in both the North and South hemispheres, and the third (Clarkcoma) diversifying in the southern Australian/New Zealand region. Our phylogeny, generated from one mitochondrial and two nuclear markers, revealed that Ophiopteris is sister to a mixed Ophiocomina/Clarkcoma clade. Ophiocomina was polyphyletic, with O. nigra and an undescribed species from the South Atlantic Ocean sister to a clade including Clarkcoma species and O. australis. The phylogeny also revealed a number of recently diverged lineages occurring within Clarkcoma, some of which are considered to be cryptic species due to the similarity in morphology combined with the apparent absence of interbreeding in a sympatric distribution, while the status of others is less certain. The phylogeny provides support for two transequatorial events in the group under study. A molecular clock analysis places both events in the middle to late Miocene. The analysis excludes a tectonic vicariance hypothesis for the antitropical distribution associated with the breakup of Pangaea and also excludes the hypothesis of more recent gene flow associated with Plio/Pleistocene glacial cycling. Copyright © 2014 Elsevier Inc. All rights reserved.
Boo, Ga Hun; Le Gall, Line; Miller, Kathy Ann; Freshwater, D Wilson; Wernberg, Thomas; Terada, Ryuta; Yoon, Kyung Ju; Boo, Sung Min
2016-08-01
Although the Gelidiales are economically important marine red algae producing agar and agarose, the phylogeny of this order remains poorly resolved. The present study provides a molecular phylogeny based on a novel marker, nuclear-encoded CesA, plus plastid-encoded psaA, psbA, rbcL, and mitochondria-encoded cox1 from subsets of 107 species from all ten genera within the Gelidiales. Analyses of individual and combined datasets support the monophyly of three currently recognized families, and reveal a new clade. On the basis of these results, the new family Orthogonacladiaceae is described to accommodate Aphanta and a new genus Orthogonacladia that includes species previously classified as Gelidium madagascariense and Pterocladia rectangularis. Acanthopeltis is merged with Gelidium, which has nomenclatural priority. Nuclear-encoded CesA was found to be useful for improving the resolution of phylogenetic relationships within the Gelidiales and is likely to be valuable for the inference of phylogenetic relationship among other red algal taxa. Copyright © 2016 Elsevier Inc. All rights reserved.
Chen, Zhi-Teng; Zhao, Meng-Yuan; Xu, Cheng; Du, Yu-Zhou
2018-05-01
The infraorder Systellognatha is the most species-rich clade in the insect order Plecoptera and includes six families in two superfamilies: Pteronarcyoidea (Pteronarcyidae, Peltoperlidae, and Styloperlidae) and Perloidea (Perlidae, Perlodidae, and Chloroperlidae). To resolve the debatable phylogeny of Systellognatha, we carried out the first mitochondrial phylogenetic analysis covering all the six families, including three newly sequenced mitogenomes from two families (Perlodidae and Peltoperlidae) and 15 published mitogenomes. The three newly reported mitogenomes share conserved mitogenomic features with other sequenced stoneflies. For phylogenetic analyses, we assembled five datasets with two inference methods to assess their influence on topology and nodal support within Systellognatha. The results indicated that inclusion of the third codon positions of PCGs, exclusion of rRNA genes, the use of nucleotide datasets and Bayesian inference could improve the phylogenetic reconstruction of Systellognatha. The monophyly of Perloidea was supported in the mitochondrial phylogeny, but Pteronarcyoidea was recovered as paraphyletic and remained controversial. In this mitochondrial phylogenetic study, the relationships within Systellognatha were recovered as (((Perlidae + (Perlodidae + Chloroperlidae)) + (Pteronarcyidae + Styloperlidae)) + Peltoperlidae). Copyright © 2018 Elsevier B.V. All rights reserved.
Molecular phylogeny of choanoflagellates, the sister group to Metazoa
Carr, M.; Leadbeater, B. S. C.; Hassan, R.; Nelson, M.; Baldauf, S. L.
2008-01-01
Choanoflagellates are single-celled aquatic flagellates with a unique morphology consisting of a cell with a single flagellum surrounded by a “collar” of microvilli. They have long interested evolutionary biologists because of their striking resemblance to the collared cells (choanocytes) of sponges. Molecular phylogeny has confirmed a close relationship between choanoflagellates and Metazoa, and the first choanoflagellate genome sequence has recently been published. However, molecular phylogenetic studies within choanoflagellates are still extremely limited. Thus, little is known about choanoflagellate evolution or the exact nature of the relationship between choanoflagellates and Metazoa. We have sequenced four genes from a broad sampling of the morphological diversity of choanoflagellates including most species currently available in culture. Phylogenetic analyses of these sequences, alone and in combination, reject much of the traditional taxonomy of the group. The molecular data also strongly support choanoflagellate monophyly rejecting proposals that Metazoa were derived from a true choanoflagellate ancestor. Mapping of a complementary matrix of morphological and ecological traits onto the phylogeny allows a reinterpretation of choanoflagellate character evolution and predicts the nature of their last common ancestor. PMID:18922774
Evolution of Electrogenic Ammonium Transporters (AMTs)
McDonald, Tami R.; Ward, John M.
2016-03-31
The ammonium transporter gene family consists of three main clades, AMT, MEP, and Rh. The evolutionary history of the AMT/MEP/Rh gene family is characterized by multiple horizontal gene transfer events, gene family expansion and contraction, and gene loss; thus the gene tree for this family of transporters is unlike the organismal tree. The genomes of angiosperms contain genes for both electrogenic and electroneutral ammonium transporters, but it is not clear how far back in the land plant lineage electrogenic ammonium transporters occur. Here, we place Marchantia polymorpha ammonium transporters in the AMT/MEP/Rh phylogeny and we show that AMTs from themore » liverwort M. polymorpha are electrogenic. This information suggests that electrogenic ammonium transport evolved at least as early as the divergence of bryophytes in the land plant lineage.« less
Evolution of Electrogenic Ammonium Transporters (AMTs)
DOE Office of Scientific and Technical Information (OSTI.GOV)
McDonald, Tami R.; Ward, John M.
The ammonium transporter gene family consists of three main clades, AMT, MEP, and Rh. The evolutionary history of the AMT/MEP/Rh gene family is characterized by multiple horizontal gene transfer events, gene family expansion and contraction, and gene loss; thus the gene tree for this family of transporters is unlike the organismal tree. The genomes of angiosperms contain genes for both electrogenic and electroneutral ammonium transporters, but it is not clear how far back in the land plant lineage electrogenic ammonium transporters occur. Here, we place Marchantia polymorpha ammonium transporters in the AMT/MEP/Rh phylogeny and we show that AMTs from themore » liverwort M. polymorpha are electrogenic. This information suggests that electrogenic ammonium transport evolved at least as early as the divergence of bryophytes in the land plant lineage.« less
Complete Chloroplast Genome Sequences of Four Meliaceae Species and Comparative Analyses
Mader, Malte; Pakull, Birte; Blanc-Jolivet, Céline; Paulini-Drewes, Maike; Bouda, Zoéwindé Henri-Noël; Degen, Bernd; Small, Ian
2018-01-01
The Meliaceae family mainly consists of trees and shrubs with a pantropical distribution. In this study, the complete chloroplast genomes of four Meliaceae species were sequenced and compared with each other and with the previously published Azadirachta indica plastome. The five plastomes are circular and exhibit a quadripartite structure with high conservation of gene content and order. They include 130 genes encoding 85 proteins, 37 tRNAs and 8 rRNAs. Inverted repeat expansion resulted in a duplication of rps19 in the five Meliaceae species, which is consistent with that in many other Sapindales, but different from many other rosids. Compared to Azadirachta indica, the four newly sequenced Meliaceae individuals share several large deletions, which mainly contribute to the decreased genome sizes. A whole-plastome phylogeny supports previous findings that the four species form a monophyletic sister clade to Azadirachta indica within the Meliaceae. SNPs and indels identified in all complete Meliaceae plastomes might be suitable targets for the future development of genetic markers at different taxonomic levels. The extended analysis of SNPs in the matK gene led to the identification of four potential Meliaceae-specific SNPs as a basis for future validation and marker development. PMID:29494509
Dourado, Manuella Nóbrega; Andreote, Fernando Dini; Dini-Andreote, Francisco; Conti, Raphael; Araújo, Janete Magali; Araújo, Welington Luiz
2012-01-01
The genus Methylobacterium comprises pink-pigmented facultative methylotrophic (PPFM) bacteria, known to be an important plant-associated bacterial group. Species of this group, described as plant-nodulating, have the dual capacity of producing cytokinin and enzymes, such as pectinase and cellulase, involved in systemic resistance induction and nitrogen fixation under specific plant environmental conditions. The aim hereby was to evaluate the phylogenetic distribution of Methylobacterium spp. isolates from different host plants. Thus, a comparative analysis between sequences from structural (16S rRNA) and functional mxaF (which codifies for a subunit of the enzyme methanol dehydrogenase) ubiquitous genes, was undertaken. Notably, some Methylobacterium spp. isolates are generalists through colonizing more than one host plant, whereas others are exclusively found in certain specific plant-species. Congruency between phylogeny and specific host inhabitance was higher in the mxaF gene than in the 16S rRNA, a possible indication of function-based selection in this niche. Therefore, in a first stage, plant colonization by Methylobacterium spp. could represent generalist behavior, possibly related to microbial competition and adaptation to a plant environment. Otherwise, niche-specific colonization is apparently impelled by the host plant. PMID:22481887
Dourado, Manuella Nóbrega; Andreote, Fernando Dini; Dini-Andreote, Francisco; Conti, Raphael; Araújo, Janete Magali; Araújo, Welington Luiz
2012-01-01
The genus Methylobacterium comprises pink-pigmented facultative methylotrophic (PPFM) bacteria, known to be an important plant-associated bacterial group. Species of this group, described as plant-nodulating, have the dual capacity of producing cytokinin and enzymes, such as pectinase and cellulase, involved in systemic resistance induction and nitrogen fixation under specific plant environmental conditions. The aim hereby was to evaluate the phylogenetic distribution of Methylobacterium spp. isolates from different host plants. Thus, a comparative analysis between sequences from structural (16S rRNA) and functional mxaF (which codifies for a subunit of the enzyme methanol dehydrogenase) ubiquitous genes, was undertaken. Notably, some Methylobacterium spp. isolates are generalists through colonizing more than one host plant, whereas others are exclusively found in certain specific plant-species. Congruency between phylogeny and specific host inhabitance was higher in the mxaF gene than in the 16S rRNA, a possible indication of function-based selection in this niche. Therefore, in a first stage, plant colonization by Methylobacterium spp. could represent generalist behavior, possibly related to microbial competition and adaptation to a plant environment. Otherwise, niche-specific colonization is apparently impelled by the host plant.
Phylogeny and a structural model of plant MHX transporters
2013-01-01
Background The Arabidopsis thaliana MHX gene (AtMHX) encodes a Mg2+/H+ exchanger. Among non-plant proteins, AtMHX showed the highest similarity to mammalian Na+/Ca2+ exchanger (NCX) transporters, which are part of the Ca2+/cation (CaCA) exchanger superfamily. Results Sequences showing similarity to AtMHX were searched in the databases or sequenced from cDNA clones. Phylogenetic analysis showed that the MHX family is limited to plants, and constitutes a sixth family within the CaCA superfamily. Some plants include, besides a full MHX gene, partial MHX-related sequences. More than one full MHX gene was currently identified only in Oryza sativa and Mimulus guttatus, but an EST for more than one MHX was identified only in M. guttatus. MHX genes are not present in the currently available chlorophyte genomes. The prevalence of upstream ORFs in MHX genes is much higher than in most plant genes, and can limit their expression. A structural model of the MHXs, based on the resolved structure of NCX1, implies that the MHXs include nine transmembrane segments. The MHXs and NCXs share 32 conserved residues, including a GXG motif implicated in the formation of a tight-turn in a reentrant-loop. Three residues differ between all MHX and NCX proteins. Altered mobility under reducing and non-reducing conditions suggests the presence of an intramolecular disulfide-bond in AtMHX. Conclusions The absence of MHX genes in non-plant genomes and in the currently available chlorophyte genomes, and the presence of an NCX in Chlamydomonas, are consistent with the suggestion that the MHXs evolved from the NCXs after the split of the chlorophyte and streptophyte lineages of the plant kingdom. The MHXs underwent functional diploidization in most plant species. De novo duplication of MHX occurred in O. sativa before the split between the Indica and Japonica subspecies, and was apparently followed by translocation of one MHX paralog from chromosome 2 to chromosome 11 in Japonica. The structural analysis presented and the identification of elements that differ between the MHXs and the NCXs, or between the MHXs of specific plant groups, can contribute to clarification of the structural basis of the function and ion selectivity of MHX transporters. PMID:23634958
Lyubetsky, Vassily; Gershgorin, Roman; Gorbunov, Konstantin
2017-12-06
Chromosome structure is a very limited model of the genome including the information about its chromosomes such as their linear or circular organization, the order of genes on them, and the DNA strand encoding a gene. Gene lengths, nucleotide composition, and intergenic regions are ignored. Although highly incomplete, such structure can be used in many cases, e.g., to reconstruct phylogeny and evolutionary events, to identify gene synteny, regulatory elements and promoters (considering highly conserved elements), etc. Three problems are considered; all assume unequal gene content and the presence of gene paralogs. The distance problem is to determine the minimum number of operations required to transform one chromosome structure into another and the corresponding transformation itself including the identification of paralogs in two structures. We use the DCJ model which is one of the most studied combinatorial rearrangement models. Double-, sesqui-, and single-operations as well as deletion and insertion of a chromosome region are considered in the model; the single ones comprise cut and join. In the reconstruction problem, a phylogenetic tree with chromosome structures in the leaves is given. It is necessary to assign the structures to inner nodes of the tree to minimize the sum of distances between terminal structures of each edge and to identify the mutual paralogs in a fairly large set of structures. A linear algorithm is known for the distance problem without paralogs, while the presence of paralogs makes it NP-hard. If paralogs are allowed but the insertion and deletion operations are missing (and special constraints are imposed), the reduction of the distance problem to integer linear programming is known. Apparently, the reconstruction problem is NP-hard even in the absence of paralogs. The problem of contigs is to find the optimal arrangements for each given set of contigs, which also includes the mutual identification of paralogs. We proved that these problems can be reduced to integer linear programming formulations, which allows an algorithm to redefine the problems to implement a very special case of the integer linear programming tool. The results were tested on synthetic and biological samples. Three well-known problems were reduced to a very special case of integer linear programming, which is a new method of their solutions. Integer linear programming is clearly among the main computational methods and, as generally accepted, is fast on average; in particular, computation systems specifically targeted at it are available. The challenges are to reduce the size of the corresponding integer linear programming formulations and to incorporate a more detailed biological concept in our model of the reconstruction.
Bachvaroff, Tsvetan R.; Gornik, Sebastian G.; Concepcion, Gregory T.; Waller, Ross F.; Mendez, Gregory S.; Lippmeier, J. Casey; Delwiche, Charles F.
2014-01-01
The alveolates are composed of three major lineages, the ciliates, dinoflagellates, and apicomplexans. Together these ‘protist’ taxa play key roles in primary production and ecology, as well as in illness of humans and other animals. The interface between the dinoflagellate and apicomplexan clades has been an area of recent discovery, blurring the distinction between these two clades. Moreover, phylogenetic analysis has yet to determine the position of basal dinoflagellate clades hence the deepest branches of the dinoflagellate tree currently remain unresolved. Large-scale mRNA sequencing was applied to 11 species of dinoflagellates, including strains of the syndinean genera Hematodinium and Amoebophrya, parasites of crustaceans and dinoflagellates, respectively, to optimize and update the dinoflagellate tree. From the transcriptome-scale data a total of 73 ribosomal protein-coding genes were selected for phylogeny. After individual gene orthology assessment, the genes were concatenated into a >15,000 amino acid alignment with 76 taxa from dinoflagellates, apicomplexans, ciliates, and the outgroup heterokonts. Overall the tree was well resolved and supported, when the data was subsampled with gblocks or constraint trees were tested with the approximately unbiased test. The deepest branches of the dinoflagellate tree can now be resolved with strong support, and provides a clearer view of the evolution of the distinctive traits of dinoflagellates. PMID:24135237
McGowen, Michael R
2011-09-01
Oceanic dolphins (Delphinidae) are the product of a rapid radiation that yielded ∼36 extant species of small to medium-sized cetaceans that first emerged in the Late Miocene. Although they are a charismatic group of organisms that have become poster children for marine conservation, many phylogenetic relationships within Delphinidae remain elusive due to the slow molecular evolution of the group and the difficulty of resolving short branches from successive cladogenic events. Here I combine existing and newly generated sequences from four mitochondrial (mt) genes and 20 nuclear (nu) genes to reconstruct a well-supported phylogenetic hypothesis for Delphinidae. This study compares maximum-likelihood and Bayesian inference methods of several data sets including mtDNA, combined nuDNA, gene trees of individual nuDNA loci, and concatenated mtDNA+nuDNA. In addition, I contrast these standard phylogenetic analyses with the species tree reconstruction method of Bayesian concordance analysis (BCA). Despite finding discordance between mtDNA and individual nuDNA loci, the concatenated matrix recovers a completely resolved and robustly supported phylogeny that is also broadly congruent with BCA trees. This study strongly supports groupings such as Delphininae, Lissodelphininae, Globicephalinae, Sotalia+Delphininae, Steno+Orcaella+Globicephalinae, and Leucopleurus acutus, Lagenorhynchus albirostris, and Orcinus orca as basal delphinid taxa. Copyright © 2011 Elsevier Inc. All rights reserved.
Liu, Jingjing; Sun, Faqian; Wang, Liang; Ju, Xi; Wu, Weixiang; Chen, Yingxu
2014-01-01
Methane can be used as an alternative carbon source in biological denitrification because it is nontoxic, widely available and relatively inexpensive. A microbial consortium involved in methane oxidation coupled to denitrification (MOD) was enriched with nitrite and nitrate as electron acceptors under micro-aerobic conditions. The 16S rRNA gene combined with pmoA phylogeny of methanotrophs and nirK phylogeny of denitrifiers were analysed to reveal the dominant microbial populations and functional microorganisms. Real-time quantitative polymerase chain reaction results showed high numbers of methanotrophs and denitrifiers in the enriched consortium. The 16S rRNA gene clone library revealed that Methylococcaceae and Methylophilaceae were the dominant populations in the MOD ecosystem. Phylogenetic analyses of pmoA gene clone libraries indicated that all methanotrophs belonged to Methylococcaceae, a type I methanotroph employing the ribulose monophosphate pathway for methane oxidation. Methylotrophic denitrifiers of the Methylophilaceae that can utilize organic intermediates (i.e. formaldehyde, citrate and acetate) released from the methanotrophs played a vital role in aerobic denitrification. This study is the first report to confirm micro-aerobic denitrification and to make phylogenetic and functional assignments for some members of the microbial assemblages involved in MOD. © 2013 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
The Complete Chloroplast Genome of Wild Rice (Oryza minuta) and Its Comparison to Related Species.
Asaf, Sajjad; Waqas, Muhammad; Khan, Abdul L; Khan, Muhammad A; Kang, Sang-Mo; Imran, Qari M; Shahzad, Raheem; Bilal, Saqib; Yun, Byung-Wook; Lee, In-Jung
2017-01-01
Oryza minuta , a tetraploid wild relative of cultivated rice (family Poaceae), possesses a BBCC genome and contains genes that confer resistance to bacterial blight (BB) and white-backed (WBPH) and brown (BPH) plant hoppers. Based on the importance of this wild species, this study aimed to understand the phylogenetic relationships of O. minuta with other Oryza species through an in-depth analysis of the composition and diversity of the chloroplast (cp) genome. The analysis revealed a cp genome size of 135,094 bp with a typical quadripartite structure and consisting of a pair of inverted repeats separated by small and large single copies, 139 representative genes, and 419 randomly distributed microsatellites. The genomic organization, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. Approximately 30 forward, 28 tandem and 20 palindromic repeats were detected in the O . minuta cp genome. Comparison of the complete O. minuta cp genome with another eleven Oryza species showed a high degree of sequence similarity and relatively high divergence of intergenic spacers. Phylogenetic analyses were conducted based on the complete genome sequence, 65 shared genes and matK gene showed same topologies and O. minuta forms a single clade with parental O. punctata . Thus, the complete O . minuta cp genome provides interesting insights and valuable information that can be used to identify related species and reconstruct its phylogeny.
Genome-Scale Phylogeny of the Alphavirus Genus Suggests a Marine Origin
Palacios, G.; Tesh, R. B.; Savji, N.; Guzman, H.; Sherman, M.; Weaver, S. C.; Lipkin, W. I.
2012-01-01
The genus Alphavirus comprises a diverse group of viruses, including some that cause severe disease. Using full-length sequences of all known alphaviruses, we produced a robust and comprehensive phylogeny of the Alphavirus genus, presenting a more complete evolutionary history of these viruses compared to previous studies based on partial sequences. Our phylogeny suggests the origin of the alphaviruses occurred in the southern oceans and spread equally through the Old and New World. Since lice appear to be involved in aquatic alphavirus transmission, it is possible that we are missing a louse-borne branch of the alphaviruses. Complete genome sequencing of all members of the genus also revealed conserved residues forming the structural basis of the E1 and E2 protein dimers. PMID:22190718