Science.gov

Sample records for genome comparisons reveal

  1. Genome Comparisons Reveal a Dominant Mechanism of Chromosome Number Reduction in Grasses and Accelerated Genome Evolution in Triticeae

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single nucleotide polymorphism was employed in the construction of a high-resolution, expressed sequence tag (EST) map of Aegilops tauschii, the diploid source of the wheat D genome. Comparison of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and...

  2. The Genomic Tree as Revealed from Whole Proteome Comparisons

    PubMed Central

    Tekaia, Fredj; Lazcano, Antonio; Dujon, Bernard

    1999-01-01

    The availability of a number of complete cellular genome sequences allows the development of organisms’ classification, taking into account their genome content, the loss or acquisition of genes, and overall gene similarities as signatures of common ancestry. On the basis of correspondence analysis and hierarchical classification methods, a methodological framework is introduced here for the classification of the available 20 completely sequenced genomes and partial information for Schizosaccharomyces pombe, Homo sapiens, and Mus musculus. The outcome of such an analysis leads to a classification of genomes that we call a genomic tree. Although these trees are phenograms, they carry with them strong phylogenetic signatures and are remarkably similar to 16S-like rRNA-based phylogenies. Our results suggest that duplication and deletion events that took place through evolutionary time were globally similar in related organisms. The genomic trees presented here place the Archaea in the proximity of the Bacteria when the whole gene content of each organism is considered, and when ancestral gene duplications are eliminated. Genomic trees represent an additional approach for the understanding of evolution at the genomic level and may contribute to the proper assessment of the evolutionary relationships between extant species. PMID:10400922

  3. Pathogenicity determinants in smut fungi revealed by genome comparison.

    PubMed

    Schirawski, Jan; Mannhaupt, Gertrud; Münch, Karin; Brefort, Thomas; Schipper, Kerstin; Doehlemann, Gunther; Di Stasio, Maurizio; Rössel, Nicole; Mendoza-Mendoza, Artemio; Pester, Doris; Müller, Olaf; Winterberg, Britta; Meyer, Elmar; Ghareeb, Hassan; Wollenberg, Theresa; Münsterkötter, Martin; Wong, Philip; Walter, Mathias; Stukenbrock, Eva; Güldener, Ulrich; Kahmann, Regine

    2010-12-10

    Biotrophic pathogens, such as the related maize pathogenic fungi Ustilago maydis and Sporisorium reilianum, establish an intimate relationship with their hosts by secreting protein effectors. Because secreted effectors interacting with plant proteins should rapidly evolve, we identified variable genomic regions by sequencing the genome of S. reilianum and comparing it with the U. maydis genome. We detected 43 regions of low sequence conservation in otherwise well-conserved syntenic genomes. These regions primarily encode secreted effectors and include previously identified virulence clusters. By deletion analysis in U. maydis, we demonstrate a role in virulence for four previously unknown diversity regions. This highlights the power of comparative genomics of closely related species for identification of virulence determinants.

  4. Culture Independent Genomic Comparisons Reveal Environmental Adaptations for Altiarchaeales

    PubMed Central

    Baker, Brett J.; Probst, Alexander J.; Podar, Mircea; Lloyd, Karen G.

    2016-01-01

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site. These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These data

  5. Culture independent genomic comparisons reveal environmental adaptations for Altiarchaeales

    DOE PAGES

    Bird, Jordan T.; Baker, Brett J.; Probst, Alexander J.; ...

    2016-08-05

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site.more » These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions. These

  6. Culture independent genomic comparisons reveal environmental adaptations for Altiarchaeales

    SciTech Connect

    Bird, Jordan T.; Baker, Brett J.; Probst, Alexander J.; Podar, Mircea; Lloyd, Karen G.

    2016-08-05

    The recently proposed candidatus order Altiarchaeales remains an uncultured archaeal lineage composed of genetically diverse, globally widespread organisms frequently observed in anoxic subsurface environments. In spite of 15 years of studies on the psychrophilic biofilm-producing Candidatus Altiarchaeum hamiconexum and its close relatives, very little is known about the phylogenetic and functional diversity of the widespread free-living marine members of this taxon. From methanogenic sediments in the White Oak River Estuary, NC, USA, we sequenced a single cell amplified genome (SAG), WOR_SM1_SCG, and used it to identify and refine two high-quality genomes from metagenomes, WOR_SM1_79 and WOR_SM1_86-2, from the same site. These three genomic reconstructions form a monophyletic group, which also includes three previously published genomes from metagenomes from terrestrial springs and a SAG from Sakinaw Lake in a group previously designated as pMC2A384. A synapomorphic mutation in the Altiarchaeales tRNA synthetase β subunit, pheT, caused the protein to be encoded as two subunits at non-adjacent loci. Consistent with the terrestrial spring clades, our estuarine genomes contained a near-complete autotrophic metabolism, H2 or CO as potential electron donors, a reductive acetyl-CoA pathway for carbon fixation, and methylotroph-like NADP(H)-dependent dehydrogenase. Phylogenies based on 16S rRNA genes and concatenated conserved proteins identified two distinct sub-clades of Altiarchaeales, Alti-1 populated by organisms from actively flowing springs, and Alti-2 which was more widespread, diverse, and not associated with visible mats. The core Alti-1 genome suggested Alti-1 is adapted for the stream environment with lipopolysaccharide production capacity and extracellular hami structures. The core Alti-2 genome suggested members of this clade are free-living with distinct mechanisms for energy maintenance, motility, osmoregulation, and sulfur redox reactions

  7. Comparison of Francisella tularensis genomes reveals evolutionary events associated with the emergence of human pathogenic strains

    PubMed Central

    Rohmer, Laurence; Fong, Christine; Abmayr, Simone; Wasnick, Michael; Larson Freeman, Theodore J; Radey, Matthew; Guina, Tina; Svensson, Kerstin; Hayden, Hillary S; Jacobs, Michael; Gallagher, Larry A; Manoil, Colin; Ernst, Robert K; Drees, Becky; Buckley, Danielle; Haugen, Eric; Bovee, Donald; Zhou, Yang; Chang, Jean; Levy, Ruth; Lim, Regina; Gillett, Will; Guenthener, Don; Kang, Allison; Shaffer, Scott A; Taylor, Greg; Chen, Jinzhi; Gallis, Byron; D'Argenio, David A; Forsman, Mats; Olson, Maynard V; Goodlett, David R; Kaul, Rajinder; Miller, Samuel I; Brittnacher, Mitchell J

    2007-01-01

    Background Francisella tularensis subspecies tularensis and holarctica are pathogenic to humans, whereas the two other subspecies, novicida and mediasiatica, rarely cause disease. To uncover the factors that allow subspecies tularensis and holarctica to be pathogenic to humans, we compared their genome sequences with the genome sequence of Francisella tularensis subspecies novicida U112, which is nonpathogenic to humans. Results Comparison of the genomes of human pathogenic Francisella strains with the genome of U112 identifies genes specific to the human pathogenic strains and reveals pseudogenes that previously were unidentified. In addition, this analysis provides a coarse chronology of the evolutionary events that took place during the emergence of the human pathogenic strains. Genomic rearrangements at the level of insertion sequences (IS elements), point mutations, and small indels took place in the human pathogenic strains during and after differentiation from the nonpathogenic strain, resulting in gene inactivation. Conclusion The chronology of events suggests a substantial role for genetic drift in the formation of pseudogenes in Francisella genomes. Mutations that occurred early in the evolution, however, might have been fixed in the population either because of evolutionary bottlenecks or because they were pathoadaptive (beneficial in the context of infection). Because the structure of Francisella genomes is similar to that of the genomes of other emerging or highly pathogenic bacteria, this evolutionary scenario may be shared by pathogens from other species. PMID:17550600

  8. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity.

    PubMed

    Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F

    2015-04-28

    The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery.

  9. Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity

    PubMed Central

    Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F; Abbazia, Patrick; Ababio, Amma; Adam, Naazneen

    2015-01-01

    The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery. DOI: http://dx.doi.org/10.7554/eLife.06416.001 PMID:25919952

  10. Gain and Loss of Phototrophic Genes Revealed by Comparison of Two Citromicrobium Bacterial Genomes

    PubMed Central

    Zheng, Qiang; Zhang, Rui; Fogg, Paul C. M.; Beatty, J. Thomas; Wang, Yu; Jiao, Nianzhi

    2012-01-01

    Proteobacteria are thought to have diverged from a phototrophic ancestor, according to the scattered distribution of phototrophy throughout the proteobacterial clade, and so the occurrence of numerous closely related phototrophic and chemotrophic microorganisms may be the result of the loss of genes for phototrophy. A widespread form of bacterial phototrophy is based on the photochemical reaction center, encoded by puf and puh operons that typically are in a ‘photosynthesis gene cluster’ (abbreviated as the PGC) with pigment biosynthesis genes. Comparison of two closely related Citromicrobial genomes (98.1% sequence identity of complete 16S rRNA genes), Citromicrobium sp. JL354, which contains two copies of reaction center genes, and Citromicrobium strain JLT1363, which is chemotrophic, revealed evidence for the loss of phototrophic genes. However, evidence of horizontal gene transfer was found in these two bacterial genomes. An incomplete PGC (pufLMC-puhCBA) in strain JL354 was located within an integrating conjugative element, which indicates a potential mechanism for the horizontal transfer of genes for phototrophy. PMID:22558224

  11. Comparison of assembled Clostridium botulinum A1 genomes revealed their evolutionary relationship.

    PubMed

    Ng, Virginia; Lin, Wei-Jen

    2014-01-01

    Clostridium botulinum encompasses bacteria that produce at least one of the seven serotypes of botulinum neurotoxin (BoNT/A-G). The availability of genome sequences of four closely related Type A1 or A1(B) strains, as well as the A1-specific microarray, allowed the analysis of their genomic organizations and evolutionary relationship. The four genomes share >90% core genes and >96% functional groups. Phylogenetic analysis based on COG shows closer relations of the A1(B) strain, NCTC 2916, to B1 and F1 than A1 strains. Alignment of the genomes of the three A1 strains revealed a highly similar chromosomal structure with three small gaps in the genome of ATCC 19397 and one additional gap in the genome of Hall A, suggesting ATCC 19379 as an evolutionary intermediate between Hall A and ATCC 3502. Analyses of the four gap regions indicated potential horizontal gene transfer and recombination events important for the evolution of A1 strains.

  12. Comparison of 26 Sphingomonad Genomes Reveals Diverse Environmental Adaptations and Biodegradative Capabilities

    PubMed Central

    Aylward, Frank O.; McDonald, Bradon R.; Adams, Sandra M.; Valenzuela, Alejandra; Schmidt, Rebeccah A.; Goodwin, Lynne A.; Woyke, Tanja; Currie, Cameron R.; Suen, Garret

    2013-01-01

    Sphingomonads comprise a physiologically versatile group within the Alphaproteobacteria that includes strains of interest for biotechnology, human health, and environmental nutrient cycling. In this study, we compared 26 sphingomonad genome sequences to gain insight into their ecology, metabolic versatility, and environmental adaptations. Our multilocus phylogenetic and average amino acid identity (AAI) analyses confirm that Sphingomonas, Sphingobium, Sphingopyxis, and Novosphingobium are well-resolved monophyletic groups with the exception of Sphingomonas sp. strain SKA58, which we propose belongs to the genus Sphingobium. Our pan-genomic analysis of sphingomonads reveals numerous species-specific open reading frames (ORFs) but few signatures of genus-specific cores. The organization and coding potential of the sphingomonad genomes appear to be highly variable, and plasmid-mediated gene transfer and chromosome-plasmid recombination, together with prophage- and transposon-mediated rearrangements, appear to play prominent roles in the genome evolution of this group. We find that many of the sphingomonad genomes encode numerous oxygenases and glycoside hydrolases, which are likely responsible for their ability to degrade various recalcitrant aromatic compounds and polysaccharides, respectively. Many of these enzymes are encoded on megaplasmids, suggesting that they may be readily transferred between species. We also identified enzymes putatively used for the catabolism of sulfonate and nitroaromatic compounds in many of the genomes, suggesting that plant-based compounds or chemical contaminants may be sources of nitrogen and sulfur. Many of these sphingomonads appear to be adapted to oligotrophic environments, but several contain genomic features indicative of host associations. Our work provides a basis for understanding the ecological strategies employed by sphingomonads and their role in environmental nutrient cycling. PMID:23563954

  13. Whole-genome sequence comparisons reveal the evolution of Vibrio cholerae O1.

    PubMed

    Kim, Eun Jin; Lee, Chan Hee; Nair, G Balakrish; Kim, Dong Wook

    2015-08-01

    The analysis of the whole-genome sequences of Vibrio cholerae strains from previous and current cholera pandemics has demonstrated that genomic changes and alterations in phage CTX (particularly in the gene encoding the B subunit of cholera toxin) were major features in the evolution of V. cholerae. Recent studies have revealed the genetic mechanisms in these bacteria by which new variants of V. cholerae are generated from type-specific strains; these mechanisms suggest that certain strains are selected by environmental or human factors over time. By understanding the mechanisms and driving forces of historical and current changes in the V. cholerae population, it would be possible to predict the direction of such changes and the evolution of new variants; this has implications for the battle against cholera.

  14. Genome comparison and context analysis reveals putative mobile forms of restriction–modification systems and related rearrangements

    PubMed Central

    Furuta, Yoshikazu; Abe, Kentaro; Kobayashi, Ichizo

    2010-01-01

    The mobility of restriction–modification (RM) gene complexes and their association with genome rearrangements is a subject of active investigation. Here we conducted systematic genome comparisons and genome context analysis on fully sequenced prokaryotic genomes to detect RM-linked genome rearrangements. RM genes were frequently found to be linked to mobility-related genes such as integrase and transposase homologs. They were flanked by direct and inverted repeats at a significantly high frequency. Insertion by long target duplication was observed for I, II, III and IV restriction types. We found several RM genes flanked by long inverted repeats, some of which had apparently inserted into a genome with a short target duplication. In some cases, only a portion of an apparently complete RM system was flanked by inverted repeats. We also found a unit composed of RM genes and an integrase homolog that integrated into a tRNA gene. An allelic substitution of a Type III system with a linked Type I and IV system pair, and allelic diversity in the putative target recognition domain of Type IIG systems were observed. This study revealed the possible mobility of all types of RM systems, and the diversity in their mobility-related organization. PMID:20071371

  15. Genome-Wide Comparison of Magnaporthe Species Reveals a Host-Specific Pattern of Secretory Proteins and Transposable Elements

    PubMed Central

    Gowda, Malali

    2016-01-01

    Blast disease caused by the Magnaporthe species is a major factor affecting the productivity of rice, wheat and millets. This study was aimed at generating genomic information for rice and non-rice Magnaporthe isolates to understand the extent of genetic variation. We have sequenced the whole genome of the Magnaporthe isolates, infecting rice (leaf and neck), finger millet (leaf and neck), foxtail millet (leaf) and buffel grass (leaf). Rice and finger millet isolates infecting both leaf and neck tissues were sequenced, since the damage and yield loss caused due to neck blast is much higher as compared to leaf blast. The genome-wide comparison was carried out to study the variability in gene content, candidate effectors, repeat element distribution, genes involved in carbohydrate metabolism and SNPs. The analysis of repeat element footprints revealed some genes such as naringenin, 2-oxoglutarate 3-dioxygenase being targeted by Pot2 and Occan, in isolates from different host species. Some repeat insertions were host-specific while other insertions were randomly shared between isolates. The distributions of repeat elements, secretory proteins, CAZymes and SNPs showed significant variation across host-specific lineages of Magnaporthe indicating an independent genome evolution orchestrated by multiple genomic factors. PMID:27658241

  16. Genome-wide comparison of cowpox viruses reveals a new clade related to Variola virus.

    PubMed

    Dabrowski, Piotr Wojtek; Radonić, Aleksandar; Kurth, Andreas; Nitsche, Andreas

    2013-01-01

    Zoonotic infections caused by several orthopoxviruses (OPV) like monkeypox virus or vaccinia virus have a significant impact on human health. In Europe, the number of diagnosed infections with cowpox viruses (CPXV) is increasing in animals as well as in humans. CPXV used to be enzootic in cattle; however, such infections were not being diagnosed over the last decades. Instead, individual cases of cowpox are being found in cats or exotic zoo animals that transmit the infection to humans. Both animals and humans reveal local exanthema on arms and legs or on the face. Although cowpox is generally regarded as a self-limiting disease, immunosuppressed patients can develop a lethal systemic disease resembling smallpox. To date, only limited information on the complex and, compared to other OPV, sparsely conserved CPXV genomes is available. Since CPXV displays the widest host range of all OPV known, it seems important to comprehend the genetic repertoire of CPXV which in turn may help elucidate specific mechanisms of CPXV pathogenesis and origin. Therefore, 22 genomes of independent CPXV strains from clinical cases, involving ten humans, four rats, two cats, two jaguarundis, one beaver, one elephant, one marah and one mongoose, were sequenced by using massive parallel pyrosequencing. The extensive phylogenetic analysis showed that the CPXV strains sequenced clearly cluster into several distinct clades, some of which are closely related to Vaccinia viruses while others represent different clades in a CPXV cluster. Particularly one CPXV clade is more closely related to Camelpox virus, Taterapox virus and Variola virus than to any other known OPV. These results support and extend recent data from other groups who postulate that CPXV does not form a monophyletic clade and should be divided into multiple lineages.

  17. Comparison of environmental and isolate Sulfobacillus genomes reveals diverse carbon, sulfur, nitrogen, and hydrogen metabolisms

    SciTech Connect

    Justice, Nicholas B.; Norman, Anders; Brown, Christopher T.; Singh, Andrea; Thomas, Brian C.; Banfield, Jillian F.

    2014-12-15

    Bacteria of the genus Sulfobacillus are found worldwide as members of microbial communities that accelerate sulfide mineral dissolution in acid mine drainage environments (AMD), acid-rock drainage environments (ARD), as well as in industrial bioleaching operations. Despite their frequent identification in these environments, their role in biogeochemical cycling is poorly understood. Here we report draft genomes of five species of the Sulfobacillus genus (AMDSBA1-5) reconstructed by cultivation-independent sequencing of biofilms sampled from the Richmond Mine (Iron Mountain, CA). Three of these species (AMDSBA2, AMDSBA3, and AMDSBA4) have no cultured representatives while AMDSBA1 is a strain of S. benefaciens, and AMDSBA5 a strain of S. thermosulfidooxidans. We analyzed the diversity of energy conservation and central carbon metabolisms for these genomes and previously published Sulfobacillus genomes. Pathways of sulfur oxidation vary considerably across the genus, including the number and type of subunits of putative heterodisulfide reductase complexes likely involved in sulfur oxidation. The number and type of nickel-iron hydrogenase proteins varied across the genus, as does the presence of different central carbon pathways. Only the AMDSBA3 genome encodes a dissimilatory nitrate reducatase and only the AMDSBA5 and S. thermosulfidooxidans genomes encode assimilatory nitrate reductases. Lastly, within the genus, AMDSBA4 is unusual in that its electron transport chain includes a cytochrome bc type complex, a unique cytochrome c oxidase, and two distinct succinate dehydrogenase complexes. Overall, the results significantly expand our understanding of carbon, sulfur, nitrogen, and hydrogen metabolism within the Sulfobacillus genus.

  18. Comparison of environmental and isolate Sulfobacillus genomes reveals diverse carbon, sulfur, nitrogen, and hydrogen metabolisms

    DOE PAGES

    Justice, Nicholas B.; Norman, Anders; Brown, Christopher T.; ...

    2014-12-15

    Bacteria of the genus Sulfobacillus are found worldwide as members of microbial communities that accelerate sulfide mineral dissolution in acid mine drainage environments (AMD), acid-rock drainage environments (ARD), as well as in industrial bioleaching operations. Despite their frequent identification in these environments, their role in biogeochemical cycling is poorly understood. Here we report draft genomes of five species of the Sulfobacillus genus (AMDSBA1-5) reconstructed by cultivation-independent sequencing of biofilms sampled from the Richmond Mine (Iron Mountain, CA). Three of these species (AMDSBA2, AMDSBA3, and AMDSBA4) have no cultured representatives while AMDSBA1 is a strain of S. benefaciens, and AMDSBA5 amore » strain of S. thermosulfidooxidans. We analyzed the diversity of energy conservation and central carbon metabolisms for these genomes and previously published Sulfobacillus genomes. Pathways of sulfur oxidation vary considerably across the genus, including the number and type of subunits of putative heterodisulfide reductase complexes likely involved in sulfur oxidation. The number and type of nickel-iron hydrogenase proteins varied across the genus, as does the presence of different central carbon pathways. Only the AMDSBA3 genome encodes a dissimilatory nitrate reducatase and only the AMDSBA5 and S. thermosulfidooxidans genomes encode assimilatory nitrate reductases. Lastly, within the genus, AMDSBA4 is unusual in that its electron transport chain includes a cytochrome bc type complex, a unique cytochrome c oxidase, and two distinct succinate dehydrogenase complexes. Overall, the results significantly expand our understanding of carbon, sulfur, nitrogen, and hydrogen metabolism within the Sulfobacillus genus.« less

  19. Whole Genome Comparison of Campylobacter jejuni Human Isolates Using a Low-Cost Microarray Reveals Extensive Genetic Diversity

    PubMed Central

    Dorrell, Nick; Mangan, Joseph A.; Laing, Kenneth G.; Hinds, Jason; Linton, Dennis; Al-Ghusein, Hasan; Barrell, Bart G.; Parkhill, Julian; Stoker, Neil G.; Karlyshev, Andrey V.; Butcher, Philip D.; Wren, Brendan W.

    2001-01-01

    Campylobacter jejuni is the leading cause of bacterial food-borne diarrhoeal disease throughout the world, and yet is still a poorly understood pathogen. Whole genome microarray comparisons of 11 C. jejuni strains of diverse origin identified genes in up to 30 NCTC 11168 loci ranging from 0.7 to 18.7 kb that are either absent or highly divergent in these isolates. Many of these regions are associated with the biosynthesis of surface structures including flagella, lipo-oligosaccharide, and the newly identified capsule. Other strain-variable genes of known function include those responsible for iron acquisition, DNA restriction/modification, and sialylation. In fact, at least 21% of genes in the sequenced strain appear dispensable as they are absent or highly divergent in one or more of the isolates tested, thus defining 1300 C. jejuni core genes. Such core genes contribute mainly to metabolic, biosynthetic, cellular, and regulatory processes, but many virulence determinants are also conserved. Comparison of the capsule biosynthesis locus revealed conservation of all the genes in this region in strains with the same Penner serotype as strain NCTC 11168. By contrast, between 5 and 17 NCTC 11168 genes in this region are either absent or highly divergent in strains of a different serotype from the sequenced strain, providing further evidence that the capsule accounts for Penner serotype specificity. These studies reveal extensive genetic diversity among C. jejuni strains and pave the way toward identifying correlates of pathogenicity and developing improved epidemiological tools for this problematic pathogen. PMID:11591647

  20. Genome-wide comparison and taxonomic relatedness of multiple Xylella fastidiosa strains reveal the occurrence of three subspecies and a new Xylella species.

    PubMed

    Marcelletti, Simone; Scortichini, Marco

    2016-10-01

    A total of 21 Xylella fastidiosa strains were assessed by comparing their genomes to infer their taxonomic relationships. The whole-genome-based average nucleotide identity and tetranucleotide frequency correlation coefficient analyses were performed. In addition, a consensus tree based on comparisons of 956 core gene families, and a genome-wide phylogenetic tree and a Neighbor-net network were constructed with 820,088 nucleotides (i.e., approximately 30-33 % of the entire X. fastidiosa genome). All approaches revealed the occurrence of three well-demarcated genetic clusters that represent X. fastidiosa subspecies fastidiosa, multiplex and pauca, with the latter appeared to diverge. We suggest that the proposed but never formally described subspecies 'sandyi' and 'morus' are instead members of the subspecies fastidiosa. These analyses support the view that the Xylella strain isolated from Pyrus pyrifolia in Taiwan is likely to be a new species. A widely used multilocus sequence typing analysis yielded conflicting results.

  1. Pseudomonas syringae pv. actinidiae draft genomes comparison reveal strain-specific features involved in adaptation and virulence to Actinidia species.

    PubMed

    Marcelletti, Simone; Ferrante, Patrizia; Petriccione, Milena; Firrao, Giuseppe; Scortichini, Marco

    2011-01-01

    A recent re-emerging bacterial canker disease incited by Pseudomonas syringae pv. actinidiae (Psa) is causing severe economic losses to Actinidia chinensis and A. deliciosa cultivations in southern Europe, New Zealand, Chile and South Korea. Little is known about the genetic features of this pathovar. We generated genome-wide Illumina sequence data from two Psa strains causing outbreaks of bacterial canker on the A. deliciosa cv. Hayward in Japan (J-Psa, type-strain of the pathovar) and in Italy (I-Psa) in 1984 and 1992, respectively as well as from a Psa strain (I2-Psa) isolated at the beginning of the recent epidemic on A. chinensis cv. Hort16A in Italy. All strains were isolated from typical leaf spot symptoms. The phylogenetic relationships revealed that Psa is more closely related to P. s. pv. theae than to P. avellanae within genomospecies 8. Comparative genomic analyses revealed both relevant intrapathovar variations and putative pathovar-specific genomic regions in Psa. The genomic sequences of J-Psa and I-Psa were very similar. Conversely, the I2-Psa genome encodes four additional effector protein genes, lacks a 50 kb plasmid and the phaseolotoxin gene cluster, argK-tox but has acquired a 160 kb plasmid and putative prophage sequences. Several lines of evidence from the analysis of the genome sequences support the hypothesis that this strain did not evolve from the Psa population that caused the epidemics in 1984-1992 in Japan and Italy but rather is the product of a recent independent evolution of the pathovar actinidiae for infecting Actinidia spp. All Psa strains share the genetic potential for copper resistance, antibiotic detoxification, high affinity iron acquisition and detoxification of nitric oxide of plant origin. Similar to other sequenced phytopathogenic pseudomonads associated with woody plant species, the Psa strains isolated from leaves also display a set of genes involved in the catabolism of plant-derived aromatic compounds.

  2. Pseudomonas syringae pv. actinidiae Draft Genomes Comparison Reveal Strain-Specific Features Involved in Adaptation and Virulence to Actinidia Species

    PubMed Central

    Marcelletti, Simone; Ferrante, Patrizia; Petriccione, Milena; Firrao, Giuseppe; Scortichini, Marco

    2011-01-01

    A recent re-emerging bacterial canker disease incited by Pseudomonas syringae pv. actinidiae (Psa) is causing severe economic losses to Actinidia chinensis and A. deliciosa cultivations in southern Europe, New Zealand, Chile and South Korea. Little is known about the genetic features of this pathovar. We generated genome-wide Illumina sequence data from two Psa strains causing outbreaks of bacterial canker on the A. deliciosa cv. Hayward in Japan (J-Psa, type-strain of the pathovar) and in Italy (I-Psa) in 1984 and 1992, respectively as well as from a Psa strain (I2-Psa) isolated at the beginning of the recent epidemic on A. chinensis cv. Hort16A in Italy. All strains were isolated from typical leaf spot symptoms. The phylogenetic relationships revealed that Psa is more closely related to P. s. pv. theae than to P. avellanae within genomospecies 8. Comparative genomic analyses revealed both relevant intrapathovar variations and putative pathovar-specific genomic regions in Psa. The genomic sequences of J-Psa and I-Psa were very similar. Conversely, the I2-Psa genome encodes four additional effector protein genes, lacks a 50 kb plasmid and the phaseolotoxin gene cluster, argK-tox but has acquired a 160 kb plasmid and putative prophage sequences. Several lines of evidence from the analysis of the genome sequences support the hypothesis that this strain did not evolve from the Psa population that caused the epidemics in 1984–1992 in Japan and Italy but rather is the product of a recent independent evolution of the pathovar actinidiae for infecting Actinidia spp. All Psa strains share the genetic potential for copper resistance, antibiotic detoxification, high affinity iron acquisition and detoxification of nitric oxide of plant origin. Similar to other sequenced phytopathogenic pseudomonads associated with woody plant species, the Psa strains isolated from leaves also display a set of genes involved in the catabolism of plant-derived aromatic compounds. PMID

  3. Cross-Study Comparison Reveals Common Genomic, Network, and Functional Signatures of Desiccation Resistance in Drosophila melanogaster

    PubMed Central

    Telonis-Scott, Marina; Sgrò, Carla M.; Hoffmann, Ary A.; Griffin, Philippa C.

    2016-01-01

    Repeated attempts to map the genomic basis of complex traits often yield different outcomes because of the influence of genetic background, gene-by-environment interactions, and/or statistical limitations. However, where repeatability is low at the level of individual genes, overlap often occurs in gene ontology categories, genetic pathways, and interaction networks. Here we report on the genomic overlap for natural desiccation resistance from a Pool-genome-wide association study experiment and a selection experiment in flies collected from the same region in southeastern Australia in different years. We identified over 600 single nucleotide polymorphisms associated with desiccation resistance in flies derived from almost 1,000 wild-caught genotypes, a similar number of loci to that observed in our previous genomic study of selected lines, demonstrating the genetic complexity of this ecologically important trait. By harnessing the power of cross-study comparison, we narrowed the candidates from almost 400 genes in each study to a core set of 45 genes, enriched for stimulus, stress, and defense responses. In addition to gene-level overlap, there was higher order congruence at the network and functional levels, suggesting genetic redundancy in key stress sensing, stress response, immunity, signaling, and gene expression pathways. We also identified variants linked to different molecular aspects of desiccation physiology previously verified from functional experiments. Our approach provides insight into the genomic basis of a complex and ecologically important trait and predicts candidate genetic pathways to explore in multiple genetic backgrounds and related species within a functional framework. PMID:26733490

  4. Genomic Comparison of Indigenous African and Northern European Chickens Reveals Putative Mechanisms of Stress Tolerance Related to Environmental Selection Pressure.

    PubMed

    Fleming, Damarius S; Weigend, Steffen; Simianer, Henner; Weigend, Annett; Rothschild, Max; Schmidt, Carl; Ashwell, Christopher; Persia, Mike; Reecy, James; Lamont, Susan J

    2017-03-24

    Global climate change is increasing the magnitude of environmental stressors, such as temperature, pathogens, and drought that limit survivability and sustainability of livestock production. Poultry production and its expansion are dependent upon robust animals that are able to cope with stressors in multiple environments. Understanding the genetic strategies that indigenous, non-commercial breeds have evolved to survive in their environment could help to elucidate molecular mechanisms underlying biological traits of environmental adaptation. We examined poultry from diverse breeds and climates of Africa and Northern Europe for selection signatures that have allowed them to adapt to their indigenous environments. Selection signatures were studied using a combination of population genomic methods that employed FST , iHS, and runs of homozygosity procedures. All the analyses indicated differences in environment as a driver of selective pressure in both groups of populations. The analyses revealed unique differences in the genomic regions under selection pressure from the environment for each population. The African chickens showed stronger selection towards stress signaling and angiogenesis, while the Northern European chickens showed more selection pressure toward processes related to energy homeostasis. The results suggest that chromosomes 2 and 27 are the most diverged between populations and the most selected upon within the African (chromosome 27) and Northern European (chromosome 2) birds. Examination of the divergent populations have provided new insight into genes under possible selection related to tolerance of a population's indigenous environment that may be baselines for examining the genomic contribution to tolerance adaptions.

  5. Genome sequence comparison reveals a candidate gene involved in male-hermaphrodite differentiation in papaya (Carica papaya) trees.

    PubMed

    Ueno, Hiroki; Urasaki, Naoya; Natsume, Satoshi; Yoshida, Kentaro; Tarora, Kazuhiko; Shudo, Ayano; Terauchi, Ryohei; Matsumura, Hideo

    2015-04-01

    The sex type of papaya (Carica papaya) is determined by the pair of sex chromosomes (XX, female; XY, male; and XY(h), hermaphrodite), in which there is a non-recombining genomic region in the Y and Y(h) chromosomes. This region is presumed to be involved in determination of males and hermaphrodites; it is designated as the male-specific region in the Y chromosome (MSY) and the hermaphrodite-specific region in the Y(h) chromosome (HSY). Here, we identified the genes determining male and hermaphrodite sex types by comparing MSY and HSY genomic sequences. In the MSY and HSY genomic regions, we identified 14,528 nucleotide substitutions and 965 short indels with a large gap and two highly diverged regions. In the predicted genes expressed in flower buds, we found no nucleotide differences leading to amino acid changes between the MSY and HSY. However, we found an HSY-specific transposon insertion in a gene (SVP like) showing a similarity to the Short Vegetative Phase (SVP) gene. Study of SVP-like transcripts revealed that the MSY allele encoded an intact protein, while the HSY allele encoded a truncated protein. Our findings demonstrated that the SVP-like gene is a candidate gene for male-hermaphrodite determination in papaya.

  6. A Gene-Oriented Haplotype Comparison Reveals Recently Selected Genomic Regions in Temperate and Tropical Maize Germplasm

    PubMed Central

    Zhang, Jie; Li, Yongxiang; Zheng, Jun; Zhang, Hongwei; Yang, Xiaohong; Wang, Jianhua; Wang, Guoying

    2017-01-01

    The extensive genetic variation present in maize (Zea mays) germplasm makes it possible to detect signatures of positive artificial selection that occurred during temperate and tropical maize improvement. Here we report an analysis of 532,815 polymorphisms from a maize association panel consisting of 368 diverse temperate and tropical inbred lines. We developed a gene-oriented approach adapting exonic polymorphisms to identify recently selected alleles by comparing haplotypes across the maize genome. This analysis revealed evidence of selection for more than 1100 genomic regions during recent improvement, and included regulatory genes and key genes with visible mutant phenotypes. We find that selected candidate target genes in temperate maize are enriched in biosynthetic processes, and further examination of these candidates highlights two cases, sucrose flux and oil storage, in which multiple genes in a common pathway can be cooperatively selected. Finally, based on available parallel gene expression data, we hypothesize that some genes were selected for regulatory variations, resulting in altered gene expression. PMID:28099470

  7. Ontology for Genome Comparison and Genomic Rearrangements

    PubMed Central

    Flanagan, Keith; Stevens, Robert; Pocock, Matthew; Lee, Pete

    2004-01-01

    We present an ontology for describing genomes, genome comparisons, their evolution and biological function. This ontology will support the development of novel genome comparison algorithms and aid the community in discussing genomic evolution. It provides a framework for communication about comparative genomics, and a basis upon which further automated analysis can be built. The nomenclature defined by the ontology will foster clearer communication between biologists, and also standardize terms used by data publishers in the results of analysis programs. The overriding aim of this ontology is the facilitation of consistent annotation of genomes through computational methods, rather than human annotators. To this end, the ontology includes definitions that support computer analysis and automated transfer of annotations between genomes, rather than relying upon human mediation. PMID:18629137

  8. Sequencing of Seven Haloarchaeal Genomes Reveals Patterns of Genomic Flux

    PubMed Central

    Lynch, Erin A.; Langille, Morgan G. I.; Darling, Aaron; Wilbanks, Elizabeth G.; Haltiner, Caitlin; Shao, Katie S. Y.; Starr, Michael O.; Teiling, Clotilde; Harkins, Timothy T.; Edwards, Robert A.; Eisen, Jonathan A.; Facciotti, Marc T.

    2012-01-01

    We report the sequencing of seven genomes from two haloarchaeal genera, Haloferax and Haloarcula. Ease of cultivation and the existence of well-developed genetic and biochemical tools for several diverse haloarchaeal species make haloarchaea a model group for the study of archaeal biology. The unique physiological properties of these organisms also make them good candidates for novel enzyme discovery for biotechnological applications. Seven genomes were sequenced to ∼20×coverage and assembled to an average of 50 contigs (range 5 scaffolds - 168 contigs). Comparisons of protein-coding gene compliments revealed large-scale differences in COG functional group enrichment between these genera. Analysis of genes encoding machinery for DNA metabolism reveals genera-specific expansions of the general transcription factor TATA binding protein as well as a history of extensive duplication and horizontal transfer of the proliferating cell nuclear antigen. Insights gained from this study emphasize the importance of haloarchaea for investigation of archaeal biology. PMID:22848480

  9. Comparison against 186 canid whole-genome sequences reveals survival strategies of an ancient clonally transmissible canine tumor.

    PubMed

    Decker, Brennan; Davis, Brian W; Rimbault, Maud; Long, Adrienne H; Karlins, Eric; Jagannathan, Vidhya; Reiman, Rebecca; Parker, Heidi G; Drögemüller, Cord; Corneveaux, Jason J; Chapman, Erica S; Trent, Jeffery M; Leeb, Tosso; Huentelman, Matthew J; Wayne, Robert K; Karyadi, Danielle M; Ostrander, Elaine A

    2015-11-01

    Canine transmissible venereal tumor (CTVT) is a parasitic cancer clone that has propagated for thousands of years via sexual transfer of malignant cells. Little is understood about the mechanisms that converted an ancient tumor into the world's oldest known continuously propagating somatic cell lineage. We created the largest existing catalog of canine genome-wide variation and compared it against two CTVT genome sequences, thereby separating alleles derived from the founder's genome from somatic mutations that must drive clonal transmissibility. We show that CTVT has undergone continuous adaptation to its transmissible allograft niche, with overlapping mutations at every step of immunosurveillance, particularly self-antigen presentation and apoptosis. We also identified chronologically early somatic mutations in oncogenesis- and immune-related genes that may represent key initiators of clonal transmissibility. Thus, we provide the first insights into the specific genomic aberrations that underlie CTVT's dogged perseverance in canids around the world.

  10. Genome-wide comparison of African-ancestry populations from CARe and other cohorts reveals signals of natural selection.

    PubMed

    Bhatia, Gaurav; Patterson, Nick; Pasaniuc, Bogdan; Zaitlen, Noah; Genovese, Giulio; Pollack, Samuela; Mallick, Swapan; Myers, Simon; Tandon, Arti; Spencer, Chris; Palmer, Cameron D; Adeyemo, Adebowale A; Akylbekova, Ermeg L; Cupples, L Adrienne; Divers, Jasmin; Fornage, Myriam; Kao, W H Linda; Lange, Leslie; Li, Mingyao; Musani, Solomon; Mychaleckyj, Josyf C; Ogunniyi, Adesola; Papanicolaou, George; Rotimi, Charles N; Rotter, Jerome I; Ruczinski, Ingo; Salako, Babatunde; Siscovick, David S; Tayo, Bamidele O; Yang, Qiong; McCarroll, Steve; Sabeti, Pardis; Lettre, Guillaume; De Jager, Phil; Hirschhorn, Joel; Zhu, Xiaofeng; Cooper, Richard; Reich, David; Wilson, James G; Price, Alkes L

    2011-09-09

    The study of recent natural selection in human populations has important applications to human history and medicine. Positive natural selection drives the increase in beneficial alleles and plays a role in explaining diversity across human populations. By discovering traits subject to positive selection, we can better understand the population level response to environmental pressures including infectious disease. Our study examines unusual population differentiation between three large data sets to detect natural selection. The populations examined, African Americans, Nigerians, and Gambians, are genetically close to one another (F(ST) < 0.01 for all pairs), allowing us to detect selection even with moderate changes in allele frequency. We also develop a tree-based method to pinpoint the population in which selection occurred, incorporating information across populations. Our genome-wide significant results corroborate loci previously reported to be under selection in Africans including HBB and CD36. At the HLA locus on chromosome 6, results suggest the existence of multiple, independent targets of population-specific selective pressure. In addition, we report a genome-wide significant (p = 1.36 × 10(-11)) signal of selection in the prostate stem cell antigen (PSCA) gene. The most significantly differentiated marker in our analysis, rs2920283, is highly differentiated in both Africa and East Asia and has prior genome-wide significant associations to bladder and gastric cancers.

  11. Genomic comparison of multi-drug resistant invasive and colonizing Acinetobacter baumannii isolated from diverse human body sites reveals genomic plasticity

    PubMed Central

    2011-01-01

    Background Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Results Pulsed-field gel electrophoresis, multi-locus sequence typing and antibiotic resistance profiles demonstrated genotypic and phenotypic variation. Each isolate was sequenced to high-quality draft status, which allowed for comparative genomic analyses with existing A. baumannii genomes. A high resolution, whole genome alignment method detailed the phylogenetic relationships of sequenced A. baumannii and found no correlation between phylogeny and body site of isolation. This method identified genomic regions unique to both those isolates found on the surface of the skin or in wounds, termed colonization isolates, and those identified from body fluids, termed invasive isolates; these regions may play a role in the pathogenesis and spread of this important pathogen. A PCR-based screen of 74 A. baumanii isolates demonstrated that these unique genes are not exclusive to either phenotype or isolation source; however, a conserved genomic region exclusive to all sequenced A. baumannii was identified and verified. Conclusions The results of the comparative genome analysis and PCR assay show that A. baumannii is a diverse and genomically variable pathogen that appears to have the potential to cause a range of human disease regardless of the isolation source. PMID:21639920

  12. Genome sequence comparison reveals independent inactivation of the caspase-15 gene in different evolutionary lineages of mammals.

    PubMed

    Eckhart, Leopold; Uthman, Aumaid; Sipos, Wolfgang; Tschachler, Erwin

    2006-11-01

    We have recently demonstrated that placental mammalian species such as pig and dog express a novel proapoptotic protease, caspase-15, whereas mouse and humans lack this enzyme. Here we investigated the evolutionary fate of the caspase-15 gene in different mammalian lineages by analyzing whole-genome shotgun sequences of 30 mammalian species for the presence of caspase-15 orthologs. Caspase-15 gene sequences were found in representatives of all major mammalian clades except for the superorders Afrotheria (tenrec, rock hyrax, and elephant) and Euarchontoglires (rodents, rabbit, tree shrew, and primates), which either lacked any caspase-15-like sequences or contained mutated remnants of the caspase-15 gene. Polymerase chain reaction screenings confirmed the results of the database searches and showed that the caspase-15 gene is expressed not only in various placental mammals but also in the marsupial, Monodelphis domestica. The observed species distribution implies that caspase-15 has originated in an early ancestor of modern mammals and has been conserved, over more than 180 Myr, in marsupials and many placental mammals, whereas it was independently lost in 2 phylogenetically distant clades of placental mammals, that is, Afrotheria and Euarchontoglires. Our data suggest that the inactivation of the caspase-15 gene was not counteracted by, and may even have been driven by, evolutionary constraints in these clades, and therefore, caution against the uncritical use of gene absence for the inference of phylogenetic relationships.

  13. Physical and genetic map of the Lactococcus lactis subsp. cremoris MG1363 chromosome: comparison with that of Lactococcus lactis subsp. lactis IL 1403 reveals a large genome inversion.

    PubMed Central

    Le Bourgeois, P; Lautier, M; van den Berghe, L; Gasson, M J; Ritzenthaler, P

    1995-01-01

    A physical and genetic map of the chromosome of the Lactococcus lactis subsp. cremoris reference strain MG1363 was established. The physical map was constructed for NotI, ApaI, and SmaI enzymes by using a strategy that combines creation of new rare restriction sites by the random-integration vector pRL1 and ordering of restriction fragments by indirect end-labeling experiments. The MG1363 chromosome appeared to be circular and 2,560 kb long. Seventy-seven chromosomal markers were located on the physical map by hybridization experiments. Integration via homologous recombination of pRC1-derived plasmids allowed a more precise location of some lactococcal genes and determination of their orientation on the chromosome. The MG1363 chromosome contains six rRNA operons; five are clustered within 15% of the chromosome and transcribed in the same direction. Comparison of the L. lactis subsp. cremoris MG1363 physical map with those of the two L. lactis subsp. lactis strains IL1403 and DL11 revealed a high degree of restriction polymorphism. At the genetic organization level, despite an overall conservation of gene organization, strain MG1363 presents a large inversion of half of the genome in the region containing the rRNA operons. PMID:7751295

  14. Genome comparison of barley and maize smut fungi reveals targeted loss of RNA silencing components and species-specific presence of transposable elements.

    PubMed

    Laurie, John D; Ali, Shawkat; Linning, Rob; Mannhaupt, Gertrud; Wong, Philip; Güldener, Ulrich; Münsterkötter, Martin; Moore, Richard; Kahmann, Regine; Bakkeren, Guus; Schirawski, Jan

    2012-05-01

    Ustilago hordei is a biotrophic parasite of barley (Hordeum vulgare). After seedling infection, the fungus persists in the plant until head emergence when fungal spores develop and are released from sori formed at kernel positions. The 26.1-Mb U. hordei genome contains 7113 protein encoding genes with high synteny to the smaller genomes of the related, maize-infecting smut fungi Ustilago maydis and Sporisorium reilianum but has a larger repeat content that affected genome evolution at important loci, including mating-type and effector loci. The U. hordei genome encodes components involved in RNA interference and heterochromatin formation, normally involved in genome defense, that are lacking in the U. maydis genome due to clean excision events. These excision events were possibly a result of former presence of repetitive DNA and of an efficient homologous recombination system in U. maydis. We found evidence of repeat-induced point mutations in the genome of U. hordei, indicating that smut fungi use different strategies to counteract the deleterious effects of repetitive DNA. The complement of U. hordei effector genes is comparable to the other two smuts but reveals differences in family expansion and clustering. The availability of the genome sequence will facilitate the identification of genes responsible for virulence and evolution of smut fungi on their respective hosts.

  15. Genome Comparison of Barley and Maize Smut Fungi Reveals Targeted Loss of RNA Silencing Components and Species-Specific Presence of Transposable Elements[W

    PubMed Central

    Laurie, John D.; Ali, Shawkat; Linning, Rob; Mannhaupt, Gertrud; Wong, Philip; Güldener, Ulrich; Münsterkötter, Martin; Moore, Richard; Kahmann, Regine; Bakkeren, Guus; Schirawski, Jan

    2012-01-01

    Ustilago hordei is a biotrophic parasite of barley (Hordeum vulgare). After seedling infection, the fungus persists in the plant until head emergence when fungal spores develop and are released from sori formed at kernel positions. The 26.1-Mb U. hordei genome contains 7113 protein encoding genes with high synteny to the smaller genomes of the related, maize-infecting smut fungi Ustilago maydis and Sporisorium reilianum but has a larger repeat content that affected genome evolution at important loci, including mating-type and effector loci. The U. hordei genome encodes components involved in RNA interference and heterochromatin formation, normally involved in genome defense, that are lacking in the U. maydis genome due to clean excision events. These excision events were possibly a result of former presence of repetitive DNA and of an efficient homologous recombination system in U. maydis. We found evidence of repeat-induced point mutations in the genome of U. hordei, indicating that smut fungi use different strategies to counteract the deleterious effects of repetitive DNA. The complement of U. hordei effector genes is comparable to the other two smuts but reveals differences in family expansion and clustering. The availability of the genome sequence will facilitate the identification of genes responsible for virulence and evolution of smut fungi on their respective hosts. PMID:22623492

  16. Open chromatin reveals the functional maize genome

    PubMed Central

    Rodgers-Melnick, Eli; Vera, Daniel L.; Bass, Hank W.

    2016-01-01

    Cellular processes mediated through nuclear DNA must contend with chromatin. Chromatin structural assays can efficiently integrate information across diverse regulatory elements, revealing the functional noncoding genome. In this study, we use a differential nuclease sensitivity assay based on micrococcal nuclease (MNase) digestion to discover open chromatin regions in the maize genome. We find that maize MNase-hypersensitive (MNase HS) regions localize around active genes and within recombination hotspots, focusing biased gene conversion at their flanks. Although MNase HS regions map to less than 1% of the genome, they consistently explain a remarkably large amount (∼40%) of heritable phenotypic variance in diverse complex traits. MNase HS regions are therefore on par with coding sequences as annotations that demarcate the functional parts of the maize genome. These results imply that less than 3% of the maize genome (coding and MNase HS regions) may give rise to the overwhelming majority of phenotypic variation, greatly narrowing the scope of the functional genome. PMID:27185945

  17. inGeno – an integrated genome and ortholog viewer for improved genome to genome comparisons

    PubMed Central

    Liang, Chunguang; Dandekar, Thomas

    2006-01-01

    Background Systematic genome comparisons are an important tool to reveal gene functions, pathogenic features, metabolic pathways and genome evolution in the era of post-genomics. Furthermore, such comparisons provide important clues for vaccines and drug development. Existing genome comparison software often lacks accurate information on orthologs, the function of similar genes identified and genome-wide reports and lists on specific functions. All these features and further analyses are provided here in the context of a modular software tool "inGeno" written in Java with Biojava subroutines. Results InGeno provides a user-friendly interactive visualization platform for sequence comparisons (comprehensive reciprocal protein – protein comparisons) between complete genome sequences and all associated annotations and features. The comparison data can be acquired from several different sequence analysis programs in flexible formats. Automatic dot-plot analysis includes output reduction, filtering, ortholog testing and linear regression, followed by smart clustering (local collinear blocks; LCBs) to reveal similar genome regions. Further, the system provides genome alignment and visualization editor, collinear relationships and strain-specific islands. Specific annotations and functions are parsed, recognized, clustered, logically concatenated and visualized and summarized in reports. Conclusion As shown in this study, inGeno can be applied to study and compare in particular prokaryotic genomes against each other (gram positive and negative as well as close and more distantly related species) and has been proven to be sensitive and accurate. This modular software is user-friendly and easily accommodates new routines to meet specific user-defined requirements. PMID:17054788

  18. Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms

    PubMed Central

    2012-01-01

    Background Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Results Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10-9 synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Conclusions Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations. PMID:22264329

  19. Genes but Not Genomes Reveal Bacterial Domestication of Lactococcus Lactis

    PubMed Central

    Passerini, Delphine; Beltramo, Charlotte; Coddeville, Michele; Quentin, Yves; Ritzenthaler, Paul

    2010-01-01

    Background The population structure and diversity of Lactococcus lactis subsp. lactis, a major industrial bacterium involved in milk fermentation, was determined at both gene and genome level. Seventy-six lactococcal isolates of various origins were studied by different genotyping methods and thirty-six strains displaying unique macrorestriction fingerprints were analyzed by a new multilocus sequence typing (MLST) scheme. This gene-based analysis was compared to genomic characteristics determined by pulsed-field gel electrophoresis (PFGE). Methodology/Principal Findings The MLST analysis revealed that L. lactis subsp. lactis is essentially clonal with infrequent intra- and intergenic recombination; also, despite its taxonomical classification as a subspecies, it displays a genetic diversity as substantial as that within several other bacterial species. Genome-based analysis revealed a genome size variability of 20%, a value typical of bacteria inhabiting different ecological niches, and that suggests a large pan-genome for this subspecies. However, the genomic characteristics (macrorestriction pattern, genome or chromosome size, plasmid content) did not correlate to the MLST-based phylogeny, with strains from the same sequence type (ST) differing by up to 230 kb in genome size. Conclusion/Significance The gene-based phylogeny was not fully consistent with the traditional classification into dairy and non-dairy strains but supported a new classification based on ecological separation between “environmental” strains, the main contributors to the genetic diversity within the subspecies, and “domesticated” strains, subject to recent genetic bottlenecks. Comparison between gene- and genome-based analyses revealed little relationship between core and dispensable genome phylogenies, indicating that clonal diversification and phenotypic variability of the “domesticated” strains essentially arose through substantial genomic flux within the dispensable genome

  20. Comparative genomics reveals insights into avian genome evolution and adaptation.

    PubMed

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M; Lee, Chul; Storz, Jay F; Antunes, Agostinho; Greenwold, Matthew J; Meredith, Robert W; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S; Gatesy, John; Hoffmann, Federico G; Opazo, Juan C; Håstad, Olle; Sawyer, Roger H; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A; Green, Richard E; O'Brien, Stephen J; Griffin, Darren; Johnson, Warren E; Haussler, David; Ryder, Oliver A; Willerslev, Eske; Graves, Gary R; Alström, Per; Fjeldså, Jon; Mindell, David P; Edwards, Scott V; Braun, Edward L; Rahbek, Carsten; Burt, David W; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D; Gilbert, M Thomas P; Wang, Jun

    2014-12-12

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits.

  1. Comparative genomics reveals insights into avian genome evolution and adaptation

    PubMed Central

    Zhang, Guojie; Li, Cai; Li, Qiye; Li, Bo; Larkin, Denis M.; Lee, Chul; Storz, Jay F.; Antunes, Agostinho; Greenwold, Matthew J.; Meredith, Robert W.; Ödeen, Anders; Cui, Jie; Zhou, Qi; Xu, Luohao; Pan, Hailin; Wang, Zongji; Jin, Lijun; Zhang, Pei; Hu, Haofu; Yang, Wei; Hu, Jiang; Xiao, Jin; Yang, Zhikai; Liu, Yang; Xie, Qiaolin; Yu, Hao; Lian, Jinmin; Wen, Ping; Zhang, Fang; Li, Hui; Zeng, Yongli; Xiong, Zijun; Liu, Shiping; Zhou, Long; Huang, Zhiyong; An, Na; Wang, Jie; Zheng, Qiumei; Xiong, Yingqi; Wang, Guangbiao; Wang, Bo; Wang, Jingjing; Fan, Yu; da Fonseca, Rute R.; Alfaro-Núñez, Alonzo; Schubert, Mikkel; Orlando, Ludovic; Mourier, Tobias; Howard, Jason T.; Ganapathy, Ganeshkumar; Pfenning, Andreas; Whitney, Osceola; Rivas, Miriam V.; Hara, Erina; Smith, Julia; Farré, Marta; Narayan, Jitendra; Slavov, Gancho; Romanov, Michael N; Borges, Rui; Machado, João Paulo; Khan, Imran; Springer, Mark S.; Gatesy, John; Hoffmann, Federico G.; Opazo, Juan C.; Håstad, Olle; Sawyer, Roger H.; Kim, Heebal; Kim, Kyu-Won; Kim, Hyeon Jeong; Cho, Seoae; Li, Ning; Huang, Yinhua; Bruford, Michael W.; Zhan, Xiangjiang; Dixon, Andrew; Bertelsen, Mads F.; Derryberry, Elizabeth; Warren, Wesley; Wilson, Richard K; Li, Shengbin; Ray, David A.; Green, Richard E.; O’Brien, Stephen J.; Griffin, Darren; Johnson, Warren E.; Haussler, David; Ryder, Oliver A.; Willerslev, Eske; Graves, Gary R.; Alström, Per; Fjeldså, Jon; Mindell, David P.; Edwards, Scott V.; Braun, Edward L.; Rahbek, Carsten; Burt, David W.; Houde, Peter; Zhang, Yong; Yang, Huanming; Wang, Jian; Jarvis, Erich D.; Gilbert, M. Thomas P.; Wang, Jun

    2015-01-01

    Birds are the most species-rich class of tetrapod vertebrates and have wide relevance across many research fields. We explored bird macroevolution using full genomes from 48 avian species representing all major extant clades. The avian genome is principally characterized by its constrained size, which predominantly arose because of lineage-specific erosion of repetitive elements, large segmental deletions, and gene loss. Avian genomes furthermore show a remarkably high degree of evolutionary stasis at the levels of nucleotide sequence, gene synteny, and chromosomal structure. Despite this pattern of conservation, we detected many non-neutral evolutionary changes in protein-coding genes and noncoding regions. These analyses reveal that pan-avian genomic diversity covaries with adaptations to different lifestyles and convergent evolution of traits. PMID:25504712

  2. Open chromatin reveals the functional maize genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Every cellular process mediated through nuclear DNA must contend with chromatin. As results from ENCODE show, open chromatin assays can efficiently integrate across diverse regulatory elements, revealing functional non-coding genome. In this study, we use a MNase hypersensitivity assay to discover o...

  3. Population Genomic Analysis Reveals Highly Conserved Mitochondrial Genomes in the Yeast Species Lachancea thermotolerans

    PubMed Central

    Freel, Kelle C.; Friedrich, Anne; Hou, Jing; Schacherer, Joseph

    2014-01-01

    The increasing availability of mitochondrial (mt) sequence data from various yeasts provides a tool to study genomic evolution within and between different species. While the genomes from a range of lineages are available, there is a lack of information concerning intraspecific mtDNA diversity. Here, we analyzed the mt genomes of 50 strains from Lachancea thermotolerans, a protoploid yeast species that has been isolated from several locations (Europe, Asia, Australia, South Africa, and North / South America) and ecological sources (fruit, tree exudate, plant material, and grape and agave fermentations). Protein-coding genes from the mtDNA were used to construct a phylogeny, which reflected a similar, yet less resolved topology than the phylogenetic tree of 50 nuclear genes. In comparison to its sister species Lachancea kluyveri, L. thermotolerans has a smaller mt genome. This is due to shorter intergenic regions and fewer introns, of which the latter are only found in COX1. We revealed that L. kluyveri and L. thermotolerans share similar levels of intraspecific divergence concerning the nuclear genomes. However, L. thermotolerans has a more highly conserved mt genome with the coding regions characterized by low rates of nonsynonymous substitution. Thus, in the mt genomes of L. thermotolerans, stronger purifying selection and lower mutation rates potentially shape genome diversity in contract to what was found for L. kluyveri, demonstrating that the factors driving mt genome evolution are different even between closely related species. PMID:25212859

  4. Genomic Comparison of Escherichia coli O104:H4 Isolates from 2009 and 2011 Reveals Plasmid, and Prophage Heterogeneity, Including Shiga Toxin Encoding Phage stx2

    DTIC Science & Technology

    2012-11-01

    draft genome assemblies . We present the complete, closed genome sequences of an isolate from the 2011 outbreak (2011C–3493) and two isolates from...strains associated with the outbreak and a collection of historical E. coli O104:H4 isolates using draft genome assemblies . We present the complete...mapping tool in Genomics Workbench from CLC Bio. Variations evident in both the finished and mapped data and free of potential assembly conflicts (i.e

  5. A genome wide dosage suppressor network reveals genomic robustness

    PubMed Central

    Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh

    2017-01-01

    Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637

  6. Distinctive Genome Reduction Rates Revealed by Genomic Analyses of Two Coxiella-Like Endosymbionts in Ticks.

    PubMed

    Gottlieb, Yuval; Lalzar, Itai; Klasson, Lisa

    2015-05-28

    Genome reduction is a hallmark of symbiotic genomes, and the rate and patterns of gene loss associated with this process have been investigated in several different symbiotic systems. However, in long-term host-associated coevolving symbiont clades, the genome size differences between strains are normally quite small and hence patterns of large-scale genome reduction can only be inferred from distant relatives. Here we present the complete genome of a Coxiella-like symbiont from Rhipicephalus turanicus ticks (CRt), and compare it with other genomes from the genus Coxiella in order to investigate the process of genome reduction in a genus consisting of intracellular host-associated bacteria with variable genome sizes. The 1.7-Mb CRt genome is larger than the genomes of most obligate mutualists but has a very low protein-coding content (48.5%) and an extremely high number of identifiable pseudogenes, indicating that it is currently undergoing genome reduction. Analysis of encoded functions suggests that CRt is an obligate tick mutualist, as indicated by the possible provisioning of the tick with biotin (B7), riboflavin (B2) and other cofactors, and by the loss of most genes involved in host cell interactions, such as secretion systems. Comparative analyses between CRt and the 2.5 times smaller genome of Coxiella from the lone star tick Amblyomma americanum (CLEAA) show that many of the same gene functions are lost and suggest that the large size difference might be due to a higher rate of genome evolution in CLEAA generated by the loss of the mismatch repair genes mutSL. Finally, sequence polymorphisms in the CRt population sampled from field collected ticks reveal up to one distinct strain variant per tick, and analyses of mutational patterns within the population suggest that selection might be acting on synonymous sites. The CRt genome is an extreme example of a symbiont genome caught in the act of genome reduction, and the comparison between CLEAA and CRt

  7. Distinctive Genome Reduction Rates Revealed by Genomic Analyses of Two Coxiella-Like Endosymbionts in Ticks

    PubMed Central

    Gottlieb, Yuval; Lalzar, Itai; Klasson, Lisa

    2015-01-01

    Genome reduction is a hallmark of symbiotic genomes, and the rate and patterns of gene loss associated with this process have been investigated in several different symbiotic systems. However, in long-term host-associated coevolving symbiont clades, the genome size differences between strains are normally quite small and hence patterns of large-scale genome reduction can only be inferred from distant relatives. Here we present the complete genome of a Coxiella-like symbiont from Rhipicephalus turanicus ticks (CRt), and compare it with other genomes from the genus Coxiella in order to investigate the process of genome reduction in a genus consisting of intracellular host-associated bacteria with variable genome sizes. The 1.7-Mb CRt genome is larger than the genomes of most obligate mutualists but has a very low protein-coding content (48.5%) and an extremely high number of identifiable pseudogenes, indicating that it is currently undergoing genome reduction. Analysis of encoded functions suggests that CRt is an obligate tick mutualist, as indicated by the possible provisioning of the tick with biotin (B7), riboflavin (B2) and other cofactors, and by the loss of most genes involved in host cell interactions, such as secretion systems. Comparative analyses between CRt and the 2.5 times smaller genome of Coxiella from the lone star tick Amblyomma americanum (CLEAA) show that many of the same gene functions are lost and suggest that the large size difference might be due to a higher rate of genome evolution in CLEAA generated by the loss of the mismatch repair genes mutSL. Finally, sequence polymorphisms in the CRt population sampled from field collected ticks reveal up to one distinct strain variant per tick, and analyses of mutational patterns within the population suggest that selection might be acting on synonymous sites. The CRt genome is an extreme example of a symbiont genome caught in the act of genome reduction, and the comparison between CLEAA and CRt

  8. Interpreting Mammalian Evolution using Fugu Genome Comparisons

    SciTech Connect

    Stubbs, L; Ovcharenko, I; Loots, G G

    2004-04-02

    Comparative sequence analysis of the human and the pufferfish Fugu rubripes (fugu) genomes has revealed several novel functional coding and noncoding regions in the human genome. In particular, the fugu genome has been extremely valuable for identifying transcriptional regulatory elements in human loci harboring unusually high levels of evolutionary conservation to rodent genomes. In such regions, the large evolutionary distance between human and fishes provides an additional filter through which functional noncoding elements can be detected with high efficiency.

  9. Insights from Human/Mouse genome comparisons

    SciTech Connect

    Pennacchio, Len A.

    2003-03-30

    Large-scale public genomic sequencing efforts have provided a wealth of vertebrate sequence data poised to provide insights into mammalian biology. These include deep genomic sequence coverage of human, mouse, rat, zebrafish, and two pufferfish (Fugu rubripes and Tetraodon nigroviridis) (Aparicio et al. 2002; Lander et al. 2001; Venter et al. 2001; Waterston et al. 2002). In addition, a high-priority has been placed on determining the genomic sequence of chimpanzee, dog, cow, frog, and chicken (Boguski 2002). While only recently available, whole genome sequence data have provided the unique opportunity to globally compare complete genome contents. Furthermore, the shared evolutionary ancestry of vertebrate species has allowed the development of comparative genomic approaches to identify ancient conserved sequences with functionality. Accordingly, this review focuses on the initial comparison of available mammalian genomes and describes various insights derived from such analysis.

  10. Whole Genome Comparison Reveals High Levels of Inbreeding and Strain Redundancy Across the Spectrum of Commercial Wine Strains of Saccharomyces cerevisiae

    PubMed Central

    Borneman, Anthony R.; Forgan, Angus H.; Kolouchova, Radka; Fraser, James A.; Schmidt, Simon A.

    2016-01-01

    Humans have been consuming wines for more than 7000 yr . For most of this time, fermentations were presumably performed by strains of Saccharomyces cerevisiae that naturally found their way into the fermenting must . In contrast, most commercial wines are now produced by inoculation with pure yeast monocultures, ensuring consistent, reliable and reproducible fermentations, and there are now hundreds of these yeast starter cultures commercially available. In order to thoroughly investigate the genetic diversity that has been captured by over 50 yr of commercial wine yeast development and domestication, whole genome sequencing has been performed on 212 strains of S. cerevisiae, including 119 commercial wine and brewing starter strains, and wine isolates from across seven decades. Comparative genomic analysis indicates that, despite their large numbers, commercial strains, and wine strains in general, are extremely similar genetically, possessing all of the hallmarks of a population bottle-neck, and high levels of inbreeding. In addition, many commercial strains from multiple suppliers are nearly genetically identical, suggesting that the limits of effective genetic variation within this genetically narrow group may be approaching saturation. PMID:26869621

  11. Comparison of mitochondrial genome sequences of pangolins (Mammalia, Pholidota).

    PubMed

    Hassanin, Alexandre; Hugot, Jean-Pierre; van Vuuren, Bettine Jansen

    2015-04-01

    The complete mitochondrial genome was sequenced for three species of pangolins, Manis javanica, Phataginus tricuspis, and Smutsia temminckii, and comparisons were made with two other species, Manis pentadactyla and Phataginus tetradactyla. The genome of Manidae contains the 37 genes found in a typical mammalian genome, and the structure of the control region is highly conserved among species. In Manis, the overall base composition differs from that found in African genera. Phylogenetic analyses support the monophyly of the genera Manis, Phataginus, and Smutsia, as well as the basal division between Maninae and Smutsiinae. Comparisons with GenBank sequences reveal that the reference genomes of M. pentadactyla and P. tetradactyla (accession numbers NC_016008 and NC_004027) were sequenced from misidentified taxa, and that a new species of tree pangolin should be described in Gabon.

  12. The complete genome sequencing of Prevotella intermedia strain OMA14 and a subsequent fine-scale, intra-species genomic comparison reveal an unusual amplification of conjugative and mobile transposons and identify a novel Prevotella-lineage-specific repeat.

    PubMed

    Naito, Mariko; Ogura, Yoshitoshi; Itoh, Takehiko; Shoji, Mikio; Okamoto, Masaaki; Hayashi, Tetsuya; Nakayama, Koji

    2016-02-01

    Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria.

  13. Direct comparison between genomic constitution and flavonoid contents in Allium multiple alien addition lines reveals chromosomal locations of genes related to biosynthesis from dihydrokaempferol to quercetin glucosides in scaly leaf of shallot (Allium cepa L.).

    PubMed

    Masuzaki, S; Shigyo, M; Yamauchi, N

    2006-02-01

    The extrachromosome 5A of shallot (Allium cepa L., genomes AA) has an important role in flavonoid biosynthesis in the scaly leaf of Allium fistulosum-shallot monosomic addition lines (FF+nA). This study deals with the production and biochemical characterisation of A. fistulosum-shallot multiple alien addition lines carrying at least 5A to determine the chromosomal locations of genes for quercetin formation. The multiple alien additions were selected from the crossing between allotriploid FFA (female symbol) and A. fistulosum (male symbol). The 113 plants obtained from this cross were analysed by a chromosome 5A-specific PGI isozyme marker of shallot. Thirty plants were preliminarily selected for an alien addition carrying 5A. The chromosome numbers of the 30 plants varied from 18 to 23. The other extrachromosomes in 19 plants were completely identified by using seven other chromosome markers of shallot. High-performance liquid chromatography analyses of the 19 multiple additions were conducted to identify the flavonoid compounds produced in the scaly leaves. Direct comparisons between the chromosomal constitution and the flavonoid contents of the multiple alien additions revealed that a flavonoid 3'-hydroxylase (F3'H) gene for the synthesis of quercetin from kaempferol was located on 7A and that an anonymous gene involved in the glucosidation of quercetin was on 3A or 4A. As a result of supplemental SCAR analyses by using genomic DNAs from two complete sets of A. fistulosum-shallot monosomic additions, we have assigned F3'H to 7A and flavonol synthase to 4A.

  14. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes

    PubMed Central

    Ribeiro, Teresa; Barrela, Ricardo M.; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A. P.

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  15. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of international researchers supported by the ... when they divide and grow uncontrollably, develop into melanoma. Also, MITF activity is known to be amplified ...

  16. Understanding the recent evolution of the human genome: insights from human-chimpanzee genome comparisons.

    PubMed

    Kehrer-Sawatzki, Hildegard; Cooper, David N

    2007-02-01

    The sequencing of the chimpanzee genome and the comparison with its human counterpart have begun to reveal the spectrum of genetic changes that has accompanied human evolution. In addition to gross karyotypic rearrangements such as the fusion that formed human chromosome 2 and the human-specific pericentric inversions of chromosomes 1 and 18, there is considerable submicroscopic structural variation involving deletions, duplications, and inversions. Lineage-specific segmental duplications, detected by array comparative genomic hybridization and direct sequence comparison, have made a very significant contribution to this structural divergence, which is at least three-fold greater than that due to nucleotide substitutions. Since structural genomic changes may have given rise to irreversible functional differences between the diverging species, their detailed analysis could help to identify the biological processes that have accompanied speciation. To this end, interspecies comparisons have revealed numerous human-specific gains and losses of genes as well as changes in gene expression. The very considerable structural diversity (polymorphism) evident within both lineages has, however, hampered the analysis of the structural divergence between the human and chimpanzee genomes. The concomitant evaluation of genetic divergence and diversity at the nucleotide level has nevertheless served to identify many genes that have evolved under positive selection and may thus have been involved in the development of human lineage-specific traits. Genes that display signs of weak negative selection have also been identified and could represent candidate loci for complex genomic disorders. Here, we review recent progress in comparing the human and chimpanzee genomes and discuss how the differences detected have improved our understanding of the evolution of the human genome.

  17. Comparative genomic hybridizations reveal absence of large Streptomyces coelicolor genomic islands in Streptomyces lividans

    PubMed Central

    Jayapal, Karthik P; Lian, Wei; Glod, Frank; Sherman, David H; Hu, Wei-Shou

    2007-01-01

    Background The genomes of Streptomyces coelicolor and Streptomyces lividans bear a considerable degree of synteny. While S. coelicolor is the model streptomycete for studying antibiotic synthesis and differentiation, S. lividans is almost exclusively considered as the preferred host, among actinomycetes, for cloning and expression of exogenous DNA. We used whole genome microarrays as a comparative genomics tool for identifying the subtle differences between these two chromosomes. Results We identified five large S. coelicolor genomic islands (larger than 25 kb) and 18 smaller islets absent in S. lividans chromosome. Many of these regions show anomalous GC bias and codon usage patterns. Six of them are in close vicinity of tRNA genes while nine are flanked with near perfect repeat sequences indicating that these are probable recent evolutionary acquisitions into S. coelicolor. Embedded within these segments are at least four DNA methylases and two probable methyl-sensing restriction endonucleases. Comparison with S. coelicolor transcriptome and proteome data revealed that some of the missing genes are active during the course of growth and differentiation in S. coelicolor. In particular, a pair of methylmalonyl CoA mutase (mcm) genes involved in polyketide precursor biosynthesis, an acyl-CoA dehydrogenase implicated in timing of actinorhodin synthesis and bldB, a developmentally significant regulator whose mutation causes complete abrogation of antibiotic synthesis belong to this category. Conclusion Our findings provide tangible hints for elucidating the genetic basis of important phenotypic differences between these two streptomycetes. Importantly, absence of certain genes in S. lividans identified here could potentially explain the relative ease of DNA transformations and the conditional lack of actinorhodin synthesis in S. lividans. PMID:17623098

  18. In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics.

    PubMed

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2017-01-01

    Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/.

  19. In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics

    PubMed Central

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2017-01-01

    Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/. PMID:28261563

  20. Gorilla genome structural variation reveals evolutionary parallelisms with chimpanzee.

    PubMed

    Ventura, Mario; Catacchio, Claudia R; Alkan, Can; Marques-Bonet, Tomas; Sajjadian, Saba; Graves, Tina A; Hormozdiari, Fereydoun; Navarro, Arcadi; Malig, Maika; Baker, Carl; Lee, Choli; Turner, Emily H; Chen, Lin; Kidd, Jeffrey M; Archidiacono, Nicoletta; Shendure, Jay; Wilson, Richard K; Eichler, Evan E

    2011-10-01

    Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes.

  1. Gorilla genome structural variation reveals evolutionary parallelisms with chimpanzee

    PubMed Central

    Ventura, Mario; Catacchio, Claudia R.; Alkan, Can; Marques-Bonet, Tomas; Sajjadian, Saba; Graves, Tina A.; Hormozdiari, Fereydoun; Navarro, Arcadi; Malig, Maika; Baker, Carl; Lee, Choli; Turner, Emily H.; Chen, Lin; Kidd, Jeffrey M.; Archidiacono, Nicoletta; Shendure, Jay; Wilson, Richard K.; Eichler, Evan E.

    2011-01-01

    Structural variation has played an important role in the evolutionary restructuring of human and great ape genomes. Recent analyses have suggested that the genomes of chimpanzee and human have been particularly enriched for this form of genetic variation. Here, we set out to assess the extent of structural variation in the gorilla lineage by generating 10-fold genomic sequence coverage from a western lowland gorilla and integrating these data into a physical and cytogenetic framework of structural variation. We discovered and validated over 7665 structural changes within the gorilla lineage, including sequence resolution of inversions, deletions, duplications, and mobile element insertions. A comparison with human and other ape genomes shows that the gorilla genome has been subjected to the highest rate of segmental duplication. We show that both the gorilla and chimpanzee genomes have experienced independent yet convergent patterns of structural mutation that have not occurred in humans, including the formation of subtelomeric heterochromatic caps, the hyperexpansion of segmental duplications, and bursts of retroviral integrations. Our analysis suggests that the chimpanzee and gorilla genomes are structurally more derived than either orangutan or human genomes. PMID:21685127

  2. Comparison of the genome of the oral pathogen Treponema denticola with other spirochete genomes

    PubMed Central

    Seshadri, Rekha; Myers, Garry S. A.; Tettelin, Hervé; Eisen, Jonathan A.; Heidelberg, John F.; Dodson, Robert J.; Davidsen, Tanja M.; DeBoy, Robert T.; Fouts, Derrick E.; Haft, Dan H.; Selengut, Jeremy; Ren, Qinghu; Brinkac, Lauren M.; Madupu, Ramana; Kolonay, Jamie; Durkin, Scott A.; Daugherty, Sean C.; Shetty, Jyoti; Shvartsbeyn, Alla; Gebregeorgis, Elizabeth; Geer, Keita; Tsegaye, Getahun; Malek, Joel; Ayodeji, Bola; Shatsman, Sofiya; McLeod, Michael P.; Šmajs, David; Howell, Jerrilyn K.; Pal, Sangita; Amin, Anita; Vashisth, Pankaj; McNeill, Thomas Z.; Xiang, Qin; Sodergren, Erica; Baca, Ernesto; Weinstock, George M.; Norris, Steven J.; Fraser, Claire M.; Paulsen, Ian T.

    2004-01-01

    We present the complete 2,843,201-bp genome sequence of Treponema denticola (ATCC 35405) an oral spirochete associated with periodontal disease. Analysis of the T. denticola genome reveals factors mediating coaggregation, cell signaling, stress protection, and other competitive and cooperative measures, consistent with its pathogenic nature and lifestyle within the mixed-species environment of subgingival dental plaque. Comparisons with previously sequenced spirochete genomes revealed specific factors contributing to differences and similarities in spirochete physiology as well as pathogenic potential. The T. denticola genome is considerably larger in size than the genome of the related syphilis-causing spirochete Treponema pallidum. The differences in gene content appear to be attributable to a combination of three phenomena: genome reduction, lineage-specific expansions, and horizontal gene transfer. Genes lost due to reductive evolution appear to be largely involved in metabolism and transport, whereas some of the genes that have arisen due to lineage-specific expansions are implicated in various pathogenic interactions, and genes acquired via horizontal gene transfer are largely phage-related or of unknown function. PMID:15064399

  3. Genome-wide comparison of medieval and modern Mycobacterium leprae.

    PubMed

    Schuenemann, Verena J; Singh, Pushpendra; Mendum, Thomas A; Krause-Kyora, Ben; Jäger, Günter; Bos, Kirsten I; Herbig, Alexander; Economou, Christos; Benjak, Andrej; Busso, Philippe; Nebel, Almut; Boldsen, Jesper L; Kjellström, Anna; Wu, Huihai; Stewart, Graham R; Taylor, G Michael; Bauer, Peter; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Tucker, Katie; Roffey, Simon; Sow, Samba O; Cole, Stewart T; Nieselt, Kay; Krause, Johannes

    2013-07-12

    Leprosy was endemic in Europe until the Middle Ages. Using DNA array capture, we have obtained genome sequences of Mycobacterium leprae from skeletons of five medieval leprosy cases from the United Kingdom, Sweden, and Denmark. In one case, the DNA was so well preserved that full de novo assembly of the ancient bacterial genome could be achieved through shotgun sequencing alone. The ancient M. leprae sequences were compared with those of 11 modern strains, representing diverse genotypes and geographic origins. The comparisons revealed remarkable genomic conservation during the past 1000 years, a European origin for leprosy in the Americas, and the presence of an M. leprae genotype in medieval Europe now commonly associated with the Middle East. The exceptional preservation of M. leprae biomarkers, both DNA and mycolic acids, in ancient skeletons has major implications for palaeomicrobiology and human pathogen evolution.

  4. The genome of Tetranychus urticae reveals herbivorous pest adaptations

    PubMed Central

    Grbić, Miodrag; Van Leeuwen, Thomas; Clark, Richard M.; Rombauts, Stephane; Rouzé, Pierre; Grbić, Vojislava; Osborne, Edward J.; Dermauw, Wannes; Ngoc, Phuong Cao Thi; Ortego, Félix; Hernández-Crespo, Pedro; Diaz, Isabel; Martinez, Manuel; Navajas, Maria; Sucena, Élio; Magalhães, Sara; Nagy, Lisa; Pace, Ryan M.; Djuranović, Sergej; Smagghe, Guy; Iga, Masatoshi; Christiaens, Olivier; Veenstra, Jan A.; Ewer, John; Villalobos, Rodrigo Mancilla; Hutter, Jeffrey L.; Hudson, Stephen D.; Velez, Marisela; Yi, Soojin V.; Zeng, Jia; Pires-daSilva, Andre; Roch, Fernando; Cazaux, Marc; Navarro, Marie; Zhurov, Vladimir; Acevedo, Gustavo; Bjelica, Anica; Fawcett, Jeffrey A.; Bonnet, Eric; Martens, Cindy; Baele, Guy; Wissler, Lothar; Sanchez-Rodriguez, Aminael; Tirry, Luc; Blais, Catherine; Demeestere, Kristof; Henz, Stefan R.; Gregory, T. Ryan; Mathieu, Johannes; Verdon, Lou; Farinelli, Laurent; Schmutz, Jeremy; Lindquist, Erika; Feyereisen, René; Van de Peer, Yves

    2016-01-01

    The spider mite Tetranychus urticae is a cosmopolitan agricultural pest with an extensive host plant range and an extreme record of pesticide resistance. Here we present the completely sequenced and annotated spider mite genome, representing the first complete chelicerate genome. At 90 megabases T. urticae has the smallest sequenced arthropod genome. Compared with other arthropods, the spider mite genome shows unique changes in the hormonal environment and organization of the Hox complex, and also reveals evolutionary innovation of silk production. We find strong signatures of polyphagy and detoxification in gene families associated with feeding on different hosts and in new gene families acquired by lateral gene transfer. Deep transcriptome analysis of mites feeding on different plants shows how this pest responds to a changing host environment. The T. urticae genome thus offers new insights into arthropod evolution and plant–herbivore interactions, and provides unique opportunities for developing novel plant protection strategies. PMID:22113690

  5. Hybridization Reveals the Evolving Genomic Architecture of Speciation

    PubMed Central

    Kronforst, Marcus R.; Hansen, Matthew E.B.; Crawford, Nicholas G.; Gallant, Jason R.; Zhang, Wei; Kulathinal, Rob J.; Kapan, Durrell D.; Mullen, Sean P.

    2014-01-01

    SUMMARY The rate at which genomes diverge during speciation is unknown, as are the physical dynamics of the process. Here, we compare full genome sequences of 32 butterflies, representing five species from a hybridizing Heliconius butterfly community, to examine genome-wide patterns of introgression and infer how divergence evolves during the speciation process. Our analyses reveal that initial divergence is restricted to a small fraction of the genome, largely clustered around known wing-patterning genes. Over time, divergence evolves rapidly, due primarily to the origin of new divergent regions. Furthermore, divergent genomic regions display signatures of both selection and adaptive introgression, demonstrating the link between microevolutionary processes acting within species and the origin of species across macroevolutionary timescales. Our results provide a uniquely comprehensive portrait of the evolving species boundary due to the role that hybridization plays in reducing the background accumulation of divergence at neutral sites. PMID:24183670

  6. Whole-genome analyses reveal genetic instability of Acetobacter pasteurianus

    PubMed Central

    Azuma, Yoshinao; Hosoyama, Akira; Matsutani, Minenosuke; Furuya, Naoko; Horikawa, Hiroshi; Harada, Takeshi; Hirakawa, Hideki; Kuhara, Satoru; Matsushita, Kazunobu; Fujita, Nobuyuki; Shirai, Mutsunori

    2009-01-01

    Acetobacter species have been used for brewing traditional vinegar and are known to have genetic instability. To clarify the mutability, Acetobacter pasteurianus NBRC 3283, which forms a multi-phenotype cell complex, was subjected to genome DNA sequencing. The genome analysis revealed that there are more than 280 transposons and five genes with hyper-mutable tandem repeats as common features in the genome consisting of a 2.9-Mb chromosome and six plasmids. There were three single nucleotide mutations and five transposon insertions in 32 isolates from the cell complex. The A. pasteurianus hyper-mutability was applied for breeding a temperature-resistant strain grown at an unviable high-temperature (42°C). The genomic DNA sequence of a heritable mutant showing temperature resistance was analyzed by mutation mapping, illustrating that a 92-kb deletion and three single nucleotide mutations occurred in the genome during the adaptation. Alpha-proteobacteria including A. pasteurianus consists of many intracellular symbionts and parasites, and their genomes show increased evolution rates and intensive genome reduction. However, A. pasteurianus is assumed to be a free-living bacterium, it may have the potentiality to evolve to fit in natural niches of seasonal fruits and flowers with other organisms, such as yeasts and lactic acid bacteria. PMID:19638423

  7. Integrated genomics of Mucorales reveals novel therapeutic targets

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mucormycosis is a life-threatening infection caused by Mucorales fungi. We sequenced 30 fungal genomes and performed transcriptomics with three representative Rhizopus and Mucor strains with human airway epithelial cells during fungal invasion to reveal key host and fungal determinants contributing ...

  8. Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation.

    PubMed

    Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki

    2014-05-28

    The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments.

  9. Proteomics and comparative genomics of Nitrososphaera viennensis reveal the core genome and adaptations of archaeal ammonia oxidizers

    PubMed Central

    Kerou, Melina; Offre, Pierre; Valledor, Luis; Abby, Sophie S.; Melcher, Michael; Nagler, Matthias; Weckwerth, Wolfram; Schleper, Christa

    2016-01-01

    Ammonia-oxidizing archaea (AOA) are among the most abundant microorganisms and key players in the global nitrogen and carbon cycles. They share a common energy metabolism but represent a heterogeneous group with respect to their environmental distribution and adaptions, growth requirements, and genome contents. We report here the genome and proteome of Nitrososphaera viennensis EN76, the type species of the archaeal class Nitrososphaeria of the phylum Thaumarchaeota encompassing all known AOA. N. viennensis is a soil organism with a 2.52-Mb genome and 3,123 predicted protein-coding genes. Proteomic analysis revealed that nearly 50% of the predicted genes were translated under standard laboratory growth conditions. Comparison with genomes of closely related species of the predominantly terrestrial Nitrososphaerales as well as the more streamlined marine Nitrosopumilales [Candidatus (Ca.) order] and the acidophile “Ca. Nitrosotalea devanaterra” revealed a core genome of AOA comprising 860 genes, which allowed for the reconstruction of central metabolic pathways common to all known AOA and expressed in the N. viennensis and “Ca. Nitrosopelagicus brevis” proteomes. Concomitantly, we were able to identify candidate proteins for as yet unidentified crucial steps in central metabolisms. In addition to unraveling aspects of core AOA metabolism, we identified specific metabolic innovations associated with the Nitrososphaerales mediating growth and survival in the soil milieu, including the capacity for biofilm formation, cell surface modifications and cell adhesion, and carbohydrate conversions as well as detoxification of aromatic compounds and drugs. PMID:27864514

  10. Genome Sequencing Reveals a Phage in Helicobacter pylori

    PubMed Central

    Lehours, Philippe; Vale, Filipa F.; Bjursell, Magnus K.; Melefors, Ojar; Advani, Reza; Glavas, Steve; Guegueniat, Julia; Gontier, Etienne; Lacomme, Sabrina; Alves Matos, António; Menard, Armelle; Mégraud, Francis; Engstrand, Lars; Andersson, Anders F.

    2011-01-01

    ABSTRACT Helicobacter pylori chronically infects the gastric mucosa in more than half of the human population; in a subset of this population, its presence is associated with development of severe disease, such as gastric cancer. Genomic analysis of several strains has revealed an extensive H. pylori pan-genome, likely to grow as more genomes are sampled. Here we describe the draft genome sequence (63 contigs; 26× mean coverage) of H. pylori strain B45, isolated from a patient with gastric mucosa-associated lymphoid tissue (MALT) lymphoma. The major finding was a 24.6-kb prophage integrated in the bacterial genome. The prophage shares most of its genes (22/27) with prophage region II of Helicobacter acinonychis strain Sheeba. After UV treatment of liquid cultures, circular DNA carrying the prophage integrase gene could be detected, and intracellular tailed phage-like particles were observed in H. pylori cells by transmission electron microscopy, indicating that phage production can be induced from the prophage. PCR amplification and sequencing of the integrase gene from 341 H. pylori strains from different geographic regions revealed a high prevalence of the prophage (21.4%). Phylogenetic reconstruction showed four distinct clusters in the integrase gene, three of which tended to be specific for geographic regions. Our study implies that phages may play important roles in the ecology and evolution of H. pylori. PMID:22086490

  11. Modeling malaria genomics reveals transmission decline and rebound in Senegal.

    PubMed

    Daniels, Rachel F; Schaffner, Stephen F; Wenger, Edward A; Proctor, Joshua L; Chang, Hsiao-Han; Wong, Wesley; Baro, Nicholas; Ndiaye, Daouda; Fall, Fatou Ba; Ndiop, Medoune; Ba, Mady; Milner, Danny A; Taylor, Terrie E; Neafsey, Daniel E; Volkman, Sarah K; Eckhoff, Philip A; Hartl, Daniel L; Wirth, Dyann F

    2015-06-02

    To study the effects of malaria-control interventions on parasite population genomics, we examined a set of 1,007 samples of the malaria parasite Plasmodium falciparum collected in Thiès, Senegal between 2006 and 2013. The parasite samples were genotyped using a molecular barcode of 24 SNPs. About 35% of the samples grouped into subsets with identical barcodes, varying in size by year and sometimes persisting across years. The barcodes also formed networks of related groups. Analysis of 164 completely sequenced parasites revealed extensive sharing of genomic regions. In at least two cases we found first-generation recombinant offspring of parents whose genomes are similar or identical to genomes also present in the sample. An epidemiological model that tracks parasite genotypes can reproduce the observed pattern of barcode subsets. Quantification of likelihoods in the model strongly suggests a reduction of transmission from 2006-2010 with a significant rebound in 2012-2013. The reduced transmission and rebound were confirmed directly by incidence data from Thiès. These findings imply that intensive intervention to control malaria results in rapid and dramatic changes in parasite population genomics. The results also suggest that genomics combined with epidemiological modeling may afford prompt, continuous, and cost-effective tracking of progress toward malaria elimination.

  12. Camelid genomes reveal evolution and adaptation to desert environments.

    PubMed

    Wu, Huiguang; Guang, Xuanmin; Al-Fageeh, Mohamed B; Cao, Junwei; Pan, Shengkai; Zhou, Huanmin; Zhang, Li; Abutarboush, Mohammed H; Xing, Yanping; Xie, Zhiyuan; Alshanqeeti, Ali S; Zhang, Yanru; Yao, Qiulin; Al-Shomrani, Badr M; Zhang, Dong; Li, Jiang; Manee, Manee M; Yang, Zili; Yang, Linfeng; Liu, Yiyi; Zhang, Jilin; Altammami, Musaad A; Wang, Shenyuan; Yu, Lili; Zhang, Wenbin; Liu, Sanyang; Ba, La; Liu, Chunxia; Yang, Xukui; Meng, Fanhua; Wang, Shaowei; Li, Lu; Li, Erli; Li, Xueqiong; Wu, Kaifeng; Zhang, Shu; Wang, Junyi; Yin, Ye; Yang, Huanming; Al-Swailem, Abdulaziz M; Wang, Jun

    2014-10-21

    Bactrian camel (Camelus bactrianus), dromedary (Camelus dromedarius) and alpaca (Vicugna pacos) are economically important livestock. Although the Bactrian camel and dromedary are large, typically arid-desert-adapted mammals, alpacas are adapted to plateaus. Here we present high-quality genome sequences of these three species. Our analysis reveals the demographic history of these species since the Tortonian Stage of the Miocene and uncovers a striking correlation between large fluctuations in population size and geological time boundaries. Comparative genomic analysis reveals complex features related to desert adaptations, including fat and water metabolism, stress responses to heat, aridity, intense ultraviolet radiation and choking dust. Transcriptomic analysis of Bactrian camels further reveals unique osmoregulation, osmoprotection and compensatory mechanisms for water reservation underpinned by high blood glucose levels. We hypothesize that these physiological mechanisms represent kidney evolutionary adaptations to the desert environment. This study advances our understanding of camelid evolution and the adaptation of camels to arid-desert environments.

  13. When COI barcodes deceive: complete genomes reveal introgression in hairstreaks.

    PubMed

    Cong, Qian; Shen, Jinhui; Borek, Dominika; Robbins, Robert K; Opler, Paul A; Otwinowski, Zbyszek; Grishin, Nick V

    2017-02-08

    Two species of hairstreak butterflies from the genus Calycopis are known in the United States: C. cecrops and C. isobeon Analysis of mitochondrial COI barcodes of Calycopis revealed cecrops-like specimens from the eastern US with atypical barcodes that were 2.6% different from either USA species, but similar to Central American Calycopis species. To address the possibility that the specimens with atypical barcodes represent an undescribed cryptic species, we sequenced complete genomes of 27 Calycopis specimens of four species: C. cecrops, C. isobeon, C. quintana and C. bactra Some of these specimens were collected up to 60 years ago and preserved dry in museum collections, but nonetheless produced genomes as complete as fresh samples. Phylogenetic trees reconstructed using the whole mitochondrial and nuclear genomes were incongruent. While USA Calycopis with atypical barcodes grouped with Central American species C. quintana by mitochondria, nuclear genome trees placed them within typical USA C. cecrops in agreement with morphology, suggesting mitochondrial introgression. Nuclear genomes also show introgression, especially between C. cecrops and C. isobeon About 2.3% of each C. cecrops genome has probably (p-value < 0.01, FDR < 0.1) introgressed from C. isobeon and about 3.4% of each C. isobeon genome may have come from C. cecrops. The introgressed regions are enriched in genes encoding transmembrane proteins, mitochondria-targeting proteins and components of the larval cuticle. This study provides the first example of mitochondrial introgression in Lepidoptera supported by complete genome sequencing. Our results caution about relying solely on COI barcodes and mitochondrial DNA for species identification or discovery.

  14. Integrated Syntenic and Phylogenomic Analyses Reveal an Ancient Genome Duplication in Monocots[W

    PubMed Central

    Jiao, Yuannian; Li, Jingping; Tang, Haibao; Paterson, Andrew H.

    2014-01-01

    Unraveling widespread polyploidy events throughout plant evolution is a necessity for inferring the impacts of whole-genome duplication (WGD) on speciation, functional innovations, and to guide identification of true orthologs in divergent taxa. Here, we employed an integrated syntenic and phylogenomic analyses to reveal an ancient WGD that shaped the genomes of all commelinid monocots, including grasses, bromeliads, bananas (Musa acuminata), ginger, palms, and other plants of fundamental, agricultural, and/or horticultural interest. First, comprehensive phylogenomic analyses revealed 1421 putative gene families that retained ancient duplication shared by Musa (Zingiberales) and grass (Poales) genomes, indicating an ancient WGD in monocots. Intergenomic synteny blocks of Musa and Oryza were investigated, and 30 blocks were shown to be duplicated before Musa-Oryza divergence an estimated 120 to 150 million years ago. Synteny comparisons of four monocot (rice [Oryza sativa], sorghum [Sorghum bicolor], banana, and oil palm [Elaeis guineensis]) and two eudicot (grape [Vitis vinifera] and sacred lotus [Nelumbo nucifera]) genomes also support this additional WGD in monocots, herein called Tau (τ). Integrating synteny and phylogenomic comparisons achieves better resolution of ancient polyploidy events than either approach individually, a principle that is exemplified in the disambiguation of a WGD series of rho (ρ)-sigma (σ)-tau (τ) in the grass lineages that echoes the alpha (α)-beta (β)-gamma (γ) series previously revealed in the Arabidopsis thaliana lineage. PMID:25082857

  15. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes.

    PubMed

    Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A P; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H

    2014-05-23

    Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus.

  16. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes

    PubMed Central

    Liu, Shengyi; Liu, Yumei; Yang, Xinhua; Tong, Chaobo; Edwards, David; Parkin, Isobel A. P.; Zhao, Meixia; Ma, Jianxin; Yu, Jingyin; Huang, Shunmou; Wang, Xiyin; Wang, Junyi; Lu, Kun; Fang, Zhiyuan; Bancroft, Ian; Yang, Tae-Jin; Hu, Qiong; Wang, Xinfa; Yue, Zhen; Li, Haojie; Yang, Linfeng; Wu, Jian; Zhou, Qing; Wang, Wanxin; King, Graham J; Pires, J. Chris; Lu, Changxin; Wu, Zhangyan; Sampath, Perumal; Wang, Zhuo; Guo, Hui; Pan, Shengkai; Yang, Limei; Min, Jiumeng; Zhang, Dong; Jin, Dianchuan; Li, Wanshun; Belcram, Harry; Tu, Jinxing; Guan, Mei; Qi, Cunkou; Du, Dezhi; Li, Jiana; Jiang, Liangcai; Batley, Jacqueline; Sharpe, Andrew G; Park, Beom-Seok; Ruperao, Pradeep; Cheng, Feng; Waminal, Nomar Espinosa; Huang, Yin; Dong, Caihua; Wang, Li; Li, Jingping; Hu, Zhiyong; Zhuang, Mu; Huang, Yi; Huang, Junyan; Shi, Jiaqin; Mei, Desheng; Liu, Jing; Lee, Tae-Ho; Wang, Jinpeng; Jin, Huizhe; Li, Zaiyun; Li, Xun; Zhang, Jiefu; Xiao, Lu; Zhou, Yongming; Liu, Zhongsong; Liu, Xuequn; Qin, Rui; Tang, Xu; Liu, Wenbin; Wang, Yupeng; Zhang, Yangyong; Lee, Jonghoon; Kim, Hyun Hee; Denoeud, France; Xu, Xun; Liang, Xinming; Hua, Wei; Wang, Xiaowu; Wang, Jun; Chalhoub, Boulos; Paterson, Andrew H

    2014-01-01

    Polyploidization has provided much genetic variation for plant adaptive evolution, but the mechanisms by which the molecular evolution of polyploid genomes establishes genetic architecture underlying species differentiation are unclear. Brassica is an ideal model to increase knowledge of polyploid evolution. Here we describe a draft genome sequence of Brassica oleracea, comparing it with that of its sister species B. rapa to reveal numerous chromosome rearrangements and asymmetrical gene loss in duplicated genomic blocks, asymmetrical amplification of transposable elements, differential gene co-retention for specific pathways and variation in gene expression, including alternative splicing, among a large number of paralogous and orthologous genes. Genes related to the production of anticancer phytochemicals and morphological variations illustrate consequences of genome duplication and gene divergence, imparting biochemical and morphological variation to B. oleracea. This study provides insights into Brassica genome evolution and will underpin research into the many important crops in this genus. PMID:24852848

  17. Genome-wide analysis of HPV integration in human cancers reveals recurrent, focal genomic instability

    PubMed Central

    Akagi, Keiko; Li, Jingfeng; Broutian, Tatevik R.; Padilla-Nash, Hesed; Xiao, Weihong; Jiang, Bo; Rocco, James W.; Teknos, Theodoros N.; Kumar, Bhavna; Wangsa, Danny; He, Dandan; Ried, Thomas; Symer, David E.; Gillison, Maura L.

    2014-01-01

    Genomic instability is a hallmark of human cancers, including the 5% caused by human papillomavirus (HPV). Here we report a striking association between HPV integration and adjacent host genomic structural variation in human cancer cell lines and primary tumors. Whole-genome sequencing revealed HPV integrants flanking and bridging extensive host genomic amplifications and rearrangements, including deletions, inversions, and chromosomal translocations. We present a model of “looping” by which HPV integrant-mediated DNA replication and recombination may result in viral–host DNA concatemers, frequently disrupting genes involved in oncogenesis and amplifying HPV oncogenes E6 and E7. Our high-resolution results shed new light on a catastrophic process, distinct from chromothripsis and other mutational processes, by which HPV directly promotes genomic instability. PMID:24201445

  18. Genomic sequence of 'Candidatus Liberibacter solanacearum' haplotype C and its comparison with haplotype A and B genomes

    PubMed Central

    Haapalainen, Minna; Schott, Thomas; Thompson, Sarah M.; Smith, Grant R.; Nissinen, Anne I.; Pirhonen, Minna

    2017-01-01

    Haplotypes A and B of ‘Candidatus Liberibacter solanacearum’ (CLso) are associated with diseases of solanaceous plants, especially Zebra chip disease of potato, and haplotypes C, D and E are associated with symptoms on apiaceous plants. To date, one complete genome of haplotype B and two high quality draft genomes of haplotype A have been obtained for these unculturable bacteria using metagenomics from the psyllid vector Bactericera cockerelli. Here, we present the first genomic sequences obtained for the carrot-associated CLso. These two genomic sequences of haplotype C, FIN114 (1.24 Mbp) and FIN111 (1.20 Mbp), were obtained from carrot psyllids (Trioza apicalis) harboring CLso. Genomic comparisons between the haplotypes A, B and C revealed that the genome organization differs between these haplotypes, due to large inversions and other recombinations. Comparison of protein-coding genes indicated that the core genome of CLso consists of 885 ortholog groups, with the pan-genome consisting of 1327 ortholog groups. Twenty-seven ortholog groups are unique to CLso haplotype C, whilst 11 ortholog groups shared by the haplotypes A and B, are not found in the haplotype C. Some of these ortholog groups that are not part of the core genome may encode functions related to interactions with the different host plant and psyllid species. PMID:28158295

  19. Genomic analysis reveals selection in Chinese native black pig

    PubMed Central

    Fu, Yuhua; Li, Cencen; Tang, Qianzi; Tian, Shilin; Jin, Long; Chen, Jianhai; Li, Mingzhou; Li, Changchun

    2016-01-01

    Identification of genomic signatures that help reveal mechanisms underlying desirable traits in domesticated pigs is of significant biological, agricultural and medical importance. To identify the genomic footprints left by selection during domestication of the Enshi black pig, a typical native and meat-lard breed in China, we generated about 72-fold coverage of the pig genome using pools of genomic DNA representing three different populations of Enshi black pigs from three different locations. Combining this data with the available whole genomes of 13 Chinese wild boars, we identified 417 protein-coding genes embedded in the selected regions of Enshi black pigs. These genes are mainly involved in developmental and metabolic processes, response to stimulus, and other biological processes. Signatures of selection were detected in genes involved in body size and immunity (RPS10 and VASN), lipid metabolism (GSK3), male fertility (INSL6) and developmental processes (TBX19). These findings provide a window into the potential genetic mechanism underlying development of desirable phenotypes in Enshi black pigs during domestication and subsequent artificial selection. Thus, our results illustrate how domestication has shaped patterns of genetic variation in Enshi black pigs and provide valuable genetic resources that enable effective use of pigs in agricultural production. PMID:27808243

  20. Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes.

    PubMed

    Huang, Shengfeng; Chen, Zelin; Yan, Xinyu; Yu, Ting; Huang, Guangrui; Yan, Qingyu; Pontarotti, Pierre Antoine; Zhao, Hongchen; Li, Jie; Yang, Ping; Wang, Ruihua; Li, Rui; Tao, Xin; Deng, Ting; Wang, Yiquan; Li, Guang; Zhang, Qiujin; Zhou, Sisi; You, Leiming; Yuan, Shaochun; Fu, Yonggui; Wu, Fenfang; Dong, Meiling; Chen, Shangwu; Xu, Anlong

    2014-12-19

    Vertebrates diverged from other chordates ~500 Myr ago and experienced successful innovations and adaptations, but the genomic basis underlying vertebrate origins are not fully understood. Here we suggest, through comparison with multiple lancelet (amphioxus) genomes, that ancient vertebrates experienced high rates of protein evolution, genome rearrangement and domain shuffling and that these rates greatly slowed down after the divergence of jawed and jawless vertebrates. Compared with lancelets, modern vertebrates retain, at least relatively, less protein diversity, fewer nucleotide polymorphisms, domain combinations and conserved non-coding elements (CNE). Modern vertebrates also lost substantial transposable element (TE) diversity, whereas lancelets preserve high TE diversity that includes even the long-sought RAG transposon. Lancelets also exhibit rapid gene turnover, pervasive transcription, fastest exon shuffling in metazoans and substantial TE methylation not observed in other invertebrates. These new lancelet genome sequences provide new insights into the chordate ancestral state and the vertebrate evolution.

  1. Joint assembly and genetic mapping of the Atlantic horseshoe crab genome reveals ancient whole genome duplication

    PubMed Central

    2014-01-01

    Background Horseshoe crabs are marine arthropods with a fossil record extending back approximately 450 million years. They exhibit remarkable morphological stability over their long evolutionary history, retaining a number of ancestral arthropod traits, and are often cited as examples of “living fossils.” As arthropods, they belong to the Ecdysozoa, an ancient super-phylum whose sequenced genomes (including insects and nematodes) have thus far shown more divergence from the ancestral pattern of eumetazoan genome organization than cnidarians, deuterostomes and lophotrochozoans. However, much of ecdysozoan diversity remains unrepresented in comparative genomic analyses. Results Here we apply a new strategy of combined de novo assembly and genetic mapping to examine the chromosome-scale genome organization of the Atlantic horseshoe crab, Limulus polyphemus. We constructed a genetic linkage map of this 2.7 Gbp genome by sequencing the nuclear DNA of 34 wild-collected, full-sibling embryos and their parents at a mean redundancy of 1.1x per sample. The map includes 84,307 sequence markers grouped into 1,876 distinct genetic intervals and 5,775 candidate conserved protein coding genes. Conclusions Comparison with other metazoan genomes shows that the L. polyphemus genome preserves ancestral bilaterian linkage groups, and that a common ancestor of modern horseshoe crabs underwent one or more ancient whole genome duplications 300 million years ago, followed by extensive chromosome fusion. These results provide a counter-example to the often noted correlation between whole genome duplication and evolutionary radiations. The new, low-cost genetic mapping method for obtaining a chromosome-scale view of non-model organism genomes that we demonstrate here does not require laboratory culture, and is potentially applicable to a broad range of other species. PMID:24987520

  2. The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants

    SciTech Connect

    Rensing, Stefan A.; Lang, Daniel; Zimmer, Andreas D.; Terry, Astrid; Salamov, Asaf; Shapiro, Harris; Nishiyama, Tomaoki; Perroud, Pierre-Francois; Lindquist, Erika A.; Kamisugi, Yasuko; Tanahashi, Takako; Sakakibara, Keiko; Fujita, Tomomichi; Oishi, Kazuko; Shin, Tadasu; Kuroki, Yoko; Toyoda, Atsushi; Suzuki, Yutaka; Hashimoto, Shin-ichi; Yamaguchi, Kazuo; Sugano, Sumio; Kohara, Yuji; Fujiyama, Asao; Anterola, Aldwin; Aoki, Setsuyuki; Ashton, Neil; Barbazuk, W. Brad; Barker, Elizabeth; Bennetzen, Jeffrey L.; Blankenship, Robert; Cho, Sung Hyun; Dutcher, Susan K.; Estelle, Mark; Fawcett, Jeffrey A.; Gundlach, Heidrum; Hanada, Kousuke; Melkozernov, Alexander; Murata, Takashi; Nelson, David R.; Pils, Birgit; Prigge, Michael; Reiss, Bernd; Renner, Tanya; Rombauts, Stephane; Rushton, Paul J.; Sanderfoot, Anton; Schween, Gabriele; Shiu, Shin-Han; Stueber, Kurt; Theodoulou, Frederica L.; Tu, Hank; Van de Peer, Yves; Verrier, Paul J.; Waters, Elizabeth; Wood, Andrew; Yang, Lixing; Cove, David; Cuming, Andrew C.; Hasebe, Mitsayasu; Lucas, Susan; Mishler, Brent D.; Reski, Ralf; Grigoriev, Igor V.; Quatrano, Rakph S.; Boore, Jeffrey L.

    2007-09-18

    We report the draft genome sequence of the model moss Physcomitrella patens and compare its features with those of flowering plants, from which it is separated by more than 400 million years, and unicellular aquatic algae. This comparison reveals genomic changes concomitant with the evolutionary movement to land, including a general increase in gene family complexity; loss of genes associated with aquatic environments (e.g., flagellar arms); acquisition of genes for tolerating terrestrial stresses (e.g., variation in temperature and water availability); and the development of the auxin and abscisic acid signaling pathways for coordinating multicellular growth and dehydration response. The Physcomitrella genome provides a resource for phylogenetic inferences about gene function and for experimental analysis of plant processes through this plant's unique facility for reverse genetics.

  3. Analysis of the Mitochondrial Genome in Hypomyces aurantius Reveals a Novel Twintron Complex in Fungi

    PubMed Central

    Deng, Youjin; Zhang, Qihui; Ming, Ray; Lin, Longji; Lin, Xiangzhi; Lin, Yiying; Li, Xiao; Xie, Baogui; Wen, Zhiqiang

    2016-01-01

    Hypomyces aurantius is a mycoparasite that causes cobweb disease, a most serious disease of cultivated mushrooms. Intra-species identification is vital for disease control, however the lack of genomic data makes development of molecular markers challenging. Small size, high copy number, and high mutation rate of fungal mitochondrial genome makes it a good candidate for intra and inter species differentiation. In this study, the mitochondrial genome of H. H.a0001 was determined from genomic DNA using Illumina sequencing. The roughly 72 kb genome shows all major features found in other Hypocreales: 14 common protein genes, large and small subunit rRNAs genes and 27 tRNAs genes. Gene arrangement comparison showed conserved gene orders in Hypocreales mitochondria are relatively conserved, with the exception of Acremonium chrysogenum and Acremonium implicatum. Mitochondrial genome comparison also revealed that intron length primarily contributes to mitogenome size variation. Seventeen introns were detected in six conserved genes: five in cox1, four in rnl, three in cob, two each in atp6 and cox3, and one in cox2. Four introns were found to contain two introns or open reading frames: cox3-i2 is a twintron containing two group IA type introns; cox2-i1 is a group IB intron encoding two homing endonucleases; and cox1-i4 and cox1-i3 both contain two open reading frame (ORFs). Analyses combining secondary intronic structures, insertion sites, and similarities of homing endonuclease genes reveal two group IA introns arranged side by side within cox3-i2. Mitochondrial data for H. aurantius provides the basis for further studies relating to population genetics and species identification. PMID:27376282

  4. [Comparison of mitochondrial genomes of bivalves].

    PubMed

    SONG, Wen-Tao; GAO, Xiang-Gang; LI, Yun-Feng; LIU, Wei-Dong; LIU, Ying; HE, Chong-Bo

    2009-11-01

    The structure and organization of mitochondrial genomes of 14 marine bivalves and two freshwater bivalves were analyzed using comparative genomics and bioinformatics methods. The results showed that the organization and gene order of the mitochondrial genomes of these bivalve species studied were different from each other. The size, organization, gene numbers, and gene order of mitochondrial genomes in bivalves at different taxa were different. Phylogenetic analysis using the whole mitochondrial genomes and all the coding genes showed different results-- phylogenetic analysis conducted using the whole mitochondrial genomes was consistent with the existing classification and phylogenetic analysis conducted using all coding genes not consistent with the existing classification.

  5. Genomes of three tomato pathogens within the Ralstonia solanacearum species complex reveal significant evolutionary divergence

    PubMed Central

    2010-01-01

    Background The Ralstonia solanacearum species complex includes thousands of strains pathogenic to an unusually wide range of plant species. These globally dispersed and heterogeneous strains cause bacterial wilt diseases, which have major socio-economic impacts. Pathogenicity is an ancestral trait in R. solanacearum and strains with high genetic variation can be subdivided into four phylotypes, correlating to isolates from Asia (phylotype I), the Americas (phylotype IIA and IIB), Africa (phylotype III) and Indonesia (phylotype IV). Comparison of genome sequences strains representative of this phylogenetic diversity can help determine which traits allow this bacterium to be such a pathogen of so many different plant species and how the bacteria survive in many different habitats. Results The genomes of three tomato bacterial wilt pathogens, CFBP2957 (phy. IIA), CMR15 (phy. III) and PSI07 (phy. IV) were sequenced and manually annotated. These genomes were compared with those of three previously sequenced R. solanacearum strains: GMI1000 (tomato, phy. I), IPO1609 (potato, phy. IIB), and Molk2 (banana, phy. IIB). The major genomic features (size, G+C content, number of genes) were conserved across all of the six sequenced strains. Despite relatively high genetic distances (calculated from average nucleotide identity) and many genomic rearrangements, more than 60% of the genes of the megaplasmid and 70% of those on the chromosome are syntenic. The three new genomic sequences revealed the presence of several previously unknown traits, probably acquired by horizontal transfers, within the genomes of R. solanacearum, including a type IV secretion system, a rhi-type anti-mitotic toxin and two small plasmids. Genes involved in virulence appear to be evolving at a faster rate than the genome as a whole. Conclusions Comparative analysis of genome sequences and gene content confirmed the differentiation of R. solanacearum species complex strains into four phylotypes. Genetic

  6. Comparative genomics reveals diversity among xanthomonads infecting tomato and pepper

    PubMed Central

    2011-01-01

    Background Bacterial spot of tomato and pepper is caused by four Xanthomonas species and is a major plant disease in warm humid climates. The four species are distinct from each other based on physiological and molecular characteristics. The genome sequence of strain 85-10, a member of one of the species, Xanthomonas euvesicatoria (Xcv) has been previously reported. To determine the relationship of the four species at the genome level and to investigate the molecular basis of their virulence and differing host ranges, draft genomic sequences of members of the other three species were determined and compared to strain 85-10. Results We sequenced the genomes of X. vesicatoria (Xv) strain 1111 (ATCC 35937), X. perforans (Xp) strain 91-118 and X. gardneri (Xg) strain 101 (ATCC 19865). The genomes were compared with each other and with the previously sequenced Xcv strain 85-10. In addition, the molecular features were predicted that may be required for pathogenicity including the type III secretion apparatus, type III effectors, other secretion systems, quorum sensing systems, adhesins, extracellular polysaccharide, and lipopolysaccharide determinants. Several novel type III effectors from Xg strain 101 and Xv strain 1111 genomes were computationally identified and their translocation was validated using a reporter gene assay. A homolog to Ax21, the elicitor of XA21-mediated resistance in rice, and a functional Ax21 sulfation system were identified in Xcv. Genes encoding proteins with functions mediated by type II and type IV secretion systems have also been compared, including enzymes involved in cell wall deconstruction, as contributors to pathogenicity. Conclusions Comparative genomic analyses revealed considerable diversity among bacterial spot pathogens, providing new insights into differences and similarities that may explain the diverse nature of these strains. Genes specific to pepper pathogens, such as the O-antigen of the lipopolysaccharide cluster, and genes

  7. Genomic affinities revealed by GISH suggests intergenomic restructuring between parental genomes of the paleopolyploid genus Zea.

    PubMed

    González, Graciela Esther; Poggio, Lidia

    2015-10-01

    The present work compares the molecular affinities, revealed by GISH, with the analysis of meiotic pairing in intra- and interspecific hybrids between species of Zea obtained in previous works. The joint analysis of these data provided evidence about the evolutionary relationships among the species from the paleopolyploid genus Zea (maize and teosintes). GISH and meiotic pairing of intraspecific hybrids revealed high genomic affinity between maize (Zea mays subsp. mays) and both Zea mays subsp. parviglumis and Zea mays subsp. mexicana. On the other hand, when Zea mays subsp. huehuetenanguensis DNA was probed on maize chromosomes, a lower affinity was detected, and the pattern of hybridization suggested intergenomical restructuring between the parental genomes of maize. When DNA from Zea luxurians was used as probe, homogeneous hybridization signals were observed through all maize chromosomes. Lower genomic affinity was observed when DNA from Zea diploperennis was probed on maize chromosomes, especially at knob regions. Maize chromosomes hybridized with Zea perennis DNA showed hybridization signals on four chromosome pairs: two chromosome pairs presented hybridization signal in only one chromosomal arm, whereas four chromosome pairs did not show any hybridization. These results are in agreement with previous GISH studies, which have identified the genomic source of the chromosomes involved in the meiotic configurations of Z. perennis × maize hybrids. These findings allow postulating that maize has a parental genome not shared with Z. perennis, and the existence of intergenomic restructuring between the parental genomes of maize. Moreover, the absence of hybridization signals in all maize knobs indicate that these heterochromatic regions were lost during the Z. perennis genome evolution.

  8. Comparative genome sequencing reveals genomic signature of extreme desiccation tolerance in the anhydrobiotic midge

    PubMed Central

    Gusev, Oleg; Suetsugu, Yoshitaka; Cornette, Richard; Kawashima, Takeshi; Logacheva, Maria D.; Kondrashov, Alexey S.; Penin, Aleksey A.; Hatanaka, Rie; Kikuta, Shingo; Shimura, Sachiko; Kanamori, Hiroyuki; Katayose, Yuichi; Matsumoto, Takashi; Shagimardanova, Elena; Alexeev, Dmitry; Govorun, Vadim; Wisecaver, Jennifer; Mikheyev, Alexander; Koyanagi, Ryo; Fujie, Manabu; Nishiyama, Tomoaki; Shigenobu, Shuji; Shibata, Tomoko F.; Golygina, Veronika; Hasebe, Mitsuyasu; Okuda, Takashi; Satoh, Nori; Kikawada, Takahiro

    2014-01-01

    Anhydrobiosis represents an extreme example of tolerance adaptation to water loss, where an organism can survive in an ametabolic state until water returns. Here we report the first comparative analysis examining the genomic background of extreme desiccation tolerance, which is exclusively found in larvae of the only anhydrobiotic insect, Polypedilum vanderplanki. We compare the genomes of P. vanderplanki and a congeneric desiccation-sensitive midge P. nubifer. We determine that the genome of the anhydrobiotic species specifically contains clusters of multi-copy genes with products that act as molecular shields. In addition, the genome possesses several groups of genes with high similarity to known protective proteins. However, these genes are located in distinct paralogous clusters in the genome apart from the classical orthologues of the corresponding genes shared by both chironomids and other insects. The transcripts of these clustered paralogues contribute to a large majority of the mRNA pool in the desiccating larvae and most likely define successful anhydrobiosis. Comparison of expression patterns of orthologues between two chironomid species provides evidence for the existence of desiccation-specific gene expression systems in P. vanderplanki. PMID:25216354

  9. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium

    SciTech Connect

    Ma, Li Jun; van der Does, H. C.; Borkovich, Katherine A.; Coleman, Jeffrey J.; Daboussi, Marie-Jose; Di Pietro, Antonio; Dufresne, Marie; Freitag, Michael; Grabherr, Manfred; Henrissat, Bernard; Houterman, Petra M.; Kang, Seogchan; Shim, Won-Bo; Wolochuk, Charles; Xie, Xiaohui; Xu, Jin Rong; Antoniw, John; Baker, Scott E.; Bluhm, Burton H.; Breakspear, Andrew; Brown, Daren W.; Butchko, Robert A.; Chapman, Sinead; Coulson, Richard; Coutinho, Pedro M.; Danchin, Etienne G.; Diener, Andrew; Gale, Liane R.; Gardiner, Donald; Goff, Steven; Hammond-Kossack, Kim; Hilburn, Karen; Hua-Van, Aurelie; Jonkers, Wilfried; Kazan, Kemal; Kodira, Chinnappa D.; Koehrsen, Michael; Kumar, Lokesh; Lee, Yong Hwan; Li, Liande; Manners, John M.; Miranda-Saavedra, Diego; Mukherjee, Mala; Park, Gyungsoon; Park, Jongsun; Park, Sook Young; Proctor, Robert H.; Regev, Aviv; Ruiz-Roldan, M. C.; Sain, Divya; Sakthikumar, Sharadha; Sykes, Sean; Schwartz, David C.; Turgeon, Barbara G.; Wapinski, Ilan; Yoder, Olen; Young, Sarah; Zeng, Qiandong; Zhou, Shiguo; Galagan, James; Cuomo, Christina A.; Kistler, H. Corby; Rep, Martijn

    2010-03-18

    Fusarium species are among the most important phytopathogenic and toxigenic fungi, having significant impact on crop production and animal health. Distinctively, members of the F. oxysporum species complex exhibit wide host range but discontinuously distributed host specificity, reflecting remarkable genetic adaptability. To understand the molecular underpinnings of diverse phenotypic traits and their evolution in Fusarium, we compared the genomes of three economically important and phylogenetically related, yet phenotypically diverse plant-pathogenic species, F. graminearum, F. verticillioides and F. oxysporum f. sp. lycopersici. Our analysis revealed greatly expanded lineage-specific (LS) genomic regions in F. oxysporum that include four entire chromosomes, accounting for more than one-quarter of the genome. LS regions are rich in transposons and genes with distinct evolutionary profiles but related to pathogenicity. Experimentally, we demonstrate for the first time the transfer of two LS chromosomes between strains of F. oxysporum, resulting in the conversion of a non-pathogenic strain into a pathogen. Transfer of LS chromosomes between otherwise genetically isolated strains explains the polyphyletic origin of host specificity and the emergence of new pathogenic lineages in the F. oxysporum species complex, putting the evolution of fungal pathogenicity into a new perspective.

  10. Mitochondrial Genome Sequences Effectively Reveal the Phylogeny of Hylobates Gibbons

    PubMed Central

    Chan, Yi-Chiao; Roos, Christian; Inoue-Murayama, Miho; Inoue, Eiji; Shih, Chih-Chin; Pei, Kurtis Jai-Chyi; Vigilant, Linda

    2010-01-01

    Background Uniquely among hominoids, gibbons exist as multiple geographically contiguous taxa exhibiting distinctive behavioral, morphological, and karyotypic characteristics. However, our understanding of the evolutionary relationships of the various gibbons, especially among Hylobates species, is still limited because previous studies used limited taxon sampling or short mitochondrial DNA (mtDNA) sequences. Here we use mtDNA genome sequences to reconstruct gibbon phylogenetic relationships and reveal the pattern and timing of divergence events in gibbon evolutionary history. Methodology/Principal Findings We sequenced the mitochondrial genomes of 51 individuals representing 11 species belonging to three genera (Hylobates, Nomascus and Symphalangus) using the high-throughput 454 sequencing system with the parallel tagged sequencing approach. Three phylogenetic analyses (maximum likelihood, Bayesian analysis and neighbor-joining) depicted the gibbon phylogenetic relationships congruently and with strong support values. Most notably, we recover a well-supported phylogeny of the Hylobates gibbons. The estimation of divergence times using Bayesian analysis with relaxed clock model suggests a much more rapid speciation process in Hylobates than in Nomascus. Conclusions/Significance Use of more than 15 kb sequences of the mitochondrial genome provided more informative and robust data than previous studies of short mitochondrial segments (e.g., control region or cytochrome b) as shown by the reliable reconstruction of divergence patterns among Hylobates gibbons. Moreover, molecular dating of the mitogenomic divergence times implied that biogeographic change during the last five million years may be a factor promoting the speciation of Sundaland animals, including Hylobates species. PMID:21203450

  11. Genome-Wide Analysis in Brazilians Reveals Highly Differentiated Native American Genome Regions.

    PubMed

    Mychaleckyj, Josyf C; Havt, Alexandre; Nayak, Uma; Pinkerton, Relana; Farber, Emily; Concannon, Patrick; Lima, Aldo A; Guerrant, Richard L

    2017-03-01

    Despite its population, geographic size, and emerging economic importance, disproportionately little genome-scale research exists into genetic factors that predispose Brazilians to disease, or the population genetics of risk. After identification of suitable proxy populations and careful analysis of tri-continental admixture in 1,538 North-Eastern Brazilians to estimate individual ancestry and ancestral allele frequencies, we computed 400,000 genome-wide locus-specific branch length (LSBL) Fst statistics of Brazilian Amerindian ancestry compared to European and African; and a similar set of differentiation statistics for their Amerindian component compared with the closest Asian 1000 Genomes population (surprisingly, Bengalis in Bangladesh). After ranking SNPs by these statistics, we identified the top 10 highly differentiated SNPs in five genome regions in the LSBL tests of Brazilian Amerindian ancestry compared to European and African; and the top 10 SNPs in eight regions comparing their Amerindian component to the closest Asian 1000 Genomes population. We found SNPs within or proximal to the genes CIITA (rs6498115), SMC6 (rs1834619), and KLHL29 (rs2288697) were most differentiated in the Amerindian-specific branch, while SNPs in the genes ADAMTS9 (rs7631391), DOCK2 (rs77594147), SLC28A1 (rs28649017), ARHGAP5 (rs7151991), and CIITA (rs45601437) were most highly differentiated in the Asian comparison. These genes are known to influence immune function, metabolic and anthropometry traits, and embryonic development. These analyses have identified candidate genes for selection within Amerindian ancestry, and by comparison of the two analyses, those for which the differentiation may have arisen during the migration from Asia to the Americas.

  12. Genome-scale co-expression network comparison across Escherichia coli and Salmonella enterica serovar Typhimurium reveals significant conservation at the regulon level of local regulators despite their dissimilar lifestyles.

    PubMed

    Zarrineh, Peyman; Sánchez-Rodríguez, Aminael; Hosseinkhan, Nazanin; Narimani, Zahra; Marchal, Kathleen; Masoudi-Nejad, Ali

    2014-01-01

    Availability of genome-wide gene expression datasets provides the opportunity to study gene expression across different organisms under a plethora of experimental conditions. In our previous work, we developed an algorithm called COMODO (COnserved MODules across Organisms) that identifies conserved expression modules between two species. In the present study, we expanded COMODO to detect the co-expression conservation across three organisms by adapting the statistics behind it. We applied COMODO to study expression conservation/divergence between Escherichia coli, Salmonella enterica, and Bacillus subtilis. We observed that some parts of the regulatory interaction networks were conserved between E. coli and S. enterica especially in the regulon of local regulators. However, such conservation was not observed between the regulatory interaction networks of B. subtilis and the two other species. We found co-expression conservation on a number of genes involved in quorum sensing, but almost no conservation for genes involved in pathogenicity across E. coli and S. enterica which could partially explain their different lifestyles. We concluded that despite their different lifestyles, no significant rewiring have occurred at the level of local regulons involved for instance, and notable conservation can be detected in signaling pathways and stress sensing in the phylogenetically close species S. enterica and E. coli. Moreover, conservation of local regulons seems to depend on the evolutionary time of divergence across species disappearing at larger distances as shown by the comparison with B. subtilis. Global regulons follow a different trend and show major rewiring even at the limited evolutionary distance that separates E. coli and S. enterica.

  13. Genome-Scale Co-Expression Network Comparison across Escherichia coli and Salmonella enterica Serovar Typhimurium Reveals Significant Conservation at the Regulon Level of Local Regulators Despite Their Dissimilar Lifestyles

    PubMed Central

    Zarrineh, Peyman; Sánchez-Rodríguez, Aminael; Hosseinkhan, Nazanin; Narimani, Zahra; Marchal, Kathleen; Masoudi-Nejad, Ali

    2014-01-01

    Availability of genome-wide gene expression datasets provides the opportunity to study gene expression across different organisms under a plethora of experimental conditions. In our previous work, we developed an algorithm called COMODO (COnserved MODules across Organisms) that identifies conserved expression modules between two species. In the present study, we expanded COMODO to detect the co-expression conservation across three organisms by adapting the statistics behind it. We applied COMODO to study expression conservation/divergence between Escherichia coli, Salmonella enterica, and Bacillus subtilis. We observed that some parts of the regulatory interaction networks were conserved between E. coli and S. enterica especially in the regulon of local regulators. However, such conservation was not observed between the regulatory interaction networks of B. subtilis and the two other species. We found co-expression conservation on a number of genes involved in quorum sensing, but almost no conservation for genes involved in pathogenicity across E. coli and S. enterica which could partially explain their different lifestyles. We concluded that despite their different lifestyles, no significant rewiring have occurred at the level of local regulons involved for instance, and notable conservation can be detected in signaling pathways and stress sensing in the phylogenetically close species S. enterica and E. coli. Moreover, conservation of local regulons seems to depend on the evolutionary time of divergence across species disappearing at larger distances as shown by the comparison with B. subtilis. Global regulons follow a different trend and show major rewiring even at the limited evolutionary distance that separates E. coli and S. enterica. PMID:25101984

  14. Comparative genomic analysis reveals a distant liver enhancer upstream of the COUP-TFII gene

    SciTech Connect

    Baroukh, Nadine; Ahituv, Nadav; Chang, Jessie; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Pennacchio, Len A.

    2004-08-20

    COUP-TFII is a central nuclear hormone receptor that tightly regulates the expression of numerous target lipid metabolism genes in vertebrates. However, it remains unclear how COUP-TFII itself is transcriptionally controlled since studies with its promoter and upstream region fail to recapitulate the genes liver expression. In an attempt to identify liver enhancers in the vicinity of COUP-TFII, we employed a comparative genomic approach. Initial comparisons between humans and mice of the 3,470kb gene poor region surrounding COUP-TFII revealed 2,023 conserved non-coding elements. To prioritize a subset of these elements for functional studies, we performed further genomic comparisons with the orthologous pufferfish (Fugu rubripes) locus and uncovered two anciently conserved non-coding sequences (CNS) upstream of COUP-TFII (CNS-62kb and CNS-66kb). Testing these two elements using reporter constructs in liver (HepG2) cells revealed that CNS-66kb, but not CNS-62kb, yielded robust in vitro enhancer activity. In addition, an in vivo reporter assay using naked DNA transfer with CNS-66kb linked to luciferase displayed strong reproducible liver expression in adult mice, further supporting its role as a liver enhancer. Together, these studies further support the utility of comparative genomics to uncover gene regulatory sequences based on evolutionary conservation and provide the substrates to better understand the regulation and expression of COUP-TFII.

  15. Insights into the genome evolution of Yersinia pestis through whole genome comparison with Yersinia pseudotuberculosis

    SciTech Connect

    Souza, B; Stoutland, P; Derbise, A; Georgescu, A; Elliott, J; Land, M; Marceau, M; Motin, V; Hinnebusch, J; Simonet, M; Medigue, C; Dacheux, D; Chenal-Francisque, V; Regala, W; Brubaker, R R; Carniel, E; Chain, P; Verguez, L; Fowler, J; Garcia, E; Lamerdin, J; Hauser, L; Larimer, F

    2004-01-24

    Yersinia pestis, the causative agent of plague, is a highly uniform clone that diverged recently from the enteric pathogen Yersinia pseudotuberculosis. Despite their close genetic relationship, they differ radically in their pathogenicity and transmission. Here we report the complete genomic sequence of Y. pseudotuberculosis IP32953 and its use for detailed genome comparisons to available Y. pestis sequences. Analyses of identified differences across a panel of Yersinia isolates from around the world reveals 32 Y. pestis chromosomal genes that, together with the two Y. pestis-specific plasmids, represent the only new genetic material in Y. pestis acquired since the divergence from Y. pseudotuberculosis. In contrast, 149 new pseudogenes (doubling the previous estimate) and 317 genes absent from Y. pestis were detected, indicating that as many as 13% of Y. pseudotuberculosis genes no longer function in Y. pestis. Extensive IS-mediated genome rearrangements and reductive evolution through massive gene loss, resulting in elimination and modification of pre-existing gene expression pathways appear to be more important than acquisition of new genes in the evolution of Y. pestis. These results provide a sobering example of how a highly virulent epidemic clone can suddenly emerge from a less virulent, closely related progenitor.

  16. Population-based 3D genome structure analysis reveals driving forces in spatial genome organization

    PubMed Central

    Li, Wenyuan; Kalhor, Reza; Dai, Chao; Hao, Shengli; Gong, Ke; Zhou, Yonggang; Li, Haochen; Zhou, Xianghong Jasmine; Le Gros, Mark A.; Larabell, Carolyn A.; Chen, Lin; Alber, Frank

    2016-01-01

    Conformation capture technologies (e.g., Hi-C) chart physical interactions between chromatin regions on a genome-wide scale. However, the structural variability of the genome between cells poses a great challenge to interpreting ensemble-averaged Hi-C data, particularly for long-range and interchromosomal interactions. Here, we present a probabilistic approach for deconvoluting Hi-C data into a model population of distinct diploid 3D genome structures, which facilitates the detection of chromatin interactions likely to co-occur in individual cells. Our approach incorporates the stochastic nature of chromosome conformations and allows a detailed analysis of alternative chromatin structure states. For example, we predict and experimentally confirm the presence of large centromere clusters with distinct chromosome compositions varying between individual cells. The stability of these clusters varies greatly with their chromosome identities. We show that these chromosome-specific clusters can play a key role in the overall chromosome positioning in the nucleus and stabilizing specific chromatin interactions. By explicitly considering genome structural variability, our population-based method provides an important tool for revealing novel insights into the key factors shaping the spatial genome organization. PMID:26951677

  17. New study reveals relatively few mutations in AML genomes - TCGA

    Cancer.gov

    Investigators for The Cancer Genome Atlas (TCGA) Research Network have detailed and broadly classified the genomic alterations that frequently underlie the development of acute myeloid leukemia (AML).

  18. The Methanosarcina barkeri genome: comparative analysis withMethanosarcina acetivorans and Methanosarcina mazei reveals extensiverearrangement within methanosarcinal genomes

    SciTech Connect

    Maeder, Dennis L.; Anderson, Iain; Brettin, Thomas S.; Bruce,David C.; Gilna, Paul; Han, Cliff S.; Lapidus, Alla; Metcalf, William W.; Saunders, Elizabeth; Tapia, Roxanne; Sowers, Kevin R.

    2006-05-19

    We report here a comparative analysis of the genome sequence of Methanosarcina barkeri with those of Methanosarcina acetivorans and Methanosarcina mazei. All three genomes share a conserved double origin of replication and many gene clusters. M. barkeri is distinguished by having an organization that is well conserved with respect to the other Methanosarcinae in the region proximal to the origin of replication with interspecies gene similarities as high as 95%. However it is disordered and marked by increased transposase frequency and decreased gene synteny and gene density in the proximal semi-genome. Of the 3680 open reading frames in M. barkeri, 678 had paralogs with better than 80% similarity to both M. acetivorans and M. mazei while 128 nonhypothetical orfs were unique (non-paralogous) amongst these species including a complete formate dehydrogenase operon, two genes required for N-acetylmuramic acid synthesis, a 14 gene gas vesicle cluster and a bacterial P450-specific ferredoxin reductase cluster not previously observed or characterized in this genus. A cryptic 36 kbp plasmid sequence was detected in M. barkeri that contains an orc1 gene flanked by a presumptive origin of replication consisting of 38 tandem repeats of a 143 nt motif. Three-way comparison of these genomes reveals differing mechanisms for the accrual of changes. Elongation of the large M. acetivorans is the result of multiple gene-scale insertions and duplications uniformly distributed in that genome, while M. barkeri is characterized by localized inversions associated with the loss of gene content. In contrast, the relatively short M. mazei most closely approximates the ancestral organizational state.

  19. Single-Cell (Meta-)Genomics of a Dimorphic Candidatus Thiomargarita nelsonii Reveals Genomic Plasticity

    PubMed Central

    Flood, Beverly E.; Fliss, Palmer; Jones, Daniel S.; Dick, Gregory J.; Jain, Sunit; Kaster, Anne-Kristin; Winkel, Matthias; Mußmann, Marc; Bailey, Jake

    2016-01-01

    The genus Thiomargarita includes the world's largest bacteria. But as uncultured organisms, their physiology, metabolism, and basis for their gigantism are not well understood. Thus, a genomics approach, applied to a single Candidatus Thiomargarita nelsonii cell was employed to explore the genetic potential of one of these enigmatic giant bacteria. The Thiomargarita cell was obtained from an assemblage of budding Ca. T. nelsonii attached to a provannid gastropod shell from Hydrate Ridge, a methane seep offshore of Oregon, USA. Here we present a manually curated genome of Bud S10 resulting from a hybrid assembly of long Pacific Biosciences and short Illumina sequencing reads. With respect to inorganic carbon fixation and sulfur oxidation pathways, the Ca. T. nelsonii Hydrate Ridge Bud S10 genome was similar to marine sister taxa within the family Beggiatoaceae. However, the Bud S10 genome contains genes suggestive of the genetic potential for lithotrophic growth on arsenite and perhaps hydrogen. The genome also revealed that Bud S10 likely respires nitrate via two pathways: a complete denitrification pathway and a dissimilatory nitrate reduction to ammonia pathway. Both pathways have been predicted, but not previously fully elucidated, in the genomes of other large, vacuolated, sulfur-oxidizing bacteria. Surprisingly, the genome also had a high number of unusual features for a bacterium to include the largest number of metacaspases and introns ever reported in a bacterium. Also present, are a large number of other mobile genetic elements, such as insertion sequence (IS) transposable elements and miniature inverted-repeat transposable elements (MITEs). In some cases, mobile genetic elements disrupted key genes in metabolic pathways. For example, a MITE interrupts hupL, which encodes the large subunit of the hydrogenase in hydrogen oxidation. Moreover, we detected a group I intron in one of the most critical genes in the sulfur oxidation pathway, dsrA. The dsrA group

  20. Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

    PubMed

    Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

    2015-01-01

    Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.

  1. Genomic analysis of primordial dwarfism reveals novel disease genes.

    PubMed

    Shaheen, Ranad; Faqeih, Eissa; Ansari, Shinu; Abdel-Salam, Ghada; Al-Hassnan, Zuhair N; Al-Shidi, Tarfa; Alomar, Rana; Sogaty, Sameera; Alkuraya, Fowzan S

    2014-02-01

    Primordial dwarfism (PD) is a disease in which severely impaired fetal growth persists throughout postnatal development and results in stunted adult size. The condition is highly heterogeneous clinically, but the use of certain phenotypic aspects such as head circumference and facial appearance has proven helpful in defining clinical subgroups. In this study, we present the results of clinical and genomic characterization of 16 new patients in whom a broad definition of PD was used (e.g., 3M syndrome was included). We report a novel PD syndrome with distinct facies in two unrelated patients, each with a different homozygous truncating mutation in CRIPT. Our analysis also reveals, in addition to mutations in known PD disease genes, the first instance of biallelic truncating BRCA2 mutation causing PD with normal bone marrow analysis. In addition, we have identified a novel locus for Seckel syndrome based on a consanguineous multiplex family and identified a homozygous truncating mutation in DNA2 as the likely cause. An additional novel PD disease candidate gene XRCC4 was identified by autozygome/exome analysis, and the knockout mouse phenotype is highly compatible with PD. Thus, we add a number of novel genes to the growing list of PD-linked genes, including one which we show to be linked to a novel PD syndrome with a distinct facial appearance. PD is extremely heterogeneous genetically and clinically, and genomic tools are often required to reach a molecular diagnosis.

  2. Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs

    SciTech Connect

    Curtis, Bruce A.; Tanifuji, Goro; Burki, Fabien; Gruber, Ansgar; Irimia, Manuuel; Maruyama, Shinichiro; Arias, Maria C.; Ball, Steven G.; Gile, Gillian H.; Hirakawa, Yoshihisa; Hopkins, Julia F.; Kuo, Alan; Rensing, Stefan A.; Schmutz, Jeremy; Symeonidi, Aikaterini; Elias, Marek; Eveleigh, Robert J. M.; Herman, Emily K.; Klute, Mary J.; Nakayama, Takuro; Obornik, Miroslav; Reyes-Prieto, Adrian; Armbrust, E. Virginia; Aves, Stephen J.; Beiko, Robert G.; Coutinho, Pedro; Dacks, Joel B.; Durnford, Dion G.; Fast, Naomi M.; Green, Beverley R.; Grisdale, Cameron J.; Hempel, Franziska; Henrissat, Bernard; Hoppner, Marc P.; Ishida, Ken-Ichiro; Kim, Eunsoo; Koreny, Ludek; Kroth, Peter G.; Liu, Yuan; Malik, Shehre-Banoo; Maier, Uwe G.; McRose, Darcy; Mock, Thomas; Neilson, Jonathan A. D.; Onodera, Naoko T.; Poole, Anthony M.; Pritham, Ellen J.; Richards, Thomas A.; Rocap, Gabrielle; Roy, Scott W.; Sarai, Chihiro; Schaack, Sarah; Shirato, Shu; Slamovits, Claudio H.; Spencer, Davie F.; Suzuki, Shigekatsu; Worden, Alexandra Z.; Zauner, Stefan; Barry, Kerrie; Bell, Callum; Bharti, Arvind K.; Crow, John A.; Grimwood, Jane; Kramer, Robin; Lindquist, Erika; Lucas, Susan; Salamov, Asaf; McFadden, Geoffrey I.; Lane, Christopher E.; Keeling, Patrick J.; Gray, Michael W.; Grigoriev, Igor V.; Archibald, John M.

    2012-08-10

    Cryptophyte and chlorarachniophyte algae are transitional forms in the widespread secondary endosymbiotic acquisition of photosynthesis by engulfment of eukaryotic algae. Unlike most secondary plastid-bearing algae, miniaturized versions of the endosymbiont nuclei (nucleomorphs) persist in cryptophytes and chlorarachniophytes. To determine why, and to address other fundamental questions about eukaryote eukaryote endosymbiosis, we sequenced the nuclear genomes of the cryptophyte Guillardia theta and the chlorarachniophyte Bigelowiella natans. Both genomes have 21,000 protein genes and are intron rich, and B. natans exhibits unprecedented alternative splicing for a single-celled organism. Phylogenomic analyses and subcellular targeting predictions reveal extensive genetic and biochemical mosaicism, with both host- and endosymbiont-derived genes servicing the mitochondrion, the host cell cytosol, the plastid and the remnant endosymbiont cytosol of both algae. Mitochondrion-to-nucleus gene transfer still occurs in both organisms but plastid-to-nucleus and nucleomorph-to-nucleus transfers do not, which explains why a small residue of essential genes remains locked in each nucleomorph.

  3. Comparative Genomic Analysis Reveals Ecological Differentiation in the Genus Carnobacterium

    PubMed Central

    Iskandar, Christelle F.; Borges, Frédéric; Taminiau, Bernard; Daube, Georges; Zagorec, Monique; Remenant, Benoît; Leisner, Jørgen J.; Hansen, Martin A.; Sørensen, Søren J.; Mangavel, Cécile; Cailliez-Grimal, Catherine; Revol-Junelles, Anne-Marie

    2017-01-01

    Lactic acid bacteria (LAB) differ in their ability to colonize food and animal-associated habitats: while some species are specialized and colonize a limited number of habitats, other are generalist and are able to colonize multiple animal-linked habitats. In the current study, Carnobacterium was used as a model genus to elucidate the genetic basis of these colonization differences. Analyses of 16S rRNA gene meta-barcoding data showed that C. maltaromaticum followed by C. divergens are the most prevalent species in foods derived from animals (meat, fish, dairy products), and in the gut. According to phylogenetic analyses, these two animal-adapted species belong to one of two deeply branched lineages. The second lineage contains species isolated from habitats where contact with animal is rare. Genome analyses revealed that members of the animal-adapted lineage harbor a larger secretome than members of the other lineage. The predicted cell-surface proteome is highly diversified in C. maltaromaticum and C. divergens with genes involved in adaptation to the animal milieu such as those encoding biopolymer hydrolytic enzymes, a heme uptake system, and biopolymer-binding adhesins. These species also exhibit genes for gut adaptation and respiration. In contrast, Carnobacterium species belonging to the second lineage encode a poorly diversified cell-surface proteome, lack genes for gut adaptation and are unable to respire. These results shed light on the important genomics traits required for adaptation to animal-linked habitats in generalist Carnobacterium. PMID:28337181

  4. Genome Alignment Spanning Major Poaceae Lineages Reveals Heterogeneous Evolutionary Rates and Alters Inferred Dates for Key Evolutionary Events.

    PubMed

    Wang, Xiyin; Wang, Jingpeng; Jin, Dianchuan; Guo, Hui; Lee, Tae-Ho; Liu, Tao; Paterson, Andrew H

    2015-06-01

    Multiple comparisons among genomes can clarify their evolution, speciation, and functional innovations. To date, the genome sequences of eight grasses representing the most economically important Poaceae (grass) clades have been published, and their genomic-level comparison is an essential foundation for evolutionary, functional, and translational research. Using a formal and conservative approach, we aligned these genomes. Direct comparison of paralogous gene pairs all duplicated simultaneously reveal striking variation in evolutionary rates among whole genomes, with nucleotide substitution slowest in rice and up to 48% faster in other grasses, adding a new dimension to the value of rice as a grass model. We reconstructed ancestral genome contents for major evolutionary nodes, potentially contributing to understanding the divergence and speciation of grasses. Recent fossil evidence suggests revisions of the estimated dates of key evolutionary events, implying that the pan-grass polyploidization occurred ∼96 million years ago and could not be related to the Cretaceous-Tertiary mass extinction as previously inferred. Adjusted dating to reflect both updated fossil evidence and lineage-specific evolutionary rates suggested that maize subgenome divergence and maize-sorghum divergence were virtually simultaneous, a coincidence that would be explained if polyploidization directly contributed to speciation. This work lays a solid foundation for Poaceae translational genomics.

  5. Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans.

    PubMed

    Raghavan, Maanasa; Skoglund, Pontus; Graf, Kelly E; Metspalu, Mait; Albrechtsen, Anders; Moltke, Ida; Rasmussen, Simon; Stafford, Thomas W; Orlando, Ludovic; Metspalu, Ene; Karmin, Monika; Tambets, Kristiina; Rootsi, Siiri; Mägi, Reedik; Campos, Paula F; Balanovska, Elena; Balanovsky, Oleg; Khusnutdinova, Elza; Litvinov, Sergey; Osipova, Ludmila P; Fedorova, Sardana A; Voevoda, Mikhail I; DeGiorgio, Michael; Sicheritz-Ponten, Thomas; Brunak, Søren; Demeshchenko, Svetlana; Kivisild, Toomas; Villems, Richard; Nielsen, Rasmus; Jakobsson, Mattias; Willerslev, Eske

    2014-01-02

    The origins of the First Americans remain contentious. Although Native Americans seem to be genetically most closely related to east Asians, there is no consensus with regard to which specific Old World populations they are closest to. Here we sequence the draft genome of an approximately 24,000-year-old individual (MA-1), from Mal'ta in south-central Siberia, to an average depth of 1×. To our knowledge this is the oldest anatomically modern human genome reported to date. The MA-1 mitochondrial genome belongs to haplogroup U, which has also been found at high frequency among Upper Palaeolithic and Mesolithic European hunter-gatherers, and the Y chromosome of MA-1 is basal to modern-day western Eurasians and near the root of most Native American lineages. Similarly, we find autosomal evidence that MA-1 is basal to modern-day western Eurasians and genetically closely related to modern-day Native Americans, with no close affinity to east Asians. This suggests that populations related to contemporary western Eurasians had a more north-easterly distribution 24,000 years ago than commonly thought. Furthermore, we estimate that 14 to 38% of Native American ancestry may originate through gene flow from this ancient population. This is likely to have occurred after the divergence of Native American ancestors from east Asian ancestors, but before the diversification of Native American populations in the New World. Gene flow from the MA-1 lineage into Native American ancestors could explain why several crania from the First Americans have been reported as bearing morphological characteristics that do not resemble those of east Asians. Sequencing of another south-central Siberian, Afontova Gora-2 dating to approximately 17,000 years ago, revealed similar autosomal genetic signatures as MA-1, suggesting that the region was continuously occupied by humans throughout the Last Glacial Maximum. Our findings reveal that western Eurasian genetic signatures in modern-day Native

  6. Genome Sequence of Thermofilum pendens Reveals an Exceptional Loss of Biosynthetic Pathways without Genome Reduction

    SciTech Connect

    Anderson, Iain; Rodriquez, Jason; Susanti, Dwi; Porat, I.; Reich, Claudia; Ulrich, Luke; Elkins, James G; Mavromatis, K; Lykidis, A; Kim, Edwin; Thompson, Linda S; Nolan, Matt; Land, Miriam L; Copeland, A; Lapidus, Alla L.; Lucas, Susan; Detter, J C; Zhulin, Igor B; Olsen, Gary; Whitman, W. B.; Mukhopadhyay, Biswarup; Bristow, James; Kyrpides, Nikos C

    2008-01-01

    We report the complete genome of Thermofilum pendens, a deep-branching member of class Thermoproteales of Crenarchaeota. T. pendens is a sulfur-dependent, anaerobic heterotroph isolated from a solfatara in Iceland. It was known to utilize peptides as an energy source, but the genome reveals substantial ability to grow on carbohydrates. T. pendens is the first Crenarchaeote and only the second archaeon found to have transporters of the phosphotransferase system. T. pendens is known to require an extract of Thermoproteus tenax for growth, and the genome sequence reveals that biosynthetic pathways for purines, most amino acids, and most cofactors are absent. T. pendens has fewer biosynthetic enzymes than any other free-living organism. In addition to heterotrophy, T. pendens may gain energy from sulfur reduction with hydrogen and formate as electron donors. It may also be capable of sulfur-independent growth on formate with formate hydrogenlyase. Additional novel features are the presence of a monomethylamine:corrinoid methyltransferase, the first time this enzyme has been found outside of Methanosarcinales, and a presenilin-related protein from a new subfamily. Predicted highly expressed proteins include ABC transporters for carbohydrates and peptides, and CRISPR-associated proteins, suggesting that defense against viruses is a high priority.

  7. Genome sequence of Thermofilum pendens reveals an exceptional loss of biosynthetic pathways without genome reduction

    SciTech Connect

    Kyrpides, Nikos; Anderson, Iain; Rodriguez, Jason; Susanti, Dwi; Porat, Iris; Reich, Claudia; Ulrich, Luke E.; Elkins, James G.; Mavromatis, Kostas; Lykidis, Athanasios; Kim, Edwin; Thompson, Linda S.; Nolan, Matt; Land, Miriam; Copeland, Alex; Lapidus, Alla; Lucas, Susan; Detter, Chris; Zhulin, Igor B.; Olsen, Gary J.; Whitman, William; Mukhopadhyay, Biswarup; Bristow, James; Kyrpides, Nikos

    2008-01-01

    We report the complete genome of Thermofilum pendens, a deep-branching, hyperthermophilic member of the order Thermoproteales within the archaeal kingdom Crenarchaeota. T. pendens is a sulfur-dependent, anaerobic heterotroph isolated from a solfatara in Iceland. It is an extracellular commensal, requiring an extract of Thermoproteus tenax for growth, and the genome sequence reveals that biosynthetic pathways for purines, most amino acids, and most cofactors are absent. In fact T. pendens has fewer biosynthetic enzymes than obligate intracellular parasites, although it does not display other features common among obligate parasites and thus does not appear to be in the process of becoming a parasite. It appears that T. pendens has adapted to life in an environment rich in nutrients. T. pendens was known to utilize peptides as an energy source, but the genome reveals substantial ability to grow on carbohydrates. T. pendens is the first crenarchaeote and only the second archaeon found to have a transporter of the phosphotransferase system. In addition to fermentation, T. pendens may gain energy from sulfur reduction with hydrogen and formate as electron donors. It may also be capable of sulfur-independent growth on formate with formate hydrogenlyase. Additional novel features are the presence of a monomethylamine:corrinoid methyltransferase, the first time this enzyme has been found outside of Methanosarcinales, and a presenilin-related protein. Predicted highly expressed proteins do not include housekeeping genes, and instead include ABC transporters for carbohydrates and peptides, and CRISPR-associated proteins.

  8. Revealing effective classifiers through network comparison

    NASA Astrophysics Data System (ADS)

    Gallos, Lazaros K.; Fefferman, Nina H.

    2014-11-01

    The ability to compare complex systems can provide new insight into the fundamental nature of the processes captured, in ways that are otherwise inaccessible to observation. Here, we introduce the n-tangle method to directly compare two networks for structural similarity, based on the distribution of edge density in network subgraphs. We demonstrate that this method can efficiently introduce comparative analysis into network science and opens the road for many new applications. For example, we show how the construction of a “phylogenetic tree” across animal taxa according to their social structure can reveal commonalities in the behavioral ecology of the populations, or how students create similar networks according to the University size. Our method can be expanded to study many additional properties, such as network classification, changes during time evolution, convergence of growth models, and detection of structural changes during damage.

  9. Identification of Sesame Genomic Variations from Genome Comparison of Landrace and Variety

    PubMed Central

    Wei, Xin; Zhu, Xiaodong; Yu, Jingyin; Wang, Linhai; Zhang, Yanxin; Li, Donghua; Zhou, Rong; Zhang, Xiurong

    2016-01-01

    Sesame (Sesamum indicum L.) is one of the main oilseed crops, providing vegetable oil and protein to human. Landrace is the gene source of variety, carrying many desire alleles for genetic improvement. Despite the importance of sesame landrace, genome of sesame landrace remains unexplored and genomic variations between landrace and variety still is not clear. To identify the genomic variations between sesame landrace and variety, two representative sesame landrace accessions, “Baizhima” and “Mishuozhima,” were selected and re-sequenced. The genome sequencing and de novo assembling of the two sesame landraces resulted in draft genomes of 267 Mb and 254 Mb, respectively, with the contig N50 more than 47 kb. Totally, 1,332,025 SNPs and 506,245 InDels were identified from the genome of “Baizhima” and “Mishuozhima” by comparison of the genome of a variety “Zhongzhi13.” Among the genomic variations, 70,018 SNPs and 8311 InDels were located in the coding regions of genes. Genomic variations may contribute to variation of sesame agronomic traits such as flowering time, plant height, and oil content. The identified genomic variations were successfully used in the QTL mapping and the black pigment synthesis gene, PPO, was found to be the candidate gene of sesame seed coat color. The comprehensively compared genomes of sesame landrace and modern variety produced massive useful genomic information, constituting a powerful tool to support genetic research, and molecular breeding of sesame. PMID:27536315

  10. Identification of Sesame Genomic Variations from Genome Comparison of Landrace and Variety.

    PubMed

    Wei, Xin; Zhu, Xiaodong; Yu, Jingyin; Wang, Linhai; Zhang, Yanxin; Li, Donghua; Zhou, Rong; Zhang, Xiurong

    2016-01-01

    Sesame (Sesamum indicum L.) is one of the main oilseed crops, providing vegetable oil and protein to human. Landrace is the gene source of variety, carrying many desire alleles for genetic improvement. Despite the importance of sesame landrace, genome of sesame landrace remains unexplored and genomic variations between landrace and variety still is not clear. To identify the genomic variations between sesame landrace and variety, two representative sesame landrace accessions, "Baizhima" and "Mishuozhima," were selected and re-sequenced. The genome sequencing and de novo assembling of the two sesame landraces resulted in draft genomes of 267 Mb and 254 Mb, respectively, with the contig N50 more than 47 kb. Totally, 1,332,025 SNPs and 506,245 InDels were identified from the genome of "Baizhima" and "Mishuozhima" by comparison of the genome of a variety "Zhongzhi13." Among the genomic variations, 70,018 SNPs and 8311 InDels were located in the coding regions of genes. Genomic variations may contribute to variation of sesame agronomic traits such as flowering time, plant height, and oil content. The identified genomic variations were successfully used in the QTL mapping and the black pigment synthesis gene, PPO, was found to be the candidate gene of sesame seed coat color. The comprehensively compared genomes of sesame landrace and modern variety produced massive useful genomic information, constituting a powerful tool to support genetic research, and molecular breeding of sesame.

  11. Genomic Analysis of the Basal Lineage Fungus Rhizopus oryzae Reveals a Whole-Genome Duplication

    PubMed Central

    Ma, Li-Jun; Ibrahim, Ashraf S.; Skory, Christopher; Grabherr, Manfred G.; Burger, Gertraud; Butler, Margi; Elias, Marek; Idnurm, Alexander; Lang, B. Franz; Sone, Teruo; Abe, Ayumi; Calvo, Sarah E.; Corrochano, Luis M.; Engels, Reinhard; Fu, Jianmin; Hansberg, Wilhelm; Kim, Jung-Mi; Kodira, Chinnappa D.; Koehrsen, Michael J.; Liu, Bo; Miranda-Saavedra, Diego; O'Leary, Sinead; Ortiz-Castellanos, Lucila; Poulter, Russell; Rodriguez-Romero, Julio; Ruiz-Herrera, José; Shen, Yao-Qing; Zeng, Qiandong; Galagan, James; Birren, Bruce W.

    2009-01-01

    Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called “zygomycetes,” R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99–880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin–proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14α-demethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments. PMID:19578406

  12. High-throughput genomic profiling of adult solid tumors reveals novel insights into cancer pathogenesis.

    PubMed

    Hartmaier, Ryan J; Albacker, Lee; Chmielecki, Juliann; Bailey, Mark; He, Jie; Goldberg, Michael; Ramkissoon, Shakti; Suh, James; Elvin, Julia A; Chiacchia, Samuel; Frampton, Garrett M; Ross, Jeffrey S; Miller, Vincent; Stephens, Philip J; Lipson, Doron

    2017-02-24

    Genomic profiling is widely predicted to become a standard of care in clinical oncology, but more effective data sharing to accelerate progress in precision medicine will be required. Here we describe cancer-associated genomic profiles from 18,004 unique adult cancers. The dataset was composed of 162 tumor subtypes including multiple rare and uncommon tumors. Comparison of alteration frequencies to The Cancer Genome Atlas (TCGA) identified some differences and suggested an enrichment of treatment-refractory samples in breast and lung cancer cohorts. To illustrate novelty within the dataset, we surveyed the genomic landscape of rare diseases and identified an increased frequency of NOTCH1 alterations in adenoid cystic carcinomas compared to previous studies. Analysis of tumor suppressor gene patterns revealed disease specificity for certain genes but broad inactivation of others. We identified multiple potentially druggable, novel and known kinase fusions in diseases beyond those in which they are currently recognized. Analysis of variants of unknown significance identified an enrichment of SMAD4 alterations in colon cancer and other rare alterations predicted to have functional impact. Analysis of established, clinically relevant alterations highlighted the spectrum of molecular changes for which testing is currently recommended, as well as opportunities for expansion of indications for use of approved targeted therapies. Overall, this dataset presents a new resource with which to investigate rare alterations and diseases, validate clinical relevance, and identify novel therapeutic targets.

  13. Comparative genomic analysis of Lactobacillus plantarum ZJ316 reveals its genetic adaptation and potential probiotic profiles* #

    PubMed Central

    Li, Ping; Li, Xuan; Gu, Qing; Lou, Xiu-yu; Zhang, Xiao-mei; Song, Da-feng; Zhang, Chen

    2016-01-01

    Objective: In previous studies, Lactobacillus plantarum ZJ316 showed probiotic properties, such as antimicrobial activity against various pathogens and the capacity to significantly improve pig growth and pork quality. The purpose of this study was to reveal the genes potentially related to its genetic adaptation and probiotic profiles based on comparative genomic analysis. Methods: The genome sequence of L. plantarum ZJ316 was compared with those of eight L. plantarum strains deposited in GenBank. BLASTN, Mauve, and MUMmer programs were used for genome alignment and comparison. CRISPRFinder was applied for searching the clustered regularly interspaced short palindromic repeats (CRISPRs). Results: We identified genes that encode proteins related to genetic adaptation and probiotic profiles, including carbohydrate transport and metabolism, proteolytic enzyme systems and amino acid biosynthesis, CRISPR adaptive immunity, stress responses, bile salt resistance, ability to adhere to the host intestinal wall, exopolysaccharide (EPS) biosynthesis, and bacteriocin biosynthesis. Conclusions: Comparative characterization of the L. plantarum ZJ316 genome provided the genetic basis for further elucidating the functional mechanisms of its probiotic properties. ZJ316 could be considered a potential probiotic candidate. PMID:27487802

  14. Comparative Genomics Reveals Insight into Virulence Strategies of Plant Pathogenic Oomycetes

    PubMed Central

    Adhikari, Bishwo N.; Hamilton, John P.; Zerillo, Marcelo M.; Tisserat, Ned; Lévesque, C. André; Buell, C. Robin

    2013-01-01

    The kingdom Stramenopile includes diatoms, brown algae, and oomycetes. Plant pathogenic oomycetes, including Phytophthora, Pythium and downy mildew species, cause devastating diseases on a wide range of host species and have a significant impact on agriculture. Here, we report comparative analyses on the genomes of thirteen straminipilous species, including eleven plant pathogenic oomycetes, to explore common features linked to their pathogenic lifestyle. We report the sequencing, assembly, and annotation of six Pythium genomes and comparison with other stramenopiles including photosynthetic diatoms, and other plant pathogenic oomycetes such as Phytophthora species, Hyaloperonospora arabidopsidis, and Pythium ultimum var. ultimum. Novel features of the oomycete genomes include an expansion of genes encoding secreted effectors and plant cell wall degrading enzymes in Phytophthora species and an over-representation of genes involved in proteolytic degradation and signal transduction in Pythium species. A complete lack of classical RxLR effectors was observed in the seven surveyed Pythium genomes along with an overall reduction of pathogenesis-related gene families in H. arabidopsidis. Comparative analyses revealed fewer genes encoding enzymes involved in carbohydrate metabolism in Pythium species and H. arabidopsidis as compared to Phytophthora species, suggesting variation in virulence mechanisms within plant pathogenic oomycete species. Shared features between the oomycetes and diatoms revealed common mechanisms of intracellular signaling and transportation. Our analyses demonstrate the value of comparative genome analyses for exploring the evolution of pathogenesis and survival mechanisms in the oomycetes. The comparative analyses of seven Pythium species with the closely related oomycetes, Phytophthora species and H. arabidopsidis, and distantly related diatoms provide insight into genes that underlie virulence. PMID:24124466

  15. Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire

    PubMed Central

    2010-01-01

    Background Pythium ultimum is a ubiquitous oomycete plant pathogen responsible for a variety of diseases on a broad range of crop and ornamental species. Results The P. ultimum genome (42.8 Mb) encodes 15,290 genes and has extensive sequence similarity and synteny with related Phytophthora species, including the potato blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86% of genes, with detectable differential expression of suites of genes under abiotic stress and in the presence of a host. The predicted proteome includes a large repertoire of proteins involved in plant pathogen interactions, although, surprisingly, the P. ultimum genome does not encode any classical RXLR effectors and relatively few Crinkler genes in comparison to related phytopathogenic oomycetes. A lower number of enzymes involved in carbohydrate metabolism were present compared to Phytophthora species, with the notable absence of cutinases, suggesting a significant difference in virulence mechanisms between P. ultimum and more host-specific oomycete species. Although we observed a high degree of orthology with Phytophthora genomes, there were novel features of the P. ultimum proteome, including an expansion of genes involved in proteolysis and genes unique to Pythium. We identified a small gene family of cadherins, proteins involved in cell adhesion, the first report of these in a genome outside the metazoans. Conclusions Access to the P. ultimum genome has revealed not only core pathogenic mechanisms within the oomycetes but also lineage-specific genes associated with the alternative virulence and lifestyles found within the pythiaceous lineages compared to the Peronosporaceae. PMID:20626842

  16. Comparative genomics reveals insight into virulence strategies of plant pathogenic oomycetes.

    PubMed

    Adhikari, Bishwo N; Hamilton, John P; Zerillo, Marcelo M; Tisserat, Ned; Lévesque, C André; Buell, C Robin

    2013-01-01

    The kingdom Stramenopile includes diatoms, brown algae, and oomycetes. Plant pathogenic oomycetes, including Phytophthora, Pythium and downy mildew species, cause devastating diseases on a wide range of host species and have a significant impact on agriculture. Here, we report comparative analyses on the genomes of thirteen straminipilous species, including eleven plant pathogenic oomycetes, to explore common features linked to their pathogenic lifestyle. We report the sequencing, assembly, and annotation of six Pythium genomes and comparison with other stramenopiles including photosynthetic diatoms, and other plant pathogenic oomycetes such as Phytophthora species, Hyaloperonospora arabidopsidis, and Pythium ultimum var. ultimum. Novel features of the oomycete genomes include an expansion of genes encoding secreted effectors and plant cell wall degrading enzymes in Phytophthora species and an over-representation of genes involved in proteolytic degradation and signal transduction in Pythium species. A complete lack of classical RxLR effectors was observed in the seven surveyed Pythium genomes along with an overall reduction of pathogenesis-related gene families in H. arabidopsidis. Comparative analyses revealed fewer genes encoding enzymes involved in carbohydrate metabolism in Pythium species and H. arabidopsidis as compared to Phytophthora species, suggesting variation in virulence mechanisms within plant pathogenic oomycete species. Shared features between the oomycetes and diatoms revealed common mechanisms of intracellular signaling and transportation. Our analyses demonstrate the value of comparative genome analyses for exploring the evolution of pathogenesis and survival mechanisms in the oomycetes. The comparative analyses of seven Pythium species with the closely related oomycetes, Phytophthora species and H. arabidopsidis, and distantly related diatoms provide insight into genes that underlie virulence.

  17. Comparative genomics reveals evidence of marine adaptation in Salinispora species

    PubMed Central

    2012-01-01

    Background Actinobacteria represent a consistent component of most marine bacterial communities yet little is known about the mechanisms by which these Gram-positive bacteria adapt to life in the marine environment. Here we employed a phylogenomic approach to identify marine adaptation genes in marine Actinobacteria. The focus was on the obligate marine actinomycete genus Salinispora and the identification of marine adaptation genes that have been acquired from other marine bacteria. Results Functional annotation, comparative genomics, and evidence of a shared evolutionary history with bacteria from hyperosmotic environments were used to identify a pool of more than 50 marine adaptation genes. An Actinobacterial species tree was used to infer the likelihood of gene gain or loss in accounting for the distribution of each gene. Acquired marine adaptation genes were associated with electron transport, sodium and ABC transporters, and channels and pores. In addition, the loss of a mechanosensitive channel gene appears to have played a major role in the inability of Salinispora strains to grow following transfer to low osmotic strength media. Conclusions The marine Actinobacteria for which genome sequences are available are broadly distributed throughout the Actinobacterial phylogenetic tree and closely related to non-marine forms suggesting they have been independently introduced relatively recently into the marine environment. It appears that the acquisition of transporters in Salinispora spp. represents a major marine adaptation while gene loss is proposed to play a role in the inability of this genus to survive outside of the marine environment. This study reveals fundamental differences between marine adaptations in Gram-positive and Gram-negative bacteria and no common genetic basis for marine adaptation among the Actinobacteria analyzed. PMID:22401625

  18. Genomic View of Bipolar Disorder Revealed by Whole Genome Sequencing in a Genetic Isolate

    PubMed Central

    Georgi, Benjamin; Craig, David; Kember, Rachel L.; Liu, Wencheng; Lindquist, Ingrid; Nasser, Sara; Brown, Christopher; Egeland, Janice A.; Paul, Steven M.; Bućan, Maja

    2014-01-01

    Bipolar disorder is a common, heritable mental illness characterized by recurrent episodes of mania and depression. Despite considerable effort to elucidate the genetic underpinnings of bipolar disorder, causative genetic risk factors remain elusive. We conducted a comprehensive genomic analysis of bipolar disorder in a large Old Order Amish pedigree. Microsatellite genotypes and high-density SNP-array genotypes of 388 family members were combined with whole genome sequence data for 50 of these subjects, comprising 18 parent-child trios. This study design permitted evaluation of candidate variants within the context of haplotype structure by resolving the phase in sequenced parent-child trios and by imputation of variants into multiple unsequenced siblings. Non-parametric and parametric linkage analysis of the entire pedigree as well as on smaller clusters of families identified several nominally significant linkage peaks, each of which included dozens of predicted deleterious variants. Close inspection of exonic and regulatory variants in genes under the linkage peaks using family-based association tests revealed additional credible candidate genes for functional studies and further replication in population-based cohorts. However, despite the in-depth genomic characterization of this unique, large and multigenerational pedigree from a genetic isolate, there was no convergence of evidence implicating a particular set of risk loci or common pathways. The striking haplotype and locus heterogeneity we observed has profound implications for the design of studies of bipolar and other related disorders. PMID:24625924

  19. Genomic comparison of Kingella kingae strains.

    PubMed

    Fournier, Pierre-Edouard; Rouli, Laetitia; El Karkouri, Khalid; Nguyen, Thi-Tien; Yagupsky, Pablo; Raoult, Didier

    2012-11-01

    Kingella kingae is a betaproteobacterium from the order Neisseriales, and it is an agent of invasive infections in children. We sequenced the genome from the septic arthritis strain 11220434. It is composed of a 1,990,794-bp chromosome but no plasmid, and it contains 2,042 protein-coding genes and 52 RNA genes, including 3 rRNA genes.

  20. Replication Study: Melanoma genome sequencing reveals frequent PREX2 mutations

    PubMed Central

    Horrigan, Stephen K; Courville, Pascal; Sampey, Darryl; Zhou, Faren; Cai, Steve

    2017-01-01

    In 2015, as part of the Reproducibility Project: Cancer Biology, we published a Registered Report (Chroscinski et al., 2014) that described how we intended to replicate selected experiments from the paper "Melanoma genome sequencing reveals frequent PREX2 mutations" (Berger et al., 2012). Here we report the results of those experiments. We regenerated cells stably expressing ectopic wild-type and mutant phosphatidylinositol-3,4,5-trisphosphate-dependent Rac exchange factor 2 (PREX2) using the same immortalized human NRASG12D melanocytes as the original study. Evaluation of PREX2 expression in these newly generated stable cells revealed varying levels of expression among the PREX2 isoforms, which was also observed in the stable cells made in the original study (Figure S6A; Berger et al., 2012). Additionally, ectopically expressed PREX2 was found to be at least 5 times above endogenous PREX2 expression. The monitoring of tumor formation of these stable cells in vivo resulted in no statistically significant difference in tumor-free survival driven by PREX2 variants, whereas the original study reported that these PREX2 mutations increased the rate of tumor incidence compared to controls (Figure 3B and S6B; Berger et al., 2012). Surprisingly, the median tumor-free survival was 1 week in this replication attempt, while 70% of the control mice were reported to be tumor-free after 9 weeks in the original study. The rapid tumor onset observed in this replication attempt, compared to the original study, makes the detection of accelerated tumor growth in PREX2 expressing NRASG12D melanocytes extremely difficult. Finally, we report meta-analyses for each result. DOI: http://dx.doi.org/10.7554/eLife.21634.001 PMID:28100394

  1. Genomic profiling reveals mutational landscape in parathyroid carcinomas

    PubMed Central

    Bellizzi, Justin; Lau, Chun Yee; Moe, Aye S.; Strahl, Maya; Newman, Leah C.; Fink, Marc Y.; Antipin, Yevgeniy; Yu, Willie; Stevenson, Mark; Cavaco, Branca M.; Thakker, Rajesh V.; Morreau, Hans; Schadt, Eric E.; Sebra, Robert; Li, Shuyu D.

    2017-01-01

    Parathyroid carcinoma (PC) is an extremely rare malignancy lacking effective therapeutic intervention. We generated and analyzed whole-exome sequencing data from 17 patients to identify somatic and germline genetic alterations. A panel of selected genes was sequenced in a 7-tumor expansion cohort. We show that 47% (8 of 17) of the tumors harbor somatic mutations in the CDC73 tumor suppressor, with germline inactivating variants in 4 of the 8 patients. The PI3K/AKT/mTOR pathway was altered in 21% of the 24 cases, revealing a major oncogenic pathway in PC. We observed CCND1 amplification in 29% of the 17 patients, and a previously unreported recurrent mutation in putative kinase ADCK1. We identified the first sporadic PCs with somatic mutations in the Wnt canonical pathway, complementing previously described epigenetic mechanisms mediating Wnt activation. This is the largest genomic sequencing study of PC, and represents major progress toward a full molecular characterization of this rare malignancy to inform improved and individualized treatments. PMID:28352668

  2. The Complete Chloroplast Genome of Wild Rice (Oryza minuta) and Its Comparison to Related Species

    PubMed Central

    Asaf, Sajjad; Waqas, Muhammad; Khan, Abdul L.; Khan, Muhammad A.; Kang, Sang-Mo; Imran, Qari M.; Shahzad, Raheem; Bilal, Saqib; Yun, Byung-Wook; Lee, In-Jung

    2017-01-01

    Oryza minuta, a tetraploid wild relative of cultivated rice (family Poaceae), possesses a BBCC genome and contains genes that confer resistance to bacterial blight (BB) and white-backed (WBPH) and brown (BPH) plant hoppers. Based on the importance of this wild species, this study aimed to understand the phylogenetic relationships of O. minuta with other Oryza species through an in-depth analysis of the composition and diversity of the chloroplast (cp) genome. The analysis revealed a cp genome size of 135,094 bp with a typical quadripartite structure and consisting of a pair of inverted repeats separated by small and large single copies, 139 representative genes, and 419 randomly distributed microsatellites. The genomic organization, gene order, GC content and codon usage are similar to those of typical angiosperm cp genomes. Approximately 30 forward, 28 tandem and 20 palindromic repeats were detected in the O. minuta cp genome. Comparison of the complete O. minuta cp genome with another eleven Oryza species showed a high degree of sequence similarity and relatively high divergence of intergenic spacers. Phylogenetic analyses were conducted based on the complete genome sequence, 65 shared genes and matK gene showed same topologies and O. minuta forms a single clade with parental O. punctata. Thus, the complete O. minuta cp genome provides interesting insights and valuable information that can be used to identify related species and reconstruct its phylogeny. PMID:28326093

  3. Genomic Comparison of Kingella kingae Strains

    PubMed Central

    Rouli, Laetitia; El Karkouri, Khalid; Nguyen, Thi-Tien; Yagupsky, Pablo; Raoult, Didier

    2012-01-01

    Kingella kingae is a betaproteobacterium from the order Neisseriales, and it is an agent of invasive infections in children. We sequenced the genome from the septic arthritis strain 11220434. It is composed of a 1,990,794-bp chromosome but no plasmid, and it contains 2,042 protein-coding genes and 52 RNA genes, including 3 rRNA genes. PMID:23045489

  4. Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth

    PubMed Central

    Palkopoulou, Eleftheria; Mallick, Swapan; Skoglund, Pontus; Enk, Jacob; Rohland, Nadin; Li, Heng; Omrak, Ayça; Vartanyan, Sergey; Poinar, Hendrik; Götherström, Anders; Reich, David; Dalén, Love

    2015-01-01

    Summary The processes leading up to species extinctions are typically characterized by prolonged declines in population size and geographic distribution, followed by a phase in which populations are very small and may be subject to intrinsic threats, including loss of genetic diversity and inbreeding [1]. However, whether such genetic factors have had an impact on species prior to their extinction is unclear [2, 3]; examining this would require a detailed reconstruction of a species’ demographic history as well as changes in genome-wide diversity leading up to its extinction. Here, we present high-quality complete genome sequences from two woolly mammoths (Mammuthus primigenius). The first mammoth was sequenced at 17.1-fold coverage, and dates to ~4,300 years before present, constituting one of the last surviving individuals on Wrangel Island. The second mammoth, sequenced at 11.2-fold coverage, was obtained from a ~44,800 year old specimen from the Late Pleistocene population in northeastern Siberia. The demographic trajectories inferred from the two genomes are qualitatively similar and reveal a population bottleneck during the Middle or Early Pleistocene, and a more recent severe decline in the ancestors of the Wrangel mammoth at the end of the last glaciation. A comparison of the two genomes shows that the Wrangel mammoth has a 20% reduction in heterozygosity as well as a 28-fold increase in the fraction of the genome that is comprised of runs of homozygosity. We conclude that the population on Wrangel Island, which was the last surviving woolly mammoth population, was subject to reduced genetic diversity shortly before it became extinct. PMID:25913407

  5. Complete genomes reveal signatures of demographic and genetic declines in the woolly mammoth.

    PubMed

    Palkopoulou, Eleftheria; Mallick, Swapan; Skoglund, Pontus; Enk, Jacob; Rohland, Nadin; Li, Heng; Omrak, Ayça; Vartanyan, Sergey; Poinar, Hendrik; Götherström, Anders; Reich, David; Dalén, Love

    2015-05-18

    The processes leading up to species extinctions are typically characterized by prolonged declines in population size and geographic distribution, followed by a phase in which populations are very small and may be subject to intrinsic threats, including loss of genetic diversity and inbreeding. However, whether such genetic factors have had an impact on species prior to their extinction is unclear; examining this would require a detailed reconstruction of a species' demographic history as well as changes in genome-wide diversity leading up to its extinction. Here, we present high-quality complete genome sequences from two woolly mammoths (Mammuthus primigenius). The first mammoth was sequenced at 17.1-fold coverage and dates to ∼4,300 years before present, representing one of the last surviving individuals on Wrangel Island. The second mammoth, sequenced at 11.2-fold coverage, was obtained from an ∼44,800-year-old specimen from the Late Pleistocene population in northeastern Siberia. The demographic trajectories inferred from the two genomes are qualitatively similar and reveal a population bottleneck during the Middle or Early Pleistocene, and a more recent severe decline in the ancestors of the Wrangel mammoth at the end of the last glaciation. A comparison of the two genomes shows that the Wrangel mammoth has a 20% reduction in heterozygosity as well as a 28-fold increase in the fraction of the genome that comprises runs of homozygosity. We conclude that the population on Wrangel Island, which was the last surviving woolly mammoth population, was subject to reduced genetic diversity shortly before it became extinct.

  6. Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions.

    PubMed

    Bellas, Christopher M; Anesio, Alexandre M; Barker, Gary

    2015-01-01

    Microbial communities in glacial ecosystems are diverse, active, and subjected to strong viral pressures and infection rates. In this study we analyse putative virus genomes assembled from three dsDNA viromes from cryoconite hole ecosystems of Svalbard and the Greenland Ice Sheet to assess the potential hosts and functional role viruses play in these habitats. We assembled 208 million reads from the virus-size fraction and developed a procedure to select genuine virus scaffolds from cellular contamination. Our curated virus library contained 546 scaffolds up to 230 Kb in length, 54 of which were circular virus consensus genomes. Analysis of virus marker genes revealed a wide range of viruses had been assembled, including bacteriophages, cyanophages, nucleocytoplasmic large DNA viruses and a virophage, with putative hosts identified as Cyanobacteria, Alphaproteobacteria, Gammaproteobacteria, Actinobacteria, Firmicutes, eukaryotic algae and amoebae. Whole genome comparisons revealed the majority of circular genome scaffolds (CGS) formed 12 novel groups, two of which contained multiple phage members with plasmid-like properties, including a group of phage-plasmids possessing plasmid-like partition genes and toxin-antitoxin addiction modules to ensure their replication and a satellite phage-plasmid group. Surprisingly we also assembled a phage that not only encoded plasmid partition genes, but a clustered regularly interspaced short palindromic repeat (CRISPR)/Cas adaptive bacterial immune system. One of the spacers was an exact match for another phage in our virome, indicating that in a novel use of the system, the lysogen was potentially capable of conferring immunity on its bacterial host against other phage. Together these results suggest that highly novel and diverse groups of viruses are present in glacial environments, some of which utilize very unusual life strategies and genes to control their replication and maintain a long-term relationship with their hosts.

  7. Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions

    PubMed Central

    Bellas, Christopher M.; Anesio, Alexandre M.; Barker, Gary

    2015-01-01

    Microbial communities in glacial ecosystems are diverse, active, and subjected to strong viral pressures and infection rates. In this study we analyse putative virus genomes assembled from three dsDNA viromes from cryoconite hole ecosystems of Svalbard and the Greenland Ice Sheet to assess the potential hosts and functional role viruses play in these habitats. We assembled 208 million reads from the virus-size fraction and developed a procedure to select genuine virus scaffolds from cellular contamination. Our curated virus library contained 546 scaffolds up to 230 Kb in length, 54 of which were circular virus consensus genomes. Analysis of virus marker genes revealed a wide range of viruses had been assembled, including bacteriophages, cyanophages, nucleocytoplasmic large DNA viruses and a virophage, with putative hosts identified as Cyanobacteria, Alphaproteobacteria, Gammaproteobacteria, Actinobacteria, Firmicutes, eukaryotic algae and amoebae. Whole genome comparisons revealed the majority of circular genome scaffolds (CGS) formed 12 novel groups, two of which contained multiple phage members with plasmid-like properties, including a group of phage-plasmids possessing plasmid-like partition genes and toxin-antitoxin addiction modules to ensure their replication and a satellite phage-plasmid group. Surprisingly we also assembled a phage that not only encoded plasmid partition genes, but a clustered regularly interspaced short palindromic repeat (CRISPR)/Cas adaptive bacterial immune system. One of the spacers was an exact match for another phage in our virome, indicating that in a novel use of the system, the lysogen was potentially capable of conferring immunity on its bacterial host against other phage. Together these results suggest that highly novel and diverse groups of viruses are present in glacial environments, some of which utilize very unusual life strategies and genes to control their replication and maintain a long-term relationship with their hosts

  8. Genome Sequencing of the Phytoseiid Predatory Mite Metaseiulus occidentalis Reveals Completely Atomized Hox Genes and Superdynamic Intron Evolution.

    PubMed

    Hoy, Marjorie A; Waterhouse, Robert M; Wu, Ke; Estep, Alden S; Ioannidis, Panagiotis; Palmer, William J; Pomerantz, Aaron F; Simão, Felipe A; Thomas, Jainy; Jiggins, Francis M; Murphy, Terence D; Pritham, Ellen J; Robertson, Hugh M; Zdobnov, Evgeny M; Gibbs, Richard A; Richards, Stephen

    2016-06-27

    Metaseiulus occidentalis is an eyeless phytoseiid predatory mite employed for the biological control of agricultural pests including spider mites. Despite appearances, these predator and prey mites are separated by some 400 Myr of evolution and radically different lifestyles. We present a 152-Mb draft assembly of the M. occidentalis genome: Larger than that of its favored prey, Tetranychus urticae, but considerably smaller than those of many other chelicerates, enabling an extremely contiguous and complete assembly to be built-the best arachnid to date. Aided by transcriptome data, genome annotation cataloged 18,338 protein-coding genes and identified large numbers of Helitron transposable elements. Comparisons with other arthropods revealed a particularly dynamic and turbulent genomic evolutionary history. Its genes exhibit elevated molecular evolution, with strikingly high numbers of intron gains and losses, in stark contrast to the deer tick Ixodes scapularis Uniquely among examined arthropods, this predatory mite's Hox genes are completely atomized, dispersed across the genome, and it encodes five copies of the normally single-copy RNA processing Dicer-2 gene. Examining gene families linked to characteristic biological traits of this tiny predator provides initial insights into processes of sex determination, development, immune defense, and how it detects, disables, and digests its prey. As the first reference genome for the Phytoseiidae, and for any species with the rare sex determination system of parahaploidy, the genome of the western orchard predatory mite improves genomic sampling of chelicerates and provides invaluable new resources for functional genomic analyses of this family of agriculturally important mites.

  9. The Capsaspora genome reveals a complex unicellular prehistory of animals.

    PubMed

    Suga, Hiroshi; Chen, Zehua; de Mendoza, Alex; Sebé-Pedrós, Arnau; Brown, Matthew W; Kramer, Eric; Carr, Martin; Kerner, Pierre; Vervoort, Michel; Sánchez-Pons, Núria; Torruella, Guifré; Derelle, Romain; Manning, Gerard; Lang, B Franz; Russ, Carsten; Haas, Brian J; Roger, Andrew J; Nusbaum, Chad; Ruiz-Trillo, Iñaki

    2013-01-01

    To reconstruct the evolutionary origin of multicellular animals from their unicellular ancestors, the genome sequences of diverse unicellular relatives are essential. However, only the genome of the choanoflagellate Monosiga brevicollis has been reported to date. Here we completely sequence the genome of the filasterean Capsaspora owczarzaki, the closest known unicellular relative of metazoans besides choanoflagellates. Analyses of this genome alter our understanding of the molecular complexity of metazoans' unicellular ancestors showing that they had a richer repertoire of proteins involved in cell adhesion and transcriptional regulation than previously inferred only with the choanoflagellate genome. Some of these proteins were secondarily lost in choanoflagellates. In contrast, most intercellular signalling systems controlling development evolved later concomitant with the emergence of the first metazoans. We propose that the acquisition of these metazoan-specific developmental systems and the co-option of pre-existing genes drove the evolutionary transition from unicellular protists to metazoans.

  10. The Capsaspora genome reveals a complex unicellular prehistory of animals

    PubMed Central

    Suga, Hiroshi; Chen, Zehua; de Mendoza, Alex; Sebé-Pedrós, Arnau; Brown, Matthew W.; Kramer, Eric; Carr, Martin; Kerner, Pierre; Vervoort, Michel; Sánchez-Pons, Núria; Torruella, Guifré; Derelle, Romain; Manning, Gerard; Lang, B. Franz; Russ, Carsten; Haas, Brian J.; Roger, Andrew J.; Nusbaum, Chad; Ruiz-Trillo, Iñaki

    2013-01-01

    To reconstruct the evolutionary origin of multicellular animals from their unicellular ancestors, the genome sequences of diverse unicellular relatives are essential. However, only the genome of the choanoflagellate Monosiga brevicollis has been reported to date. Here we completely sequence the genome of the filasterean Capsaspora owczarzaki, the closest known unicellular relative of metazoans besides choanoflagellates. Analyses of this genome alter our understanding of the molecular complexity of metazoans’ unicellular ancestors showing that they had a richer repertoire of proteins involved in cell adhesion and transcriptional regulation than previously inferred only with the choanoflagellate genome. Some of these proteins were secondarily lost in choanoflagellates. In contrast, most intercellular signalling systems controlling development evolved later concomitant with the emergence of the first metazoans. We propose that the acquisition of these metazoan-specific developmental systems and the co-option of pre-existing genes drove the evolutionary transition from unicellular protists to metazoans. PMID:23942320

  11. Proteomics Reveals Open Reading Frames in Mycobacterium tuberculosis H37Rv Not Predicted by Genomics

    PubMed Central

    Jungblut, Peter R.; Müller, Eva-Christina; Mattow, Jens; Kaufmann, Stefan H. E.

    2001-01-01

    Genomics revealed the sequence of 3924 genes of the H37Rv strain of Mycobacterium tuberculosis. Proteomics complements genomics in showing which genes are really expressed, and here we show the expression of six genes not predicted by genomics, as proved by two-dimensional electrophoresis and matrix-assisted laser desorption ionization and nano-electrospray mass spectrometry. PMID:11500470

  12. The mitochondrial genomes of Amphiascoides atopus and Schizopera knabeni (Harpacticoida: Miraciidae) reveal similarities between the copepod orders Harpacticoida and Poecilostomatoida.

    PubMed

    Easton, Erin E; Darrow, Emily M; Spears, Trisha; Thistle, David

    2014-03-15

    Members of subclass Copepoda are abundant, diverse, and-as a result of their variety of ecological roles in marine and freshwater environments-important, but their phylogenetic interrelationships are unclear. Recent studies of arthropods have used gene arrangements in the mitochondrial (mt) genome to infer phylogenies, but for copepods, only seven complete mt genomes have been published. These data revealed several within-order and few among-order similarities. To increase the data available for comparisons, we sequenced the complete mt genome (13,831base pairs) of Amphiascoides atopus and 10,649base pairs of the mt genome of Schizopera knabeni (both in the family Miraciidae of the order Harpacticoida). Comparison of our data to those for Tigriopus japonicus (family Harpacticidae, order Harpacticoida) revealed similarities in gene arrangement among these three species that were consistent with those found within and among families of other copepod orders. Comparison of the mt genomes of our species with those known from other copepod orders revealed the arrangement of mt genes of our Harpacticoida species to be more similar to that of Sinergasilus polycolpus (order Poecilostomatoida) than to that of T. japonicus. The similarities between S. polycolpus and our species are the first to be noted across the boundaries of copepod orders and support the possibility that mt-gene arrangement might be used to infer copepod phylogenies. We also found that our two species had extremely truncated transfer RNAs and that gene overlaps occurred much more frequently than has been reported for other copepod mt genomes.

  13. Genome Sequencing Reveals the Origin of the Allotetraploid Arabidopsis suecica.

    PubMed

    Novikova, Polina Yu; Tsuchimatsu, Takashi; Simon, Samson; Nizhynska, Viktoria; Voronin, Viktor; Burns, Robin; Fedorenko, Olga M; Holm, Svante; Säll, Torbjörn; Prat, Elisa; Marande, William; Castric, Vincent; Nordborg, Magnus

    2017-04-01

    Polyploidy is an example of instantaneous speciation when it involves the formation of a new cytotype that is incompatible with the parental species. Because new polyploid individuals are likely to be rare, establishment of a new species is unlikely unless polyploids are able to reproduce through self-fertilization (selfing), or asexually. Conversely, selfing (or asexuality) makes it possible for polyploid species to originate from a single individual-a bona fide speciation event. The extent to which this happens is not known. Here, we consider the origin of Arabidopsis suecica, a selfing allopolyploid between Arabidopsis thaliana and Arabidopsis arenosa, which has hitherto been considered to be an example of a unique origin. Based on whole-genome re-sequencing of 15 natural A. suecica accessions, we identify ubiquitous shared polymorphism with the parental species, and hence conclusively reject a unique origin in favor of multiple founding individuals. We further estimate that the species originated after the last glacial maximum in Eastern Europe or central Eurasia (rather than Sweden, as the name might suggest). Finally, annotation of the self-incompatibility loci in A. suecica revealed that both loci carry non-functional alleles. The locus inherited from the selfing A. thaliana is fixed for an ancestral non-functional allele, whereas the locus inherited from the outcrossing A. arenosa is fixed for a novel loss-of-function allele. Furthermore, the allele inherited from A. thaliana is predicted to transcriptionally silence the allele inherited from A. arenosa, suggesting that loss of self-incompatibility may have been instantaneous.

  14. Mitochondrial Genome Analysis Reveals Historical Lineages in Yellowstone Bison.

    PubMed

    Forgacs, David; Wallen, Rick L; Dobson, Lauren K; Derr, James N

    2016-01-01

    Yellowstone National Park is home to one of the only plains bison populations that have continuously existed on their present landscape since prehistoric times without evidence of domestic cattle introgression. Previous studies characterized the relatively high levels of nuclear genetic diversity in these bison, but little is known about their mitochondrial haplotype diversity. This study assessed mitochondrial genomes from 25 randomly selected Yellowstone bison and found 10 different mitochondrial haplotypes with a haplotype diversity of 0.78 (± 0.06). Spatial analysis of these mitochondrial DNA (mtDNA) haplotypes did not detect geographic population subdivision (FST = -0.06, p = 0.76). However, we identified two independent and historically important lineages in Yellowstone bison by combining data from 65 bison (defined by 120 polymorphic sites) from across North America representing a total of 30 different mitochondrial DNA haplotypes. Mitochondrial DNA haplotypes from one of the Yellowstone lineages represent descendants of the 22 indigenous bison remaining in central Yellowstone in 1902. The other mitochondrial DNA lineage represents descendants of the 18 females introduced from northern Montana in 1902 to supplement the indigenous bison population and develop a new breeding herd in the northern region of the park. Comparing modern and historical mitochondrial DNA diversity in Yellowstone bison helps uncover a historical context of park restoration efforts during the early 1900s, provides evidence against a hypothesized mitochondrial disease in bison, and reveals the signature of recent hybridization between American plains bison (Bison bison bison) and Canadian wood bison (B. b. athabascae). Our study demonstrates how mitochondrial DNA can be applied to delineate the history of wildlife species and inform future conservation actions.

  15. Mitochondrial Genome Analysis Reveals Historical Lineages in Yellowstone Bison

    PubMed Central

    Derr, James N.

    2016-01-01

    Yellowstone National Park is home to one of the only plains bison populations that have continuously existed on their present landscape since prehistoric times without evidence of domestic cattle introgression. Previous studies characterized the relatively high levels of nuclear genetic diversity in these bison, but little is known about their mitochondrial haplotype diversity. This study assessed mitochondrial genomes from 25 randomly selected Yellowstone bison and found 10 different mitochondrial haplotypes with a haplotype diversity of 0.78 (± 0.06). Spatial analysis of these mitochondrial DNA (mtDNA) haplotypes did not detect geographic population subdivision (FST = -0.06, p = 0.76). However, we identified two independent and historically important lineages in Yellowstone bison by combining data from 65 bison (defined by 120 polymorphic sites) from across North America representing a total of 30 different mitochondrial DNA haplotypes. Mitochondrial DNA haplotypes from one of the Yellowstone lineages represent descendants of the 22 indigenous bison remaining in central Yellowstone in 1902. The other mitochondrial DNA lineage represents descendants of the 18 females introduced from northern Montana in 1902 to supplement the indigenous bison population and develop a new breeding herd in the northern region of the park. Comparing modern and historical mitochondrial DNA diversity in Yellowstone bison helps uncover a historical context of park restoration efforts during the early 1900s, provides evidence against a hypothesized mitochondrial disease in bison, and reveals the signature of recent hybridization between American plains bison (Bison bison bison) and Canadian wood bison (B. b. athabascae). Our study demonstrates how mitochondrial DNA can be applied to delineate the history of wildlife species and inform future conservation actions. PMID:27880780

  16. Genome-wide SNP typing reveals signatures of population history.

    PubMed

    Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith

    2008-07-01

    Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.

  17. ‘Candidatus Competibacter'-lineage genomes retrieved from metagenomes reveal functional metabolic diversity

    PubMed Central

    McIlroy, Simon J; Albertsen, Mads; Andresen, Eva K; Saunders, Aaron M; Kristiansen, Rikke; Stokholm-Bjerregaard, Mikkel; Nielsen, Kåre L; Nielsen, Per H

    2014-01-01

    The glycogen-accumulating organism (GAO) ‘Candidatus Competibacter' (Competibacter) uses aerobically stored glycogen to enable anaerobic carbon uptake, which is subsequently stored as polyhydroxyalkanoates (PHAs). This biphasic metabolism is key for the Competibacter to survive under the cyclic anaerobic-‘feast': aerobic-‘famine' regime of enhanced biological phosphorus removal (EBPR) wastewater treatment systems. As they do not contribute to phosphorus (P) removal, but compete for resources with the polyphosphate-accumulating organisms (PAO), thought responsible for P removal, their proliferation theoretically reduces the EBPR capacity. In this study, two complete genomes from Competibacter were obtained from laboratory-scale enrichment reactors through metagenomics. Phylogenetic analysis identified the two genomes, ‘Candidatus Competibacter denitrificans' and ‘Candidatus Contendobacter odensis', as being affiliated with Competibacter-lineage subgroups 1 and 5, respectively. Both have genes for glycogen and PHA cycling and for the metabolism of volatile fatty acids. Marked differences were found in their potential for the Embden–Meyerhof–Parnas and Entner–Doudoroff glycolytic pathways, as well as for denitrification, nitrogen fixation, fermentation, trehalose synthesis and utilisation of glucose and lactate. Genetic comparison of P metabolism pathways with sequenced PAOs revealed the absence of the Pit phosphate transporter in the Competibacter-lineage genomes—identifying a key metabolic difference with the PAO physiology. These genomes are the first from any GAO organism and provide new insights into the complex interaction and niche competition between PAOs and GAOs in EBPR systems. PMID:24173461

  18. A pangenomic analysis of the Nannochloropsis organellar genomes reveals novel genetic variations in key metabolic genes

    PubMed Central

    2014-01-01

    Background Microalgae in the genus Nannochloropsis are photosynthetic marine Eustigmatophytes of significant interest to the bioenergy and aquaculture sectors due to their ability to efficiently accumulate biomass and lipids for utilization in renewable transportation fuels, aquaculture feed, and other useful bioproducts. To better understand the genetic complement that drives the metabolic processes of these organisms, we present the assembly and comparative pangenomic analysis of the chloroplast and mitochondrial genomes from Nannochloropsis salina CCMP1776. Results The chloroplast and mitochondrial genomes of N. salina are 98.4% and 97% identical to their counterparts in Nannochloropsis gaditana. Comparison of the Nannochloropsis pangenome to other algae within and outside of the same phyla revealed regions of significant genetic divergence in key genes that encode proteins needed for regulation of branched chain amino synthesis (acetohydroxyacid synthase), carbon fixation (RuBisCO activase), energy conservation (ATP synthase), protein synthesis and homeostasis (Clp protease, ribosome). Conclusions Many organellar gene modifications in Nannochloropsis are unique and deviate from conserved orthologs found across the tree of life. Implementation of secondary and tertiary structure prediction was crucial to functionally characterize many proteins and therefore should be implemented in automated annotation pipelines. The exceptional similarity of the N. salina and N. gaditana organellar genomes suggests that N. gaditana be reclassified as a strain of N. salina. PMID:24646409

  19. Comparative genomics of three Methanocellales strains reveal novel taxonomic and metabolic features.

    PubMed

    Lyu, Zhe; Lu, Yahai

    2015-06-01

    Methanocellales represents a new order of methanogens, which is widespread in environments and plays specifically the important role in methane emissions from paddy fields. To gain more insights into Methanocellales, comparative genomic studies were performed among three Methanocellales strains through the same annotation pipeline. Genetic relationships among strains revealed by genome alignment, pan-genome reconstruction and comparison of amino average identity suggest that they should be classified in different genera. In addition, multiple copies of cell cycle regulator proteins were identified for the first time in Archaea. Core metabolisms were reconstructed, predicting certain unique and novel features for Methanocellales, including a set of methanogenesis genes potentially organized toward specialization in utilizing low concentrations of H2, a new route of disulfide reduction catalysed by a disulfide-reducing hydrogenase (Drh) complex phylogenetically related to sulfate-reducing prokaryotes, an oxidative tricarboxylic acid (TCA) cycle, a sophisticated nitrogen uptake and regulation system as well as a versatile sulfur utilization system. These core metabolisms are largely conserved among the three strains, but differences in gene copy number and metabolic diversity are evident. The present study thus adds new dimensions to the unique ecophysiology of Methanocellales and offers a road map for further experimental characterization of this methanogen lineage.

  20. Alternative splicing in teleost fish genomes: same-species and cross-species analysis and comparisons.

    PubMed

    Lu, Jianguo; Peatman, Eric; Wang, Wenqi; Yang, Qing; Abernathy, Jason; Wang, Shaolin; Kucuktas, Huseyin; Liu, Zhanjiang

    2010-06-01

    Alternative splicing (AS) is a mechanism by which the coding diversity of the genome can be greatly increased. Rates of AS are known to vary according to the complexity of eukaryotic species potentially explaining the tremendous phenotypic diversity among species with similar numbers of coding genes. Little is known, however, about the nature or rate of AS in teleost fish. Here, we report the characteristics of AS in teleost fish and classification and frequency of five canonical AS types. We conducted both same-species and cross-species analysis utilizing the Genome Mapping and Alignment Program (GMAP) and an AS pipeline (ASpipe) to study AS in four genome-enabled species (Danio rerio, Oryzias latipes, Gasterosteus aculeatus, and Takifugu rubripes) and one species lacking a complete genome sequence, Ictalurus punctatus. AS frequency was lowest in the highly duplicated genome of zebrafish (17% of mapped genes). The compact genome of the pufferfish showed the highest occurrence of AS (approximately 43% of mapped genes). An inverse correlation between AS frequency and genome size was consistent across all analyzed species. Cross-species comparisons utilizing zebrafish as the reference genome allowed the identification of additional putative AS genes not revealed by zebrafish transcripts. Approximately, 50% of AS genes identified by same-species comparisons were shared among two or more species. A searchable website, the Teleost Alternative Splicing Database, was created to allow easy identification and visualization of AS transcripts in the studied teleost genomes. Our results and associated database should further our understanding of alternative splicing as an important functional and evolutionary mechanism in the genomes of teleost fish.

  1. Coelacanth genome sequence reveals the evolutionary history of vertebrate genes.

    PubMed

    Noonan, James P; Grimwood, Jane; Danke, Joshua; Schmutz, Jeremy; Dickson, Mark; Amemiya, Chris T; Myers, Richard M

    2004-12-01

    The coelacanth is one of the nearest living relatives of tetrapods. However, a teleost species such as zebrafish or Fugu is typically used as the outgroup in current tetrapod comparative sequence analyses. Such studies are complicated by the fact that teleost genomes have undergone a whole-genome duplication event, as well as individual gene-duplication events. Here, we demonstrate the value of coelacanth genome sequence by complete sequencing and analysis of the protocadherin gene cluster of the Indonesian coelacanth, Latimeria menadoensis. We found that coelacanth has 49 protocadherin cluster genes organized in the same three ordered subclusters, alpha, beta, and gamma, as the 54 protocadherin cluster genes in human. In contrast, whole-genome and tandem duplications have generated two zebrafish protocadherin clusters comprised of at least 97 genes. Additionally, zebrafish protocadherins are far more prone to homogenizing gene conversion events than coelacanth protocadherins, suggesting that recombination- and duplication-driven plasticity may be a feature of teleost genomes. Our results indicate that coelacanth provides the ideal outgroup sequence against which tetrapod genomes can be measured. We therefore present L. menadoensis as a candidate for whole-genome sequencing.

  2. Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus

    PubMed Central

    Martin, William; Rujan, Tamas; Richly, Erik; Hansen, Andrea; Cornelsen, Sabine; Lins, Thomas; Leister, Dario; Stoebe, Bettina; Hasegawa, Masami; Penny, David

    2002-01-01

    Chloroplasts were once free-living cyanobacteria that became endosymbionts, but the genomes of contemporary plastids encode only ≈5–10% as many genes as those of their free-living cousins, indicating that many genes were either lost from plastids or transferred to the nucleus during the course of plant evolution. Previous estimates have suggested that between 800 and perhaps as many as 2,000 genes in the Arabidopsis genome might come from cyanobacteria, but genome-wide phylogenetic surveys that could provide direct estimates of this number are lacking. We compared 24,990 proteins encoded in the Arabidopsis genome to the proteins from three cyanobacterial genomes, 16 other prokaryotic reference genomes, and yeast. Of 9,368 Arabidopsis proteins sufficiently conserved for primary sequence comparison, 866 detected homologues only among cyanobacteria and 834 other branched with cyanobacterial homologues in phylogenetic trees. Extrapolating from these conserved proteins to the whole genome, the data suggest that ≈4,500 of Arabidopsis protein-coding genes (≈18% of the total) were acquired from the cyanobacterial ancestor of plastids. These proteins encompass all functional classes, and the majority of them are targeted to cell compartments other than the chloroplast. Analysis of 15 sequenced chloroplast genomes revealed 117 nuclear-encoded proteins that are also still present in at least one chloroplast genome. A phylogeny of chloroplast genomes inferred from 41 proteins and 8,303 amino acids sites indicates that at least two independent secondary endosymbiotic events have occurred involving red algae and that amino acid composition bias in chloroplast proteins strongly affects plastid genome phylogeny. PMID:12218172

  3. Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus.

    PubMed

    Martin, William; Rujan, Tamas; Richly, Erik; Hansen, Andrea; Cornelsen, Sabine; Lins, Thomas; Leister, Dario; Stoebe, Bettina; Hasegawa, Masami; Penny, David

    2002-09-17

    Chloroplasts were once free-living cyanobacteria that became endosymbionts, but the genomes of contemporary plastids encode only approximately 5-10% as many genes as those of their free-living cousins, indicating that many genes were either lost from plastids or transferred to the nucleus during the course of plant evolution. Previous estimates have suggested that between 800 and perhaps as many as 2,000 genes in the Arabidopsis genome might come from cyanobacteria, but genome-wide phylogenetic surveys that could provide direct estimates of this number are lacking. We compared 24,990 proteins encoded in the Arabidopsis genome to the proteins from three cyanobacterial genomes, 16 other prokaryotic reference genomes, and yeast. Of 9,368 Arabidopsis proteins sufficiently conserved for primary sequence comparison, 866 detected homologues only among cyanobacteria and 834 other branched with cyanobacterial homologues in phylogenetic trees. Extrapolating from these conserved proteins to the whole genome, the data suggest that approximately 4,500 of Arabidopsis protein-coding genes ( approximately 18% of the total) were acquired from the cyanobacterial ancestor of plastids. These proteins encompass all functional classes, and the majority of them are targeted to cell compartments other than the chloroplast. Analysis of 15 sequenced chloroplast genomes revealed 117 nuclear-encoded proteins that are also still present in at least one chloroplast genome. A phylogeny of chloroplast genomes inferred from 41 proteins and 8,303 amino acids sites indicates that at least two independent secondary endosymbiotic events have occurred involving red algae and that amino acid composition bias in chloroplast proteins strongly affects plastid genome phylogeny.

  4. Comparative analysis of the peanut witches'-broom phytoplasma genome reveals horizontal transfer of potential mobile units and effectors.

    PubMed

    Chung, Wan-Chia; Chen, Ling-Ling; Lo, Wen-Sui; Lin, Chan-Pin; Kuo, Chih-Horng

    2013-01-01

    Phytoplasmas are a group of bacteria that are associated with hundreds of plant diseases. Due to their economical importance and the difficulties involved in the experimental study of these obligate pathogens, genome sequencing and comparative analysis have been utilized as powerful tools to understand phytoplasma biology. To date four complete phytoplasma genome sequences have been published. However, these four strains represent limited phylogenetic diversity. In this study, we report the shotgun sequencing and evolutionary analysis of a peanut witches'-broom (PnWB) phytoplasma genome. The availability of this genome provides the first representative of the 16SrII group and substantially improves the taxon sampling to investigate genome evolution. The draft genome assembly contains 13 chromosomal contigs with a total size of 562,473 bp, covering ∼90% of the chromosome. Additionally, a complete plasmid sequence is included. Comparisons among the five available phytoplasma genomes reveal the differentiations in gene content and metabolic capacity. Notably, phylogenetic inferences of the potential mobile units (PMUs) in these genomes indicate that horizontal transfer may have occurred between divergent phytoplasma lineages. Because many effectors are associated with PMUs, the horizontal transfer of these transposon-like elements can contribute to the adaptation and diversification of these pathogens. In summary, the findings from this study highlight the importance of improving taxon sampling when investigating genome evolution. Moreover, the currently available sequences are inadequate to fully characterize the pan-genome of phytoplasmas. Future genome sequencing efforts to expand phylogenetic diversity are essential in improving our understanding of phytoplasma evolution.

  5. Comparative Analysis of the Peanut Witches'-Broom Phytoplasma Genome Reveals Horizontal Transfer of Potential Mobile Units and Effectors

    PubMed Central

    Lo, Wen-Sui; Lin, Chan-Pin; Kuo, Chih-Horng

    2013-01-01

    Phytoplasmas are a group of bacteria that are associated with hundreds of plant diseases. Due to their economical importance and the difficulties involved in the experimental study of these obligate pathogens, genome sequencing and comparative analysis have been utilized as powerful tools to understand phytoplasma biology. To date four complete phytoplasma genome sequences have been published. However, these four strains represent limited phylogenetic diversity. In this study, we report the shotgun sequencing and evolutionary analysis of a peanut witches'-broom (PnWB) phytoplasma genome. The availability of this genome provides the first representative of the 16SrII group and substantially improves the taxon sampling to investigate genome evolution. The draft genome assembly contains 13 chromosomal contigs with a total size of 562,473 bp, covering ∼90% of the chromosome. Additionally, a complete plasmid sequence is included. Comparisons among the five available phytoplasma genomes reveal the differentiations in gene content and metabolic capacity. Notably, phylogenetic inferences of the potential mobile units (PMUs) in these genomes indicate that horizontal transfer may have occurred between divergent phytoplasma lineages. Because many effectors are associated with PMUs, the horizontal transfer of these transposon-like elements can contribute to the adaptation and diversification of these pathogens. In summary, the findings from this study highlight the importance of improving taxon sampling when investigating genome evolution. Moreover, the currently available sequences are inadequate to fully characterize the pan-genome of phytoplasmas. Future genome sequencing efforts to expand phylogenetic diversity are essential in improving our understanding of phytoplasma evolution. PMID:23626855

  6. Comparison of the Genome Sequence of the Poultry Pathogen Bordetella avium with Those of B. bronchiseptica, B. pertussis, and B. parapertussis Reveals Extensive Diversity in Surface Structures Associated with Host Interaction

    PubMed Central

    Sebaihia, Mohammed; Preston, Andrew; Maskell, Duncan J.; Kuzmiak, Holly; Connell, Terry D.; King, Natalie D.; Orndorff, Paul E.; Miyamoto, David M.; Thomson, Nicholas R.; Harris, David; Goble, Arlette; Lord, Angela; Murphy, Lee; Quail, Michael A.; Rutter, Simon; Squares, Robert; Squares, Steven; Woodward, John; Parkhill, Julian; Temple, Louise M.

    2006-01-01

    Bordetella avium is a pathogen of poultry and is phylogenetically distinct from Bordetella bronchiseptica, Bordetella pertussis, and Bordetella parapertussis, which are other species in the Bordetella genus that infect mammals. In order to understand the evolutionary relatedness of Bordetella species and further the understanding of pathogenesis, we obtained the complete genome sequence of B. avium strain 197N, a pathogenic strain that has been extensively studied. With 3,732,255 base pairs of DNA and 3,417 predicted coding sequences, it has the smallest genome and gene complement of the sequenced bordetellae. In this study, the presence or absence of previously reported virulence factors from B. avium was confirmed, and the genetic bases for growth characteristics were elucidated. Over 1,100 genes present in B. avium but not in B. bronchiseptica were identified, and most were predicted to encode surface or secreted proteins that are likely to define an organism adapted to the avian rather than the mammalian respiratory tracts. These include genes coding for the synthesis of a polysaccharide capsule, hemagglutinins, a type I secretion system adjacent to two very large genes for secreted proteins, and unique genes for both lipopolysaccharide and fimbrial biogenesis. Three apparently complete prophages are also present. The BvgAS virulence regulatory system appears to have polymorphisms at a poly(C) tract that is involved in phase variation in other bordetellae. A number of putative iron-regulated outer membrane proteins were predicted from the sequence, and this regulation was confirmed experimentally for five of these. PMID:16885469

  7. Comparative Genomic Analyses of the Human NPHP1 Locus Reveal Complex Genomic Architecture and Its Regional Evolution in Primates

    PubMed Central

    Yuan, Bo; Liu, Pengfei; Gupta, Aditya; Beck, Christine R.; Tejomurtula, Anusha; Campbell, Ian M.; Gambin, Tomasz; Simmons, Alexandra D.; Withers, Marjorie A.; Harris, R. Alan; Rogers, Jeffrey; Schwartz, David C.; Lupski, James R.

    2015-01-01

    Many loci in the human genome harbor complex genomic structures that can result in susceptibility to genomic rearrangements leading to various genomic disorders. Nephronophthisis 1 (NPHP1, MIM# 256100) is an autosomal recessive disorder that can be caused by defects of NPHP1; the gene maps within the human 2q13 region where low copy repeats (LCRs) are abundant. Loss of function of NPHP1 is responsible for approximately 85% of the NPHP1 cases—about 80% of such individuals carry a large recurrent homozygous NPHP1 deletion that occurs via nonallelic homologous recombination (NAHR) between two flanking directly oriented ~45 kb LCRs. Published data revealed a non-pathogenic inversion polymorphism involving the NPHP1 gene flanked by two inverted ~358 kb LCRs. Using optical mapping and array-comparative genomic hybridization, we identified three potential novel structural variant (SV) haplotypes at the NPHP1 locus that may protect a haploid genome from the NPHP1 deletion. Inter-species comparative genomic analyses among primate genomes revealed massive genomic changes during evolution. The aggregated data suggest that dynamic genomic rearrangements occurred historically within the NPHP1 locus and generated SV haplotypes observed in the human population today, which may confer differential susceptibility to genomic instability and the NPHP1 deletion within a personal genome. Our study documents diverse SV haplotypes at a complex LCR-laden human genomic region. Comparative analyses provide a model for how this complex region arose during primate evolution, and studies among humans suggest that intra-species polymorphism may potentially modulate an individual’s susceptibility to acquiring disease-associated alleles. PMID:26641089

  8. The cavefish genome reveals candidate genes for eye loss

    PubMed Central

    McGaugh, Suzanne E.; Gross, Joshua B.; Aken, Bronwen; Blin, Maryline; Borowsky, Richard; Chalopin, Domitille; Hinaux, Hélène; Jeffery, William R.; Keene, Alex; Ma, Li; Minx, Patrick; Murphy, Daniel; O’Quin, Kelly E.; Rétaux, Sylvie; Rohner, Nicolas; Searle, Steve M. J.; Stahl, Bethany A.; Tabin, Cliff; Volff, Jean-Nicolas; Yoshizawa, Masato; Warren, Wesley C.

    2014-01-01

    Natural populations subjected to strong environmental selection pressures offer a window into the genetic underpinnings of evolutionary change. Cavefish populations, Astyanax mexicanus (Teleostei: Characiphysi), exhibit repeated, independent evolution for a variety of traits including eye degeneration, pigment loss, increased size and number of taste buds and mechanosensory organs, and shifts in many behavioural traits. Surface and cave forms are interfertile making this system amenable to genetic interrogation; however, lack of a reference genome has hampered efforts to identify genes responsible for changes in cave forms of A. mexicanus. Here we present the first de novo genome assembly for Astyanax mexicanus cavefish, contrast repeat elements to other teleost genomes, identify candidate genes underlying quantitative trait loci (QTL), and assay these candidate genes for potential functional and expression differences. We expect the cavefish genome to advance understanding of the evolutionary process, as well as, analogous human disease including retinal dysfunction. PMID:25329095

  9. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species

    PubMed Central

    Dasmahapatra, Kanchon K; Walters, James R.; Briscoe, Adriana D.; Davey, John W.; Whibley, Annabel; Nadeau, Nicola J.; Zimin, Aleksey V.; Hughes, Daniel S. T.; Ferguson, Laura C.; Martin, Simon H.; Salazar, Camilo; Lewis, James J.; Adler, Sebastian; Ahn, Seung-Joon; Baker, Dean A.; Baxter, Simon W.; Chamberlain, Nicola L.; Chauhan, Ritika; Counterman, Brian A.; Dalmay, Tamas; Gilbert, Lawrence E.; Gordon, Karl; Heckel, David G.; Hines, Heather M.; Hoff, Katharina J.; Holland, Peter W.H.; Jacquin-Joly, Emmanuelle; Jiggins, Francis M.; Jones, Robert T.; Kapan, Durrell D.; Kersey, Paul; Lamas, Gerardo; Lawson, Daniel; Mapleson, Daniel; Maroja, Luana S.; Martin, Arnaud; Moxon, Simon; Palmer, William J.; Papa, Riccardo; Papanicolaou, Alexie; Pauchet, Yannick; Ray, David A.; Rosser, Neil; Salzberg, Steven L.; Supple, Megan A.; Surridge, Alison; Tenger-Trolander, Ayse; Vogel, Heiko; Wilkinson, Paul A.; Wilson, Derek; Yorke, James A.; Yuan, Furong; Balmuth, Alexi L.; Eland, Cathlene; Gharbi, Karim; Thomson, Marian; Gibbs, Richard A.; Han, Yi; Jayaseelan, Joy C.; Kovar, Christie; Mathew, Tittu; Muzny, Donna M.; Ongeri, Fiona; Pu, Ling-Ling; Qu, Jiaxin; Thornton, Rebecca L.; Worley, Kim C.; Wu, Yuan-Qing; Linares, Mauricio; Blaxter, Mark L.; Constant, Richard H. ffrench; Joron, Mathieu; Kronforst, Marcus R.; Mullen, Sean P.; Reed, Robert D.; Scherer, Steven E.; Richards, Stephen; Mallet, James; McMillan, W. Owen; Jiggins, Chris D.

    2012-01-01

    The evolutionary importance of hybridization and introgression has long been debated1. We used genomic tools to investigate introgression in Heliconius, a rapidly radiating genus of neotropical butterflies widely used in studies of ecology, behaviour, mimicry and speciation2-5 . We sequenced the genome of Heliconius melpomene and compared it with other taxa to investigate chromosomal evolution in Lepidoptera and gene flow among multiple Heliconius species and races. Among 12,657 predicted genes for Heliconius, biologically important expansions of families of chemosensory and Hox genes are particularly noteworthy. Chromosomal organisation has remained broadly conserved since the Cretaceous, when butterflies split from the silkmoth lineage. Using genomic resequencing, we show hybrid exchange of genes between three co-mimics, H. melpomene, H. timareta, and H. elevatus, especially at two genomic regions that control mimicry pattern. Closely related Heliconius species clearly exchange protective colour pattern genes promiscuously, implying a major role for hybridization in adaptive radiation. PMID:22722851

  10. Microsporidian genome analysis reveals evolutionary strategies for obligate intracellular growth

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Microsporidia comprise a large phylum of obligate intracellular eukaryotes that are fungalrelated parasites responsible for widespread disease, and here we address questions about microsporidia biology and evolution. We sequenced three microsporidian genomes from two species, Nematocida parisii and...

  11. Genomic Mining Reveals Deep Evolutionary Relationships between Bornaviruses and Bats

    PubMed Central

    Cui, Jie; Wang, Lin-Fa

    2015-01-01

    Bats globally harbor viruses in order Mononegavirales, such as lyssaviruses and henipaviruses; however, little is known about their relationships with bornaviruses. Previous studies showed that viral fossils of bornaviral origin are embedded in the genomes of several mammalian species such as primates, indicative of an ancient origin of exogenous bornaviruses. In this study, we mined the available 10 bat genomes and recreated a clear evolutionary relationship of endogenous bornaviral elements and bats. Comparative genomics showed that endogenization of bornaviral elements frequently occurred in vesper bats, harboring EBLLs (endogenous bornavirus-like L elements) in their genomes. Molecular dating uncovered a continuous bornavirus-bat interaction spanning 70 million years. We conclude that better understanding of modern exogenous bornaviral circulation in bat populations is warranted. PMID:26569285

  12. Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs

    PubMed Central

    Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David A

    2015-01-01

    To provide context for the diversifications of archosaurs, the group that includes crocodilians, dinosaurs and birds, we generated draft genomes of three crocodilians, Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the relatively rapid evolution of bird genomes represents an autapomorphy within that clade. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these new data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs. PMID:25504731

  13. Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events.

    PubMed

    Lau, Susanna K P; Li, Kenneth S M; Huang, Yi; Shek, Chung-Tong; Tse, Herman; Wang, Ming; Choi, Garnet K Y; Xu, Huifang; Lam, Carol S F; Guo, Rongtong; Chan, Kwok-Hung; Zheng, Bo-Jian; Woo, Patrick C Y; Yuen, Kwok-Yung

    2010-03-01

    Despite the identification of severe acute respiratory syndrome-related coronavirus (SARSr-CoV) in Rhinolophus Chinese horseshoe bats (SARSr-Rh-BatCoV) in China, the evolutionary and possible recombination origin of SARSr-CoV remains undetermined. We carried out the first study to investigate the migration pattern and SARSr-Rh-BatCoV genome epidemiology in Chinese horseshoe bats during a 4-year period. Of 1,401 Chinese horseshoe bats from Hong Kong and Guangdong, China, that were sampled, SARSr-Rh-BatCoV was detected in alimentary specimens from 130 (9.3%) bats, with peak activity during spring. A tagging exercise of 511 bats showed migration distances from 1.86 to 17 km. Bats carrying SARSr-Rh-BatCoV appeared healthy, with viral clearance occurring between 2 weeks and 4 months. However, lower body weights were observed in bats positive for SARSr-Rh-BatCoV, but not Rh-BatCoV HKU2. Complete genome sequencing of 10 SARSr-Rh-BatCoV strains showed frequent recombination between different strains. Moreover, recombination was detected between SARSr-Rh-BatCoV Rp3 from Guangxi, China, and Rf1 from Hubei, China, in the possible generation of civet SARSr-CoV SZ3, with a breakpoint at the nsp16/spike region. Molecular clock analysis showed that SARSr-CoVs were newly emerged viruses with the time of the most recent common ancestor (tMRCA) at 1972, which diverged between civet and bat strains in 1995. The present data suggest that SARSr-Rh-BatCoV causes acute, self-limiting infection in horseshoe bats, which serve as a reservoir for recombination between strains from different geographical locations within reachable foraging range. Civet SARSr-CoV is likely a recombinant virus arising from SARSr-CoV strains closely related to SARSr-Rh-BatCoV Rp3 and Rf1. Such frequent recombination, coupled with rapid evolution especially in ORF7b/ORF8 region, in these animals may have accounted for the cross-species transmission and emergence of SARS.

  14. Reconstruction of the lipid metabolism for the microalga Monoraphidium neglectum from its genome sequence reveals characteristics suitable for biofuel production

    PubMed Central

    2013-01-01

    Background Microalgae are gaining importance as sustainable production hosts in the fields of biotechnology and bioenergy. A robust biomass accumulating strain of the genus Monoraphidium (SAG 48.87) was investigated in this work as a potential feedstock for biofuel production. The genome was sequenced, annotated, and key enzymes for triacylglycerol formation were elucidated. Results Monoraphidium neglectum was identified as an oleaginous species with favourable growth characteristics as well as a high potential for crude oil production, based on neutral lipid contents of approximately 21% (dry weight) under nitrogen starvation, composed of predominantly C18:1 and C16:0 fatty acids. Further characterization revealed growth in a relatively wide pH range and salt concentrations of up to 1.0% NaCl, in which the cells exhibited larger structures. This first full genome sequencing of a member of the Selenastraceae revealed a diploid, approximately 68 Mbp genome with a G + C content of 64.7%. The circular chloroplast genome was assembled to a 135,362 bp single contig, containing 67 protein-coding genes. The assembly of the mitochondrial genome resulted in two contigs with an approximate total size of 94 kb, the largest known mitochondrial genome within algae. 16,761 protein-coding genes were assigned to the nuclear genome. Comparison of gene sets with respect to functional categories revealed a higher gene number assigned to the category “carbohydrate metabolic process” and in “fatty acid biosynthetic process” in M. neglectum when compared to Chlamydomonas reinhardtii and Nannochloropsis gaditana, indicating a higher metabolic diversity for applications in carbohydrate conversions of biotechnological relevance. Conclusions The genome of M. neglectum, as well as the metabolic reconstruction of crucial lipid pathways, provides new insights into the diversity of the lipid metabolism in microalgae. The results of this work provide a platform to encourage the

  15. Comparative Genomics Including the Early-Diverging Smut Fungus Ceraceosorus bombacis Reveals Signatures of Parallel Evolution within Plant and Animal Pathogens of Fungi and Oomycetes.

    PubMed

    Sharma, Rahul; Xia, Xiaojuan; Riess, Kai; Bauer, Robert; Thines, Marco

    2015-08-27

    Ceraceosorus bombacis is an early-diverging lineage of smut fungi and a pathogen of cotton trees (Bombax ceiba). To study the evolutionary genomics of smut fungi in comparison with other fungal and oomycete pathogens, the genome of C. bombacis was sequenced and comparative genomic analyses were performed. The genome of 26.09 Mb encodes for 8,024 proteins, of which 576 are putative-secreted effector proteins (PSEPs). Orthology analysis revealed 30 ortholog PSEPs among six Ustilaginomycotina genomes, the largest groups of which are lytic enzymes, such as aspartic peptidase and glycoside hydrolase. Positive selection analyses revealed the highest percentage of positively selected PSEPs in C. bombacis compared with other Ustilaginomycotina genomes. Metabolic pathway analyses revealed the absence of genes encoding for nitrite and nitrate reductase in the genome of the human skin pathogen Malassezia globosa, but these enzymes are present in the sequenced plant pathogens in smut fungi. Interestingly, these genes are also absent in cultivable oomycete animal pathogens, while nitrate reductase has been lost in cultivable oomycete plant pathogens. Similar patterns were also observed for obligate biotrophic and hemi-biotrophic fungal and oomycete pathogens. Furthermore, it was found that both fungal and oomycete animal pathogen genomes are lacking cutinases and pectinesterases. Overall, these findings highlight the parallel evolution of certain genomic traits, revealing potential common evolutionary trajectories among fungal and oomycete pathogens, shaping the pathogen genomes according to their lifestyle.

  16. The complete mitochondrial genome of Arctic Calanus hyperboreus (Copepoda, Calanoida) reveals characteristic patterns in calanoid mitochondrial genome.

    PubMed

    Kim, Sanghee; Lim, Byung-Jin; Min, Gi-Sik; Choi, Han-Gu

    2013-05-10

    Copepoda is the most diverse and abundant group of crustaceans, but its phylogenetic relationships are ambiguous. Mitochondrial (mt) genomes are useful for studying evolutionary history, but only six complete Copepoda mt genomes have been made available and these have extremely rearranged genome structures. This study determined the mt genome of Calanus hyperboreus, making it the first reported Arctic copepod mt genome and the first complete mt genome of a calanoid copepod. The mt genome of C. hyperboreus is 17,910 bp in length and it contains the entire set of 37 mt genes, including 13 protein-coding genes, 2 rRNAs, and 22 tRNAs. It has a very unusual gene structure, including the longest control region reported for a crustacean, a large tRNA gene cluster, and reversed GC skews in 11 out of 13 protein-coding genes (84.6%). Despite the unusual features, comparing this genome to published copepod genomes revealed retained pan-crustacean features, as well as a conserved calanoid-specific pattern. Our data provide a foundation for exploring the calanoid pattern and the mechanisms of mt gene rearrangement in the evolutionary history of the copepod mt genome.

  17. Evidence-based green algal genomics reveals marine diversity and ancestral characteristics of land plants

    DOE PAGES

    van Baren, Marijke J.; Bachy, Charles; Reistetter, Emily Nahas; ...

    2016-03-31

    Prasinophytes are widespread marine green algae that are related to plants. Abundance of the genus Micromonas has reportedly increased in the Arctic due to climate-induced changes. Thus, studies of these organisms are important for marine ecology and understanding Virdiplantae evolution and diversification. We generated evidence-based Micromonas gene models using proteomics and RNA-Seq to improve prasinophyte genomic resources. First, sequences of four chromosomes in the 22 Mb Micromonas pusilla (CCMP1545) genome were finished. Comparison with the finished 21 Mb Micromonas commoda (RCC299) shows they share ≤ 8,142 of ~10,000 protein-encoding genes, depending on the analysis method. Unlike RCC299 and other sequencedmore » eukaryotes, CCMP1545 has two abundant repetitive intron types and a high percent (26%) GC splice donors. Micromonas has more genus-specific protein families (19%) than other genome sequenced prasinophytes (11%). Comparative analyses using predicted proteomes from other prasinophytes reveal proteins likely related to scale formation and ancestral photosynthesis. Our studies also indicate that peptidoglycan (PG) biosynthesis enzymes have been lost in multiple independent events in select prasinophytes and most plants. However, CCMP1545, polar Micromonas CCMP2099 and prasinophytes from other claasses retain the entire PG pathway, like moss and glaucophyte algae. Multiple vascular plants that share a unique bi-domain protein also have the pathway, except the Penicillin-Binding-Protein. Alongside Micromonas experiments using antibiotics that halt bacterial PG biosynthesis, the findings highlight unrecognized phylogenetic complexity in the PG-pathway retention and implicate a role in chloroplast structure of division in several extant Vridiplantae lineages. Extensive differences in gene loss and architecture between related prasinophytes underscore their extensive divergence. PG biosynthesis genes from the cyanobacterial endosymbiont that became the

  18. Mechanisms of thermal adaptation revealed from the genomes of the Antarctic

    SciTech Connect

    Saunders, Neil F.W.; Thomas, Torsten; Curmi, Paul M.G.; Mattick, John S.; Kuczek, Elizabeth; Slade, Rob; Davis, John; Franzmann, Peter; Boone, David; Rusterholtz, Karl; Feldman, Robert; Gates, Chris; Bench, Shellie; Sowers, Kevin; Kadner, Kristen; Aerts, Andrea; Dehal, Paramvir; Detter, Chris; Glavina, Tijana; Lucas, Susan; Richardson, Paul; Larimer, Frank; Hauser , Frank; Hauser, Loren; Land, Miriam; Cavicchioli, Richard

    2003-03-01

    We generated draft genome sequences for two cold-adapted Archaea, Methanogenium frigidum and Methanococcoides burtonii, to identify genotypic characteristics that distinguish them from Archaea with a higher optimal growth temperature (OGT). Comparative genomics revealed trends in amino acid and tRNA composition, and structural features of proteins. Proteins from the cold-adapted Archaea are characterized by a higher content of non-charged polar amino acids, particularly Gln and Thr and a lower content of hydrophobic amino acids, particularly Leu. Sequence data from nine methanogen genomes (OGT 15-98 C) was used to generate 1 111 modeled protein structures. Analysis of the models from the cold-adapted Archaea showed a strong tendency in the solvent accessible area for more Gln, Thr an hydrophobic residues and fewer charged residues. A cold shock domain (CSD) protein (CspA homolog) was identified in M. frigidum, two hypothetical proteins with CSD-folds in M. burtonii, and a unique winged helix DNA-binding domain protein in M. burtonii. This suggests that these types of nucleic acid binding proteins have a critical role in cold-adapted Archaea. Structural analysis of tRNA sequences from the Archaea indicated that GC content is the major factor influencing tRNA stability in hyperthermophiles, but not in the psychrophiles, mesophiles or moderate thermophiles. Below an OGT of 60 C, the GC content in tRNA was largely unchanged, indicating that any requirement for flexibility of tRNA in psychrophiles is mediated by other means. This is the first time that comparisons have been performed with genome data from Archaea spanning the growth temperature extremes from psychrophiles to hyperthermophiles.

  19. Genome and Transcriptome Sequences Reveal the Specific Parasitism of the Nematophagous Purpureocillium lilacinum 36-1

    PubMed Central

    Xie, Jialian; Li, Shaojun; Mo, Chenmi; Xiao, Xueqiong; Peng, Deliang; Wang, Gaofeng; Xiao, Yannong

    2016-01-01

    Purpureocillium lilacinum is a promising nematophagous ascomycete able to adapt diverse environments and it is also an opportunistic fungus that infects humans. A microbial inoculant of P. lilacinum has been registered to control plant parasitic nematodes. However, the molecular mechanism of the toxicological processes is still unclear because of the relatively few reports on the subject. In this study, using Illumina paired-end sequencing, the draft genome sequence and the transcriptome of P. lilacinum strain 36-1 infecting nematode-eggs were determined. Whole genome alignment indicated that P. lilacinum 36-1 possessed a more dynamic genome in comparison with P. lilacinum India strain. Moreover, a phylogenetic analysis showed that the P. lilacinum 36-1 had a closer relation to entomophagous fungi. The protein-coding genes in P. lilacinum 36-1 occurred much more frequently than they did in other fungi, which was a result of the depletion of repeat-induced point mutations (RIP). Comparative genome and transcriptome analyses revealed the genes that were involved in pathogenicity, particularly in the recognition, adhesion of nematode-eggs, downstream signal transduction pathways and hydrolase genes. By contrast, certain numbers of cellulose and xylan degradation genes and a lack of polysaccharide lyase genes showed the potential of P. lilacinum 36-1 as an endophyte. Notably, the expression of appressorium-formation and antioxidants-related genes exhibited similar infection patterns in P. lilacinum strain 36-1 to those of the model entomophagous fungi Metarhizium spp. These results uncovered the specific parasitism of P. lilacinum and presented the genes responsible for the infection of nematode-eggs. PMID:27486440

  20. Signatures of selection in tilapia revealed by whole genome resequencing.

    PubMed

    Xia, Jun Hong; Bai, Zhiyi; Meng, Zining; Zhang, Yong; Wang, Le; Liu, Feng; Jing, Wu; Wan, Zi Yi; Li, Jiale; Lin, Haoran; Yue, Gen Hua

    2015-09-16

    Natural selection and selective breeding for genetic improvement have left detectable signatures within the genome of a species. Identification of selection signatures is important in evolutionary biology and for detecting genes that facilitate to accelerate genetic improvement. However, selection signatures, including artificial selection and natural selection, have only been identified at the whole genome level in several genetically improved fish species. Tilapia is one of the most important genetically improved fish species in the world. Using next-generation sequencing, we sequenced the genomes of 47 tilapia individuals. We identified a total of 1.43 million high-quality SNPs and found that the LD block sizes ranged from 10-100 kb in tilapia. We detected over a hundred putative selective sweep regions in each line of tilapia. Most selection signatures were located in non-coding regions of the tilapia genome. The Wnt signaling, gonadotropin-releasing hormone receptor and integrin signaling pathways were under positive selection in all improved tilapia lines. Our study provides a genome-wide map of genetic variation and selection footprints in tilapia, which could be important for genetic studies and accelerating genetic improvement of tilapia.

  1. Three crocodilian genomes reveal ancestral patterns of evolution among archosaurs.

    PubMed

    Green, Richard E; Braun, Edward L; Armstrong, Joel; Earl, Dent; Nguyen, Ngan; Hickey, Glenn; Vandewege, Michael W; St John, John A; Capella-Gutiérrez, Salvador; Castoe, Todd A; Kern, Colin; Fujita, Matthew K; Opazo, Juan C; Jurka, Jerzy; Kojima, Kenji K; Caballero, Juan; Hubley, Robert M; Smit, Arian F; Platt, Roy N; Lavoie, Christine A; Ramakodi, Meganathan P; Finger, John W; Suh, Alexander; Isberg, Sally R; Miles, Lee; Chong, Amanda Y; Jaratlerdsiri, Weerachai; Gongora, Jaime; Moran, Christopher; Iriarte, Andrés; McCormack, John; Burgess, Shane C; Edwards, Scott V; Lyons, Eric; Williams, Christina; Breen, Matthew; Howard, Jason T; Gresham, Cathy R; Peterson, Daniel G; Schmitz, Jürgen; Pollock, David D; Haussler, David; Triplett, Eric W; Zhang, Guojie; Irie, Naoki; Jarvis, Erich D; Brochu, Christopher A; Schmidt, Carl J; McCarthy, Fiona M; Faircloth, Brant C; Hoffmann, Federico G; Glenn, Travis C; Gabaldón, Toni; Paten, Benedict; Ray, David A

    2014-12-12

    To provide context for the diversification of archosaurs--the group that includes crocodilians, dinosaurs, and birds--we generated draft genomes of three crocodilians: Alligator mississippiensis (the American alligator), Crocodylus porosus (the saltwater crocodile), and Gavialis gangeticus (the Indian gharial). We observed an exceptionally slow rate of genome evolution within crocodilians at all levels, including nucleotide substitutions, indels, transposable element content and movement, gene family evolution, and chromosomal synteny. When placed within the context of related taxa including birds and turtles, this suggests that the common ancestor of all of these taxa also exhibited slow genome evolution and that the comparatively rapid evolution is derived in birds. The data also provided the opportunity to analyze heterozygosity in crocodilians, which indicates a likely reduction in population size for all three taxa through the Pleistocene. Finally, these data combined with newly published bird genomes allowed us to reconstruct the partial genome of the common ancestor of archosaurs, thereby providing a tool to investigate the genetic starting material of crocodilians, birds, and dinosaurs.

  2. Intra-species sequence comparisons for annotating genomes

    SciTech Connect

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  3. Genome analysis of the platypus reveals unique signatures of evolution.

    PubMed

    Warren, Wesley C; Hillier, LaDeana W; Marshall Graves, Jennifer A; Birney, Ewan; Ponting, Chris P; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P; Miethke, Pat; Waters, Paul D; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S; López-Otín, Carlos; Ordóñez, Gonzalo R; Eichler, Evan E; Chen, Lin; Cheng, Ze; Deakin, Janine E; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T; Wakefield, Matthew J; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A; Smit, Arian F A; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A; Walker, Jerilyn A; Konkel, Miriam K; Harris, Robert S; Whittington, Camilla M; Wong, Emily S W; Gemmell, Neil J; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M; Sharp, Julie A; Nicholas, Kevin R; Ray, David A; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H; Taylor, James; Jones, Russell C; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N; Pohl, Craig S; Smith, Scott M; Hou, Shunfeng; Nefedov, Mikhail; de Jong, Pieter J; Renfree, Marilyn B; Mardis, Elaine R; Wilson, Richard K

    2008-05-08

    We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation.

  4. Evolution of cancer suppression as revealed by mammalian comparative genomics.

    PubMed

    Tollis, Marc; Schiffman, Joshua D; Boddy, Amy M

    2017-02-02

    Cancer suppression is an important feature in the evolution of large and long-lived animals. While some tumor suppression pathways are conserved among all multicellular organisms, others mechanisms of cancer resistance are uniquely lineage specific. Comparative genomics has become a powerful tool to discover these unique and shared molecular adaptations in respect to cancer suppression. These findings may one day be translated to human patients through evolutionary medicine. Here, we will review theory and methods of comparative cancer genomics and highlight major findings of cancer suppression across mammals. Our current knowledge of cancer genomics suggests that more efficient DNA repair and higher sensitivity to DNA damage may be the key to tumor suppression in large or long-lived mammals.

  5. The genomes of four tapeworm species reveal adaptations to parasitism.

    PubMed

    Tsai, Isheng J; Zarowiecki, Magdalena; Holroyd, Nancy; Garciarrubio, Alejandro; Sanchez-Flores, Alejandro; Brooks, Karen L; Tracey, Alan; Bobes, Raúl J; Fragoso, Gladis; Sciutto, Edda; Aslett, Martin; Beasley, Helen; Bennett, Hayley M; Cai, Jianping; Camicia, Federico; Clark, Richard; Cucher, Marcela; De Silva, Nishadi; Day, Tim A; Deplazes, Peter; Estrada, Karel; Fernández, Cecilia; Holland, Peter W H; Hou, Junling; Hu, Songnian; Huckvale, Thomas; Hung, Stacy S; Kamenetzky, Laura; Keane, Jacqueline A; Kiss, Ferenc; Koziol, Uriel; Lambert, Olivia; Liu, Kan; Luo, Xuenong; Luo, Yingfeng; Macchiaroli, Natalia; Nichol, Sarah; Paps, Jordi; Parkinson, John; Pouchkina-Stantcheva, Natasha; Riddiford, Nick; Rosenzvit, Mara; Salinas, Gustavo; Wasmuth, James D; Zamanian, Mostafa; Zheng, Yadong; Cai, Xuepeng; Soberón, Xavier; Olson, Peter D; Laclette, Juan P; Brehm, Klaus; Berriman, Matthew

    2013-04-04

    Tapeworms (Cestoda) cause neglected diseases that can be fatal and are difficult to treat, owing to inefficient drugs. Here we present an analysis of tapeworm genome sequences using the human-infective species Echinococcus multilocularis, E. granulosus, Taenia solium and the laboratory model Hymenolepis microstoma as examples. The 115- to 141-megabase genomes offer insights into the evolution of parasitism. Synteny is maintained with distantly related blood flukes but we find extreme losses of genes and pathways that are ubiquitous in other animals, including 34 homeobox families and several determinants of stem cell fate. Tapeworms have specialized detoxification pathways, metabolism that is finely tuned to rely on nutrients scavenged from their hosts, and species-specific expansions of non-canonical heat shock proteins and families of known antigens. We identify new potential drug targets, including some on which existing pharmaceuticals may act. The genomes provide a rich resource to underpin the development of urgently needed treatments and control.

  6. Genome analysis of the platypus reveals unique signatures of evolution

    PubMed Central

    Warren, Wesley C.; Hillier, LaDeana W.; Marshall Graves, Jennifer A.; Birney, Ewan; Ponting, Chris P.; Grützner, Frank; Belov, Katherine; Miller, Webb; Clarke, Laura; Chinwalla, Asif T.; Yang, Shiaw-Pyng; Heger, Andreas; Locke, Devin P.; Miethke, Pat; Waters, Paul D.; Veyrunes, Frédéric; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Wallis, John; Puente, Xose S.; López-Otín, Carlos; Ordóñez, Gonzalo R.; Eichler, Evan E.; Chen, Lin; Cheng, Ze; Deakin, Janine E.; Alsop, Amber; Thompson, Katherine; Kirby, Patrick; Papenfuss, Anthony T.; Wakefield, Matthew J.; Olender, Tsviya; Lancet, Doron; Huttley, Gavin A.; Smit, Arian F. A.; Pask, Andrew; Temple-Smith, Peter; Batzer, Mark A.; Walker, Jerilyn A.; Konkel, Miriam K.; Harris, Robert S.; Whittington, Camilla M.; Wong, Emily S. W.; Gemmell, Neil J.; Buschiazzo, Emmanuel; Vargas Jentzsch, Iris M.; Merkel, Angelika; Schmitz, Juergen; Zemann, Anja; Churakov, Gennady; Kriegs, Jan Ole; Brosius, Juergen; Murchison, Elizabeth P.; Sachidanandam, Ravi; Smith, Carly; Hannon, Gregory J.; Tsend-Ayush, Enkhjargal; McMillan, Daniel; Attenborough, Rosalind; Rens, Willem; Ferguson-Smith, Malcolm; Lefèvre, Christophe M.; Sharp, Julie A.; Nicholas, Kevin R.; Ray, David A.; Kube, Michael; Reinhardt, Richard; Pringle, Thomas H.; Taylor, James; Jones, Russell C.; Nixon, Brett; Dacheux, Jean-Louis; Niwa, Hitoshi; Sekita, Yoko; Huang, Xiaoqiu; Stark, Alexander; Kheradpour, Pouya; Kellis, Manolis; Flicek, Paul; Chen, Yuan; Webber, Caleb; Hardison, Ross; Nelson, Joanne; Hallsworth-Pepin, Kym; Delehaunty, Kim; Markovic, Chris; Minx, Pat; Feng, Yucheng; Kremitzki, Colin; Mitreva, Makedonka; Glasscock, Jarret; Wylie, Todd; Wohldmann, Patricia; Thiru, Prathapan; Nhan, Michael N.; Pohl, Craig S.; Smith, Scott M.; Hou, Shunfeng; Renfree, Marilyn B.; Mardis, Elaine R.; Wilson, Richard K.

    2009-01-01

    We present a draft genome sequence of the platypus, Ornithorhynchus anatinus. This monotreme exhibits a fascinating combination of reptilian and mammalian characters. For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles. Analysis of the first monotreme genome aligned these features with genetic innovations. We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology. Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified. Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation. PMID:18464734

  7. Initial sequence of the chimpanzee genome and comparison with the human genome.

    PubMed

    2005-09-01

    Here we present a draft genome sequence of the common chimpanzee (Pan troglodytes). Through comparison with the human genome, we have generated a largely complete catalogue of the genetic differences that have accumulated since the human and chimpanzee species diverged from our common ancestor, constituting approximately thirty-five million single-nucleotide changes, five million insertion/deletion events, and various chromosomal rearrangements. We use this catalogue to explore the magnitude and regional variation of mutational forces shaping these two genomes, and the strength of positive and negative selection acting on their genes. In particular, we find that the patterns of evolution in human and chimpanzee protein-coding genes are highly correlated and dominated by the fixation of neutral and slightly deleterious alleles. We also use the chimpanzee genome as an outgroup to investigate human population genetics and identify signatures of selective sweeps in recent human evolution.

  8. The Large Mitochondrial Genome of Symbiodinium minutum Reveals Conserved Noncoding Sequences between Dinoflagellates and Apicomplexans

    PubMed Central

    Shoguchi, Eiichi; Shinzato, Chuya; Hisata, Kanako; Satoh, Nori; Mungpakdee, Sutada

    2015-01-01

    Even though mitochondrial genomes, which characterize eukaryotic cells, were first discovered more than 50 years ago, mitochondrial genomics remains an important topic in molecular biology and genome sciences. The Phylum Alveolata comprises three major groups (ciliates, apicomplexans, and dinoflagellates), the mitochondrial genomes of which have diverged widely. Even though the gene content of dinoflagellate mitochondrial genomes is reportedly comparable to that of apicomplexans, the highly fragmented and rearranged genome structures of dinoflagellates have frustrated whole genomic analysis. Consequently, noncoding sequences and gene arrangements of dinoflagellate mitochondrial genomes have not been well characterized. Here we report that the continuous assembled genome (∼326 kb) of the dinoflagellate, Symbiodinium minutum, is AT-rich (∼64.3%) and that it contains three protein-coding genes. Based upon in silico analysis, the remaining 99% of the genome comprises transcriptomic noncoding sequences. RNA edited sites and unique, possible start and stop codons clarify conserved regions among dinoflagellates. Our massive transcriptome analysis shows that almost all regions of the genome are transcribed, including 27 possible fragmented ribosomal RNA genes and 12 uncharacterized small RNAs that are similar to mitochondrial RNA genes of the malarial parasite, Plasmodium falciparum. Gene map comparisons show that gene order is only slightly conserved between S. minutum and P. falciparum. However, small RNAs and intergenic sequences share sequence similarities with P. falciparum, suggesting that the function of noncoding sequences has been preserved despite development of very different genome structures. PMID:26199191

  9. Genomic Variants Revealed by Invariably Missing Genotypes in Nelore Cattle

    PubMed Central

    da Silva, Joaquim Manoel; Giachetto, Poliana Fernanda; da Silva, Luiz Otávio Campos; Cintra, Leandro Carrijo; Paiva, Samuel Rezende; Caetano, Alexandre Rodrigues; Yamagishi, Michel Eduardo Beleza

    2015-01-01

    High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus) samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production. PMID:26305794

  10. Butterfly genome reveals promiscuous exchange of mimicry adaptations among species.

    PubMed

    2012-07-05

    The evolutionary importance of hybridization and introgression has long been debated. Hybrids are usually rare and unfit, but even infrequent hybridization can aid adaptation by transferring beneficial traits between species. Here we use genomic tools to investigate introgression in Heliconius, a rapidly radiating genus of neotropical butterflies widely used in studies of ecology, behaviour, mimicry and speciation. We sequenced the genome of Heliconius melpomene and compared it with other taxa to investigate chromosomal evolution in Lepidoptera and gene flow among multiple Heliconius species and races. Among 12,669 predicted genes, biologically important expansions of families of chemosensory and Hox genes are particularly noteworthy. Chromosomal organization has remained broadly conserved since the Cretaceous period, when butterflies split from the Bombyx (silkmoth) lineage. Using genomic resequencing, we show hybrid exchange of genes between three co-mimics, Heliconius melpomene, Heliconius timareta and Heliconius elevatus, especially at two genomic regions that control mimicry pattern. We infer that closely related Heliconius species exchange protective colour-pattern genes promiscuously, implying that hybridization has an important role in adaptive radiation.

  11. The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons

    PubMed Central

    Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.

    2016-01-01

    To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095

  12. Comparative genomic paleontology across plant kingdom reveals the dynamics of TE-driven genome evolution.

    PubMed

    El Baidouri, Moaine; Panaud, Olivier

    2013-01-01

    Long terminal repeat-retrotransposons (LTR-RTs) are the most abundant class of transposable elements (TEs) in plants. They strongly impact the structure, function, and evolution of their host genome, and, in particular, their role in genome size variation has been clearly established. However, the dynamics of the process through which LTR-RTs have differentially shaped plant genomes is still poorly understood because of a lack of comparative studies. Using a new robust and automated family classification procedure, we exhaustively characterized the LTR-RTs in eight plant genomes for which a high-quality sequence is available (i.e., Arabidopsis thaliana, A. lyrata, grapevine, soybean, rice, Brachypodium dystachion, sorghum, and maize). This allowed us to perform a comparative genome-wide study of the retrotranspositional landscape in these eight plant lineages from both monocots and dicots. We show that retrotransposition has recurrently occurred in all plant genomes investigated, regardless their size, and through bursts, rather than a continuous process. Moreover, in each genome, only one or few LTR-RT families have been active in the recent past, and the difference in genome size among the species studied could thus mostly be accounted for by the extent of the latest transpositional burst(s). Following these bursts, LTR-RTs are efficiently eliminated from their host genomes through recombination and deletion, but we show that the removal rate is not lineage specific. These new findings lead us to propose a new model of TE-driven genome evolution in plants.

  13. Mitogenomes from The 1000 Genome Project Reveal New Near Eastern Features in Present-Day Tuscans

    PubMed Central

    Pardo-Seco, Jacobo; Amigo, Jorge; Martinón-Torres, Federico

    2015-01-01

    Background Genetic analyses have recently been carried out on present-day Tuscans (Central Italy) in order to investigate their presumable recent Near East ancestry in connection with the long-standing debate on the origins of the Etruscan civilization. We retrieved mitogenomes and genome-wide SNP data from 110 Tuscans analyzed within the context of The 1000 Genome Project. For phylogeographic and evolutionary analysis we made use of a large worldwide database of entire mitogenomes (>26,000) and partial control region sequences (>180,000). Results Different analyses reveal the presence of typical Near East haplotypes in Tuscans representing isolated members of various mtDNA phylogenetic branches. As a whole, the Near East component in Tuscan mitogenomes can be estimated at about 8%; a proportion that is comparable to previous estimates but significantly lower than admixture estimates obtained from autosomal SNP data (21%). Phylogeographic and evolutionary inter-population comparisons indicate that the main signal of Near Eastern Tuscan mitogenomes comes from Iran. Conclusions Mitogenomes of recent Near East origin in present-day Tuscans do not show local or regional variation. This points to a demographic scenario that is compatible with a recent arrival of Near Easterners to this region in Italy with no founder events or bottlenecks. PMID:25786119

  14. Yeast genome-wide screen reveals dissimilar sets of host genes affecting replication of RNA viruses

    PubMed Central

    Panavas, Tadas; Serviene, Elena; Brasher, Jeremy; Nagy, Peter D.

    2005-01-01

    Viruses are devastating pathogens of humans, animals, and plants. To further our understanding of how viruses use the resources of infected cells, we systematically tested the yeast single-gene-knockout library for the effect of each host gene on the replication of tomato bushy stunt virus (TBSV), a positive-strand RNA virus of plants. The genome-wide screen identified 96 host genes whose absence either reduced or increased the accumulation of the TBSV replicon. The identified genes are involved in the metabolism of nucleic acids, lipids, proteins, and other compounds and in protein targeting/transport. Comparison with published genome-wide screens reveals that the replication of TBSV and brome mosaic virus (BMV), which belongs to a different supergroup among plus-strand RNA viruses, is affected by vastly different yeast genes. Moreover, a set of yeast genes involved in vacuolar targeting of proteins and vesicle-mediated transport both affected replication of the TBSV replicon and enhanced the cytotoxicity of the Parkinson's disease-related α-synuclein when this protein was expressed in yeast. In addition, a set of host genes involved in ubiquitin-dependent protein catabolism affected both TBSV replication and the cytotoxicity of a mutant huntingtin protein, a candidate agent in Huntington's disease. This finding suggests that virus infection and disease-causing proteins might use or alter similar host pathways and may suggest connections between chronic diseases and prior virus infection. PMID:15883361

  15. Upper Palaeolithic genomes reveal deep roots of modern Eurasians.

    PubMed

    Jones, Eppie R; Gonzalez-Fortes, Gloria; Connell, Sarah; Siska, Veronika; Eriksson, Anders; Martiniano, Rui; McLaughlin, Russell L; Gallego Llorente, Marcos; Cassidy, Lara M; Gamba, Cristina; Meshveliani, Tengiz; Bar-Yosef, Ofer; Müller, Werner; Belfer-Cohen, Anna; Matskevich, Zinovi; Jakeli, Nino; Higham, Thomas F G; Currat, Mathias; Lordkipanidze, David; Hofreiter, Michael; Manica, Andrea; Pinhasi, Ron; Bradley, Daniel G

    2015-11-16

    We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic-Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ∼45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ∼25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ∼3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.

  16. Upper Palaeolithic genomes reveal deep roots of modern Eurasians

    PubMed Central

    Jones, Eppie R.; Gonzalez-Fortes, Gloria; Connell, Sarah; Siska, Veronika; Eriksson, Anders; Martiniano, Rui; McLaughlin, Russell L.; Gallego Llorente, Marcos; Cassidy, Lara M.; Gamba, Cristina; Meshveliani, Tengiz; Bar-Yosef, Ofer; Müller, Werner; Belfer-Cohen, Anna; Matskevich, Zinovi; Jakeli, Nino; Higham, Thomas F. G.; Currat, Mathias; Lordkipanidze, David; Hofreiter, Michael; Manica, Andrea; Pinhasi, Ron; Bradley, Daniel G.

    2015-01-01

    We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic–Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ∼45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ∼25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ∼3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages. PMID:26567969

  17. Genome Analysis of the Fruiting Body-Forming Myxobacterium Chondromyces crocatus Reveals High Potential for Natural Product Biosynthesis

    PubMed Central

    Zaburannyi, Nestor; Bunk, Boyke; Maier, Josef; Overmann, Jörg

    2016-01-01

    Here, we report the complete genome sequence of the type strain of the myxobacterial genus Chondromyces, Chondromyces crocatus Cm c5. It presents one of the largest prokaryotic genomes featuring a single circular chromosome and no plasmids. Analysis revealed an enlarged set of tRNA genes, along with reduced pressure on preferred codon usage compared to that of other bacterial genomes. The large coding capacity and the plethora of encoded secondary metabolite biosynthetic gene clusters are in line with the capability of Cm c5 to produce an arsenal of antibacterial, antifungal, and cytotoxic compounds. Known pathways of the ajudazol, chondramide, chondrochloren, crocacin, crocapeptin, and thuggacin compound families are complemented by many more natural compound biosynthetic gene clusters in the chromosome. Whole-genome comparison of the fruiting-body-forming type strain (Cm c5, DSM 14714) to an accustomed laboratory strain which has lost this ability (nonfruiting phenotype, Cm c5 fr−) revealed genetic changes in three loci. In addition to the low synteny found with the closest sequenced representative of the same family, Sorangium cellulosum, extensive genetic information duplication and broad application of eukaryotic-type signal transduction systems are hallmarks of this 11.3-Mbp prokaryotic genome. PMID:26773087

  18. Analysis of Human mRNAs With the Reference Genome Sequence Reveals Potential Errors, Polymorphisms, and RNA Editing

    PubMed Central

    Furey, Terrence S.; Diekhans, Mark; Lu, Yontao; Graves, Tina A.; Oddy, Lachlan; Randall-Maher, Jennifer; Hillier, LaDeana W.; Wilson, Richard K.; Haussler, David

    2004-01-01

    The NCBI Reference Sequence (RefSeq) project and the NIH Mammalian Gene Collection (MGC) together define a set of ∼30,000 nonredundant human mRNA sequences with identified coding regions representing 17,000 distinct loci. These high-quality mRNA sequences allow for the identification of transcribed regions in the human genome sequence, and many researchers accept them as the correct representation of each defined gene sequence. Computational comparison of these mRNA sequences and the recently published essentially finished human genome sequence reveals several thousand undocumented nonsynonymous substitution and frame shift discrepancies between the two resources. Additional analysis is undertaken to verify that the euchromatic human genome is sufficiently complete—containing nearly the whole mRNA collection, thus allowing for a comprehensive analysis to be undertaken. Many of the discrepancies will prove to be genuine polymorphisms in the human population, somatic cell genomic variants, or examples of RNA editing. It is observed that the genome sequence variant has significant additional support from other mRNAs and ESTs, almost four times more often than does the mRNA variant, suggesting that the genome sequence is more accurate. In ∼15% of these cases, there is substantial support for both variants, suggestive of an undocumented polymorphism. An initial screening against a 24-individual genomic DNA diversity panel verified 60% of a small set of potential single nucleotide polymorphisms from which successful results could be obtained. We also find statistical evidence that a few of these discrepancies are due to RNA editing. Overall, these results suggest that the mRNA collections may contain a substantial number of errors. For current and future mRNA collections, it may be prudent to fully reconcile each genome sequence discrepancy, classifying each as a polymorphism, site of RNA editing or somatic cell variation, or genome sequence error. PMID:15489323

  19. Evolutionary comparison reveals that diverging CTCF sites are signatures of ancestral topological associating domains borders

    PubMed Central

    Gómez-Marín, Carlos; Tena, Juan J.; Acemel, Rafael D.; López-Mayorga, Macarena; Naranjo, Silvia; de la Calle-Mustienes, Elisa; Maeso, Ignacio; Beccari, Leonardo; Aneas, Ivy; Vielmas, Erika; Bovolenta, Paola; Nobrega, Marcelo A.; Carvajal, Jaime; Gómez-Skarmeta, José Luis

    2015-01-01

    Increasing evidence in the last years indicates that the vast amount of regulatory information contained in mammalian genomes is organized in precise 3D chromatin structures. However, the impact of this spatial chromatin organization on gene expression and its degree of evolutionary conservation is still poorly understood. The Six homeobox genes are essential developmental regulators organized in gene clusters conserved during evolution. Here, we reveal that the Six clusters share a deeply evolutionarily conserved 3D chromatin organization that predates the Cambrian explosion. This chromatin architecture generates two largely independent regulatory landscapes (RLs) contained in two adjacent topological associating domains (TADs). By disrupting the conserved TAD border in one of the zebrafish Six clusters, we demonstrate that this border is critical for preventing competition between promoters and enhancers located in separated RLs, thereby generating different expression patterns in genes located in close genomic proximity. Moreover, evolutionary comparison of Six-associated TAD borders reveals the presence of CCCTC-binding factor (CTCF) sites with diverging orientations in all studied deuterostomes. Genome-wide examination of mammalian HiC data reveals that this conserved CTCF configuration is a general signature of TAD borders, underscoring that common organizational principles underlie TAD compartmentalization in deuterostome evolution. PMID:26034287

  20. The genome of Romanomermis culicivorax: revealing fundamental changes in the core developmental genetic toolkit in Nematoda

    PubMed Central

    2013-01-01

    Background The genetics of development in the nematode Caenorhabditis elegans has been described in exquisite detail. The phylum Nematoda has two classes: Chromadorea (which includes C. elegans) and the Enoplea. While the development of many chromadorean species resembles closely that of C. elegans, enoplean nematodes show markedly different patterns of early cell division and cell fate assignment. Embryogenesis of the enoplean Romanomermis culicivorax has been studied in detail, but the genetic circuitry underpinning development in this species has not been explored. Results We generated a draft genome for R. culicivorax and compared its gene content with that of C. elegans, a second enoplean, the vertebrate parasite Trichinella spiralis, and a representative arthropod, Tribolium castaneum. This comparison revealed that R. culicivorax has retained components of the conserved ecdysozoan developmental gene toolkit lost in C. elegans. T. spiralis has independently lost even more of this toolkit than has C. elegans. However, the C. elegans toolkit is not simply depauperate, as many novel genes essential for embryogenesis in C. elegans are not found in, or have only extremely divergent homologues in R. culicivorax and T. spiralis. Our data imply fundamental differences in the genetic programmes not only for early cell specification but also others such as vulva formation and sex determination. Conclusions Despite the apparent morphological conservatism, major differences in the molecular logic of development have evolved within the phylum Nematoda. R. culicivorax serves as a tractable system to contrast C. elegans and understand how divergent genomic and thus regulatory backgrounds nevertheless generate a conserved phenotype. The R. culicivorax draft genome will promote use of this species as a research model. PMID:24373391

  1. Comparison of the genomes of human and mouse lays the foundation of genome zoology.

    PubMed

    Emes, Richard D; Goodstadt, Leo; Winter, Eitan E; Ponting, Chris P

    2003-04-01

    The extensive similarities between the genomes of human and model organisms are the foundation of much of modern biology, with model organism experimentation permitting valuable insights into biological function and the aetiology of human disease. In contrast, differences among genomes have received less attention. Yet these can be expected to govern the physiological and morphological distinctions apparent among species, especially if such differences are the result of evolutionary adaptation. A recent comparison of the draft sequences of mouse and human genomes has shed light on the selective forces that have predominated in their recent evolutionary histories. In particular, mouse-specific clusters of homologues associated with roles in reproduction, immunity and host defence appear to be under diversifying positive selective pressure, as indicated by high ratios of non-synonymous to synonymous substitution rates. These clusters are also frequently punctuated by homologous pseudogenes. They thus have experienced numerous gene death, as well as gene birth, events. These regions appear, therefore, to have borne the brunt of adaptive evolution that underlies physiological and behavioural innovation in mice. We predict that the availability of numerous animal genomes will give rise to a new field of genome zoology in which differences in animal physiology and ethology are illuminated by the study of genomic sequence variations.

  2. Genomic comparison of sporeforming bacilli isolated from milk

    PubMed Central

    2014-01-01

    Background Sporeformers in the order Bacillales are important contributors to spoilage of pasteurized milk. While only a few Bacillus and Viridibacillus strains can grow in milk at 6°C, the majority of Paenibacillus isolated from pasteurized fluid milk can grow under these conditions. To gain a better understanding of genomic features of these important spoilage organisms and to identify candidate genomic features that may facilitate cold growth in milk, we performed a comparative genomic analysis of selected dairy associated sporeformers representing isolates that can and cannot grow in milk at 6°C. Results The genomes for seven Paenibacillus spp., two Bacillus spp., and one Viridibacillus sp. isolates were sequenced. Across the genomes sequenced, we identified numerous genes encoding antimicrobial resistance mechanisms, bacteriocins, and pathways for synthesis of non-ribosomal peptide antibiotics. Phylogenetic analysis placed genomes representing Bacillus, Paenibacillus and Viridibacillus into three distinct well supported clades and further classified the Paenibacillus strains characterized here into three distinct clades, including (i) clade I, which contains one strain able to grow at 6°C in skim milk broth and one strain not able to grow under these conditions, (ii) clade II, which contains three strains able to grow at 6°C in skim milk broth, and (iii) clade III, which contains two strains unable to grow under these conditions. While all Paenibacillus genomes were found to include multiple copies of genes encoding β-galactosidases, clade II strains showed significantly higher numbers of genes encoding these enzymes as compared to clade III strains. Genome comparison of strains able to grow at 6°C and strains unable to grow at this temperature identified numerous genes encoding features that might facilitate the growth of Paenibacillus in milk at 6°C, including peptidases with cold-adapted features (flexibility and disorder regions in the protein

  3. High resolution genetic mapping by genome sequencing reveals genome duplication and tetraploid genetic structure of the diploid Miscanthus sinensis.

    PubMed

    Ma, Xue-Feng; Jensen, Elaine; Alexandrov, Nickolai; Troukhan, Maxim; Zhang, Liping; Thomas-Jones, Sian; Farrar, Kerrie; Clifton-Brown, John; Donnison, Iain; Swaller, Timothy; Flavell, Richard

    2012-01-01

    We have created a high-resolution linkage map of Miscanthus sinensis, using genotyping-by-sequencing (GBS), identifying all 19 linkage groups for the first time. The result is technically significant since Miscanthus has a very large and highly heterozygous genome, but has no or limited genomics information to date. The composite linkage map containing markers from both parental linkage maps is composed of 3,745 SNP markers spanning 2,396 cM on 19 linkage groups with a 0.64 cM average resolution. Comparative genomics analyses of the M. sinensis composite linkage map to the genomes of sorghum, maize, rice, and Brachypodium distachyon indicate that sorghum has the closest syntenic relationship to Miscanthus compared to other species. The comparative results revealed that each pair of the 19 M. sinensis linkages aligned to one sorghum chromosome, except for LG8, which mapped to two sorghum chromosomes (4 and 7), presumably due to a chromosome fusion event after genome duplication. The data also revealed several other chromosome rearrangements relative to sorghum, including two telomere-centromere inversions of the sorghum syntenic chromosome 7 in LG8 of M. sinensis and two paracentric inversions of sorghum syntenic chromosome 4 in LG7 and LG8 of M. sinensis. The results clearly demonstrate, for the first time, that the diploid M. sinensis is tetraploid origin consisting of two sub-genomes. This complete and high resolution composite linkage map will not only serve as a useful resource for novel QTL discoveries, but also enable informed deployment of the wealth of existing genomics resources of other species to the improvement of Miscanthus as a high biomass energy crop. In addition, it has utility as a reference for genome sequence assembly for the forthcoming whole genome sequencing of the Miscanthus genus.

  4. Genomic comparison of virulent Rickettsia rickettsii Sheila Smith and avirulent Rickettsia rickettsii Iowa.

    PubMed

    Ellison, Damon W; Clark, Tina R; Sturdevant, Daniel E; Virtaneva, Kimmo; Porcella, Stephen F; Hackstadt, Ted

    2008-02-01

    Rickettsia rickettsii is an obligate intracellular pathogen that is the causative agent of Rocky Mountain spotted fever. To identify genes involved in the virulence of R. rickettsii, the genome of an avirulent strain, R. rickettsii Iowa, was sequenced and compared to the genome of the virulent strain R. rickettsii Sheila Smith. R. rickettsii Iowa is avirulent in a guinea pig model of infection and displays altered plaque morphology with decreased lysis of infected host cells. Comparison of the two genomes revealed that R. rickettsii Iowa and R. rickettsii Sheila Smith share a high degree of sequence identity. A whole-genome alignment comparing R. rickettsii Iowa to R. rickettsii Sheila Smith revealed a total of 143 deletions for the two strains. A subsequent single-nucleotide polymorphism (SNP) analysis comparing Iowa to Sheila Smith revealed 492 SNPs for the two genomes. One of the deletions in R. rickettsii Iowa truncates rompA, encoding a major surface antigen (rickettsial outer membrane protein A [rOmpA]) and member of the autotransporter family, 660 bp from the start of translation. Immunoblotting and immunofluorescence confirmed the absence of rOmpA from R. rickettsii Iowa. In addition, R. rickettsii Iowa is defective in the processing of rOmpB, an autotransporter and also a major surface antigen of spotted fever group rickettsiae. Disruption of rompA and the defect in rOmpB processing are most likely factors that contribute to the avirulence of R. rickettsii Iowa. Genomic differences between the two strains do not significantly alter gene expression as analysis of microarrays revealed only four differences in gene expression between R. rickettsii Iowa and R. rickettsii strain R. Although R. rickettsii Iowa does not cause apparent disease, infection of guinea pigs with this strain confers protection against subsequent challenge with the virulent strain R. rickettsii Sheila Smith.

  5. Mitochondrial Genome Analysis of Wild Rice (Oryza minuta) and Its Comparison with Other Related Species

    PubMed Central

    Asaf, Sajjad; Khan, Abdul Latif; Khan, Abdur Rahim; Waqas, Muhammad; Kang, Sang-Mo; Khan, Muhammad Aaqil; Shahzad, Raheem; Seo, Chang-Woo; Shin, Jae-Ho; Lee, In-Jung

    2016-01-01

    Oryza minuta (Poaceae family) is a tetraploid wild relative of cultivated rice with a BBCC genome. O. minuta has the potential to resist against various pathogenic diseases such as bacterial blight (BB), white backed planthopper (WBPH) and brown plant hopper (BPH). Here, we sequenced and annotated the complete mitochondrial genome of O. minuta. The mtDNA genome is 515,022 bp, containing 60 protein coding genes, 31 tRNA genes and two rRNA genes. The mitochondrial genome organization and the gene content at the nucleotide level are highly similar (89%) to that of O. rufipogon. Comparison with other related species revealed that most of the genes with known function are conserved among the Poaceae members. Similarly, O. minuta mt genome shared 24 protein-coding genes, 15 tRNA genes and 1 ribosomal RNA gene with other rice species (indica and japonica). The evolutionary relationship and phylogenetic analysis revealed that O. minuta is more closely related to O. rufipogon than to any other related species. Such studies are essential to understand the evolutionary divergence among species and analyze common gene pools to combat risks in the current scenario of a changing environment. PMID:27045847

  6. Genomic Characterization of Methanomicrobiales Reveals Three Classes of Methanogens

    SciTech Connect

    Anderson, Iain; Ulrich, Luke; Lupa, Boguslaw; Susanti, Dwi; Porat, I.; Hooper, Sean; Lykidis, A; Sieprawska-Lupa, Magdalena; Dharmarajan, Lakshmi; Goltsman, Eugene; Lapidus, Alla L.; Saunders, Elizabeth H; Han, Cliff; Land, Miriam L; Lucas, Susan; Mukhopadhyay, Biswarup; Whitman, William; Woese, Carl; Bristow, James; Kyrpides, Nikos C

    2009-01-01

    Background Methanomicrobiales is the least studied order of methanogens. While these organisms appear to be more closely related to the Methanosarcinales in ribosomal-based phylogenetic analyses, they are metabolically more similar to Class I methanogens. Methodology/Principal Findings In order to improve our understanding of this lineage, we have completely sequenced the genomes of two members of this order, Methanocorpusculum labreanum Z and Methanoculleus marisnigri JR1, and compared them with the genome of a third, Methanospirillum hungatei JF-1. Similar to Class I methanogens, Methanomicrobiales use a partial reductive citric acid cycle for 2-oxoglutarate biosynthesis, and they have the Eha energy-converting hydrogenase. In common with Methanosarcinales, Methanomicrobiales possess the Ech hydrogenase and at least some of them may couple formylmethanofuran formation and heterodisulfide reduction to transmembrane ion gradients. Uniquely, M. labreanum and M. hungatei contain hydrogenases similar to the Pyrococcus furiosus Mbh hydrogenase, and all three Methanomicrobiales have anti-sigma factor and anti-anti-sigma factor regulatory proteins not found in other methanogens. Phylogenetic analysis based on seven core proteins of methanogenesis and cofactor biosynthesis places the Methanomicrobiales equidistant from Class I methanogens and Methanosarcinales. Conclusions/Significance Our results indicate that Methanomicrobiales, rather than being similar to Class I methanogens or Methanomicrobiales, share some features of both and have some unique properties. We find that there are three distinct classes of methanogens: the Class I methanogens, the Methanomicrobiales (Class II), and the Methanosarcinales (Class III).

  7. Genomic Characterization of Methanomicrobiales Reveals Three Classes of Methanogens

    SciTech Connect

    Anderson, Iain; Ulrich, Luke E.; Lupa, Boguslaw; Susanti, Dwi; Porat, Iris; Hooper, Sean D.; Lykidis, Athanasios; Sieprawska-Lupa, Magdalena; Dharmarajan, Lakshmi; Goltsman, Eugene; Lapidus, Alla; Saunders, Elizabeth; Han, Cliff; Land, Miriam; Lucas, Susan; Mukhopadhyay, Biswarup; Whitman, William B.; Woese, Carl; Bristow, James; Kyrpides, Nikos

    2009-05-01

    Methanomicrobiales is the least studied order of methanogens. While these organisms appear to be more closely related to the Methanosarcinales in ribosomal-based phylogenetic analyses, they are metabolically more similar to Class I methanogens. In order to improve our understanding of this lineage, we have completely sequenced the genomes of two members of this order, Methanocorpusculum labreanum Z and Methanoculleus marisnigri JR1, and compared them with the genome of a third, Methanospirillum hungatei JF-1. Similar to Class I methanogens, Methanomicrobiales use a partial reductive citric acid cycle for 2-oxoglutarate biosynthesis, and they have the Eha energy-converting hydrogenase. In common with Methanosarcinales, Methanomicrobiales possess the Ech hydrogenase and at least some of them may couple formylmethanofuran formation and heterodisulfide reduction to transmembrane ion gradients. Uniquely, M. labreanum and M. hungatei contain hydrogenases similar to the Pyrococcus furiosus Mbh hydrogenase, and all three Methanomicrobiales have anti-sigma factor and anti-anti-sigma factor regulatory proteins not found in other methanogens. Phylogenetic analysis based on seven core proteins of methanogenesis and cofactor biosynthesis places the Methanomicrobiales equidistant from Class I methanogens and Methanosarcinales. Our results indicate that Methanomicrobiales, rather than being similar to Class I methanogens or Methanomicrobiales, share some features of both and have some unique properties. We find that there are three distinct classes of methanogens: the Class I methanogens, the Methanomicrobiales (Class II), and the Methanosarcinales (Class III).

  8. High-resolution genomic profiling of chronic lymphocytic leukemia reveals new recurrent genomic alterations.

    PubMed

    Edelmann, Jennifer; Holzmann, Karlheinz; Miller, Florian; Winkler, Dirk; Bühler, Andreas; Zenz, Thorsten; Bullinger, Lars; Kühn, Michael W M; Gerhardinger, Andreas; Bloehdorn, Johannes; Radtke, Ina; Su, Xiaoping; Ma, Jing; Pounds, Stanley; Hallek, Michael; Lichter, Peter; Korbel, Jan; Busch, Raymonde; Mertens, Daniel; Downing, James R; Stilgenbauer, Stephan; Döhner, Hartmut

    2012-12-06

    To identify genomic alterations in chronic lymphocytic leukemia (CLL), we performed single-nucleotide polymorphism-array analysis using Affymetrix Version 6.0 on 353 samples from untreated patients entered in the CLL8 treatment trial. Based on paired-sample analysis (n = 144), a mean of 1.8 copy number alterations per patient were identified; approximately 60% of patients carried no copy number alterations other than those detected by fluorescence in situ hybridization analysis. Copy-neutral loss-of-heterozygosity was detected in 6% of CLL patients and was found most frequently on 13q, 17p, and 11q. Minimally deleted regions were refined on 13q14 (deleted in 61% of patients) to the DLEU1 and DLEU2 genes, on 11q22.3 (27% of patients) to ATM, on 2p16.1-2p15 (gained in 7% of patients) to a 1.9-Mb fragment containing 9 genes, and on 8q24.21 (5% of patients) to a segment 486 kb proximal to the MYC locus. 13q deletions exhibited proximal and distal breakpoint cluster regions. Among the most common novel lesions were deletions at 15q15.1 (4% of patients), with the smallest deletion (70.48 kb) found in the MGA locus. Sequence analysis of MGA in 59 samples revealed a truncating mutation in one CLL patient lacking a 15q deletion. MNT at 17p13.3, which in addition to MGA and MYC encodes for the network of MAX-interacting proteins, was also deleted recurrently.

  9. Genome sequencing and analysis reveals possible determinants of Staphylococcus aureus nasal carriage

    PubMed Central

    Sivaraman, Karthikeyan; Venkataraman, Nitya; Tsai, Jennifer; Dewell, Scott; Cole, Alexander M

    2008-01-01

    Background Nasal carriage of Staphylococcus aureus is a major risk factor in clinical and community settings due to the range of etiologies caused by the organism. We have identified unique immunological and ultrastructural properties associated with nasal carriage isolates denoting a role for bacterial factors in nasal carriage. However, despite extensive molecular level characterizations by several groups suggesting factors necessary for colonization on nasal epithelium, genetic determinants of nasal carriage are unknown. Herein, we have set a genomic foundation for unraveling the bacterial determinants of nasal carriage in S. aureus. Results MLST analysis revealed no lineage specific differences between carrier and non-carrier strains suggesting a role for mobile genetic elements. We completely sequenced a model carrier isolate (D30) and a model non-carrier strain (930918-3) to identify differential gene content. Comparison revealed the presence of 84 genes unique to the carrier strain and strongly suggests a role for Type VII secretion systems in nasal carriage. These genes, along with a putative pathogenicity island (SaPIBov) present uniquely in the carrier strains are likely important in affecting carriage. Further, PCR-based genotyping of other clinical isolates for a specific subset of these 84 genes raise the possibility of nasal carriage being caused by multiple gene sets. Conclusion Our data suggest that carriage is likely a heterogeneic phenotypic trait and implies a role for nucleotide level polymorphism in carriage. Complete genome level analyses of multiple carriage strains of S. aureus will be important in clarifying molecular determinants of S. aureus nasal carriage. PMID:18808706

  10. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level

    PubMed Central

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea’s genetic data sources. PMID:27446038

  11. Comparative Genomics Analysis of Streptomyces Species Reveals Their Adaptation to the Marine Environment and Their Diversity at the Genomic Level.

    PubMed

    Tian, Xinpeng; Zhang, Zhewen; Yang, Tingting; Chen, Meili; Li, Jie; Chen, Fei; Yang, Jin; Li, Wenjie; Zhang, Bing; Zhang, Zhang; Wu, Jiayan; Zhang, Changsheng; Long, Lijuan; Xiao, Jingfa

    2016-01-01

    Over 200 genomes of streptomycete strains that were isolated from various environments are available from the NCBI. However, little is known about the characteristics that are linked to marine adaptation in marine-derived streptomycetes. The particularity and complexity of the marine environment suggest that marine streptomycetes are genetically diverse. Here, we sequenced nine strains from the Streptomyces genus that were isolated from different longitudes, latitudes, and depths of the South China Sea. Then we compared these strains to 22 NCBI downloaded streptomycete strains. Thirty-one streptomycete strains are clearly grouped into a marine-derived subgroup and multiple source subgroup-based phylogenetic tree. The phylogenetic analyses have revealed the dynamic process underlying streptomycete genome evolution, and lateral gene transfer is an important driving force during the process. Pan-genomics analyses have revealed that streptomycetes have an open pan-genome, which reflects the diversity of these streptomycetes and guarantees the species a quick and economical response to diverse environments. Functional and comparative genomics analyses indicate that the marine-derived streptomycetes subgroup possesses some common characteristics of marine adaptation. Our findings have expanded our knowledge of how ocean isolates of streptomycete strains adapt to marine environments. The availability of streptomycete genomes from the South China Sea will be beneficial for further analysis on marine streptomycetes and will enrich the South China Sea's genetic data sources.

  12. Genomic Species Are Ecological Species as Revealed by Comparative Genomics in Agrobacterium tumefaciens

    PubMed Central

    Lassalle, Florent; Campillo, Tony; Vial, Ludovic; Baude, Jessica; Costechareyre, Denis; Chapulliot, David; Shams, Malek; Abrouk, Danis; Lavire, Céline; Oger-Desfeux, Christine; Hommais, Florence; Guéguen, Laurent; Daubin, Vincent; Muller, Daniel; Nesme, Xavier

    2011-01-01

    The definition of bacterial species is based on genomic similarities, giving rise to the operational concept of genomic species, but the reasons of the occurrence of differentiated genomic species remain largely unknown. We used the Agrobacterium tumefaciens species complex and particularly the genomic species presently called genomovar G8, which includes the sequenced strain C58, to test the hypothesis of genomic species having specific ecological adaptations possibly involved in the speciation process. We analyzed the gene repertoire specific to G8 to identify potential adaptive genes. By hybridizing 25 strains of A. tumefaciens on DNA microarrays spanning the C58 genome, we highlighted the presence and absence of genes homologous to C58 in the taxon. We found 196 genes specific to genomovar G8 that were mostly clustered into seven genomic islands on the C58 genome—one on the circular chromosome and six on the linear chromosome—suggesting higher plasticity and a major adaptive role of the latter. Clusters encoded putative functional units, four of which had been verified experimentally. The combination of G8-specific functions defines a hypothetical species primary niche for G8 related to commensal interaction with a host plant. This supports that the G8 ancestor was able to exploit a new ecological niche, maybe initiating ecological isolation and thus speciation. Searching genomic data for synapomorphic traits is a powerful way to describe bacterial species. This procedure allowed us to find such phenotypic traits specific to genomovar G8 and thus propose a Latin binomial, Agrobacterium fabrum, for this bona fide genomic species. PMID:21795751

  13. Genome Sequencing of the Phytoseiid Predatory Mite Metaseiulus occidentalis Reveals Completely Atomized Hox Genes and Superdynamic Intron Evolution

    PubMed Central

    Hoy, Marjorie A.; Waterhouse, Robert M.; Wu, Ke; Estep, Alden S.; Ioannidis, Panagiotis; Palmer, William J.; Pomerantz, Aaron F.; Simão, Felipe A.; Thomas, Jainy; Jiggins, Francis M.; Murphy, Terence D.; Pritham, Ellen J.; Robertson, Hugh M.; Zdobnov, Evgeny M.; Gibbs, Richard A.; Richards, Stephen

    2016-01-01

    Metaseiulus occidentalis is an eyeless phytoseiid predatory mite employed for the biological control of agricultural pests including spider mites. Despite appearances, these predator and prey mites are separated by some 400 Myr of evolution and radically different lifestyles. We present a 152-Mb draft assembly of the M. occidentalis genome: Larger than that of its favored prey, Tetranychus urticae, but considerably smaller than those of many other chelicerates, enabling an extremely contiguous and complete assembly to be built—the best arachnid to date. Aided by transcriptome data, genome annotation cataloged 18,338 protein-coding genes and identified large numbers of Helitron transposable elements. Comparisons with other arthropods revealed a particularly dynamic and turbulent genomic evolutionary history. Its genes exhibit elevated molecular evolution, with strikingly high numbers of intron gains and losses, in stark contrast to the deer tick Ixodes scapularis. Uniquely among examined arthropods, this predatory mite’s Hox genes are completely atomized, dispersed across the genome, and it encodes five copies of the normally single-copy RNA processing Dicer-2 gene. Examining gene families linked to characteristic biological traits of this tiny predator provides initial insights into processes of sex determination, development, immune defense, and how it detects, disables, and digests its prey. As the first reference genome for the Phytoseiidae, and for any species with the rare sex determination system of parahaploidy, the genome of the western orchard predatory mite improves genomic sampling of chelicerates and provides invaluable new resources for functional genomic analyses of this family of agriculturally important mites. PMID:26951779

  14. An Aboriginal Australian genome reveals separate human dispersals into Asia.

    PubMed

    Rasmussen, Morten; Guo, Xiaosen; Wang, Yong; Lohmueller, Kirk E; Rasmussen, Simon; Albrechtsen, Anders; Skotte, Line; Lindgreen, Stinus; Metspalu, Mait; Jombart, Thibaut; Kivisild, Toomas; Zhai, Weiwei; Eriksson, Anders; Manica, Andrea; Orlando, Ludovic; De La Vega, Francisco M; Tridico, Silvana; Metspalu, Ene; Nielsen, Kasper; Ávila-Arcos, María C; Moreno-Mayar, J Víctor; Muller, Craig; Dortch, Joe; Gilbert, M Thomas P; Lund, Ole; Wesolowska, Agata; Karmin, Monika; Weinert, Lucy A; Wang, Bo; Li, Jun; Tai, Shuaishuai; Xiao, Fei; Hanihara, Tsunehiko; van Driem, George; Jha, Aashish R; Ricaut, François-Xavier; de Knijff, Peter; Migliano, Andrea B; Gallego Romero, Irene; Kristiansen, Karsten; Lambert, David M; Brunak, Søren; Forster, Peter; Brinkmann, Bernd; Nehlich, Olaf; Bunce, Michael; Richards, Michael; Gupta, Ramneek; Bustamante, Carlos D; Krogh, Anders; Foley, Robert A; Lahr, Marta M; Balloux, Francois; Sicheritz-Pontén, Thomas; Villems, Richard; Nielsen, Rasmus; Wang, Jun; Willerslev, Eske

    2011-10-07

    We present an Aboriginal Australian genomic sequence obtained from a 100-year-old lock of hair donated by an Aboriginal man from southern Western Australia in the early 20th century. We detect no evidence of European admixture and estimate contamination levels to be below 0.5%. We show that Aboriginal Australians are descendants of an early human dispersal into eastern Asia, possibly 62,000 to 75,000 years ago. This dispersal is separate from the one that gave rise to modern Asians 25,000 to 38,000 years ago. We also find evidence of gene flow between populations of the two dispersal waves prior to the divergence of Native Americans from modern Asian ancestors. Our findings support the hypothesis that present-day Aboriginal Australians descend from the earliest humans to occupy Australia, likely representing one of the oldest continuous populations outside Africa.

  15. Efficient analysis of mouse genome sequences reveal many nonsense variants

    PubMed Central

    Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E.; Libert, Claude

    2016-01-01

    Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605

  16. Single-Molecule FISH Reveals Non-selective Packaging of Rift Valley Fever Virus Genome Segments

    PubMed Central

    Wichgers Schreur, Paul J.; Kortekaas, Jeroen

    2016-01-01

    The bunyavirus genome comprises a small (S), medium (M), and large (L) RNA segment of negative polarity. Although genome segmentation confers evolutionary advantages by enabling genome reassortment events with related viruses, genome segmentation also complicates genome replication and packaging. Accumulating evidence suggests that genomes of viruses with eight or more genome segments are incorporated into virions by highly selective processes. Remarkably, little is known about the genome packaging process of the tri-segmented bunyaviruses. Here, we evaluated, by single-molecule RNA fluorescence in situ hybridization (FISH), the intracellular spatio-temporal distribution and replication kinetics of the Rift Valley fever virus (RVFV) genome and determined the segment composition of mature virions. The results reveal that the RVFV genome segments start to replicate near the site of infection before spreading and replicating throughout the cytoplasm followed by translocation to the virion assembly site at the Golgi network. Despite the average intracellular S, M and L genome segments approached a 1:1:1 ratio, major differences in genome segment ratios were observed among cells. We also observed a significant amount of cells lacking evidence of M-segment replication. Analysis of two-segmented replicons and four-segmented viruses subsequently confirmed the previous notion that Golgi recruitment is mediated by the Gn glycoprotein. The absence of colocalization of the different segments in the cytoplasm and the successful rescue of a tri-segmented variant with a codon shuffled M-segment suggested that inter-segment interactions are unlikely to drive the copackaging of the different segments into a single virion. The latter was confirmed by direct visualization of RNPs inside mature virions which showed that the majority of virions lack one or more genome segments. Altogether, this study suggests that RVFV genome packaging is a non-selective process. PMID:27548280

  17. Genome sequence of the basal haplorrhine primate Tarsius syrichta reveals unusual insertions

    PubMed Central

    Schmitz, Jürgen; Noll, Angela; Raabe, Carsten A.; Churakov, Gennady; Voss, Reinhard; Kiefmann, Martin; Rozhdestvensky, Timofey; Brosius, Jürgen; Baertsch, Robert; Clawson, Hiram; Roos, Christian; Zimin, Aleksey; Minx, Patrick; Montague, Michael J.; Wilson, Richard K.; Warren, Wesley C.

    2016-01-01

    Tarsiers are phylogenetically located between the most basal strepsirrhines and the most derived anthropoid primates. While they share morphological features with both groups, they also possess uncommon primate characteristics, rendering their evolutionary history somewhat obscure. To investigate the molecular basis of such attributes, we present here a new genome assembly of the Philippine tarsier (Tarsius syrichta), and provide extended analyses of the genome and detailed history of transposable element insertion events. We describe the silencing of Alu monomers on the lineage leading to anthropoids, and recognize an unexpected abundance of long terminal repeat-derived and LINE1-mobilized transposed elements (Tarsius interspersed elements; TINEs). For the first time in mammals, we identify a complete mitochondrial genome insertion within the nuclear genome, then reveal tarsier-specific, positive gene selection and posit population size changes over time. The genomic resources and analyses presented here will aid efforts to more fully understand the ancient characteristics of primate genomes. PMID:27708261

  18. Evidence-based green algal genomics reveals marine diversity and ancestral characteristics of land plants

    SciTech Connect

    van Baren, Marijke J.; Bachy, Charles; Reistetter, Emily Nahas; Purvine, Samuel O.; Grimwood, Jane; Sudek, Sebastian; Yu, Hang; Poirier, Camille; Deerinck, Thomas J.; Kuo, Alan; Grigoriev, Igor V.; Wong, Chee -Hong; Smith, Richard D.; Callister, Stephen J.; Wei, Chia -Lin; Schmutz, Jeremy; Worden, Alexandra Z.

    2016-03-31

    Prasinophytes are widespread marine green algae that are related to plants. Abundance of the genus Micromonas has reportedly increased in the Arctic due to climate-induced changes. Thus, studies of these organisms are important for marine ecology and understanding Virdiplantae evolution and diversification. We generated evidence-based Micromonas gene models using proteomics and RNA-Seq to improve prasinophyte genomic resources. First, sequences of four chromosomes in the 22 Mb Micromonas pusilla (CCMP1545) genome were finished. Comparison with the finished 21 Mb Micromonas commoda (RCC299) shows they share ≤ 8,142 of ~10,000 protein-encoding genes, depending on the analysis method. Unlike RCC299 and other sequenced eukaryotes, CCMP1545 has two abundant repetitive intron types and a high percent (26%) GC splice donors. Micromonas has more genus-specific protein families (19%) than other genome sequenced prasinophytes (11%). Comparative analyses using predicted proteomes from other prasinophytes reveal proteins likely related to scale formation and ancestral photosynthesis. Our studies also indicate that peptidoglycan (PG) biosynthesis enzymes have been lost in multiple independent events in select prasinophytes and most plants. However, CCMP1545, polar Micromonas CCMP2099 and prasinophytes from other claasses retain the entire PG pathway, like moss and glaucophyte algae. Multiple vascular plants that share a unique bi-domain protein also have the pathway, except the Penicillin-Binding-Protein. Alongside Micromonas experiments using antibiotics that halt bacterial PG biosynthesis, the findings highlight unrecognized phylogenetic complexity in the PG-pathway retention and implicate a role in chloroplast structure of division in several extant Vridiplantae lineages. Extensive differences in gene loss and architecture between related prasinophytes underscore their extensive divergence. PG biosynthesis genes from the

  19. The Streamlined Genome of Phytomonas spp. Relative to Human Pathogenic Kinetoplastids Reveals a Parasite Tailored for Plants

    PubMed Central

    Porcel, Betina M.; Denoeud, France; Opperdoes, Fred; Noel, Benjamin; Madoui, Mohammed-Amine; Hammarton, Tansy C.; Field, Mark C.; Da Silva, Corinne; Couloux, Arnaud; Poulain, Julie; Katinka, Michael; Jabbari, Kamel; Aury, Jean-Marc; Campbell, David A.; Cintron, Roxana; Dickens, Nicholas J.; Docampo, Roberto; Sturm, Nancy R.; Koumandou, V. Lila; Fabre, Sandrine; Flegontov, Pavel; Lukeš, Julius; Michaeli, Shulamit; Mottram, Jeremy C.; Szöőr, Balázs; Zilberstein, Dan; Bringaud, Frédéric; Wincker, Patrick; Dollet, Michel

    2014-01-01

    Members of the family Trypanosomatidae infect many organisms, including animals, plants and humans. Plant-infecting trypanosomes are grouped under the single genus Phytomonas, failing to reflect the wide biological and pathological diversity of these protists. While some Phytomonas spp. multiply in the latex of plants, or in fruit or seeds without apparent pathogenicity, others colonize the phloem sap and afflict plants of substantial economic value, including the coffee tree, coconut and oil palms. Plant trypanosomes have not been studied extensively at the genome level, a major gap in understanding and controlling pathogenesis. We describe the genome sequences of two plant trypanosomatids, one pathogenic isolate from a Guianan coconut and one non-symptomatic isolate from Euphorbia collected in France. Although these parasites have extremely distinct pathogenic impacts, very few genes are unique to either, with the vast majority of genes shared by both isolates. Significantly, both Phytomonas spp. genomes consist essentially of single copy genes for the bulk of their metabolic enzymes, whereas other trypanosomatids e.g. Leishmania and Trypanosoma possess multiple paralogous genes or families. Indeed, comparison with other trypanosomatid genomes revealed a highly streamlined genome, encoding for a minimized metabolic system while conserving the major pathways, and with retention of a full complement of endomembrane organelles, but with no evidence for functional complexity. Identification of the metabolic genes of Phytomonas provides opportunities for establishing in vitro culturing of these fastidious parasites and new tools for the control of agricultural plant disease. PMID:24516393

  20. Phylogeny of Banana Streak Virus reveals recent and repetitive endogenization in the genome of its banana host (Musa sp.).

    PubMed

    Gayral, Philippe; Iskra-Caruana, Marie-Line

    2009-07-01

    Banana streak virus (BSV) is a plant dsDNA pararetrovirus (family Caulimoviridae, genus badnavirus). Although integration is not an essential step in the BSV replication cycle, the nuclear genome of banana (Musa sp.) contains BSV endogenous pararetrovirus sequences (BSV EPRVs). Some BSV EPRVs are infectious by reconstituting a functional viral genome. Recent studies revealed a large molecular diversity of episomal BSV viruses (i.e., nonintegrated) while others focused on BSV EPRV sequences only. In this study, the evolutionary history of badnavirus integration in banana was inferred from phylogenetic relationships between BSV and BSV EPRVs. The relative evolution rates and selective pressures (d(N)/d(S) ratio) were also compared between endogenous and episomal viral sequences. At least 27 recent independent integration events occurred after the divergence of three banana species, indicating that viral integration is a recent and frequent phenomenon. Relaxation of selective pressure on badnaviral sequences that experienced neutral evolution after integration in the plant genome was recorded. Additionally, a significant decrease (35%) in the EPRV evolution rate was observed compared to BSV, reflecting the difference in the evolution rate between episomal dsDNA viruses and plant genome. The comparison of our results with the evolution rate of the Musa genome and other reverse-transcribing viruses suggests that EPRVs play an active role in episomal BSV diversity and evolution.

  1. Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes

    PubMed Central

    Biankin, Andrew V.; Waddell, Nicola; Kassahn, Karin S.; Gingras, Marie-Claude; Muthuswamy, Lakshmi B.; Johns, Amber L.; Miller, David K.; Wilson, Peter J.; Patch, Ann-Marie; Wu, Jianmin; Chang, David K.; Cowley, Mark J.; Gardiner, Brooke B.; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J.; Gill, Anthony J.; Pinho, Andreia V.; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J. Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R. Scott; Humphris, Jeremy L.; Kaplan, Warren; Jones, Marc D.; Colvin, Emily K.; Nagrial, Adnan M.; Humphrey, Emily S.; Chou, Angela; Chin, Venessa T.; Chantrill, Lorraine A.; Mawson, Amanda; Samra, Jaswinder S.; Kench, James G.; Lovell, Jessica A.; Daly, Roger J.; Merrett, Neil D.; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q.; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M.; Fisher, William E.; Brunicardi, F. Charles; Hodges, Sally E.; Reid, Jeffrey G.; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R.; Dinh, Huyen; Buhay, Christian J.; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E.; Yung, Christina K.; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A.; Petersen, Gloria M.; Gallinger, Steven; Hruban, Ralph H.; Maitra, Anirban; Iacobuzio-Donahue, Christine A.; Schulick, Richard D.; Wolfgang, Christopher L.; Morgan, Richard A.; Lawlor, Rita T.; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A.; Mann, Karen M.; Jenkins, Nancy A.; Perez-Mancera, Pedro A.; Adams, David J.; Largaespada, David A.; Wessels, Lodewyk F. A.; Rust, Alistair G.; Stein, Lincoln D.; Tuveson, David A.; Copeland, Neal G.; Musgrove, Elizabeth A.; Scarpa, Aldo; Eshleman, James R.; Hudson, Thomas J.; Sutherland, Robert L.; Wheeler, David A.; Pearson, John V.; McPherson, John D.; Gibbs, Richard A.; Grimmond, Sean M.

    2012-01-01

    Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis. PMID:23103869

  2. Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes.

    PubMed

    Biankin, Andrew V; Waddell, Nicola; Kassahn, Karin S; Gingras, Marie-Claude; Muthuswamy, Lakshmi B; Johns, Amber L; Miller, David K; Wilson, Peter J; Patch, Ann-Marie; Wu, Jianmin; Chang, David K; Cowley, Mark J; Gardiner, Brooke B; Song, Sarah; Harliwong, Ivon; Idrisoglu, Senel; Nourse, Craig; Nourbakhsh, Ehsan; Manning, Suzanne; Wani, Shivangi; Gongora, Milena; Pajic, Marina; Scarlett, Christopher J; Gill, Anthony J; Pinho, Andreia V; Rooman, Ilse; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Nones, Katia; Fink, J Lynn; Christ, Angelika; Bruxner, Tim; Cloonan, Nicole; Kolle, Gabriel; Newell, Felicity; Pinese, Mark; Mead, R Scott; Humphris, Jeremy L; Kaplan, Warren; Jones, Marc D; Colvin, Emily K; Nagrial, Adnan M; Humphrey, Emily S; Chou, Angela; Chin, Venessa T; Chantrill, Lorraine A; Mawson, Amanda; Samra, Jaswinder S; Kench, James G; Lovell, Jessica A; Daly, Roger J; Merrett, Neil D; Toon, Christopher; Epari, Krishna; Nguyen, Nam Q; Barbour, Andrew; Zeps, Nikolajs; Kakkar, Nipun; Zhao, Fengmei; Wu, Yuan Qing; Wang, Min; Muzny, Donna M; Fisher, William E; Brunicardi, F Charles; Hodges, Sally E; Reid, Jeffrey G; Drummond, Jennifer; Chang, Kyle; Han, Yi; Lewis, Lora R; Dinh, Huyen; Buhay, Christian J; Beck, Timothy; Timms, Lee; Sam, Michelle; Begley, Kimberly; Brown, Andrew; Pai, Deepa; Panchal, Ami; Buchner, Nicholas; De Borja, Richard; Denroche, Robert E; Yung, Christina K; Serra, Stefano; Onetto, Nicole; Mukhopadhyay, Debabrata; Tsao, Ming-Sound; Shaw, Patricia A; Petersen, Gloria M; Gallinger, Steven; Hruban, Ralph H; Maitra, Anirban; Iacobuzio-Donahue, Christine A; Schulick, Richard D; Wolfgang, Christopher L; Morgan, Richard A; Lawlor, Rita T; Capelli, Paola; Corbo, Vincenzo; Scardoni, Maria; Tortora, Giampaolo; Tempero, Margaret A; Mann, Karen M; Jenkins, Nancy A; Perez-Mancera, Pedro A; Adams, David J; Largaespada, David A; Wessels, Lodewyk F A; Rust, Alistair G; Stein, Lincoln D; Tuveson, David A; Copeland, Neal G; Musgrove, Elizabeth A; Scarpa, Aldo; Eshleman, James R; Hudson, Thomas J; Sutherland, Robert L; Wheeler, David A; Pearson, John V; McPherson, John D; Gibbs, Richard A; Grimmond, Sean M

    2012-11-15

    Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARID1A and SF3B1), and uncover novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement of axon guidance genes in pancreatic carcinogenesis.

  3. Genomic analysis of regulatory network dynamics reveals large topological changes

    NASA Astrophysics Data System (ADS)

    Luscombe, Nicholas M.; Madan Babu, M.; Yu, Haiyuan; Snyder, Michael; Teichmann, Sarah A.; Gerstein, Mark

    2004-09-01

    Network analysis has been applied widely, providing a unifying language to describe disparate systems ranging from social interactions to power grids. It has recently been used in molecular biology, but so far the resulting networks have only been analysed statically. Here we present the dynamics of a biological network on a genomic scale, by integrating transcriptional regulatory information and gene-expression data for multiple conditions in Saccharomyces cerevisiae. We develop an approach for the statistical analysis of network dynamics, called SANDY, combining well-known global topological measures, local motifs and newly derived statistics. We uncover large changes in underlying network architecture that are unexpected given current viewpoints and random simulations. In response to diverse stimuli, transcription factors alter their interactions to varying degrees, thereby rewiring the network. A few transcription factors serve as permanent hubs, but most act transiently only during certain conditions. By studying sub-network structures, we show that environmental responses facilitate fast signal propagation (for example, with short regulatory cascades), whereas the cell cycle and sporulation direct temporal progression through multiple stages (for example, with highly inter-connected transcription factors). Indeed, to drive the latter processes forward, phase-specific transcription factors inter-regulate serially, and ubiquitously active transcription factors layer above them in a two-tiered hierarchy. We anticipate that many of the concepts presented here-particularly the large-scale topological changes and hub transience-will apply to other biological networks, including complex sub-systems in higher eukaryotes.

  4. Genome structure and primitive sex chromosome revealed in Populus

    SciTech Connect

    Tuskan, Gerald A; Yin, Tongming; Gunter, Lee E; Blaudez, D

    2008-01-01

    We constructed a comprehensive genetic map for Populus and ordered 332 Mb of sequence scaffolds along the 19 haploid chromosomes in order to compare chromosomal regions among diverse members of the genus. These efforts lead us to conclude that chromosome XIX in Populus is evolving into a sex chromosome. Consistent segregation distortion in favor of the sub-genera Tacamahaca alleles provided evidence of divergent selection among species, particularly at the proximal end of chromosome XIX. A large microsatellite marker (SSR) cluster was detected in the distorted region even though the genome-wide distribute SSR sites was uniform across the physical map. The differences between the genetic map and physical sequence data suggested recombination suppression was occurring in the distorted region. A gender-determination locus and an overabundance of NBS-LRR genes were also co-located to the distorted region and were put forth as the cause for divergent selection and recombination suppression. This hypothesis was verified by using fine-scale mapping of an integrated scaffold in the vicinity of the gender-determination locus. As such it appears that chromosome XIX in Populus is in the process of evolving from an autosome into a sex chromosome and that NBS-LRR genes may play important role in the chromosomal diversification process in Populus.

  5. Genomic analysis of regulatory network dynamics reveals large topological changes.

    PubMed

    Luscombe, Nicholas M; Babu, M Madan; Yu, Haiyuan; Snyder, Michael; Teichmann, Sarah A; Gerstein, Mark

    2004-09-16

    Network analysis has been applied widely, providing a unifying language to describe disparate systems ranging from social interactions to power grids. It has recently been used in molecular biology, but so far the resulting networks have only been analysed statically. Here we present the dynamics of a biological network on a genomic scale, by integrating transcriptional regulatory information and gene-expression data for multiple conditions in Saccharomyces cerevisiae. We develop an approach for the statistical analysis of network dynamics, called SANDY, combining well-known global topological measures, local motifs and newly derived statistics. We uncover large changes in underlying network architecture that are unexpected given current viewpoints and random simulations. In response to diverse stimuli, transcription factors alter their interactions to varying degrees, thereby rewiring the network. A few transcription factors serve as permanent hubs, but most act transiently only during certain conditions. By studying sub-network structures, we show that environmental responses facilitate fast signal propagation (for example, with short regulatory cascades), whereas the cell cycle and sporulation direct temporal progression through multiple stages (for example, with highly inter-connected transcription factors). Indeed, to drive the latter processes forward, phase-specific transcription factors inter-regulate serially, and ubiquitously active transcription factors layer above them in a two-tiered hierarchy. We anticipate that many of the concepts presented here--particularly the large-scale topological changes and hub transience--will apply to other biological networks, including complex sub-systems in higher eukaryotes.

  6. Diversity of Pseudomonas Genomes, Including Populus-Associated Isolates, as Revealed by Comparative Genome Analysis.

    PubMed

    Jun, Se-Ran; Wassenaar, Trudy M; Nookaew, Intawat; Hauser, Loren; Wanchai, Visanu; Land, Miriam; Timm, Collin M; Lu, Tse-Yuan S; Schadt, Christopher W; Doktycz, Mitchel J; Pelletier, Dale A; Ussery, David W

    2015-10-30

    The Pseudomonas genus contains a metabolically versatile group of organisms that are known to occupy numerous ecological niches, including the rhizosphere and endosphere of many plants. Their diversity influences the phylogenetic diversity and heterogeneity of these communities. On the basis of average amino acid identity, comparative genome analysis of >1,000 Pseudomonas genomes, including 21 Pseudomonas strains isolated from the roots of native Populus deltoides (eastern cottonwood) trees resulted in consistent and robust genomic clusters with phylogenetic homogeneity. All Pseudomonas aeruginosa genomes clustered together, and these were clearly distinct from other Pseudomonas species groups on the basis of pangenome and core genome analyses. In contrast, the genomes of Pseudomonas fluorescens were organized into 20 distinct genomic clusters, representing enormous diversity and heterogeneity. Most of our 21 Populus-associated isolates formed three distinct subgroups within the major P. fluorescens group, supported by pathway profile analysis, while two isolates were more closely related to Pseudomonas chlororaphis and Pseudomonas putida. Genes specific to Populus-associated subgroups were identified. Genes specific to subgroup 1 include several sensory systems that act in two-component signal transduction, a TonB-dependent receptor, and a phosphorelay sensor. Genes specific to subgroup 2 contain hypothetical genes, and genes specific to subgroup 3 were annotated with hydrolase activity. This study justifies the need to sequence multiple isolates, especially from P. fluorescens, which displays the most genetic variation, in order to study functional capabilities from a pangenomic perspective. This information will prove useful when choosing Pseudomonas strains for use to promote growth and increase disease resistance in plants.

  7. Comparative Genomics and Metabolic Analysis Reveals Peculiar Characteristics of Rhodococcus opacus Strain M213 Particularly for Naphthalene Degradation

    PubMed Central

    Blom, Jochen; Indest, Karl J.; Jung, Carina M.; Stothard, Paul; Bera, Gopal; Green, Stefan J.; Ogram, Andrew

    2016-01-01

    The genome of Rhodococcus opacus strain M213, isolated from a fuel-oil contaminated soil, was sequenced and annotated which revealed a genome size of 9,194,165 bp encoding 8680 putative genes and a G+C content of 66.72%. Among the protein coding genes, 71.77% were annotated as clusters of orthologous groups of proteins (COGs); 55% of the COGs were present as paralog clusters. Pulsed field gel electrophoresis (PFGE) analysis of M213 revealed the presence of three different sized replicons- a circular chromosome and two megaplasmids (pNUO1 and pNUO2) estimated to be of 750Kb 350Kb in size, respectively. Conversely, using an alternative approach of optical mapping, the plasmid replicons appeared as a circular ~1.2 Mb megaplasmid and a linear, ~0.7 Mb megaplasmid. Genome-wide comparative analysis of M213 with a cohort of sequenced Rhodococcus species revealed low syntenic affiliation with other R. opacus species including strains B4 and PD630. Conversely, a closer affiliation of M213, at the functional (COG) level, was observed with the catabolically versatile R. jostii strain RHA1 and other Rhodococcii such as R. wratislaviensis strain IFP 2016, R. imtechensis strain RKJ300, Rhodococcus sp. strain JVH1, and Rhodococcus sp. strain DK17, respectively. An in-depth, genome-wide comparison between these functional relatives revealed 971 unique genes in M213 representing 11% of its total genome; many associating with catabolic functions. Of major interest was the identification of as many as 154 genomic islands (GEIs), many with duplicated catabolic genes, in particular for PAHs; a trait that was confirmed by PCR-based identification of naphthalene dioxygenase (NDO) as a representative gene, across PFGE-resolved replicons of strain M213. Interestingly, several plasmid/GEI-encoded genes, that likely participate in degrading naphthalene (NAP) via a peculiar pathway, were also identified in strain M213 using a combination of bioinformatics, metabolic analysis and gene

  8. Similarities and differences in the nuclear genome organization within Pooideae species revealed by comparative genomic in situ hybridization (GISH).

    PubMed

    Majka, Joanna; Majka, Maciej; Kwiatek, Michał; Wiśniewska, Halina

    2016-10-14

    In this paper, we highlight the affinity between the genomes of key representatives of the Pooideae subfamily, revealed at the chromosomal level by genomic in situ hybridization (GISH). The analyses were conducted using labeled probes from each species to hybridize with chromosomes of every species used in this study based on a "round robin" rule. As a result, the whole chromosomes or chromosome regions were distinguished or variable types of signals were visualized to prove the different levels of the relationships between genomes used in this study. We observed the unexpected lack of signals in secondary constrictions of rye (RR) chromosomes probed by triticale (AABBRR) genomic DNA. We have also identified unlabeled chromosome regions, which point to species-specific sequences connected with disparate pathways of chromosome differentiation. Our results revealed a conservative character of coding sequence of 35S rDNA among selected species of the genera Aegilops, Brachypodium, Festuca, Hordeum, Lolium, Secale, and Triticum. In summary, we showed strong relationships in genomic DNA sequences between species which have been previously reported to be phylogenetically distant.

  9. Array CGH reveals genomic aberrations in human emphysema.

    PubMed

    Choi, Jin Soo; Lee, Woon Jeong; Baik, Seung Ho; Yoon, Hyoung Kyu; Lee, Kweon-Haeng; Kim, Yeul Hong; Lim, Young; Wang, Young-Pil

    2009-01-01

    Emphysema is the major component of chronic obstructive pulmonary disease (COPD), which is the fourth leading cause of death in the world. Several epidemiologic studies suggest that genetic factors may have an important role in the pathogenesis of emphysema. We analyzed the gene expression profiles of chromosomal aberrations using array comparative genomic hybridization (array CGH) in 32 patients with emphysema to identify the candidate genes that might be causally involved in the pathogenesis of emphysema. Copy number gains and losses were detected in chromosomal regions, and the corresponding genes were confirmed by real-time polymerase chain reaction. Several frequently altered loci were found, including a gain at 5p15.33 (60% of the study subjects), and a loss at 7q22.1 (31% of the study subjects). DNA gains were identified at a high frequency at 1p, 5p, 11p, 12p, 15q, 17p, 18q, 21q, and 22q, whereas DNA losses were frequently found at 7q and 22q. We found that the fold change levels were highest at the CYP4B1 (1p33), JUN (1p32.1), NOTCH2 (1p12-p11.2), SDHA (5p15.33), KCNQ1 (11p15.5-p15.4), NINJ2 (12p13.33), PCSK6 (15q26.3), ABR (17p13.3), CTDP1 (18q23), RUNX1 (21q22.12) and HDAC10 (22q13.33) gene loci. We also observed losses in the MUC17 (7q22.1), COMT (22q11.21) and GSTT1 (22q11.2) genes. These studies show that array CGH is a useful tool for the identification of gene alterations in cases of emphysema and that the aforementioned genes might represent potential candidate genes involved in the pathogenesis of emphysema.

  10. Genome resequencing in Populus: Revealing large-scale genome variation and implications on specialized-trait genomics

    SciTech Connect

    Muchero, Wellington; Labbe, Jessy L; Priya, Ranjan; DiFazio, Steven P; Tuskan, Gerald A

    2014-01-01

    To date, Populus ranks among a few plant species with a complete genome sequence and other highly developed genomic resources. With the first genome sequence among all tree species, Populus has been adopted as a suitable model organism for genomic studies in trees. However, far from being just a model species, Populus is a key renewable economic resource that plays a significant role in providing raw materials for the biofuel and pulp and paper industries. Therefore, aside from leading frontiers of basic tree molecular biology and ecological research, Populus leads frontiers in addressing global economic challenges related to fuel and fiber production. The latter fact suggests that research aimed at improving quality and quantity of Populus as a raw material will likely drive the pursuit of more targeted and deeper research in order to unlock the economic potential tied in molecular biology processes that drive this tree species. Advances in genome sequence-driven technologies, such as resequencing individual genotypes, which in turn facilitates large scale SNP discovery and identification of large scale polymorphisms are key determinants of future success in these initiatives. In this treatise we discuss implications of genome sequence-enable technologies on Populus genomic and genetic studies of complex and specialized-traits.

  11. Genomic investigation reveals evolution and lifestyle adaptation of endophytic Staphylococcus epidermidis.

    PubMed

    Chaudhry, Vasvi; Patil, Prabhu B

    2016-01-13

    Staphylococcus epidermidis is a major human associated bacterium and also an emerging nosocomial pathogen. There are reports of its association to rodents, sheep and plants. However, comparative and evolutionary studies of ecologically diverse strains of S. epidermidis are lacking. Here, we report the whole genome sequences of four S. epidermidis strains isolated from surface sterilized rice seeds along with genome sequence of type strain. Phylogenomic analysis of rice endophytic S. epidermidis (RESE) with "type strain" unequivocally established their species identity. Whole genome based tree of 93 strains of S. epidermidis revealed RESE as distinct sub-lineage which is more related to rodent sub-lineage than to majority of human lineage strains. Furthermore, comparative genomics revealed 20% variable gene-pool in S. epidermidis, suggesting that genomes of ecologically diverse strains are under flux. Interestingly, we were also able to map several genomic regions that are under flux and gave rise to RESE strains. The largest of these genomic regions encodes a cluster of genes unique to RESE that are known to be required for survival and stress tolerance, apart from those required for adaptation to plant habitat. The genomes and genes of RESE represent distinct ecological resource/sequences and provided first evolutionary insights into adaptation of S. epidermidis to plants.

  12. Genomic investigation reveals evolution and lifestyle adaptation of endophytic Staphylococcus epidermidis

    PubMed Central

    Chaudhry, Vasvi; Patil, Prabhu B.

    2016-01-01

    Staphylococcus epidermidis is a major human associated bacterium and also an emerging nosocomial pathogen. There are reports of its association to rodents, sheep and plants. However, comparative and evolutionary studies of ecologically diverse strains of S. epidermidis are lacking. Here, we report the whole genome sequences of four S. epidermidis strains isolated from surface sterilized rice seeds along with genome sequence of type strain. Phylogenomic analysis of rice endophytic S. epidermidis (RESE) with “type strain” unequivocally established their species identity. Whole genome based tree of 93 strains of S. epidermidis revealed RESE as distinct sub-lineage which is more related to rodent sub-lineage than to majority of human lineage strains. Furthermore, comparative genomics revealed 20% variable gene-pool in S. epidermidis, suggesting that genomes of ecologically diverse strains are under flux. Interestingly, we were also able to map several genomic regions that are under flux and gave rise to RESE strains. The largest of these genomic regions encodes a cluster of genes unique to RESE that are known to be required for survival and stress tolerance, apart from those required for adaptation to plant habitat. The genomes and genes of RESE represent distinct ecological resource/sequences and provided first evolutionary insights into adaptation of S. epidermidis to plants. PMID:26758912

  13. Deciphering the Cryptic Genome: Genome-wide Analyses of the Rice Pathogen Fusarium fujikuroi Reveal Complex Regulation of Secondary Metabolism and Novel Metabolites

    PubMed Central

    Studt, Lena; Niehaus, Eva-Maria; Espino, Jose J.; Huß, Kathleen; Michielse, Caroline B.; Albermann, Sabine; Wagner, Dominik; Bergner, Sonja V.; Connolly, Lanelle R.; Fischer, Andreas; Reuter, Gunter; Kleigrewe, Karin; Bald, Till; Wingfield, Brenda D.; Ophir, Ron; Freeman, Stanley; Hippler, Michael; Smith, Kristina M.; Brown, Daren W.; Proctor, Robert H.; Münsterkötter, Martin; Freitag, Michael; Humpf, Hans-Ulrich; Güldener, Ulrich; Tudzynski, Bettina

    2013-01-01

    The fungus Fusarium fujikuroi causes “bakanae” disease of rice due to its ability to produce gibberellins (GAs), but it is also known for producing harmful mycotoxins. However, the genetic capacity for the whole arsenal of natural compounds and their role in the fungus' interaction with rice remained unknown. Here, we present a high-quality genome sequence of F. fujikuroi that was assembled into 12 scaffolds corresponding to the 12 chromosomes described for the fungus. We used the genome sequence along with ChIP-seq, transcriptome, proteome, and HPLC-FTMS-based metabolome analyses to identify the potential secondary metabolite biosynthetic gene clusters and to examine their regulation in response to nitrogen availability and plant signals. The results indicate that expression of most but not all gene clusters correlate with proteome and ChIP-seq data. Comparison of the F. fujikuroi genome to those of six other fusaria revealed that only a small number of gene clusters are conserved among these species, thus providing new insights into the divergence of secondary metabolism in the genus Fusarium. Noteworthy, GA biosynthetic genes are present in some related species, but GA biosynthesis is limited to F. fujikuroi, suggesting that this provides a selective advantage during infection of the preferred host plant rice. Among the genome sequences analyzed, one cluster that includes a polyketide synthase gene (PKS19) and another that includes a non-ribosomal peptide synthetase gene (NRPS31) are unique to F. fujikuroi. The metabolites derived from these clusters were identified by HPLC-FTMS-based analyses of engineered F. fujikuroi strains overexpressing cluster genes. In planta expression studies suggest a specific role for the PKS19-derived product during rice infection. Thus, our results indicate that combined comparative genomics and genome-wide experimental analyses identified novel genes and secondary metabolites that contribute to the evolutionary success of F

  14. Genome-Wide Divergence and Linkage Disequilibrium Analyses for Capsicum baccatum Revealed by Genome-Anchored Single Nucleotide Polymorphisms.

    PubMed

    Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C V Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K

    2016-01-01

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST ) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9-2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers.

  15. Genome-Wide Divergence and Linkage Disequilibrium Analyses for Capsicum baccatum Revealed by Genome-Anchored Single Nucleotide Polymorphisms

    PubMed Central

    Nimmakayala, Padma; Abburi, Venkata L.; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C. V. Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K.

    2016-01-01

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9–2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers. PMID:27857720

  16. Comparative Genomics of Flatworms (Platyhelminthes) Reveals Shared Genomic Features of Ecto- and Endoparastic Neodermata

    PubMed Central

    Hahn, Christoph; Fromm, Bastian; Bachmann, Lutz

    2014-01-01

    The ectoparasitic Monogenea comprise a major part of the obligate parasitic flatworm diversity. Although genomic adaptations to parasitism have been studied in the endoparasitic tapeworms (Cestoda) and flukes (Trematoda), no representative of the Monogenea has been investigated yet. We present the high-quality draft genome of Gyrodactylus salaris, an economically important monogenean ectoparasite of wild Atlantic salmon (Salmo salar). A total of 15,488 gene models were identified, of which 7,102 were functionally annotated. The controversial phylogenetic relationships within the obligate parasitic Neodermata were resolved in a phylogenomic analysis using 1,719 gene models (alignment length of >500,000 amino acids) for a set of 16 metazoan taxa. The Monogenea were found basal to the Cestoda and Trematoda, which implies ectoparasitism being plesiomorphic within the Neodermata and strongly supports a common origin of complex life cycles. Comparative analysis of seven parasitic flatworm genomes identified shared genomic features for the ecto- and endoparasitic lineages, such as a substantial reduction of the core bilaterian gene complement, including the homeodomain-containing genes, and a loss of the piwi and vasa genes, which are considered essential for animal development. Furthermore, the shared loss of functional fatty acid biosynthesis pathways and the absence of peroxisomes, the latter organelles presumed ubiquitous in eukaryotes except for parasitic protozoans, were inferred. The draft genome of G. salaris opens for future in-depth analyses of pathogenicity and host specificity of poorly characterized G. salaris strains, and will enhance studies addressing the genomics of host–parasite interactions and speciation in the highly diverse monogenean flatworms. PMID:24732282

  17. Comparative genomics of flatworms (platyhelminthes) reveals shared genomic features of ecto- and endoparastic neodermata.

    PubMed

    Hahn, Christoph; Fromm, Bastian; Bachmann, Lutz

    2014-05-01

    The ectoparasitic Monogenea comprise a major part of the obligate parasitic flatworm diversity. Although genomic adaptations to parasitism have been studied in the endoparasitic tapeworms (Cestoda) and flukes (Trematoda), no representative of the Monogenea has been investigated yet. We present the high-quality draft genome of Gyrodactylus salaris, an economically important monogenean ectoparasite of wild Atlantic salmon (Salmo salar). A total of 15,488 gene models were identified, of which 7,102 were functionally annotated. The controversial phylogenetic relationships within the obligate parasitic Neodermata were resolved in a phylogenomic analysis using 1,719 gene models (alignment length of >500,000 amino acids) for a set of 16 metazoan taxa. The Monogenea were found basal to the Cestoda and Trematoda, which implies ectoparasitism being plesiomorphic within the Neodermata and strongly supports a common origin of complex life cycles. Comparative analysis of seven parasitic flatworm genomes identified shared genomic features for the ecto- and endoparasitic lineages, such as a substantial reduction of the core bilaterian gene complement, including the homeodomain-containing genes, and a loss of the piwi and vasa genes, which are considered essential for animal development. Furthermore, the shared loss of functional fatty acid biosynthesis pathways and the absence of peroxisomes, the latter organelles presumed ubiquitous in eukaryotes except for parasitic protozoans, were inferred. The draft genome of G. salaris opens for future in-depth analyses of pathogenicity and host specificity of poorly characterized G. salaris strains, and will enhance studies addressing the genomics of host-parasite interactions and speciation in the highly diverse monogenean flatworms.

  18. Genome sequencing and comparative genomics of honey bee microsporidia, Nosema apis reveal novel insights into host-parasite interactions

    PubMed Central

    2013-01-01

    Background The microsporidia parasite Nosema contributes to the steep global decline of honey bees that are critical pollinators of food crops. There are two species of Nosema that have been found to infect honey bees, Nosema apis and N. ceranae. Genome sequencing of N. apis and comparative genome analysis with N. ceranae, a fully sequenced microsporidia species, reveal novel insights into host-parasite interactions underlying the parasite infections. Results We applied the whole-genome shotgun sequencing approach to sequence and assemble the genome of N. apis which has an estimated size of 8.5 Mbp. We predicted 2,771 protein- coding genes and predicted the function of each putative protein using the Gene Ontology. The comparative genomic analysis led to identification of 1,356 orthologs that are conserved between the two Nosema species and genes that are unique characteristics of the individual species, thereby providing a list of virulence factors and new genetic tools for studying host-parasite interactions. We also identified a highly abundant motif in the upstream promoter regions of N. apis genes. This motif is also conserved in N. ceranae and other microsporidia species and likely plays a role in gene regulation across the microsporidia. Conclusions The availability of the N. apis genome sequence is a significant addition to the rapidly expanding body of microsprodian genomic data which has been improving our understanding of eukaryotic genome diversity and evolution in a broad sense. The predicted virulent genes and transcriptional regulatory elements are potential targets for innovative therapeutics to break down the life cycle of the parasite. PMID:23829473

  19. Heteroplasmy in the mitochondrial genomes of human lice and ticks revealed by high throughput sequencing.

    PubMed

    Xiong, Haoyu; Barker, Stephen C; Burger, Thomas D; Raoult, Didier; Shao, Renfu

    2013-01-01

    The typical mitochondrial (mt) genomes of bilateral animals consist of 37 genes on a single circular chromosome. The mt genomes of the human body louse, Pediculus humanus, and the human head louse, Pediculus capitis, however, are extensively fragmented and contain 20 minichromosomes, with one to three genes on each minichromosome. Heteroplasmy, i.e. nucleotide polymorphisms in the mt genome within individuals, has been shown to be significantly higher in the mt cox1 gene of human lice than in humans and other animals that have the typical mt genomes. To understand whether the extent of heteroplasmy in human lice is associated with mt genome fragmentation, we sequenced the entire coding regions of all of the mt minichromosomes of six human body lice and six human head lice from Ethiopia, China and France with an Illumina HiSeq platform. For comparison, we also sequenced the entire coding regions of the mt genomes of seven species of ticks, which have the typical mitochondrial genome organization of bilateral animals. We found that the level of heteroplasmy varies significantly both among the human lice and among the ticks. The human lice from Ethiopia have significantly higher level of heteroplasmy than those from China and France (Pt<0.05). The tick, Amblyomma cajennense, has significantly higher level of heteroplasmy than other ticks (Pt<0.05). Our results indicate that heteroplasmy level can be substantially variable within a species and among closely related species, and does not appear to be determined by single factors such as genome fragmentation.

  20. Improved genome assembly of American alligator genome reveals conserved architecture of estrogen signaling.

    PubMed

    Rice, Edward S; Kohno, Satomi; John, John St; Pham, Son; Howard, Jonathan; Lareau, Liana F; O'Connell, Brendan L; Hickey, Glenn; Armstrong, Joel; Deran, Alden; Fiddes, Ian; Platt, Roy N; Gresham, Cathy; McCarthy, Fiona; Kern, Colin; Haan, David; Phan, Tan; Schmidt, Carl; Sanford, Jeremy R; Ray, David A; Paten, Benedict; Guillette, Louis J; Green, Richard E

    2017-01-30

    The American alligator, Alligator mississippiensis, like all crocodilians, has temperature-dependent sex determination, in which the sex of an embryo is determined by the incubation temperature of the egg during a critical period of development. The lack of genetic differences between male and female alligators leaves open the question of how the genes responsible for sex determination and differentiation are regulated. Insight into this question comes from the fact that exposing an embryo incubated at male-producing temperature to estrogen causes it to develop ovaries. Because estrogen response elements are known to regulate genes over long distances, a contiguous genome assembly is crucial for predicting and understanding their impact. We present an improved assembly of the American alligator genome, scaffolded with in vitro proximity ligation (Chicago) data. We use this assembly to scaffold two other crocodilian genomes based on synteny. We perform RNA sequencing of tissues from American alligator embryos to find genes that are differentially expressed between embryos incubated at male- versus female-producing temperature. Finally, we use the improved contiguity of our assembly along with the current model of CTCF-mediated chromatin looping to predict regions of the genome likely to contain estrogen-responsive genes. We find that these regions are significantly enriched for genes with female-biased expression in developing gonads after the critical period during which sex is determined by incubation temperature. We thus conclude that estrogen signaling is a major driver of female-biased gene expression in the post-temperature sensitive period gonads.

  1. Genome-Wide Sequencing Reveals Two Major Sub-Lineages in the Genetically Monomorphic Pathogen Xanthomonas Campestris Pathovar Musacearum

    PubMed Central

    Wasukira, Arthur; Tayebwa, Johnbosco; Thwaites, Richard; Paszkiewicz, Konrad; Aritua, Valente; Kubiriba, Jerome; Smith, Julian; Grant, Murray; Studholme, David J.

    2012-01-01

    The bacterium Xanthomonas campestris pathovar musacearum (Xcm) is the causal agent of banana Xanthomonas wilt (BXW). This disease has devastated economies based on banana and plantain crops (Musa species) in East Africa. Here we use genome-wide sequencing to discover a set of single-nucleotide polymorphisms (SNPs) among East African isolates of Xcm. These SNPs have potential as molecular markers for phylogeographic studies of the epidemiology and spread of the pathogen. Our analysis reveals two major sub-lineages of the pathogen, suggesting that the current outbreaks of BXW on Musa species in the region may have more than one introductory event, perhaps from Ethiopia. Also, based on comparisons of genome-wide sequence data from multiple isolates of Xcm and multiple strains of X. vasicola pathovar vasculorum, we identify genes specific to Xcm that could be used to specifically detect Xcm by PCR-based methods. PMID:24704974

  2. Genome-wide sequencing reveals two major sub-lineages in the genetically monomorphic pathogen xanthomonas campestris pathovar musacearum.

    PubMed

    Wasukira, Arthur; Tayebwa, Johnbosco; Thwaites, Richard; Paszkiewicz, Konrad; Aritua, Valente; Kubiriba, Jerome; Smith, Julian; Grant, Murray; Studholme, David J

    2012-07-04

    The bacterium Xanthomonas campestris pathovar musacearum (Xcm) is the causal agent of banana Xanthomonas wilt (BXW). This disease has devastated economies based on banana and plantain crops (Musa species) in East Africa. Here we use genome-wide sequencing to discover a set of single-nucleotide polymorphisms (SNPs) among East African isolates of Xcm. These SNPs have potential as molecular markers for phylogeographic studies of the epidemiology and spread of the pathogen. Our analysis reveals two major sub-lineages of the pathogen, suggesting that the current outbreaks of BXW on Musa species in the region may have more than one introductory event, perhaps from Ethiopia. Also, based on comparisons of genome-wide sequence data from multiple isolates of Xcm and multiple strains of X. vasicola pathovar vasculorum, we identify genes specific to Xcm that could be used to specifically detect Xcm by PCR-based methods.

  3. Genetic variability of mutans streptococci revealed by wide whole-genome sequencing

    PubMed Central

    2013-01-01

    Background Mutans streptococci are a group of bacteria significantly contributing to tooth decay. Their genetic variability is however still not well understood. Results Genomes of 6 clinical S. mutans isolates of different origins, one isolate of S. sobrinus (DSM 20742) and one isolate of S. ratti (DSM 20564) were sequenced and comparatively analyzed. Genome alignment revealed a mosaic-like structure of genome arrangement. Genes related to pathogenicity are found to have high variations among the strains, whereas genes for oxidative stress resistance are well conserved, indicating the importance of this trait in the dental biofilm community. Analysis of genome-scale metabolic networks revealed significant differences in 42 pathways. A striking dissimilarity is the unique presence of two lactate oxidases in S. sobrinus DSM 20742, probably indicating an unusual capability of this strain in producing H2O2 and expanding its ecological niche. In addition, lactate oxidases may form with other enzymes a novel energetic pathway in S. sobrinus DSM 20742 that can remedy its deficiency in citrate utilization pathway. Using 67 S. mutans genomes currently available including the strains sequenced in this study, we estimates the theoretical core genome size of S. mutans, and performed modeling of S. mutans pan-genome by applying different fitting models. An “open” pan-genome was inferred. Conclusions The comparative genome analyses revealed diversities in the mutans streptococci group, especially with respect to the virulence related genes and metabolic pathways. The results are helpful for better understanding the evolution and adaptive mechanisms of these oral pathogen microorganisms and for combating them. PMID:23805886

  4. Comparative hybridization reveals extensive genome variation in the AIDS-associated pathogen Cryptococcus neoformans

    PubMed Central

    Hu, Guanggan; Liu, Iris; Sham, Anita; Stajich, Jason E; Dietrich, Fred S; Kronstad, James W

    2008-01-01

    Background Genome variability can have a profound influence on the virulence of pathogenic microbes. The availability of genome sequences for two strains of the AIDS-associated fungal pathogen Cryptococcus neoformans presented an opportunity to use comparative genome hybridization (CGH) to examine genome variability between strains of different mating type, molecular subtype, and ploidy. Results Initially, CGH was used to compare the approximately 100 kilobase MATa and MATα mating-type regions in serotype A and D strains to establish the relationship between the Log2 ratios of hybridization signals and sequence identity. Subsequently, we compared the genomes of the environmental isolate NIH433 (MATa) and the clinical isolate NIH12 (MATα) with a tiling array of the genome of the laboratory strain JEC21 derived from these strains. In this case, CGH identified putative recombination sites and the origins of specific segments of the JEC21 genome. Similarly, CGH analysis revealed marked variability in the genomes of strains representing the VNI, VNII, and VNB molecular subtypes of the A serotype, including disomy for chromosome 13 in two strains. Additionally, CGH identified differences in chromosome content between three strains with the hybrid AD serotype and revealed that chromosome 1 from the serotype A genome is preferentially retained in all three strains. Conclusion The genomes of serotypes A, D, and AD strains exhibit extensive variation that spans the range from small differences (such as regions of divergence, deletion, or amplification) to the unexpected disomy for chromosome 13 in haploid strains and preferential retention of specific chromosomes in naturally occurring diploids. PMID:18294377

  5. Stepwise Evolution of Coral Biomineralization Revealed with Genome-Wide Proteomics and Transcriptomics

    PubMed Central

    Sawada, Hitoshi; Satoh, Noriyuki

    2016-01-01

    Despite the importance of stony corals in many research fields related to global issues, such as marine ecology, climate change, paleoclimatogy, and metazoan evolution, very little is known about the evolutionary origin of coral skeleton formation. In order to investigate the evolution of coral biomineralization, we have identified skeletal organic matrix proteins (SOMPs) in the skeletal proteome of the scleractinian coral, Acropora digitifera, for which large genomic and transcriptomic datasets are available. Scrupulous gene annotation was conducted based on comparisons of functional domain structures among metazoans. We found that SOMPs include not only coral-specific proteins, but also protein families that are widely conserved among cnidarians and other metazoans. We also identified several conserved transmembrane proteins in the skeletal proteome. Gene expression analysis revealed that expression of these conserved genes continues throughout development. Therefore, these genes are involved not only skeleton formation, but also in basic cellular functions, such as cell-cell interaction and signaling. On the other hand, genes encoding coral-specific proteins, including extracellular matrix domain-containing proteins, galaxins, and acidic proteins, were prominently expressed in post-settlement stages, indicating their role in skeleton formation. Taken together, the process of coral skeleton formation is hypothesized as: 1) formation of initial extracellular matrix between epithelial cells and substrate, employing pre-existing transmembrane proteins; 2) additional extracellular matrix formation using novel proteins that have emerged by domain shuffling and rapid molecular evolution and; 3) calcification controlled by coral-specific SOMPs. PMID:27253604

  6. Complete Mitochondrial Genomes Reveal Neolithic Expansion into Europe

    PubMed Central

    Fu, Qiaomei; Rudan, Pavao; Pääbo, Svante; Krause, Johannes

    2012-01-01

    The Neolithic transition from hunting and gathering to farming and cattle breeding marks one of the most drastic cultural changes in European prehistory. Short stretches of ancient mitochondrial DNA (mtDNA) from skeletons of pre-Neolithic hunter-gatherers as well as early Neolithic farmers support the demic diffusion model where a migration of early farmers from the Near East and a replacement of pre-Neolithic hunter-gatherers are largely responsible for cultural innovation and changes in subsistence strategies during the Neolithic revolution in Europe. In order to test if a signal of population expansion is still present in modern European mitochondrial DNA, we analyzed a comprehensive dataset of 1,151 complete mtDNAs from present-day Europeans. Relying upon ancient DNA data from previous investigations, we identified mtDNA haplogroups that are typical for early farmers and hunter-gatherers, namely H and U respectively. Bayesian skyline coalescence estimates were then used on subsets of complete mtDNAs from modern populations to look for signals of past population expansions. Our analyses revealed a population expansion between 15,000 and 10,000 years before present (YBP) in mtDNAs typical for hunters and gatherers, with a decline between 10,000 and 5,000 YBP. These corresponded to an analogous population increase approximately 9,000 YBP for mtDNAs typical of early farmers. The observed changes over time suggest that the spread of agriculture in Europe involved the expansion of farming populations into Europe followed by the eventual assimilation of resident hunter-gatherers. Our data show that contemporary mtDNA datasets can be used to study ancient population history if only limited ancient genetic data is available. PMID:22427842

  7. Complete mitochondrial genomes reveal neolithic expansion into Europe.

    PubMed

    Fu, Qiaomei; Rudan, Pavao; Pääbo, Svante; Krause, Johannes

    2012-01-01

    The Neolithic transition from hunting and gathering to farming and cattle breeding marks one of the most drastic cultural changes in European prehistory. Short stretches of ancient mitochondrial DNA (mtDNA) from skeletons of pre-Neolithic hunter-gatherers as well as early Neolithic farmers support the demic diffusion model where a migration of early farmers from the Near East and a replacement of pre-Neolithic hunter-gatherers are largely responsible for cultural innovation and changes in subsistence strategies during the Neolithic revolution in Europe. In order to test if a signal of population expansion is still present in modern European mitochondrial DNA, we analyzed a comprehensive dataset of 1,151 complete mtDNAs from present-day Europeans. Relying upon ancient DNA data from previous investigations, we identified mtDNA haplogroups that are typical for early farmers and hunter-gatherers, namely H and U respectively. Bayesian skyline coalescence estimates were then used on subsets of complete mtDNAs from modern populations to look for signals of past population expansions. Our analyses revealed a population expansion between 15,000 and 10,000 years before present (YBP) in mtDNAs typical for hunters and gatherers, with a decline between 10,000 and 5,000 YBP. These corresponded to an analogous population increase approximately 9,000 YBP for mtDNAs typical of early farmers. The observed changes over time suggest that the spread of agriculture in Europe involved the expansion of farming populations into Europe followed by the eventual assimilation of resident hunter-gatherers. Our data show that contemporary mtDNA datasets can be used to study ancient population history if only limited ancient genetic data is available.

  8. Asymmetric Genome Organization in an RNA Virus Revealed via Graph-Theoretical Analysis of Tomographic Data

    PubMed Central

    Geraets, James A.; Dykeman, Eric C.; Stockley, Peter G.; Ranson, Neil A.; Twarock, Reidun

    2015-01-01

    Cryo-electron microscopy permits 3-D structures of viral pathogens to be determined in remarkable detail. In particular, the protein containers encapsulating viral genomes have been determined to high resolution using symmetry averaging techniques that exploit the icosahedral architecture seen in many viruses. By contrast, structure determination of asymmetric components remains a challenge, and novel analysis methods are required to reveal such features and characterize their functional roles during infection. Motivated by the important, cooperative roles of viral genomes in the assembly of single-stranded RNA viruses, we have developed a new analysis method that reveals the asymmetric structural organization of viral genomes in proximity to the capsid in such viruses. The method uses geometric constraints on genome organization, formulated based on knowledge of icosahedrally-averaged reconstructions and the roles of the RNA-capsid protein contacts, to analyse cryo-electron tomographic data. We apply this method to the low-resolution tomographic data of a model virus and infer the unique asymmetric organization of its genome in contact with the protein shell of the capsid. This opens unprecedented opportunities to analyse viral genomes, revealing conserved structural features and mechanisms that can be targeted in antiviral drug design. PMID:25793998

  9. Comparative Genome Analyses of Vibrio anguillarum Strains Reveal a Link with Pathogenicity Traits

    PubMed Central

    Castillo, Daniel; Alvise, Paul D.; Xu, Ruiqi; Zhang, Faxing; Middelboe, Mathias

    2017-01-01

    ABSTRACT Vibrio anguillarum is a marine bacterium that can cause vibriosis in many fish and shellfish species, leading to high mortalities and economic losses in aquaculture. Although putative virulence factors have been identified, the mechanism of pathogenesis of V. anguillarum is not fully understood. Here, we analyzed whole-genome sequences of a collection of V. anguillarum strains and compared them to virulence of the strains as determined in larval challenge assays. Previously identified virulence factors were globally distributed among the strains, with some genetic diversity. However, the pan-genome revealed that six out of nine high-virulence strains possessed a unique accessory genome that was attributed to pathogenic genomic islands, prophage-like elements, virulence factors, and a new set of gene clusters involved in biosynthesis, modification, and transport of polysaccharides. In contrast, V. anguillarum strains that were medium to nonvirulent had a high degree of genomic homogeneity. Finally, we found that a phylogeny based on the core genomes clustered the strains with moderate to no virulence, while six out of nine high-virulence strains represented phylogenetically separate clusters. Hence, we suggest a link between genotype and virulence characteristics of Vibrio anguillarum, which can be used to unravel the molecular evolution of V. anguillarum and can also be important from survey and diagnostic perspectives. IMPORTANCE Comparative genome analysis of strains of a pathogenic bacterial species can be a powerful tool to discover acquisition of mobile genetic elements related to virulence. Here, we compared 28 V. anguillarum strains that differed in virulence in fish larval models. By pan-genome analyses, we found that six of nine highly virulent strains had a unique core and accessory genome. In contrast, V. anguillarum strains that were medium to nonvirulent had low genomic diversity. Integration of genomic and phenotypic features provides

  10. Comparative Genomics of the Extreme Acidophile Acidithiobacillus thiooxidans Reveals Intraspecific Divergence and Niche Adaptation

    PubMed Central

    Zhang, Xian; Feng, Xue; Tao, Jiemeng; Ma, Liyuan; Xiao, Yunhua; Liang, Yili; Liu, Xueduan; Yin, Huaqun

    2016-01-01

    Acidithiobacillus thiooxidans known for its ubiquity in diverse acidic and sulfur-bearing environments worldwide was used as the research subject in this study. To explore the genomic fluidity and intraspecific diversity of Acidithiobacillus thiooxidans (A. thiooxidans) species, comparative genomics based on nine draft genomes was performed. Phylogenomic scrutiny provided first insights into the multiple groupings of these strains, suggesting that genetic diversity might be potentially correlated with their geographic distribution as well as geochemical conditions. While these strains shared a large number of common genes, they displayed differences in gene content. Functional assignment indicated that the core genome was essential for microbial basic activities such as energy acquisition and uptake of nutrients, whereas the accessory genome was thought to be involved in niche adaptation. Comprehensive analysis of their predicted central metabolism revealed that few differences were observed among these strains. Further analyses showed evidences of relevance between environmental conditions and genomic diversification. Furthermore, a diverse pool of mobile genetic elements including insertion sequences and genomic islands in all A. thiooxidans strains probably demonstrated the frequent genetic flow (such as lateral gene transfer) in the extremely acidic environments. From another perspective, these elements might endow A. thiooxidans species with capacities to withstand the chemical constraints of their natural habitats. Taken together, our findings bring some valuable data to better understand the genomic diversity and econiche adaptation within A. thiooxidans strains. PMID:27548157

  11. Constraints on Genome Dynamics Revealed from Gene Distribution among the Ralstonia solanacearum Species

    PubMed Central

    Lefeuvre, Pierre; Cellier, Gilles; Remenant, Benoît; Chiroleu, Frédéric; Prior, Philippe

    2013-01-01

    Because it is suspected that gene content may partly explain host adaptation and ecology of pathogenic bacteria, it is important to study factors affecting genome composition and its evolution. While recent genomic advances have revealed extremely large pan-genomes for some bacterial species, it remains difficult to predict to what extent gene pool is accessible within or transferable between populations. As genomes bear imprints of the history of the organisms, gene distribution pattern analyses should provide insights into the forces and factors at play in the shaping and maintaining of bacterial genomes. In this study, we revisited the data obtained from a previous CGH microarrays analysis in order to assess the genomic plasticity of the R. solanacearum species complex. Gene distribution analyses demonstrated the remarkably dispersed genome of R. solanacearum with more than half of the genes being accessory. From the reconstruction of the ancestral genomes compositions, we were able to infer the number of gene gain and loss events along the phylogeny. Analyses of gene movement patterns reveal that factors associated with gene function, genomic localization and ecology delineate gene flow patterns. While the chromosome displayed lower rates of movement, the megaplasmid was clearly associated with hot-spots of gene gain and loss. Gene function was also confirmed to be an essential factor in gene gain and loss dynamics with significant differences in movement patterns between different COG categories. Finally, analyses of gene distribution highlighted possible highways of horizontal gene transfer. Due to sampling and design bias, we can only speculate on factors at play in this gene movement dynamic. Further studies examining precise conditions that favor gene transfer would provide invaluable insights in the fate of bacteria, species delineation and the emergence of successful pathogens. PMID:23723974

  12. Comparative Genome Sequence Analysis Reveals the Extent of Diversity and Conservation for Glycan-Associated Proteins in Burkholderia spp.

    PubMed Central

    Ong, Hui San; Mohamed, Rahmah; Firdaus-Raih, Mohd

    2012-01-01

    Members of the Burkholderia family occupy diverse ecological niches. In pathogenic family members, glycan-associated proteins are often linked to functions that include virulence, protein conformation maintenance, surface recognition, cell adhesion, and immune system evasion. Comparative analysis of available Burkholderia genomes has revealed a core set of 178 glycan-associated proteins shared by all Burkholderia of which 68 are homologous to known essential genes. The genome sequence comparisons revealed insights into species-specific gene acquisitions through gene transfers, identified an S-layer protein, and proposed that significantly reactive surface proteins are associated to sugar moieties as a potential means to circumvent host defense mechanisms. The comparative analysis using a curated database of search queries enabled us to gain insights into the extent of conservation and diversity, as well as the possible virulence-associated roles of glycan-associated proteins in members of the Burkholderia spp. The curated list of glycan-associated proteins used can also be directed to screen other genomes for glycan-associated homologs. PMID:22991502

  13. Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea

    PubMed Central

    2014-01-01

    Background Brassica oleracea is a valuable vegetable species that has contributed to human health and nutrition for hundreds of years and comprises multiple distinct cultivar groups with diverse morphological and phytochemical attributes. In addition to this phenotypic wealth, B. oleracea offers unique insights into polyploid evolution, as it results from multiple ancestral polyploidy events and a final Brassiceae-specific triplication event. Further, B. oleracea represents one of the diploid genomes that formed the economically important allopolyploid oilseed, Brassica napus. A deeper understanding of B. oleracea genome architecture provides a foundation for crop improvement strategies throughout the Brassica genus. Results We generate an assembly representing 75% of the predicted B. oleracea genome using a hybrid Illumina/Roche 454 approach. Two dense genetic maps are generated to anchor almost 92% of the assembled scaffolds to nine pseudo-chromosomes. Over 50,000 genes are annotated and 40% of the genome predicted to be repetitive, thus contributing to the increased genome size of B. oleracea compared to its close relative B. rapa. A snapshot of both the leaf transcriptome and methylome allows comparisons to be made across the triplicated sub-genomes, which resulted from the most recent Brassiceae-specific polyploidy event. Conclusions Differential expression of the triplicated syntelogs and cytosine methylation levels across the sub-genomes suggest residual marks of the genome dominance that led to the current genome architecture. Although cytosine methylation does not correlate with individual gene dominance, the independent methylation patterns of triplicated copies suggest epigenetic mechanisms play a role in the functional diversification of duplicate genes. PMID:24916971

  14. Adaptations to a subterranean environment and longevity revealed by the analysis of mole rat genomes

    PubMed Central

    Fang, Xiaodong; Seim, Inge; Huang, Zhiyong; Gerashchenko, Maxim V.; Xiong, Zhiqiang; Turanov, Anton A.; Zhu, Yabing; Lobanov, Alexei V.; Fan, Dingding; Yim, Sun Hee; Yao, Xiaoming; Ma, Siming; Yang, Lan; Lee, Sang-Goo; Kim, Eun Bae; Bronson, Roderick T.; Šumbera, Radim; Buffenstein, Rochelle; Zhou, Xin; Krogh, Anders; Park, Thomas J.; Zhang, Guojie; Wang, Jun; Gladyshev, Vadim N.

    2014-01-01

    SUMMARY Subterranean mammals spend their lives in dark, unventilated environments rich in carbon dioxide and ammonia, and low in oxygen. Many of these animals are also long-lived and exhibit reduced aging-associated diseases, such as neurodegenerative disorders and cancer. We sequenced the genome of the Damaraland mole rat (DMR, Fukomys damarensis) and improved the genome assembly of the naked mole rat (NMR, Heterocephalus glaber). Comparative genome analysis, along with transcriptomes of related subterranean rodents, reveal candidate molecular adaptations for subterranean life and longevity, including a divergent insulin peptide, expression of oxygen-carrying globins in the brain, prevention of high CO2-induced pain perception, and enhanced ammonia detoxification. Juxtaposition of the genomes of DMR and other more conventional animals with the genome of NMR revealed several truly exceptional NMR features: unusual thermogenesis, aberrant melatonin system, pain insensitivity, and novel processing of 28S rRNA. Together, the new genomes and transcriptomes extend our understanding of subterranean adaptations, stress resistance and longevity. PMID:25176646

  15. History of plastid DNA insertions reveals weak deletion and at mutation biases in angiosperm mitochondrial genomes.

    PubMed

    Sloan, Daniel B; Wu, Zhiqiang

    2014-11-21

    Angiosperm mitochondrial genomes exhibit many unusual properties, including heterogeneous nucleotide composition and exceptionally large and variable genome sizes. Determining the role of nonadaptive mechanisms such as mutation bias in shaping the molecular evolution of these unique genomes has proven challenging because their dynamic structures generally prevent identification of homologous intergenic sequences for comparative analyses. Here, we report an analysis of angiosperm mitochondrial DNA sequences that are derived from inserted plastid DNA (mtpts). The availability of numerous completely sequenced plastid genomes allows us to infer the evolutionary history of these insertions, including the specific nucleotide substitutions and indels that have occurred because their incorporation into the mitochondrial genome. Our analysis confirmed that many mtpts have a complex history, including frequent gene conversion and multiple examples of horizontal transfer between divergent angiosperm lineages. Nevertheless, it is clear that the majority of extant mtpt sequence in angiosperms is the product of recent transfer (or gene conversion) and is subject to rapid loss/deterioration, suggesting that most mtpts are evolving relatively free from functional constraint. The evolution of mtpt sequences reveals a pattern of biased mutational input in angiosperm mitochondrial genomes, including an excess of small deletions over insertions and a skew toward nucleotide substitutions that increase AT content. However, these mutation biases are far weaker than have been observed in many other cellular genomes, providing insight into some of the notable features of angiosperm mitochondrial architecture, including the retention of large intergenic regions and the relatively neutral GC content found in these regions.

  16. Whole genome sequence of Desulfovibrio magneticus strain RS-1 revealed common gene clusters in magnetotactic bacteria

    PubMed Central

    Nakazawa, Hidekazu; Arakaki, Atsushi; Narita-Yamada, Sachiko; Yashiro, Isao; Jinno, Koji; Aoki, Natsuko; Tsuruyama, Ai; Okamura, Yoshiko; Tanikawa, Satoshi; Fujita, Nobuyuki; Takeyama, Haruko; Matsunaga, Tadashi

    2009-01-01

    Magnetotactic bacteria are ubiquitous microorganisms that synthesize intracellular magnetite particles (magnetosomes) by accumulating Fe ions from aquatic environments. Recent molecular studies, including comprehensive proteomic, transcriptomic, and genomic analyses, have considerably improved our hypotheses of the magnetosome-formation mechanism. However, most of these studies have been conducted using pure-cultured bacterial strains of α-proteobacteria. Here, we report the whole-genome sequence of Desulfovibrio magneticus strain RS-1, the only isolate of magnetotactic microorganisms classified under δ-proteobacteria. Comparative genomics of the RS-1 and four α-proteobacterial strains revealed the presence of three separate gene regions (nuo and mamAB-like gene clusters, and gene region of a cryptic plasmid) conserved in all magnetotactic bacteria. The nuo gene cluster, encoding NADH dehydrogenase (complex I), was also common to the genomes of three iron-reducing bacteria exhibiting uncontrolled extracellular and/or intracellular magnetite synthesis. A cryptic plasmid, pDMC1, encodes three homologous genes that exhibit high similarities with those of other magnetotactic bacterial strains. In addition, the mamAB-like gene cluster, encoding the key components for magnetosome formation such as iron transport and magnetosome alignment, was conserved only in the genomes of magnetotactic bacteria as a similar genomic island-like structure. Our findings suggest the presence of core genetic components for magnetosome biosynthesis; these genes may have been acquired into the magnetotactic bacterial genomes by multiple gene-transfer events during proteobacterial evolution. PMID:19675025

  17. Genome Sequencing of the Behavior Manipulating Virus LbFV Reveals a Possible New Virus Family

    PubMed Central

    Lepetit, David; Gillet, Benjamin; Hughes, Sandrine; Kraaijeveld, Ken

    2016-01-01

    Parasites are sometimes able to manipulate the behavior of their hosts. However, the molecular cues underlying this phenomenon are poorly documented. We previously reported that the parasitoid wasp Leptopilina boulardi which develops from Drosophila larvae is often infected by an inherited DNA virus. In addition to being maternally transmitted, the virus benefits from horizontal transmission in superparasitized larvae (Drosophila that have been parasitized several times). Interestingly, the virus forces infected females to lay eggs in already parasitized larvae, thus increasing the chance of being horizontally transmitted. In a first step towards the identification of virus genes responsible for the behavioral manipulation, we present here the genome sequence of the virus, called LbFV. The sequencing revealed that its genome contains an homologous repeat sequence (hrs) found in eight regions in the genome. The presence of this hrs may explain the genomic plasticity that we observed for this genome. The genome of LbFV encodes 108 ORFs, most of them having no homologs in public databases. The virus is however related to Hytrosaviridae, although distantly. LbFV may thus represent a member of a new virus family. Several genes of LbFV were captured from eukaryotes, including two anti-apoptotic genes. More surprisingly, we found that LbFV captured from an ancestral wasp a protein with a Jumonji domain. This gene was afterwards duplicated in the virus genome. We hypothesized that this gene may be involved in manipulating the expression of wasp genes, and possibly in manipulating its behavior. PMID:28173110

  18. Gekko japonicus genome reveals evolution of adhesive toe pads and tail regeneration

    PubMed Central

    Liu, Yan; Zhou, Qian; Wang, Yongjun; Luo, Longhai; Yang, Jian; Yang, Linfeng; Liu, Mei; Li, Yingrui; Qian, Tianmei; Zheng, Yuan; Li, Meiyuan; Li, Jiang; Gu, Yun; Han, Zujing; Xu, Man; Wang, Yingjie; Zhu, Changlai; Yu, Bin; Yang, Yumin; Ding, Fei; Jiang, Jianping; Yang, Huanming; Gu, Xiaosong

    2015-01-01

    Reptiles are the most morphologically and physiologically diverse tetrapods, and have undergone 300 million years of adaptive evolution. Within the reptilian tetrapods, geckos possess several interesting features, including the ability to regenerate autotomized tails and to climb on smooth surfaces. Here we sequence the genome of Gekko japonicus (Schlegel's Japanese Gecko) and investigate genetic elements related to its physiology. We obtain a draft G. japonicus genome sequence of 2.55 Gb and annotated 22,487 genes. Comparative genomic analysis reveals specific gene family expansions or reductions that are associated with the formation of adhesive setae, nocturnal vision and tail regeneration, as well as the diversification of olfactory sensation. The obtained genomic data provide robust genetic evidence of adaptive evolution in reptiles. PMID:26598231

  19. What genomic sequence information has revealed about Vibrio ecology in the ocean--a review.

    PubMed

    Grimes, Darrell Jay; Johnson, Crystal N; Dillon, Kevin S; Flowers, Adrienne R; Noriea, Nicholas F; Berutti, Tracy

    2009-10-01

    To date, the genomes of eight Vibrio strains representing six species and three human pathogens have been fully sequenced and reported. This review compares genomic information revealed from these sequencing efforts and what we can infer about Vibrio biology and ecology from this and related genomic information. The focus of the review is on those attributes that allow the Vibrios to survive and even proliferate in their ocean habitats, which include seawater, plankton, invertebrates, fish, marine mammals, plants, man-made structures (surfaces), and particulate matter. Areas covered include general information about the eight genomes, each of which is distributed over two chromosomes; a discussion of expected and unusual genes found; attachment sites and mechanisms; utilization of particulate and dissolved organic matter; and conclusions.

  20. The complete genome sequences, unique mutational spectra and developmental potency of adult neurons revealed by cloning

    PubMed Central

    Rodriguez, Alberto R.; Ferguson, William C.; Shumilina, Svetlana; Clark, Royden A.; Boland, Michael J.; Martin, Greg; Chubukov, Pavel; Tsunemoto, Rachel K.; Torkamani, Ali; Kupriyanov, Sergey; Hall, Ira M.; Baldwin, Kristin K.

    2016-01-01

    Somatic mutation in neurons is linked to neurologic disease and implicated in cell type diversification. However, the origin, extent and patterns of genomic mutation in neurons remain unknown. We established a nuclear transfer method to clonally amplify the genomes of neurons from adult mice for whole genome sequencing. Comprehensive mutation detection and independent validation revealed that individual neurons harbor ~100 unique mutations from all classes, but lack recurrent rearrangements. Most neurons contain at least one gene disrupting mutation and rare (0-2) mobile element insertions. The frequency and gene bias of neuronal mutations differs from other lineages, potentially due to novel mechanisms governing post-mitotic mutation. Fertile mice were cloned from several neurons, establishing the compatibility of mutated adult neuronal genomes with reprogramming to pluripotency and development. PMID:26948891

  1. Comparative Genomics of Bifidobacterium animalis subsp. lactis Reveals a Strict Monophyletic Bifidobacterial Taxon

    PubMed Central

    Milani, Christian; Duranti, Sabrina; Lugli, Gabriele Andrea; Bottacini, Francesca; Strati, Francesco; Arioli, Stefania; Foroni, Elena; Turroni, Francesca; van Sinderen, Douwe

    2013-01-01

    Strains of Bifidobacterium animalis subsp. lactis are extensively exploited by the food industry as health-promoting bacteria, although the genetic variability of members belonging to this taxon has so far not received much scientific attention. In this article, we describe the complete genetic makeup of the B. animalis subsp. lactis Bl12 genome and discuss the genetic relatedness of this strain with other sequenced strains belonging to this taxon. Moreover, a detailed comparative genomic analysis of B. animalis subsp. lactis genomes was performed, which revealed a closely related and isogenic nature of all currently available B. animalis subsp. lactis strains, thus strongly suggesting a closed pan-genome structure of this bacterial group. PMID:23645200

  2. The Complete Genome Sequences, Unique Mutational Spectra, and Developmental Potency of Adult Neurons Revealed by Cloning.

    PubMed

    Hazen, Jennifer L; Faust, Gregory G; Rodriguez, Alberto R; Ferguson, William C; Shumilina, Svetlana; Clark, Royden A; Boland, Michael J; Martin, Greg; Chubukov, Pavel; Tsunemoto, Rachel K; Torkamani, Ali; Kupriyanov, Sergey; Hall, Ira M; Baldwin, Kristin K

    2016-03-16

    Somatic mutation in neurons is linked to neurologic disease and implicated in cell-type diversification. However, the origin, extent, and patterns of genomic mutation in neurons remain unknown. We established a nuclear transfer method to clonally amplify the genomes of neurons from adult mice for whole-genome sequencing. Comprehensive mutation detection and independent validation revealed that individual neurons harbor ∼100 unique mutations from all classes but lack recurrent rearrangements. Most neurons contain at least one gene-disrupting mutation and rare (0-2) mobile element insertions. The frequency and gene bias of neuronal mutations differ from other lineages, potentially due to novel mechanisms governing postmitotic mutation. Fertile mice were cloned from several neurons, establishing the compatibility of mutated adult neuronal genomes with reprogramming to pluripotency and development.

  3. COMPARISON OF COMPARATIVE GENOMIC HYBRIDIZATIONS TECHNOLOGIES ACROSS MICROARRAY PLATFORMS

    EPA Science Inventory

    Comparative Genomic Hybridization (CGH) measures DNA copy number differences between a reference genome and a test genome. The DNA samples are differentially labeled and hybridized to an immobilized substrate. In early CGH experiments, the DNA targets were hybridized to metaphase...

  4. Genomic Characterization of a Pattern D Streptococcus pyogenes emm53 Isolate Reveals a Genetic Rationale for Invasive Skin Tropicity

    PubMed Central

    Bao, Yun-Juan; Liang, Zhong; Mayfield, Jeffrey A.; Donahue, Deborah L.; Carothers, Katelyn E.; Lee, Shaun W.; Ploplis, Victoria A.

    2016-01-01

    ABSTRACT The genome of an invasive skin-tropic strain (AP53) of serotype M53 group A Streptococcus pyogenes (GAS) is composed of a circular chromosome of 1,860,554 bp and carries genetic markers for infection at skin locales, viz., emm gene family pattern D and FCT type 3. Through genome-scale comparisons of AP53 with other GAS genomes, we identified 596 candidate single-nucleotide polymorphisms (SNPs) that reveal a potential genetic basis for skin tropism. The genome of AP53 differed by ∼30 point mutations from a noninvasive pattern D serotype M53 strain (Alab49), 4 of which are located in virulence genes. One pseudogene, yielding an inactive sensor kinase (CovS−) of the two-component transcriptional regulator CovRS, a major determinant for invasiveness, severely attenuated the expression of the secreted cysteine protease SpeB and enhanced the expression of the hyaluronic acid capsule compared to the isogenic noninvasive AP53/CovS+ strain. The collagen-binding protein transcript sclB differed in the number of 5′-pentanucleotide repeats in the signal peptides of AP53 and Alab49 (9 versus 15), translating into different lengths of their signal peptides, which nonetheless maintained a full-length translatable coding frame. Furthermore, GAS strain AP53 acquired two phages that are absent in Alab49. One such phage (ΦAP53.2) contains the known virulence factor superantigen exotoxin gene tandem speK-slaA. Overall, we conclude that this bacterium has evolved in multiple ways, including mutational variations of regulatory genes, short-tandem-repeat polymorphisms, large-scale genomic alterations, and acquisition of phages, all of which may be involved in shaping the adaptation of GAS in specific infectious environments and contribute to its enhanced virulence. IMPORTANCE Infectious strains of S. pyogenes (GAS) are classified by their serotypes, relating to the surface M protein, the emm-like subfamily pattern, and their tropicity toward the nasopharynx and/or skin

  5. Genomic Analysis by Deep Sequencing of the Probiotic Lactobacillus brevis KB290 Harboring Nine Plasmids Reveals Genomic Stability

    PubMed Central

    Fukao, Masanori; Oshima, Kenshiro; Morita, Hidetoshi; Toh, Hidehiro; Suda, Wataru; Kim, Seok-Won; Suzuki, Shigenori; Yakabe, Takafumi; Hattori, Masahira; Yajima, Nobuhiro

    2013-01-01

    We determined the complete genome sequence of Lactobacillus brevis KB290, a probiotic lactic acid bacterium isolated from a traditional Japanese fermented vegetable. The genome contained a 2,395,134-bp chromosome that housed 2,391 protein-coding genes and nine plasmids that together accounted for 191 protein-coding genes. KB290 contained no virulence factor genes, and several genes related to presumptive cell wall-associated polysaccharide biosynthesis and the stress response were present in L. brevis KB290 but not in the closely related L. brevis ATCC 367. Plasmid-curing experiments revealed that the presence of plasmid pKB290-1 was essential for the strain's gastrointestinal tract tolerance and tendency to aggregate. Using next-generation deep sequencing of current and 18-year-old stock strains to detect low frequency variants, we evaluated genome stability. Deep sequencing of four periodic KB290 culture stocks with more than 1,000-fold coverage revealed 3 mutation sites and 37 minority variation sites, indicating long-term stability and providing a useful method for assessing the stability of industrial bacteria at the nucleotide level. PMID:23544154

  6. Integrated Consensus Map of Cultivated Peanut and Wild Relatives Reveals Structures of the A and B Genomes of Arachis and Divergence of the Legume Genomes

    PubMed Central

    Shirasawa, Kenta; Bertioli, David J.; Varshney, Rajeev K.; Moretzsohn, Marcio C.; Leal-Bertioli, Soraya C. M.; Thudi, Mahendar; Pandey, Manish K.; Rami, Jean-Francois; Foncéka, Daniel; Gowda, Makanahally V. C.; Qin, Hongde; Guo, Baozhu; Hong, Yanbin; Liang, Xuanqiang; Hirakawa, Hideki; Tabata, Satoshi; Isobe, Sachiko

    2013-01-01

    The complex, tetraploid genome structure of peanut (Arachis hypogaea) has obstructed advances in genetics and genomics in the species. The aim of this study is to understand the genome structure of Arachis by developing a high-density integrated consensus map. Three recombinant inbred line populations derived from crosses between the A genome diploid species, Arachis duranensis and Arachis stenosperma; the B genome diploid species, Arachis ipaënsis and Arachis magna; and between the AB genome tetraploids, A. hypogaea and an artificial amphidiploid (A. ipaënsis × A. duranensis)4×, were used to construct genetic linkage maps: 10 linkage groups (LGs) of 544 cM with 597 loci for the A genome; 10 LGs of 461 cM with 798 loci for the B genome; and 20 LGs of 1442 cM with 1469 loci for the AB genome. The resultant maps plus 13 published maps were integrated into a consensus map covering 2651 cM with 3693 marker loci which was anchored to 20 consensus LGs corresponding to the A and B genomes. The comparative genomics with genome sequences of Cajanus cajan, Glycine max, Lotus japonicus, and Medicago truncatula revealed that the Arachis genome has segmented synteny relationship to the other legumes. The comparative maps in legumes, integrated tetraploid consensus maps, and genome-specific diploid maps will increase the genetic and genomic understanding of Arachis and should facilitate molecular breeding. PMID:23315685

  7. Integrated consensus map of cultivated peanut and wild relatives reveals structures of the A and B genomes of Arachis and divergence of the legume genomes.

    PubMed

    Shirasawa, Kenta; Bertioli, David J; Varshney, Rajeev K; Moretzsohn, Marcio C; Leal-Bertioli, Soraya C M; Thudi, Mahendar; Pandey, Manish K; Rami, Jean-Francois; Foncéka, Daniel; Gowda, Makanahally V C; Qin, Hongde; Guo, Baozhu; Hong, Yanbin; Liang, Xuanqiang; Hirakawa, Hideki; Tabata, Satoshi; Isobe, Sachiko

    2013-04-01

    The complex, tetraploid genome structure of peanut (Arachis hypogaea) has obstructed advances in genetics and genomics in the species. The aim of this study is to understand the genome structure of Arachis by developing a high-density integrated consensus map. Three recombinant inbred line populations derived from crosses between the A genome diploid species, Arachis duranensis and Arachis stenosperma; the B genome diploid species, Arachis ipaënsis and Arachis magna; and between the AB genome tetraploids, A. hypogaea and an artificial amphidiploid (A. ipaënsis × A. duranensis)(4×), were used to construct genetic linkage maps: 10 linkage groups (LGs) of 544 cM with 597 loci for the A genome; 10 LGs of 461 cM with 798 loci for the B genome; and 20 LGs of 1442 cM with 1469 loci for the AB genome. The resultant maps plus 13 published maps were integrated into a consensus map covering 2651 cM with 3693 marker loci which was anchored to 20 consensus LGs corresponding to the A and B genomes. The comparative genomics with genome sequences of Cajanus cajan, Glycine max, Lotus japonicus, and Medicago truncatula revealed that the Arachis genome has segmented synteny relationship to the other legumes. The comparative maps in legumes, integrated tetraploid consensus maps, and genome-specific diploid maps will increase the genetic and genomic understanding of Arachis and should facilitate molecular breeding.

  8. The first aurochs genome reveals the breeding history of British and European cattle.

    PubMed

    Orlando, Ludovic

    2015-10-26

    The first genome sequence of the extinct European wild aurochs reveals the genetic foundation of native British and Irish landraces of cattle.See related Research article: www.dx.doi.org/10.1186/s13059-015-0790-2.

  9. Genome-wide transcript profiling reveals novel breast cancer-associated intronic sense RNAs.

    PubMed

    Kim, Sang Woo; Fishilevich, Elane; Arango-Argoty, Gustavo; Lin, Yuefeng; Liu, Guodong; Li, Zhihua; Monaghan, A Paula; Nichols, Mark; John, Bino

    2015-01-01

    Non-coding RNAs (ncRNAs) play major roles in development and cancer progression. To identify novel ncRNAs that may identify key pathways in breast cancer development, we performed high-throughput transcript profiling of tumor and normal matched-pair tissue samples. Initial transcriptome profiling using high-density genome-wide tiling arrays revealed changes in over 200 novel candidate genomic regions that map to intronic regions. Sixteen genomic loci were identified that map to the long introns of five key protein-coding genes, CRIM1, EPAS1, ZEB2, RBMS1, and RFX2. Consistent with the known role of the tumor suppressor ZEB2 in the cancer-associated epithelial to mesenchymal transition (EMT), in situ hybridization reveals that the intronic regions deriving from ZEB2 as well as those from RFX2 and EPAS1 are down-regulated in cells of epithelial morphology, suggesting that these regions may be important for maintaining normal epithelial cell morphology. Paired-end deep sequencing analysis reveals a large number of distinct genomic clusters with no coding potential within the introns of these genes. These novel transcripts are only transcribed from the coding strand. A comprehensive search for breast cancer associated genes reveals enrichment for transcribed intronic regions from these loci, pointing to an underappreciated role of introns or mechanisms relating to their biology in EMT and breast cancer.

  10. Genome-Wide Transcript Profiling Reveals Novel Breast Cancer-Associated Intronic Sense RNAs

    PubMed Central

    Lin, Yuefeng; Liu, Guodong; Li, Zhihua; Monaghan, A. Paula; Nichols, Mark; John, Bino

    2015-01-01

    Non-coding RNAs (ncRNAs) play major roles in development and cancer progression. To identify novel ncRNAs that may identify key pathways in breast cancer development, we performed high-throughput transcript profiling of tumor and normal matched-pair tissue samples. Initial transcriptome profiling using high-density genome-wide tiling arrays revealed changes in over 200 novel candidate genomic regions that map to intronic regions. Sixteen genomic loci were identified that map to the long introns of five key protein-coding genes, CRIM1, EPAS1, ZEB2, RBMS1, and RFX2. Consistent with the known role of the tumor suppressor ZEB2 in the cancer-associated epithelial to mesenchymal transition (EMT), in situ hybridization reveals that the intronic regions deriving from ZEB2 as well as those from RFX2 and EPAS1 are down-regulated in cells of epithelial morphology, suggesting that these regions may be important for maintaining normal epithelial cell morphology. Paired-end deep sequencing analysis reveals a large number of distinct genomic clusters with no coding potential within the introns of these genes. These novel transcripts are only transcribed from the coding strand. A comprehensive search for breast cancer associated genes reveals enrichment for transcribed intronic regions from these loci, pointing to an underappreciated role of introns or mechanisms relating to their biology in EMT and breast cancer. PMID:25798919

  11. Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The P. ultimum DAOM BR144 (=CBS 805.95 = ATCC200006) genome (42.8 Mb) encodes 15,290 genes, and has extensive sequence similarity and synteny with related Phytophthora spp., including the potato late blight pathogen Phytophthora infestans. Whole transcriptome sequencing revealed expression of 86 % o...

  12. Draft Genome Sequence of Arthrobacter crystallopoietes Strain BAB-32, Revealing Genes for Bioremediation

    PubMed Central

    Joshi, M. N.; Pandit, A. S.; Sharma, A.; Pandya, R. V.; Desai, S. M.; Saxena, A. K.

    2013-01-01

    Arthrobacter crystallopoietes strain BAB-32, a Gram-positive obligate aerobic actinobacterium having potential application in bioremediation and bioreduction of a few metals, was isolated from rhizosphere soil of Gandhinagar, Gujarat, India. The draft genome (4.3 Mb) of the strain revealed a few vital gene clusters involved in the metabolism of aromatic compounds, zinc, and sulfur. PMID:23833141

  13. Comparative genomic analysis of clinical and environmental Vibrio vulnificus isolates revealed biotype 3 evolutionary relationships

    PubMed Central

    Koton, Yael; Gordon, Michal; Chalifa-Caspi, Vered; Bisharat, Naiel

    2015-01-01

    In 1996 a common-source outbreak of severe soft tissue and bloodstream infections erupted among Israeli fish farmers and fish consumers due to changes in fish marketing policies. The causative pathogen was a new strain of Vibrio vulnificus, named biotype 3, which displayed a unique biochemical and genotypic profile. Initial observations suggested that the pathogen erupted as a result of genetic recombination between two distinct populations. We applied a whole genome shotgun sequencing approach using several V. vulnificus strains from Israel in order to study the pan genome of V. vulnificus and determine the phylogenetic relationship of biotype 3 with existing populations. The core genome of V. vulnificus based on 16 draft and complete genomes consisted of 3068 genes, representing between 59 and 78% of the whole genome of 16 strains. The accessory genome varied in size from 781 to 2044 kbp. Phylogenetic analysis based on whole, core, and accessory genomes displayed similar clustering patterns with two main clusters, clinical (C) and environmental (E), all biotype 3 strains formed a distinct group within the E cluster. Annotation of accessory genomic regions found in biotype 3 strains and absent from the core genome yielded 1732 genes, of which the vast majority encoded hypothetical proteins, phage-related proteins, and mobile element proteins. A total of 1916 proteins (including 713 hypothetical proteins) were present in all human pathogenic strains (both biotype 3 and non-biotype 3) and absent from the environmental strains. Clustering analysis of the non-hypothetical proteins revealed 148 protein clusters shared by all human pathogenic strains; these included transcriptional regulators, arylsulfatases, methyl-accepting chemotaxis proteins, acetyltransferases, GGDEF family proteins, transposases, type IV secretory system (T4SS) proteins, and integrases. Our study showed that V. vulnificus biotype 3 evolved from environmental populations and formed a genetically

  14. Complete chloroplast genome of a valuable medicinal plant, Huperzia serrata (Lycopodiaceae), and comparison with its congener1

    PubMed Central

    Guo, Zhi-You; Zhang, Hong-Rui; Shrestha, Nawal; Zhang, Xian-Chun

    2016-01-01

    Premise of the study: Here we report the complete chloroplast genome of the important medicinal species Huperzia serrata (Lycopodiaceae) and compare it to the chloroplast genome of the congeneric species H. lucidula. Methods and Results: The whole chloroplast genome of H. serrata was sequenced using an Illumina platform and assembled with Geneious version R9.0.5. The genome size of H. serrata was 154,176 bp, with 36.3% GC content. The complete chloroplast genome contained 120 unique genes, including 86 coding genes, four rRNA genes, and 30 tRNA genes. Comparison with the chloroplast genome of H. lucidula revealed three highly variable regions (rps16-chlB, ycf12-trnR, and ycf1) between these two species and 252 mutation events including 27 insertion/deletion polymorphisms and 225 single-nucleotide polymorphisms (SNPs). Ninety-two SNPs were identified in the gene-coding regions. In addition, 18 microsatellite sites were found, which can potentially be used in phylogeographic studies. Conclusions: The complete chloroplast genome of H. serrata is reported here, and will be a valuable genome resource for further phylogenetic, evolutionary, and medical studies of medicinal plants in the genus Huperzia. PMID:27843724

  15. Comparative Genomic Analyses of Multiple Pseudomonas Strains Infecting Corylus avellana Trees Reveal the Occurrence of Two Genetic Clusters with Both Common and Distinctive Virulence and Fitness Traits.

    PubMed

    Marcelletti, Simone; Scortichini, Marco

    2015-01-01

    The European hazelnut (Corylus avellana) is threatened in Europe by several pseudomonads which cause symptoms ranging from twig dieback to tree death. A comparison of the draft genomes of nine Pseudomonas strains isolated from symptomatic C. avellana trees was performed to identify common and distinctive genomic traits. The thorough assessment of genetic relationships among the strains revealed two clearly distinct clusters: P. avellanae and P. syringae. The latter including the pathovars avellanae, coryli and syringae. Between these two clusters, no recombination event was found. A genomic island of approximately 20 kb, containing the hrp/hrc type III secretion system gene cluster, was found to be present without any genomic difference in all nine pseudomonads. The type III secretion system effector repertoires were remarkably different in the two groups, with P. avellanae showing a higher number of effectors. Homologue genes of the antimetabolite mangotoxin and ice nucleation activity clusters were found solely in all P. syringae pathovar strains, whereas the siderophore yersiniabactin was only present in P. avellanae. All nine strains have genes coding for pectic enzymes and sucrose metabolism. By contrast, they do not have genes coding for indolacetic acid and anti-insect toxin. Collectively, this study reveals that genomically different Pseudomonas can converge on the same host plant by suppressing the host defence mechanisms with the use of different virulence weapons. The integration into their genomes of a horizontally acquired genomic island could play a fundamental role in their evolution, perhaps giving them the ability to exploit new ecological niches.

  16. Comparative Genomic Analyses of Multiple Pseudomonas Strains Infecting Corylus avellana Trees Reveal the Occurrence of Two Genetic Clusters with Both Common and Distinctive Virulence and Fitness Traits

    PubMed Central

    Marcelletti, Simone; Scortichini, Marco

    2015-01-01

    The European hazelnut (Corylus avellana) is threatened in Europe by several pseudomonads which cause symptoms ranging from twig dieback to tree death. A comparison of the draft genomes of nine Pseudomonas strains isolated from symptomatic C. avellana trees was performed to identify common and distinctive genomic traits. The thorough assessment of genetic relationships among the strains revealed two clearly distinct clusters: P. avellanae and P. syringae. The latter including the pathovars avellanae, coryli and syringae. Between these two clusters, no recombination event was found. A genomic island of approximately 20 kb, containing the hrp/hrc type III secretion system gene cluster, was found to be present without any genomic difference in all nine pseudomonads. The type III secretion system effector repertoires were remarkably different in the two groups, with P. avellanae showing a higher number of effectors. Homologue genes of the antimetabolite mangotoxin and ice nucleation activity clusters were found solely in all P. syringae pathovar strains, whereas the siderophore yersiniabactin was only present in P. avellanae. All nine strains have genes coding for pectic enzymes and sucrose metabolism. By contrast, they do not have genes coding for indolacetic acid and anti-insect toxin. Collectively, this study reveals that genomically different Pseudomonas can converge on the same host plant by suppressing the host defence mechanisms with the use of different virulence weapons. The integration into their genomes of a horizontally acquired genomic island could play a fundamental role in their evolution, perhaps giving them the ability to exploit new ecological niches. PMID:26147218

  17. The genome and linkage map of the northern pike (Esox lucius): conserved synteny revealed between the salmonid sister group and the Neoteleostei.

    PubMed

    Rondeau, Eric B; Minkley, David R; Leong, Jong S; Messmer, Amber M; Jantzen, Johanna R; von Schalburg, Kristian R; Lemon, Craig; Bird, Nathan H; Koop, Ben F

    2014-01-01

    The northern pike is the most frequently studied member of the Esociformes, the closest order to the diverse and economically important Salmoniformes. The ancestor of all salmonids purportedly experienced a whole-genome duplication (WGD) event, making salmonid species ideal for studying the early impacts of genome duplication while complicating their use in wider analyses of teleost evolution. Studies suggest that the Esociformes diverged from the salmonid lineage prior to the WGD, supporting the use of northern pike as a pre-duplication outgroup. Here we present the first genome assembly, reference transcriptome and linkage map for northern pike, and evaluate the suitability of this species to provide a representative pre-duplication genome for future studies of salmonid and teleost evolution. The northern pike genome sequence is composed of 94,267 contigs (N50 = 16,909 bp) contained in 5,688 scaffolds (N50 = 700,535 bp); the total scaffolded genome size is 878 million bases. Multiple lines of evidence suggest that over 96% of the protein-coding genome is present in the genome assembly. The reference transcriptome was constructed from 13 tissues and contains 38,696 transcripts, which are accompanied by normalized expression data in all tissues. Gene-prediction analysis produced a total of 19,601 northern pike-specific gene models. The first-generation linkage map identifies 25 linkage groups, in agreement with northern pike's diploid karyotype of 2N = 50, and facilitates the placement of 46% of assembled bases onto linkage groups. Analyses reveal a high degree of conserved synteny between northern pike and other model teleost genomes. While conservation of gene order is limited to smaller syntenic blocks, the wider conservation of genome organization implies the northern pike exhibits a suitable approximation of a non-duplicated Protacanthopterygiian genome. This dataset will facilitate future studies of esocid biology and empower ongoing examinations of the

  18. Large-Scale Comparative Genomics Meta-Analysis of Campylobacter jejuni Isolates Reveals Low Level of Genome Plasticity

    PubMed Central

    Taboada, Eduardo N.; Acedillo, Rey R.; Carrillo, Catherine D.; Findlay, Wendy A.; Medeiros, Diane T.; Mykytczuk, Oksana L.; Roberts, Michael J.; Valencia, C. Alexander; Farber, Jeffrey M.; Nash, John H. E.

    2004-01-01

    We have used comparative genomic hybridization (CGH) on a full-genome Campylobacter jejuni microarray to examine genome-wide gene conservation patterns among 51 strains isolated from food and clinical sources. These data have been integrated with data from three previous C. jejuni CGH studies to perform a meta-analysis that included 97 strains from the four separate data sets. Although many genes were found to be divergent across multiple strains (n = 350), many genes (n = 249) were uniquely variable in single strains. Thus, the strains in each data set comprise strains with a unique genetic diversity not found in the strains in the other data sets. Despite the large increase in the collective number of variable C. jejuni genes (n = 599) found in the meta-analysis data set, nearly half of these (n = 276) mapped to previously defined variable loci, and it therefore appears that large regions of the C. jejuni genome are genetically stable. A detailed analysis of the microarray data revealed that divergent genes could be differentiated on the basis of the amplitudes of their differential microarray signals. Of 599 variable genes, 122 could be classified as highly divergent on the basis of CGH data. Nearly all highly divergent genes (117 of 122) had divergent neighbors and showed high levels of intraspecies variability. The approach outlined here has enabled us to distinguish global trends of gene conservation in C. jejuni and has enabled us to define this group of genes as a robust set of variable markers that can become the cornerstone of a new generation of genotyping methods that use genome-wide C. jejuni gene variability data. PMID:15472310

  19. Comparative and functional triatomine genomics reveals reductions and expansions in insecticide resistance-related gene families

    PubMed Central

    Traverso, Lucila; Lavore, Andrés; Sierra, Ivana; Palacio, Victorio; Martinez-Barnetche, Jesús; Latorre-Estivalis, José Manuel; Mougabure-Cueto, Gaston; Francini, Flavio; Lorenzo, Marcelo G.; Rodríguez, Mario Henry; Ons, Sheila; Rivera-Pomar, Rolando V.

    2017-01-01

    Background Triatomine insects are vectors of Trypanosoma cruzi, a protozoan parasite that is the causative agent of Chagas’ disease. This is a neglected disease affecting approximately 8 million people in Latin America. The existence of diverse pyrethroid resistant populations of at least two species demonstrates the potential of triatomines to develop high levels of insecticide resistance. Therefore, the incorporation of strategies for resistance management is a main concern for vector control programs. Three enzymatic superfamilies are thought to mediate xenobiotic detoxification and resistance: Glutathione Transferases (GSTs), Cytochromes P450 (CYPs) and Carboxyl/Cholinesterases (CCEs). Improving our knowledge of key triatomine detoxification enzymes will strengthen our understanding of insecticide resistance processes in vectors of Chagas’ disease. Methods and findings The discovery and description of detoxification gene superfamilies in normalized transcriptomes of three triatomine species: Triatoma dimidiata, Triatoma infestans and Triatoma pallidipennis is presented. Furthermore, a comparative analysis of these superfamilies among the triatomine transcriptomes and the genome of Rhodnius prolixus, also a triatomine vector of Chagas’ disease, and other well-studied insect genomes was performed. The expression pattern of detoxification genes in R. prolixus transcriptomes from key organs was analyzed. The comparisons reveal gene expansions in Sigma class GSTs, CYP3 in CYP superfamily and clade E in CCE superfamily. Moreover, several CYP families identified in these triatomines have not yet been described in other insects. Conversely, several groups of insecticide resistance related enzymes within each enzyme superfamily are reduced or lacking in triatomines. Furthermore, our qRT-PCR results showed an increase in the expression of a CYP4 gene in a T. infestans population resistant to pyrethroids. These results could point to an involvement of metabolic

  20. Chloroplast Genome Sequence of Pigeonpea (Cajanus cajan (L.) Millspaugh) and Cajanus scarabaeoides (L.) Thouars: Genome Organization and Comparison with Other Legumes

    PubMed Central

    Kaila, Tanvi; Chaduvla, Pavan K.; Saxena, Swati; Bahadur, Kaushlendra; Gahukar, Santosh J.; Chaudhury, Ashok; Sharma, T. R.; Singh, N. K.; Gaikwad, Kishor

    2016-01-01

    Pigeonpea (Cajanus cajan (L.) Millspaugh), a diploid (2n = 22) legume crop with a genome size of 852 Mbp, serves as an important source of human dietary protein especially in South East Asian and African regions. In this study, the draft chloroplast genomes of Cajanus cajan and Cajanus scarabaeoides (L.) Thouars were generated. Cajanus scarabaeoides is an important species of the Cajanus gene pool and has also been used for developing promising CMS system by different groups. A male sterile genotype harboring the C. scarabaeoides cytoplasm was used for sequencing the plastid genome. The cp genome of C. cajan is 152,242bp long, having a quadripartite structure with LSC of 83,455 bp and SSC of 17,871 bp separated by IRs of 25,398 bp. Similarly, the cp genome of C. scarabaeoides is 152,201bp long, having a quadripartite structure in which IRs of 25,402 bp length separates 83,423 bp of LSC and 17,854 bp of SSC. The pigeonpea cp genome contains 116 unique genes, including 30 tRNA, 4 rRNA, 78 predicted protein coding genes and 5 pseudogenes. A 50 kb inversion was observed in the LSC region of pigeonpea cp genome, consistent with other legumes. Comparison of cp genome with other legumes revealed the contraction of IR boundaries due to the absence of rps19 gene in the IR region. Chloroplast SSRs were mined and a total of 280 and 292 cpSSRs were identified in C. scarabaeoides and C. cajan respectively. RNA editing was observed at 37 sites in both C. scarabaeoides and C. cajan, with maximum occurrence in the ndh genes. The pigeonpea cp genome sequence would be beneficial in providing informative molecular markers which can be utilized for genetic diversity analysis and aid in understanding the plant systematics studies among major grain legumes. PMID:28018385

  1. Chloroplast Genome Sequence of Pigeonpea (Cajanus cajan (L.) Millspaugh) and Cajanus scarabaeoides (L.) Thouars: Genome Organization and Comparison with Other Legumes.

    PubMed

    Kaila, Tanvi; Chaduvla, Pavan K; Saxena, Swati; Bahadur, Kaushlendra; Gahukar, Santosh J; Chaudhury, Ashok; Sharma, T R; Singh, N K; Gaikwad, Kishor

    2016-01-01

    Pigeonpea (Cajanus cajan (L.) Millspaugh), a diploid (2n = 22) legume crop with a genome size of 852 Mbp, serves as an important source of human dietary protein especially in South East Asian and African regions. In this study, the draft chloroplast genomes of Cajanus cajan and Cajanus scarabaeoides (L.) Thouars were generated. Cajanus scarabaeoides is an important species of the Cajanus gene pool and has also been used for developing promising CMS system by different groups. A male sterile genotype harboring the C. scarabaeoides cytoplasm was used for sequencing the plastid genome. The cp genome of C. cajan is 152,242bp long, having a quadripartite structure with LSC of 83,455 bp and SSC of 17,871 bp separated by IRs of 25,398 bp. Similarly, the cp genome of C. scarabaeoides is 152,201bp long, having a quadripartite structure in which IRs of 25,402 bp length separates 83,423 bp of LSC and 17,854 bp of SSC. The pigeonpea cp genome contains 116 unique genes, including 30 tRNA, 4 rRNA, 78 predicted protein coding genes and 5 pseudogenes. A 50 kb inversion was observed in the LSC region of pigeonpea cp genome, consistent with other legumes. Comparison of cp genome with other legumes revealed the contraction of IR boundaries due to the absence of rps19 gene in the IR region. Chloroplast SSRs were mined and a total of 280 and 292 cpSSRs were identified in C. scarabaeoides and C. cajan respectively. RNA editing was observed at 37 sites in both C. scarabaeoides and C. cajan, with maximum occurrence in the ndh genes. The pigeonpea cp genome sequence would be beneficial in providing informative molecular markers which can be utilized for genetic diversity analysis and aid in understanding the plant systematics studies among major grain legumes.

  2. Genome-wide gene expression profiling reveals unsuspected molecular alterations in pemphigus foliaceus

    PubMed Central

    Malheiros, Danielle; Panepucci, Rodrigo A; Roselino, Ana M; Araújo, Amélia G; Zago, Marco A; Petzl-Erler, Maria Luiza

    2014-01-01

    Pemphigus foliaceus (PF) is a complex autoimmune disease characterized by bullous skin lesions and the presence of antibodies against desmoglein 1. In this study we sought to contribute to a better understanding of the molecular processes in endemic PF, as the identification of factors that participate in the pathogenesis is a prerequisite for understanding its biological basis and may lead to novel therapeutic interventions. CD4+ T lymphocytes are central to the development of the disease. Therefore, we compared genome-wide gene expression profiles of peripheral CD4+ T cells of various PF patient subgroups with each other and with that of healthy individuals. The patient sample was subdivided into three groups: untreated patients with the generalized form of the disease, patients submitted to immunosuppressive treatment, and patients with the localized form of the disease. Comparisons between different subgroups resulted in 135, 54 and 64 genes differentially expressed. These genes are mainly related to lymphocyte adhesion and migration, apoptosis, cellular proliferation, cytotoxicity and antigen presentation. Several of these genes were differentially expressed when comparing lesional and uninvolved skin from the same patient. The chromosomal regions 19q13 and 12p13 concentrate differentially expressed genes and are candidate regions for PF susceptibility genes and disease markers. Our results reveal genes involved in disease severity, potential therapeutic targets and previously unsuspected processes involved in the pathogenesis. Besides, this study adds original information that will contribute to the understanding of PF's pathogenesis and of the still poorly defined in vivo functions of most of these genes. PMID:24813052

  3. Comparative chloroplast genomics reveals the evolution of Pinaceae genera and subfamilies.

    PubMed

    Lin, Ching-Ping; Huang, Jen-Pan; Wu, Chung-Shien; Hsu, Chih-Yao; Chaw, Shu-Miaw

    2010-01-01

    As the largest and the basal-most family of conifers, Pinaceae provides key insights into the evolutionary history of conifers. We present comparative chloroplast genomics and analysis of concatenated 49 chloroplast protein-coding genes common to 19 gymnosperms, including 15 species from 8 Pinaceous genera, to address the long-standing controversy about Pinaceae phylogeny. The complete cpDNAs of Cathaya argyrophylla and Cedrus deodara (Abitoideae) and draft cpDNAs of Larix decidua, Picea morrisonicola, and Pseudotsuga wilsoniana are reported. We found 21- and 42-kb inversions in congeneric species and different populations of Pinaceous species, which indicates that structural polymorphics may be common and ancient in Pinaceae. Our phylogenetic analyses reveal that Cedrus is clustered with Abies-Keteleeria rather than the basal-most genus of Pinaceae and that Cathaya is closer to Pinus than to Picea or Larix-Pseudotsuga. Topology and structural change tests and indel-distribution comparisons lend further evidence to our phylogenetic finding. Our molecular datings suggest that Pinaceae first evolved during Early Jurassic, and diversification of Pinaceous subfamilies and genera took place during Mid-Jurassic and Lower Cretaceous, respectively. Using different maximum-likelihood divergences as thresholds, we conclude that 2 (Abietoideae and Larix-Pseudotsuga-Piceae-Cathaya-Pinus), 4 (Cedrus, non-Cedrus Abietoideae, Larix-Pseudotsuga, and Piceae-Cathaya-Pinus), or 5 (Cedrus, non-Cedrus Abietoideae, Larix-Pseudotsuga, Picea, and Cathaya-Pinus) groups/subfamilies are more reasonable delimitations for Pinaceae. Specifically, our views on subfamilial classifications differ from previous studies in terms of the rank of Cedrus and with recognition of more than two subfamilies.

  4. Comparative Chloroplast Genomics Reveals the Evolution of Pinaceae Genera and Subfamilies

    PubMed Central

    Lin, Ching-Ping; Huang, Jen-Pan; Wu, Chung-Shien; Hsu, Chih-Yao; Chaw, Shu-Miaw

    2010-01-01

    As the largest and the basal-most family of conifers, Pinaceae provides key insights into the evolutionary history of conifers. We present comparative chloroplast genomics and analysis of concatenated 49 chloroplast protein-coding genes common to 19 gymnosperms, including 15 species from 8 Pinaceous genera, to address the long-standing controversy about Pinaceae phylogeny. The complete cpDNAs of Cathaya argyrophylla and Cedrus deodara (Abitoideae) and draft cpDNAs of Larix decidua, Picea morrisonicola, and Pseudotsuga wilsoniana are reported. We found 21- and 42-kb inversions in congeneric species and different populations of Pinaceous species, which indicates that structural polymorphics may be common and ancient in Pinaceae. Our phylogenetic analyses reveal that Cedrus is clustered with Abies–Keteleeria rather than the basal-most genus of Pinaceae and that Cathaya is closer to Pinus than to Picea or Larix–Pseudotsuga. Topology and structural change tests and indel-distribution comparisons lend further evidence to our phylogenetic finding. Our molecular datings suggest that Pinaceae first evolved during Early Jurassic, and diversification of Pinaceous subfamilies and genera took place during Mid-Jurassic and Lower Cretaceous, respectively. Using different maximum-likelihood divergences as thresholds, we conclude that 2 (Abietoideae and Larix–Pseudotsuga–Piceae–Cathaya–Pinus), 4 (Cedrus, non-Cedrus Abietoideae, Larix–Pseudotsuga, and Piceae–Cathaya–Pinus), or 5 (Cedrus, non-Cedrus Abietoideae, Larix–Pseudotsuga, Picea, and Cathaya–Pinus) groups/subfamilies are more reasonable delimitations for Pinaceae. Specifically, our views on subfamilial classifications differ from previous studies in terms of the rank of Cedrus and with recognition of more than two subfamilies. PMID:20651328

  5. Multiple genome sequences reveal adaptations of a phototrophic bacterium to sediment microenvironments.

    SciTech Connect

    Oda, Yasuhiro; Larimer, Frank W; Chain, Patrick S. G.; Malfatti, Stephanie; Shin, Maria V; Vergez, Lisa; Hauser, Loren John; Land, Miriam L; Braatsch, Stephan; Beatty, Thomas; Pelletier, Dale A; Schaefer, Amy L; Harwood, Caroline S

    2008-11-01

    The bacterial genus Rhodopseudomonas is comprised of photosynthetic bacteria found widely distributed in aquatic sediments. Members of the genus catalyze hydrogen gas production, carbon dioxide sequestration, and biomass turnover. The genome sequence of Rhodopseudomonas palustris CGA009 revealed a surprising richness of metabolic versatility that would seem to explain its ability to live in a heterogeneous environment like sediment. However, there is considerable genotypic diversity among Rhodopseudomonas isolates. Here we report the complete genome sequences of four additional members of the genus isolated from a restricted geographical area. The sequences confirm that the isolates belong to a coherent taxonomic unit, but they also have significant differences. Whole genome alignments show that the circular chromosomes of the isolates consist of a collinear backbone with a moderate number of genomic rearrangements that impact local gene order and orientation. There are 3,319 genes, 70% of the genes in each genome, shared by four or more strains. Between 10% and 18% of the genes in each genome are strain specific. Some of these genes suggest specialized physiological traits, which we verified experimentally, that include expanded light harvesting, oxygen respiration, and nitrogen fixation capabilities, as well as anaerobic fermentation. Strain-specific adaptations include traits that may be useful in bioenergy applications. This work suggests that against a backdrop of metabolic versatility that is a defining characteristic of Rhodopseudomonas, different ecotypes have evolved to take advantage of physical and chemical conditions in sediment microenvironments that are too small for human observation.

  6. The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets

    PubMed Central

    Godel, Christelle; Kumar, Sujai; Koutsovoulos, Georgios; Ludin, Philipp; Nilsson, Daniel; Comandatore, Francesco; Wrobel, Nicola; Thompson, Marian; Schmid, Christoph D.; Goto, Susumu; Bringaud, Frédéric; Wolstenholme, Adrian; Bandi, Claudio; Epe, Christian; Kaminsky, Ronald; Blaxter, Mark; Mäser, Pascal

    2012-01-01

    The heartworm Dirofilaria immitis is an important parasite of dogs. Transmitted by mosquitoes in warmer climatic zones, it is spreading across southern Europe and the Americas at an alarming pace. There is no vaccine, and chemotherapy is prone to complications. To learn more about this parasite, we have sequenced the genomes of D. immitis and its endosymbiont Wolbachia. We predict 10,179 protein coding genes in the 84.2 Mb of the nuclear genome, and 823 genes in the 0.9-Mb Wolbachia genome. The D. immitis genome harbors neither DNA transposons nor active retrotransposons, and there is very little genetic variation between two sequenced isolates from Europe and the United States. The differential presence of anabolic pathways such as heme and nucleotide biosynthesis hints at the intricate metabolic interrelationship between the heartworm and Wolbachia. Comparing the proteome of D. immitis with other nematodes and with mammalian hosts, we identify families of potential drug targets, immune modulators, and vaccine candidates. This genome sequence will support the development of new tools against dirofilariasis and aid efforts to combat related human pathogens, the causative agents of lymphatic filariasis and river blindness.—Godel, C., Kumar, S., Koutsovoulos, G., Ludin, P., Nilsson, D., Comandatore, F., Wrobel, N., Thompson, M., Schmid, C. D., Goto, S., Bringaud, F., Wolstenholme, A., Bandi, C., Epe, C., Kaminsky, R., Blaxter, M., Mäser, P. The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets. PMID:22889830

  7. Whole-Genome Sequencing Reveals Genetic Variation in the Asian House Rat

    PubMed Central

    Teng, Huajing; Zhang, Yaohua; Shi, Chengmin; Mao, Fengbiao; Hou, Lingling; Guo, Hongling; Sun, Zhongsheng; Zhang, Jianxu

    2016-01-01

    Whole-genome sequencing of wild-derived rat species can provide novel genomic resources, which may help decipher the genetics underlying complex phenotypes. As a notorious pest, reservoir of human pathogens, and colonizer, the Asian house rat, Rattus tanezumi, is successfully adapted to its habitat. However, little is known regarding genetic variation in this species. In this study, we identified over 41,000,000 single-nucleotide polymorphisms, plus insertions and deletions, through whole-genome sequencing and bioinformatics analyses. Moreover, we identified over 12,000 structural variants, including 143 chromosomal inversions. Further functional analyses revealed several fixed nonsense mutations associated with infection and immunity-related adaptations, and a number of fixed missense mutations that may be related to anticoagulant resistance. A genome-wide scan for loci under selection identified various genes related to neural activity. Our whole-genome sequencing data provide a genomic resource for future genetic studies of the Asian house rat species and have the potential to facilitate understanding of the molecular adaptations of rats to their ecological niches. PMID:27172215

  8. Sequencing the genome of Marssonina brunnea reveals fungus-poplar co-evolution

    PubMed Central

    2012-01-01

    Background The fungus Marssonina brunnea is a causal pathogen of Marssonina leaf spot that devastates poplar plantations by defoliating susceptible trees before normal fall leaf drop. Results We sequence the genome of M. brunnea with a size of 52 Mb assembled into 89 scaffolds, representing the first sequenced Dermateaceae genome. By inoculating this fungus onto a poplar hybrid clone, we investigate how M. brunnea interacts and co-evolves with its host to colonize poplar leaves. While a handful of virulence genes in M. brunnea, mostly from the LysM family, are detected to up-regulate during infection, the poplar down-regulates its resistance genes, such as nucleotide binding site domains and leucine rich repeats, in response to infection. From 10,027 predicted proteins of M. brunnea in a comparison with those from poplar, we identify four poplar transferases that stimulate the host to resist M. brunnea. These transferas-encoding genes may have driven the co-evolution of M. brunnea and Populus during the process of infection and anti-infection. Conclusions Our results from the draft sequence of the M. brunnea genome provide evidence for genome-genome interactions that play an important role in poplar-pathogen co-evolution. This knowledge could help to design effective strategies for controlling Marssonina leaf spot in poplar. PMID:22876864

  9. Bacterial DNA Sifted from the Trichoplax adhaerens (Animalia: Placozoa) Genome Project Reveals a Putative Rickettsial Endosymbiont

    PubMed Central

    Driscoll, Timothy; Gillespie, Joseph J.; Nordberg, Eric K.; Azad, Abdu F.; Sobral, Bruno W.

    2013-01-01

    Eukaryotic genome sequencing projects often yield bacterial DNA sequences, data typically considered as microbial contamination. However, these sequences may also indicate either symbiont genes or lateral gene transfer (LGT) to host genomes. These bacterial sequences can provide clues about eukaryote–microbe interactions. Here, we used the genome of the primitive animal Trichoplax adhaerens (Metazoa: Placozoa), which is known to harbor an uncharacterized Gram-negative endosymbiont, to search for the presence of bacterial DNA sequences. Bioinformatic and phylogenomic analyses of extracted data from the genome assembly (181 bacterial coding sequences [CDS]) and trace read archive (16S rDNA) revealed a dominant proteobacterial profile strongly skewed to Rickettsiales (Alphaproteobacteria) genomes. By way of phylogenetic analysis of 16S rDNA and 113 proteins conserved across proteobacterial genomes, as well as identification of 27 rickettsial signature genes, we propose a Rickettsiales endosymbiont of T. adhaerens (RETA). The majority (93%) of the identified bacterial CDS belongs to small scaffolds containing prokaryotic-like genes; however, 12 CDS were identified on large scaffolds comprised of eukaryotic-like genes, suggesting that T. adhaerens might have recently acquired bacterial genes. These putative LGTs may coincide with the placozoan’s aquatic niche and symbiosis with RETA. This work underscores the rich, and relatively untapped, resource of eukaryotic genome projects for harboring data pertinent to host–microbial interactions. The nature of unknown (or poorly characterized) bacterial species may only emerge via analysis of host genome sequencing projects, particularly if these species are resistant to cell culturing, as are many obligate intracellular microbes. Our work provides methodological insight for such an approach. PMID:23475938

  10. Geographic Population Structure in Epstein-Barr Virus Revealed by Comparative Genomics

    PubMed Central

    Chiara, Matteo; Manzari, Caterina; Lionetti, Claudia; Mechelli, Rosella; Anastasiadou, Eleni; Chiara Buscarinu, Maria; Ristori, Giovanni; Salvetti, Marco; Picardi, Ernesto; D’Erchia, Anna Maria; Pesole, Graziano; Horner, David S.

    2016-01-01

    Epstein-Barr virus (EBV) latently infects the majority of the human population and is implicated as a causal or contributory factor in numerous diseases. We sequenced 27 complete EBV genomes from a cohort of Multiple Sclerosis (MS) patients and healthy controls from Italy, although no variants showed a statistically significant association with MS. Taking advantage of the availability of ∼130 EBV genomes with known geographical origins, we reveal a striking geographic distribution of EBV sub-populations with distinct allele frequency distributions. We discuss mechanisms that potentially explain these observations, and their implications for understanding the association of EBV with human disease. PMID:27635051

  11. Genome-wide sequencing of Phytophthora lateralis reveals genetic variation among isolates from Lawson cypress (Chamaecyparis lawsoniana) in Northern Ireland.

    PubMed

    Quinn, Lisa; O'Neill, Paul A; Harrison, James; Paskiewicz, Konrad H; McCracken, Alistair R; Cooke, Louise R; Grant, Murray R; Studholme, David J

    2013-07-01

    Phytophthora lateralis is a fungus-like (oomycete) pathogen of trees in the family Cupressaceae, including Chamaecyparis lawsoniana (Lawson cypress or Port Orford cedar). Known in North America since the 1920s, presumably having been accidentally introduced from its assumed East Asian centre of origin, until recently, this pathogen has not been identified causing disease in Europe except for a few isolated outbreaks. However, since 2010, there have been several reports of infection of C. lawsoniana by P. lateralis in the United Kingdom, including Northern Ireland. We sequenced the genomes of four isolates of P. lateralis from two sites in Northern Ireland in 2011. Comparison with the closely related tree and shrub pathogen P. ramorum (cause of ramorum disease of larch and other species in the UK) shows that P. lateralis shares 91.47% nucleotide sequence identity over the core conserved compartments of the genome. The genomes of the four Northern Ireland isolates are almost identical, but we identified several single-nucleotide polymorphisms (SNPs) that distinguish between isolates, thereby presenting potential molecular markers of use for tracking routes of spread and in epidemiological studies. Our data reveal very low rates of heterozygosity (compared with P. ramorum), consistent with inbreeding within this P. lateralis population.

  12. Post-genomic analyses of fungal lignocellulosic biomass degradation reveal the unexpected potential of the plant pathogen Ustilago maydis

    PubMed Central

    2012-01-01

    Background Filamentous fungi are potent biomass degraders due to their ability to thrive in ligno(hemi)cellulose-rich environments. During the last decade, fungal genome sequencing initiatives have yielded abundant information on the genes that are putatively involved in lignocellulose degradation. At present, additional experimental studies are essential to provide insights into the fungal secreted enzymatic pools involved in lignocellulose degradation. Results In this study, we performed a wide analysis of 20 filamentous fungi for which genomic data are available to investigate their biomass-hydrolysis potential. A comparison of fungal genomes and secretomes using enzyme activity profiling revealed discrepancies in carbohydrate active enzymes (CAZymes) sets dedicated to plant cell wall. Investigation of the contribution made by each secretome to the saccharification of wheat straw demonstrated that most of them individually supplemented the industrial Trichoderma reesei CL847 enzymatic cocktail. Unexpectedly, the most striking effect was obtained with the phytopathogen Ustilago maydis that improved the release of total sugars by 57% and of glucose by 22%. Proteomic analyses of the best-performing secretomes indicated a specific enzymatic mechanism of U. maydis that is likely to involve oxido-reductases and hemicellulases. Conclusion This study provides insight into the lignocellulose-degradation mechanisms by filamentous fungi and allows for the identification of a number of enzymes that are potentially useful to further improve the industrial lignocellulose bioconversion process. PMID:22300648

  13. CAGO: a software tool for dynamic visual comparison and correlation measurement of genome organization.

    PubMed

    Chang, Yi-Feng; Chang, Chuan-Hsiung

    2011-01-01

    CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG) format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago.

  14. Comparative genome analyses of Mycobacterium avium reveal genomic features of its subspecies and strains that cause progression of pulmonary disease

    PubMed Central

    Uchiya, Kei-ichi; Tomida, Shuta; Nakagawa, Taku; Asahi, Shoki; Nikai, Toshiaki; Ogawa, Kenji

    2017-01-01

    Pulmonary disease caused by nontuberculous mycobacteria (NTM) is increasing worldwide. Mycobacterium avium is the most clinically significant NTM species in humans and animals, and comprises four subspecies: M. avium subsp. avium (MAA), M. avium subsp. silvaticum (MAS), M. avium subsp. paratuberculosis (MAP), and M. avium subsp. hominissuis (MAH). To improve our understanding of the genetic landscape and diversity of M. avium and its role in disease, we performed a comparative genome analysis of 79 M. avium strains. Our analysis demonstrated that MAH is an open pan-genome species. Phylogenetic analysis based on single nucleotide variants showed that MAH had the highest degree of sequence variability among the subspecies, and MAH strains isolated in Japan and those isolated abroad possessed distinct phylogenetic features. Furthermore, MAP strains, MAS and MAA strains isolated from birds, and many MAH strains that cause the progression of pulmonary disease were grouped in each specific cluster. Comparative genome analysis revealed the presence of genetic elements specific to each lineage, which are thought to be acquired via horizontal gene transfer during the evolutionary process, and identified potential genetic determinants accounting for the pathogenic and host range characteristics of M. avium. PMID:28045086

  15. Trans genomic capture and sequencing of primate exomes reveals new targets of positive selection.

    PubMed

    George, Renee D; McVicker, Graham; Diederich, Rachel; Ng, Sarah B; MacKenzie, Alexandra P; Swanson, Willie J; Shendure, Jay; Thomas, James H

    2011-10-01

    Comparison of protein-coding DNA sequences from diverse primates can provide insight into these species' evolutionary history and uncover the molecular basis for their phenotypic differences. Currently, the number of available primate reference genomes limits these genome-wide comparisons. Here we use targeted capture methods designed for human to sequence the protein-coding regions, or exomes, of four non-human primate species (three Old World monkeys and one New World monkey). Despite average sequence divergence of up to 4% from the human sequence probes, we are able to capture ~96% of coding sequences. Using a combination of mapping and assembly techniques, we generated high-quality full-length coding sequences for each species. Both the number of nucleotide differences and the distribution of insertion and deletion (indel) lengths indicate that the quality of the assembled sequences is very high and exceeds that of most reference genomes. Using this expanded set of primate coding sequences, we performed a genome-wide scan for genes experiencing positive selection and identified a novel class of adaptively evolving genes involved in the conversion of epithelial cells in skin, hair, and nails to keratin. Interestingly, the genes we identify under positive selection also exhibit significantly increased allele frequency differences among human populations, suggesting that they play a role in both recent and long-term adaptation. We also identify several genes that have been lost on specific primate lineages, which illustrate the broad utility of this data set for other evolutionary analyses. These results demonstrate the power of second-generation sequencing in comparative genomics and greatly expand the repertoire of available primate coding sequences.

  16. Mutational strand asymmetries in cancer genomes reveal mechanisms of DNA damage and repair

    PubMed Central

    Haradhvala, Nicholas J.; Polak, Paz; Stojanov, Petar; Covington, Kyle R.; Shinbrot, Eve; Hess, Julian; Rheinbay, Esther; Kim, Jaegil; Maruvka, Yosef; Braunstein, Lior Z.; Kamburov, Atanas; Hanawalt, Philip C.; Wheeler, David A.; Koren, Amnon; Lawrence, Michael S.; Getz, Gad

    2016-01-01

    Mutational processes constantly shape the somatic genome, leading to immunity, aging, and other diseases. When cancer is the outcome, we are afforded a glimpse into these processes by the clonal expansion of the malignant cell. Here, we characterize a less explored layer of the mutational landscape of cancer: mutational asymmetries between the two DNA strands. Analyzing whole genome sequences of 590 tumors from 14 different cancer types, we reveal widespread asymmetries across mutagenic processes, with transcriptional (“T-class”) asymmetry dominating UV-, smoking-, and liver-cancer-associated mutations, and replicative (“R-class”) asymmetry dominating POLE-, APOBEC-, and MSI-associated mutations. We report a striking phenomenon of Transcription-Coupled Damage (TCD) on the non-transcribed DNA strand, and provide evidence that APOBEC mutagenesis occurs on the lagging-strand template during DNA replication. As more genomes are sequenced, studying and classifying their asymmetries will illuminate the underlying biological mechanisms of DNA damage and repair. PMID:26806129

  17. Whole genome comparison of donor and cloned dogs.

    PubMed

    Kim, Hak-Min; Cho, Yun Sung; Kim, Hyunmin; Jho, Sungwoong; Son, Bongjun; Choi, Joung Yoon; Kim, Sangsoo; Lee, Byeong Chun; Bhak, Jong; Jang, Goo

    2013-10-21

    Cloning is a process that produces genetically identical organisms. However, the genomic degree of genetic resemblance in clones needs to be determined. In this report, the genomes of a cloned dog and its donor were compared. Compared with a human monozygotic twin, the genome of the cloned dog showed little difference from the genome of the nuclear donor dog in terms of single nucleotide variations, chromosomal instability, and telomere lengths. These findings suggest that cloning by somatic cell nuclear transfer produced an almost identical genome. The whole genome sequence data of donor and cloned dogs can provide a resource for further investigations on epigenetic contributions in phenotypic differences.

  18. Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus

    PubMed Central

    2013-01-01

    Background Elucidating the process of speciation requires an in-depth understanding of the evolutionary history of the species in question. Studies that rely upon a limited number of genetic loci do not always reveal actual evolutionary history, and often confuse inferences related to phylogeny and speciation. Whole-genome data, however, can overcome this issue by providing a nearly unbiased window into the patterns and processes of speciation. In order to reveal the complexity of the speciation process, we sequenced and analyzed the genomes of 10 wild pigs, representing morphologically or geographically well-defined species and subspecies of the genus Sus from insular and mainland Southeast Asia, and one African common warthog. Results Our data highlight the importance of past cyclical climatic fluctuations in facilitating the dispersal and isolation of populations, thus leading to the diversification of suids in one of the most species-rich regions of the world. Moreover, admixture analyses revealed extensive, intra- and inter-specific gene-flow that explains previous conflicting results obtained from a limited number of loci. We show that these multiple episodes of gene-flow resulted from both natural and human-mediated dispersal. Conclusions Our results demonstrate the importance of past climatic fluctuations and human mediated translocations in driving and complicating the process of speciation in island Southeast Asia. This case study demonstrates that genomics is a powerful tool to decipher the evolutionary history of a genus, and reveals the complexity of the process of speciation. PMID:24070215

  19. Wavelet Analysis of DNA Bending Profiles reveals Structural Constraints on the Evolution of Genomic Sequences.

    PubMed

    Audit, Benjamin; Vaillant, Cédric; Arnéodo, Alain; d'Aubenton-Carafa, Yves; Thermes, Claude

    2004-03-01

    Analyses of genomic DNA sequences have shown in previous works that base pairs are correlated at large distances with scale-invariant statistical properties. We show in the present study that these correlations between nucleotides (letters) result in fact from long-range correlations (LRC) between sequence-dependent DNA structural elements (words) involved in the packaging of DNA in chromatin. Using the wavelet transform technique, we perform a comparative analysis of the DNA text and of the corresponding bending profiles generated with curvature tables based on nucleosome positioning data. This exploration through the optics of the so-called `wavelet transform microscope' reveals a characteristic scale of 100-200 bp that separates two regimes of different LRC. We focus here on the existence of LRC in the small-scale regime (≲ 200 bp). Analysis of genomes in the three kingdoms reveals that this regime is specifically associated to the presence of nucleosomes. Indeed, small scale LRC are observed in eukaryotic genomes and to a less extent in archaeal genomes, in contrast with their absence in eubacterial genomes. Similarly, this regime is observed in eukaryotic but not in bacterial viral DNA genomes. There is one exception for genomes of Poxviruses, the only animal DNA viruses that do not replicate in the cell nucleus and do not present small scale LRC. Furthermore, no small scale LRC are detected in the genomes of all examined RNA viruses, with one exception in the case of retroviruses. Altogether, these results strongly suggest that small-scale LRC are a signature of the nucleosomal structure. Finally, we discuss possible interpretations of these small-scale LRC in terms of the mechanisms that govern the positioning, the stability and the dynamics of the nucleosomes along the DNA chain. This paper is maily devoted to a pedagogical presentation of the theoretical concepts and physical methods which are well suited to perform a statistical analysis of genomic

  20. Genome-wide identification and comparison of legume MLO gene family.

    PubMed

    Rispail, Nicolas; Rubiales, Diego

    2016-09-06

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species.

  1. Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis

    SciTech Connect

    Chain, Patrick S. G.; Carniel, E.; Larimer, Frank W; Lamerdin, Jane; Vergez, Lisa; Land, Miriam L; Motin, V. L.; Brubaker, R. R.; Fowler, J.; Hinnebusch, J.; Marceau, M.; Medigue, Claudine; Chenal-Francisque, V.; Souza, B.; Dacheux, D.; Elliott, J. M.; Derbise, A.; Hauser, Loren John; Garcia, Emilio

    2004-09-01

    Yersinia pestis, the causative agent of plague, is a highly uniform clone that diverged recently from the enteric pathogen Yersinia pseudotuberculosis. Despite their close genetic relationship, they differ radically in their pathogenicity and transmission. Here, we report the complete genomic sequence of Y. pseudotuberculosis IP32953 and its use for detailed genome comparisons with available Y. pestis sequences. Analyses of identified differences across a panel of Yersinia isolates from around the world reveal 32 Y. pestis chromosomal genes that, together with the two Y. pestis-specific plasmids, to our knowledge, represent the only new genetic material in Y. pestis acquired since the the divergence from Y. pseudotuberculosis. In contrast, 149 other pseudogenes (doubling the previous estimate) and 317 genes absent from Y. pestis were detected, indicating that as many as 13% of Y. pseudotuberculosis genes no longer function in Y. pestis. Extensive insertion sequence-mediated genome rearrangements and reductive evolution through massive gene loss, resulting in elimination and modification of preexisting gene expression pathways, appear to be more important than acquisition of genes in the evolution of Y. pestis. These results provide a sobering example of how a highly virulent epidemic clone can suddenly emerge from a less virulent, closely related progenitor.

  2. Genome-wide identification and comparison of legume MLO gene family

    PubMed Central

    Rispail, Nicolas; Rubiales, Diego

    2016-01-01

    MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species. PMID:27596925

  3. Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

    PubMed

    Willing, Eva-Maria; Bentzen, Paul; van Oosterhout, Cock; Hoffmann, Margarete; Cable, Joanne; Breden, Felix; Weigel, Detlef; Dreyer, Christine

    2010-03-01

    Adaptation of guppies (Poecilia reticulata) to contrasting upland and lowland habitats has been extensively studied with respect to behaviour, morphology and life history traits. Yet population history has not been studied at the whole-genome level. Although single nucleotide polymorphisms (SNPs) are the most abundant form of variation in many genomes and consequently very informative for a genome-wide picture of standing natural variation in populations, genome-wide SNP data are rarely available for wild vertebrates. Here we use genetically mapped SNP markers to comprehensively survey genetic variation within and among naturally occurring guppy populations from a wide geographic range in Trinidad and Venezuela. Results from three different clustering methods, Neighbor-net, principal component analysis (PCA) and Bayesian analysis show that the population substructure agrees with geographic separation and largely with previously hypothesized patterns of historical colonization. Within major drainages (Caroni, Oropouche and Northern), populations are genetically similar, but those in different geographic regions are highly divergent from one another, with some indications of ancient shared polymorphisms. Clear genomic signatures of a previous introduction experiment were seen, and we detected additional potential admixture events. Headwater populations were significantly less heterozygous than downstream populations. Pairwise F(ST) values revealed marked differences in allele frequencies among populations from different regions, and also among populations within the same region. F(ST) outlier methods indicated some regions of the genome as being under directional selection. Overall, this study demonstrates the power of a genome-wide SNP data set to inform for studies on natural variation, adaptation and evolution of wild populations.

  4. The genome of a Mesozoic paleovirus reveals the evolution of hepatitis B viruses.

    PubMed

    Suh, Alexander; Brosius, Jürgen; Schmitz, Jürgen; Kriegs, Jan Ole

    2013-01-01

    Paleovirology involves the identification of ancient endogenous viral elements within eukaryotic genomes. The evolutionary origins of the reverse-transcribing hepatitis B viruses, however, remain elusive, due to the small number of endogenized sequences present in host genomes. Here we report a comprehensively dated genomic record of hepatitis B virus endogenizations that spans bird evolution from >82 to <12.1 million years ago. The oldest virus relic extends over a 99% complete hepatitis B virus genome sequence and constitutes the first discovery of a Mesozoic paleovirus genome. We show that Hepadnaviridae are >63 million years older than previously known and provide direct evidence for coexistence of hepatitis B viruses and birds during the Mesozoic and Cenozoic Eras. Finally, phylogenetic analyses and distribution of hepatitis B virus relics suggest that birds potentially are the ancestral hosts of Hepadnaviridae and mammalian hepatitis B viruses probably emerged after a bird-mammal host switch. Our study reveals previously undiscovered and multi-faceted insights into prehistoric hepatitis B virus evolution and provides valuable resources for future studies, such as in-vitro resurrection of Mesozoic hepadnaviruses.

  5. Genomic and physiological analysis reveals versatile metabolic capacity of deep-sea Photobacterium phosphoreum ANT-2200.

    PubMed

    Zhang, Sheng-Da; Santini, Claire-Lise; Zhang, Wei-Jia; Barbe, Valérie; Mangenot, Sophie; Guyomar, Charlotte; Garel, Marc; Chen, Hai-Tao; Li, Xue-Gong; Yin, Qun-Jian; Zhao, Yuan; Armengaud, Jean; Gaillard, Jean-Charles; Martini, Séverine; Pradel, Nathalie; Vidaud, Claude; Alberto, François; Médigue, Claudine; Tamburini, Christian; Wu, Long-Fei

    2016-05-01

    Bacteria of the genus Photobacterium thrive worldwide in oceans and show substantial eco-physiological diversity including free-living, symbiotic and piezophilic life styles. Genomic characteristics underlying this variability across species are poorly understood. Here we carried out genomic and physiological analysis of Photobacterium phosphoreum strain ANT-2200, the first deep-sea luminous bacterium of which the genome has been sequenced. Using optical mapping we updated the genomic data and reassembled it into two chromosomes and a large plasmid. Genomic analysis revealed a versatile energy metabolic potential and physiological analysis confirmed its growth capacity by deriving energy from fermentation of glucose or maltose, by respiration with formate as electron donor and trimethlyamine N-oxide (TMAO), nitrate or fumarate as electron acceptors, or by chemo-organo-heterotrophic growth in rich media. Despite that it was isolated at a site with saturated dissolved oxygen, the ANT-2200 strain possesses four gene clusters coding for typical anaerobic enzymes, the TMAO reductases. Elevated hydrostatic pressure enhances the TMAO reductase activity, mainly due to the increase of isoenzyme TorA1. The high copy number of the TMAO reductase isoenzymes and pressure-enhanced activity might imply a strategy developed by bacteria to adapt to deep-sea habitats where the instant TMAO availability may increase with depth.

  6. De novo sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, reveal a variable genomic landscape.

    PubMed

    Tully, Benjamin J; Emerson, Joanne B; Andrade, Karen; Brocks, Jochen J; Allen, Eric E; Banfield, Jillian F; Heidelberg, Karla B

    2015-01-01

    Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon, Haloquadratum walsbyi. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of H. walsbyi to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in H. walsbyi. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived Haloquadratum genomes and were supported by environmental de novo sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of Haloquadratum. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the de novo environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies.

  7. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  8. De Novo Sequences of Haloquadratum walsbyi from Lake Tyrrell, Australia, Reveal a Variable Genomic Landscape

    PubMed Central

    Tully, Benjamin J.; Emerson, Joanne B.; Andrade, Karen; Brocks, Jochen J.; Allen, Eric E.; Banfield, Jillian F.; Heidelberg, Karla B.

    2015-01-01

    Hypersaline systems near salt saturation levels represent an extreme environment, in which organisms grow and survive near the limits of life. One of the abundant members of the microbial communities in hypersaline systems is the square archaeon, Haloquadratum walsbyi. Utilizing a short-read metagenome from Lake Tyrrell, a hypersaline ecosystem in Victoria, Australia, we performed a comparative genomic analysis of H. walsbyi to better understand the extent of variation between strains/subspecies. Results revealed that previously isolated strains/subspecies do not fully describe the complete repertoire of the genomic landscape present in H. walsbyi. Rearrangements, insertions, and deletions were observed for the Lake Tyrrell derived Haloquadratum genomes and were supported by environmental de novo sequences, including shifts in the dominant genomic landscape of the two most abundant strains. Analysis pertaining to halomucins indicated that homologs for this large protein are not a feature common for all species of Haloquadratum. Further, we analyzed ATP-binding cassette transporters (ABC-type transporters) for evidence of niche partitioning between different strains/subspecies. We were able to identify unique and variable transporter subunits from all five genomes analyzed and the de novo environmental sequences, suggesting that differences in nutrient and carbon source acquisition may play a role in maintaining distinct strains/subspecies. PMID:25709557

  9. Comparative Genomics and Transcriptomics Analyses Reveal Divergent Lifestyle Features of Nematode Endoparasitic Fungus Hirsutella minnesotensis

    PubMed Central

    Lai, Yiling; Liu, Keke; Zhang, Xinyu; Zhang, Xiaoling; Li, Kuan; Wang, Niuniu; Shu, Chi; Wu, Yunpeng; Wang, Chengshu; Bushley, Kathryn E.; Xiang, Meichun; Liu, Xingzhong

    2014-01-01

    Hirsutella minnesotensis [Ophiocordycipitaceae (Hypocreales, Ascomycota)] is a dominant endoparasitic fungus by using conidia that adhere to and penetrate the secondary stage juveniles of soybean cyst nematode. Its genome was de novo sequenced and compared with five entomopathogenic fungi in the Hypocreales and three nematode-trapping fungi in the Orbiliales (Ascomycota). The genome of H. minnesotensis is 51.4 Mb and encodes 12,702 genes enriched with transposable elements up to 32%. Phylogenomic analysis revealed that H. minnesotensis was diverged from entomopathogenic fungi in Hypocreales. Genome of H. minnesotensis is similar to those of entomopathogenic fungi to have fewer genes encoding lectins for adhesion and glycoside hydrolases for cellulose degradation, but is different from those of nematode-trapping fungi to possess more genes for protein degradation, signal transduction, and secondary metabolism. Those results indicate that H. minnesotensis has evolved different mechanism for nematode endoparasitism compared with nematode-trapping fungi. Transcriptomics analyses for the time-scale parasitism revealed the upregulations of lectins, secreted proteases and the genes for biosynthesis of secondary metabolites that could be putatively involved in host surface adhesion, cuticle degradation, and host manipulation. Genome and transcriptome analyses provided comprehensive understanding of the evolution and lifestyle of nematode endoparasitism. PMID:25359922

  10. The Genome Sequence of Polymorphum gilvum SL003B-26A1T Reveals Its Genetic Basis for Crude Oil Degradation and Adaptation to the Saline Soil

    PubMed Central

    Nie, Yong; Tang, Yue-Qin; Li, Yan; Chi, Chang-Qiao; Cai, Man; Wu, Xiao-Lei

    2012-01-01

    Polymorphum gilvum SL003B-26A1T is the type strain of a novel species in the recently published novel genus Polymorphum isolated from saline soil contaminated with crude oil. It is capable of using crude oil as the sole carbon and energy source and can adapt to saline soil at a temperature of 45°C. The Polymorphum gilvum genome provides a genetic basis for understanding how the strain could degrade crude oil and adapt to a saline environment. Genome analysis revealed the versatility of the strain for emulsifying crude oil, metabolizing aromatic compounds (a characteristic specific to the Polymorphum gilvum genome in comparison with other known genomes of oil-degrading bacteria), as well as possibly metabolizing n-alkanes through the LadA pathway. In addition, COG analysis revealed Polymorphum gilvum SL003B-26A1T has significantly higher abundances of the proteins responsible for cell motility, lipid transport and metabolism, and secondary metabolite biosynthesis, transport and catabolism than the average levels found in all other genomes sequenced thus far, but lower abundances of the proteins responsible for carbohydrate transport and metabolism, defense mechanisms, and translation than the average levels. These traits support the adaptability of Polymorphum gilvum to a crude oil-contaminated saline environment. The Polymorphum gilvum genome could serve as a platform for further study of oil-degrading microorganisms for bioremediation and microbial-enhanced oil recovery in harsh saline environments. PMID:22359583

  11. Comparative analysis of pepper and tomato reveals euchromatin expansion of pepper genome caused by differential accumulation of Ty3/Gypsy-like elements

    PubMed Central

    2011-01-01

    Background Among the Solanaceae plants, the pepper genome is three times larger than that of tomato. Although the gene repertoire and gene order of both species are well conserved, the cause of the genome-size difference is not known. To determine the causes for the expansion of pepper euchromatic regions, we compared the pepper genome to that of tomato. Results For sequence-level analysis, we generated 35.6 Mb of pepper genomic sequences from euchromatin enriched 1,245 pepper BAC clones. The comparative analysis of orthologous gene-rich regions between both species revealed insertion of transposons exclusively in the pepper sequences, maintaining the gene order and content. The most common type of the transposon found was the LTR retrotransposon. Phylogenetic comparison of the LTR retrotransposons revealed that two groups of Ty3/Gypsy-like elements (Tat and Athila) were overly accumulated in the pepper genome. The FISH analysis of the pepper Tat elements showed a random distribution in heterochromatic and euchromatic regions, whereas the tomato Tat elements showed heterochromatin-preferential accumulation. Conclusions Compared to tomato pepper euchromatin doubled its size by differential accumulation of a specific group of Ty3/Gypsy-like elements. Our results could provide an insight on the mechanism of genome evolution in the Solanaceae family. PMID:21276256

  12. The genome sequence of Polymorphum gilvum SL003B-26A1(T) reveals its genetic basis for crude oil degradation and adaptation to the saline soil.

    PubMed

    Nie, Yong; Tang, Yue-Qin; Li, Yan; Chi, Chang-Qiao; Cai, Man; Wu, Xiao-Lei

    2012-01-01

    Polymorphum gilvum SL003B-26A1(T) is the type strain of a novel species in the recently published novel genus Polymorphum isolated from saline soil contaminated with crude oil. It is capable of using crude oil as the sole carbon and energy source and can adapt to saline soil at a temperature of 45°C. The Polymorphum gilvum genome provides a genetic basis for understanding how the strain could degrade crude oil and adapt to a saline environment. Genome analysis revealed the versatility of the strain for emulsifying crude oil, metabolizing aromatic compounds (a characteristic specific to the Polymorphum gilvum genome in comparison with other known genomes of oil-degrading bacteria), as well as possibly metabolizing n-alkanes through the LadA pathway. In addition, COG analysis revealed Polymorphum gilvum SL003B-26A1(T) has significantly higher abundances of the proteins responsible for cell motility, lipid transport and metabolism, and secondary metabolite biosynthesis, transport and catabolism than the average levels found in all other genomes sequenced thus far, but lower abundances of the proteins responsible for carbohydrate transport and metabolism, defense mechanisms, and translation than the average levels. These traits support the adaptability of Polymorphum gilvum to a crude oil-contaminated saline environment. The Polymorphum gilvum genome could serve as a platform for further study of oil-degrading microorganisms for bioremediation and microbial-enhanced oil recovery in harsh saline environments.

  13. Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication

    PubMed Central

    Li, Gang; Gandolfi, Barbara; Khan, Razib; Aken, Bronwen L.; Searle, Steven M. J.; Minx, Patrick; Hillier, LaDeana W.; Koboldt, Daniel C.; Davis, Brian W.; Driscoll, Carlos A.; Barr, Christina S.; Blackistone, Kevin; Quilez, Javier; Lorente-Galdos, Belen; Marques-Bonet, Tomas; Alkan, Can; Thomas, Gregg W. C.; Hahn, Matthew W.; Menotti-Raymond, Marilyn; O’Brien, Stephen J.; Wilson, Richard K.; Lyons, Leslie A.; Murphy, William J.; Warren, Wesley C.

    2014-01-01

    Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae. PMID:25385592

  14. The genome of the polar eukaryotic microalga Coccomyxa subellipsoidea reveals traits of cold adaptation

    SciTech Connect

    Blanc, Guillaume; Agarkova, Irina; Grimwood, Jane; Kuo, Alan; Brueggeman, Andrew; Dunigan, David D.; Gurnon, James; Ladunga, Istvan; Lindquist, Erika; Lucas, Susan; Pangilinan, Jasmyn; Proschold, Thomas; Salamov, Asaf; Schmutz, Jeremy; Weeks, Donald; Tamada, Takashi; Lomsadze, Alexandre; Borodovsky, Mark; Claverie, Jean-Michel; Grigoriev, Igor V.; Van Etten, James L.

    2012-02-13

    Background Little is known about the mechanisms of adaptation of life to the extreme environmental conditions encountered in polar regions. Here we present the genome sequence of a unicellular green alga from the division chlorophyta, Coccomyxa subellipsoidea C-169, which we will hereafter refer to as C-169. This is the first eukaryotic microorganism from a polar environment to have its genome sequenced. Results The 48.8 Mb genome contained in 20 chromosomes exhibits significant synteny conservation with the chromosomes of its relatives Chlorella variabilis and Chlamydomonas reinhardtii. The order of the genes is highly reshuffled within synteny blocks, suggesting that intra-chromosomal rearrangements were more prevalent than inter-chromosomal rearrangements. Remarkably, Zepp retrotransposons occur in clusters of nested elements with strictly one cluster per chromosome probably residing at the centromere. Several protein families overrepresented in C. subellipsoidae include proteins involved in lipid metabolism, transporters, cellulose synthases and short alcohol dehydrogenases. Conversely, C-169 lacks proteins that exist in all other sequenced chlorophytes, including components of the glycosyl phosphatidyl inositol anchoring system, pyruvate phosphate dikinase and the photosystem 1 reaction center subunit N (PsaN). Conclusions We suggest that some of these gene losses and gains could have contributed to adaptation to low temperatures. Comparison of these genomic features with the adaptive strategies of psychrophilic microbes suggests that prokaryotes and eukaryotes followed comparable evolutionary routes to adapt to cold environments.

  15. Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication.

    PubMed

    Montague, Michael J; Li, Gang; Gandolfi, Barbara; Khan, Razib; Aken, Bronwen L; Searle, Steven M J; Minx, Patrick; Hillier, LaDeana W; Koboldt, Daniel C; Davis, Brian W; Driscoll, Carlos A; Barr, Christina S; Blackistone, Kevin; Quilez, Javier; Lorente-Galdos, Belen; Marques-Bonet, Tomas; Alkan, Can; Thomas, Gregg W C; Hahn, Matthew W; Menotti-Raymond, Marilyn; O'Brien, Stephen J; Wilson, Richard K; Lyons, Leslie A; Murphy, William J; Warren, Wesley C

    2014-12-02

    Little is known about the genetic changes that distinguish domestic cat populations from their wild progenitors. Here we describe a high-quality domestic cat reference genome assembly and comparative inferences made with other cat breeds, wildcats, and other mammals. Based upon these comparisons, we identified positively selected genes enriched for genes involved in lipid metabolism that underpin adaptations to a hypercarnivorous diet. We also found positive selection signals within genes underlying sensory processes, especially those affecting vision and hearing in the carnivore lineage. We observed an evolutionary tradeoff between functional olfactory and vomeronasal receptor gene repertoires in the cat and dog genomes, with an expansion of the feline chemosensory system for detecting pheromones at the expense of odorant detection. Genomic regions harboring signatures of natural selection that distinguish domestic cats from their wild congeners are enriched in neural crest-related genes associated with behavior and reward in mouse models, as predicted by the domestication syndrome hypothesis. Our description of a previously unidentified allele for the gloving pigmentation pattern found in the Birman breed supports the hypothesis that cat breeds experienced strong selection on specific mutations drawn from random bred populations. Collectively, these findings provide insight into how the process of domestication altered the ancestral wildcat genome and build a resource for future disease mapping and phylogenomic studies across all members of the Felidae.

  16. Comparative Genomics of Oral Isolates of Streptococcus mutans by in silico Genome Subtraction Does Not Reveal Accessory DNA Associated with Severe Early Childhood Caries

    PubMed Central

    Argimón, Silvia; Konganti, Kranti; Chen, Hao; Alekseyenko, Alexander V.; Brown, Stuart; Caufield, Page W.

    2014-01-01

    Comparative genomics is a popular method for the identification of microbial virulence determinants, especially since the sequencing of a large number of whole bacterial genomes from pathogenic and non-pathogenic strains has become relatively inexpensive. The bioinformatics pipelines for comparative genomics usually include gene prediction and annotation and can require significant computer power. To circumvent this, we developed a rapid method for genome-scale in silico subtractive hybridization, based on blastn and independent of feature identification and annotation. Whole genome comparisons by in silico genome subtraction were performed to identify genetic loci specific to Streptococcus mutans strains associated with severe early childhood caries (S-ECC), compared to strains isolated from caries-free (CF) children. The genome similarity of the 20 S. mutans strains included in this study, calculated by Simrank k-mer sharing, ranged from 79.5 to 90.9%, confirming this is a genetically heterogeneous group of strains. We identified strain-specific genetic elements in 19 strains, with sizes ranging from 200 bp to 39 kb. These elements contained protein-coding regions with functions mostly associated with mobile DNA. We did not, however, identify any genetic loci consistently associated with dental caries, i.e., shared by all the S-ECC strains and absent in the CF strains. Conversely, we did not identify any genetic loci specific with the healthy group. Comparison of previously published genomes from pathogenic and carriage strains of Neisseria meningitidis with our in silico genome subtraction yielded the same set of genes specific to the pathogenic strains, thus validating our method. Our results suggest that S. mutans strains derived from caries active or caries free dentitions cannot be differentiated based on the presence or absence of specific genetic elements. Our in silico genome subtraction method is available as the Microbial Genome Comparison (MGC) tool

  17. Comparative genomics of oral isolates of Streptococcus mutans by in silico genome subtraction does not reveal accessory DNA associated with severe early childhood caries.

    PubMed

    Argimón, Silvia; Konganti, Kranti; Chen, Hao; Alekseyenko, Alexander V; Brown, Stuart; Caufield, Page W

    2014-01-01

    Comparative genomics is a popular method for the identification of microbial virulence determinants, especially since the sequencing of a large number of whole bacterial genomes from pathogenic and non-pathogenic strains has become relatively inexpensive. The bioinformatics pipelines for comparative genomics usually include gene prediction and annotation and can require significant computer power. To circumvent this, we developed a rapid method for genome-scale in silico subtractive hybridization, based on blastn and independent of feature identification and annotation. Whole genome comparisons by in silico genome subtraction were performed to identify genetic loci specific to Streptococcus mutans strains associated with severe early childhood caries (S-ECC), compared to strains isolated from caries-free (CF) children. The genome similarity of the 20 S. mutans strains included in this study, calculated by Simrank k-mer sharing, ranged from 79.5% to 90.9%, confirming this is a genetically heterogeneous group of strains. We identified strain-specific genetic elements in 19 strains, with sizes ranging from 200 to 39 kb. These elements contained protein-coding regions with functions mostly associated with mobile DNA. We did not, however, identify any genetic loci consistently associated with dental caries, i.e., shared by all the S-ECC strains and absent in the CF strains. Conversely, we did not identify any genetic loci specific with the healthy group. Comparison of previously published genomes from pathogenic and carriage strains of Neisseria meningitidis with our in silico genome subtraction yielded the same set of genes specific to the pathogenic strains, thus validating our method. Our results suggest that S. mutans strains derived from caries active or caries free dentitions cannot be differentiated based on the presence or absence of specific genetic elements. Our in silico genome subtraction method is available as the Microbial Genome Comparison (MGC) tool

  18. Complete genome sequence analysis of novel human bocavirus reveals genetic recombination between human bocavirus 2 and human bocavirus 4.

    PubMed

    Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat

    2013-07-01

    Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2.

  19. Genome resolved analysis of a premature infant gut microbial community reveals a Varibaculum cambriense genome and a shift towards fermentation-based metabolism during the third week of life

    PubMed Central

    2013-01-01

    Background The premature infant gut has low individual but high inter-individual microbial diversity compared with adults. Based on prior 16S rRNA gene surveys, many species from this environment are expected to be similar to those previously detected in the human microbiota. However, the level of genomic novelty and metabolic variation of strains found in the infant gut remains relatively unexplored. Results To study the stability and function of early microbial colonizers of the premature infant gut, nine stool samples were taken during the third week of life of a premature male infant delivered via Caesarean section. Metagenomic sequences were assembled and binned into near-complete and partial genomes, enabling strain-level genomic analysis of the microbial community. We reconstructed eleven near-complete and six partial bacterial genomes representative of the key members of the microbial community. Twelve of these genomes share >90% putative ortholog amino acid identity with reference genomes. Manual curation of the assembly of one particularly novel genome resulted in the first essentially complete genome sequence (in three pieces, the order of which could not be determined due to a repeat) for Varibaculum cambriense (strain Dora), a medically relevant species that has been implicated in abscess formation. During the period studied, the microbial community undergoes a compositional shift, in which obligate anaerobes (fermenters) overtake Escherichia coli as the most abundant species. Other species remain stable, probably due to their ability to either respire anaerobically or grow by fermentation, and their capacity to tolerate fluctuating levels of oxygen. Metabolic predictions for V. cambriense suggest that, like other members of the microbial community, this organism is able to process various sugar substrates and make use of multiple different electron acceptors during anaerobic respiration. Genome comparisons within the family Actinomycetaceae reveal

  20. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma.

    PubMed

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-02-04

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs.

  1. Whole-Genome Sequencing Reveals Diverse Models of Structural Variations in Esophageal Squamous Cell Carcinoma

    PubMed Central

    Cheng, Caixia; Zhou, Yong; Li, Hongyi; Xiong, Teng; Li, Shuaicheng; Bi, Yanghui; Kong, Pengzhou; Wang, Fang; Cui, Heyang; Li, Yaoping; Fang, Xiaodong; Yan, Ting; Li, Yike; Wang, Juan; Yang, Bin; Zhang, Ling; Jia, Zhiwu; Song, Bin; Hu, Xiaoling; Yang, Jie; Qiu, Haile; Zhang, Gehong; Liu, Jing; Xu, Enwei; Shi, Ruyi; Zhang, Yanyan; Liu, Haiyan; He, Chanting; Zhao, Zhenxiang; Qian, Yu; Rong, Ruizhou; Han, Zhiwei; Zhang, Yanlin; Luo, Wen; Wang, Jiaqian; Peng, Shaoliang; Yang, Xukui; Li, Xiangchun; Li, Lin; Fang, Hu; Liu, Xingmin; Ma, Li; Chen, Yunqing; Guo, Shiping; Chen, Xing; Xi, Yanfeng; Li, Guodong; Liang, Jianfang; Yang, Xiaofeng; Guo, Jiansheng; Jia, JunMei; Li, Qingshan; Cheng, Xiaolong; Zhan, Qimin; Cui, Yongping

    2016-01-01

    Comprehensive identification of somatic structural variations (SVs) and understanding their mutational mechanisms in cancer might contribute to understanding biological differences and help to identify new therapeutic targets. Unfortunately, characterization of complex SVs across the whole genome and the mutational mechanisms underlying esophageal squamous cell carcinoma (ESCC) is largely unclear. To define a comprehensive catalog of somatic SVs, affected target genes, and their underlying mechanisms in ESCC, we re-analyzed whole-genome sequencing (WGS) data from 31 ESCCs using Meerkat algorithm to predict somatic SVs and Patchwork to determine copy-number changes. We found deletions and translocations with NHEJ and alt-EJ signature as the dominant SV types, and 16% of deletions were complex deletions. SVs frequently led to disruption of cancer-associated genes (e.g., CDKN2A and NOTCH1) with different mutational mechanisms. Moreover, chromothripsis, kataegis, and breakage-fusion-bridge (BFB) were identified as contributing to locally mis-arranged chromosomes that occurred in 55% of ESCCs. These genomic catastrophes led to amplification of oncogene through chromothripsis-derived double-minute chromosome formation (e.g., FGFR1 and LETM2) or BFB-affected chromosomes (e.g., CCND1, EGFR, ERBB2, MMPs, and MYC), with approximately 30% of ESCCs harboring BFB-derived CCND1 amplification. Furthermore, analyses of copy-number alterations reveal high frequency of whole-genome duplication (WGD) and recurrent focal amplification of CDCA7 that might act as a potential oncogene in ESCC. Our findings reveal molecular defects such as chromothripsis and BFB in malignant transformation of ESCCs and demonstrate diverse models of SVs-derived target genes in ESCCs. These genome-wide SV profiles and their underlying mechanisms provide preventive, diagnostic, and therapeutic implications for ESCCs. PMID:26833333

  2. Genomic analysis reveals Lactobacillus sanfranciscensis as stable element in traditional sourdoughs

    PubMed Central

    2011-01-01

    Sourdough has played a significant role in human nutrition and culture for thousands of years and is still of eminent importance for human diet and the bakery industry. Lactobacillus sanfranciscensis is the predominant key bacterium in traditionally fermented sourdoughs. The genome of L. sanfranciscensis TMW 1.1304 isolated from an industrial sourdough fermentation was sequenced with a combined Sanger/454-pyrosequencing approach followed by gap closing by walking on fosmids. The sequencing data revealed a circular chromosomal sequence of 1,298,316 bp and two additional plasmids, pLS1 and pLS2, with sizes of 58,739 bp and 18,715 bp, which are predicted to encode 1,437, 63 and 19 orfs, respectively. The overall GC content of the chromosome is 34.71%. Several specific features appear to contribute to the ability of L. sanfranciscensis to outcompete other bacteria in the fermentation. L. sanfranciscensis contains the smallest genome within the lactobacilli and the highest density of ribosomal RNA operons per Mbp genome among all known genomes of free-living bacteria, which is important for the rapid growth characteristics of the organism. A high frequency of gene inactivation and elimination indicates a process of reductive evolution. The biosynthetic capacity for amino acids scarcely availably in cereals and exopolysaccharides reveal the molecular basis for an autochtonous sourdough organism with potential for further exploitation in functional foods. The presence of two CRISPR/cas loci versus a high number of transposable elements suggests recalcitrance to gene intrusion and high intrinsic genome plasticity. PMID:21995419

  3. Genomes of Gardnerella Strains Reveal an Abundance of Prophages within the Bladder Microbiome

    PubMed Central

    Malki, Kema; Shapiro, Jason W.; Price, Travis K.; Hilt, Evann E.; Thomas-White, Krystal; Sircar, Trina; Rosenfeld, Amy B.; Kuffel, Gina; Zilliox, Michael J.; Wolfe, Alan J.; Putonti, Catherine

    2016-01-01

    Bacterial surveys of the vaginal and bladder human microbiota have revealed an abundance of many similar bacterial taxa. As the bladder was once thought to be sterile, the complex interactions between microbes within the bladder have yet to be characterized. To initiate this process, we have begun sequencing isolates, including the clinically relevant genus Gardnerella. Herein, we present the genomic sequences of four Gardnerella strains isolated from the bladders of women with symptoms of urgency urinary incontinence; these are the first Gardnerella genomes produced from this niche. Congruent to genomic characterization of Gardnerella isolates from the reproductive tract, isolates from the bladder reveal a large pangenome, as well as evidence of high frequency horizontal gene transfer. Prophage gene sequences were found to be abundant amongst the strains isolated from the bladder, as well as amongst publicly available Gardnerella genomes from the vagina and endometrium, motivating an in depth examination of these sequences. Amongst the 39 Gardnerella strains examined here, there were more than 400 annotated prophage gene sequences that we could cluster into 95 homologous groups; 49 of these groups were unique to a single strain. While many of these prophages exhibited no sequence similarity to any lytic phage genome, estimation of the rate of phage acquisition suggests both vertical and horizontal acquisition. Furthermore, bioinformatic evidence indicates that prophage acquisition is ongoing within both vaginal and bladder Gardnerella populations. The abundance of prophage sequences within the strains examined here suggests that phages could play an important role in the species’ evolutionary history and in its interactions within the complex communities found in the female urinary and reproductive tracts. PMID:27861551

  4. Evolution of Carbapenem-Resistant Acinetobacter baumannii Revealed through Whole-Genome Sequencing and Comparative Genomic Analysis

    PubMed Central

    Li, Henan; Liu, Fei; Zhang, Yawei; Wang, Xiaojuan; Zhao, Chunjiang; Chen, Hongbin; Zhang, Feifei; Zhu, Baoli

    2014-01-01

    Acinetobacter baumannii is a globally important nosocomial pathogen characterized by an evolving multidrug resistance. A total of 35 representative clinical A. baumannii strains isolated from 13 hospitals in nine cities in China from 1999 to 2011, including 32 carbapenem-resistant and 3 carbapenem-susceptible A. baumannii strains, were selected for whole-genome sequencing and comparative genomic analysis. Phylogenetic analysis revealed that the earliest strain, strain 1999BJAB11, and two strains isolated in Zhejiang Province in 2004 were the founder strains of carbapenem-resistant A. baumannii. Ten types of AbaR resistance islands were identified, and a previously unreported AbaR island, which comprised a two-component response regulator, resistance-related proteins, and RND efflux system proteins, was identified in two strains isolated in Zhejiang in 2004. Multiple transposons or insertion sequences (ISs) existed in each strain, and these gradually tended to diversify with evolution. Some of these IS elements or transposons were the first to be reported, and most of them were mainly found in strains from two provinces. Genome feature analysis illustrated diversified resistance genes, surface polysaccharides, and a restriction-modification system, even in strains that were phylogenetically and epidemiologically very closely related. IS-mediated deletions were identified in the type VI secretion system region, the csuE region, and core lipooligosaccharide (LOS) loci. Recombination occurred in the heme utilization region, and intrinsic resistance genes (blaADC and blaOXA-51-like variants) and three novel blaOXA-51-like variants (blaOXA-424, blaOXA-425, and blaOXA-426) were identified. Our results could improve the understanding of the evolutionary processes that contribute to the emergence of carbapenem-resistant A. baumannii strains and help elucidate the molecular evolutionary mechanism in A. baumannii. PMID:25487793

  5. Population structure and comparative genome hybridization of European flor yeast reveal a unique group of Saccharomyces cerevisiae strains with few gene duplications in their genome.

    PubMed

    Legras, Jean-Luc; Erny, Claude; Charpentier, Claudine

    2014-01-01

    Wine biological aging is a wine making process used to produce specific beverages in several countries in Europe, including Spain, Italy, France, and Hungary. This process involves the formation of a velum at the surface of the wine. Here, we present the first large scale comparison of all European flor strains involved in this process. We inferred the population structure of these European flor strains from their microsatellite genotype diversity and analyzed their ploidy. We show that almost all of these flor strains belong to the same cluster and are diploid, except for a few Spanish strains. Comparison of the array hybridization profile of six flor strains originating from these four countries, with that of three wine strains did not reveal any large segmental amplification. Nonetheless, some genes, including YKL221W/MCH2 and YKL222C, were amplified in the genome of four out of six flor strains. Finally, we correlated ICR1 ncRNA and FLO11 polymorphisms with flor yeast population structure, and associate the presence of wild type ICR1 and a long Flo11p with thin velum formation in a cluster of Jura strains. These results provide new insight into the diversity of flor yeast and show that combinations of different adaptive changes can lead to an increase of hydrophobicity and affect velum formation.

  6. Population Structure and Comparative Genome Hybridization of European Flor Yeast Reveal a Unique Group of Saccharomyces cerevisiae Strains with Few Gene Duplications in Their Genome

    PubMed Central

    Legras, Jean-Luc; Erny, Claude; Charpentier, Claudine

    2014-01-01

    Wine biological aging is a wine making process used to produce specific beverages in several countries in Europe, including Spain, Italy, France, and Hungary. This process involves the formation of a velum at the surface of the wine. Here, we present the first large scale comparison of all European flor strains involved in this process. We inferred the population structure of these European flor strains from their microsatellite genotype diversity and analyzed their ploidy. We show that almost all of these flor strains belong to the same cluster and are diploid, except for a few Spanish strains. Comparison of the array hybridization profile of six flor strains originating from these four countries, with that of three wine strains did not reveal any large segmental amplification. Nonetheless, some genes, including YKL221W/MCH2 and YKL222C, were amplified in the genome of four out of six flor strains. Finally, we correlated ICR1 ncRNA and FLO11 polymorphisms with flor yeast population structure, and associate the presence of wild type ICR1 and a long Flo11p with thin velum formation in a cluster of Jura strains. These results provide new insight into the diversity of flor yeast and show that combinations of different adaptive changes can lead to an increase of hydrophobicity and affect velum formation. PMID:25272156

  7. Genome analysis of canine astroviruses reveals genetic heterogeneity and suggests possible inter-species transmission.

    PubMed

    Mihalov-Kovács, Eszter; Martella, Vito; Lanave, Gianvito; Bodnar, Livia; Fehér, Enikő; Marton, Szilvia; Kemenesi, Gábor; Jakab, Ferenc; Bányai, Krisztián

    2017-03-15

    Canine astrovirus RNA was detected in the stools of 17/63 (26.9%) samples, using either a broadly reactive consensus RT-PCR for astroviruses or random RT-PCR coupled with massive deep sequencing. The complete or nearly complete genome sequence of five canine astroviruses was reconstructed that allowed mapping the genome organization and to investigate the genetic diversity of these viruses. The genome was about 6.6kb in length and contained three open reading frames (ORFs) flanked by a 5' UTR, and a 3' UTR plus a poly-A tail. ORF1a and ORF1b overlapped by 43 nucleotides while the ORF2 overlapped by 8 nucleotides with the 3' end of ORF1b. Upon genome comparison, four strains (HUN/2012/2, HUN/2012/6, HUN/2012/115, and HUN/2012/135) were more related genetically to each other and to UK canine astroviruses (88-96% nt identity), whilst strain HUN/2012/126 was more divergent (75-76% nt identity). In the ORF1b and ORF2, strains HUN/2012/2, HUN/2012/6, and HUN/2012/135 were related genetically to other canine astroviruses identified formerly in Europe and China, whereas strain HUN/2012/126 was related genetically to a divergent canine astrovirus strain, ITA/2010/Zoid. For one canine astrovirus, HUN/2012/8, only a 3.2kb portion of the genome, at the 3' end, could be determined. Interestingly, this strain possessed unique genetic signatures (including a longer ORF1b/ORF2 overlap and a longer 3'UTR) and it was divergent in both ORF1b and ORF2 from all other canine astroviruses, with the highest nucleotide sequence identity (68% and 63%, respectively) to a mink astrovirus, thus suggesting a possible event of interspecies transmission. The genetic heterogeneity of canine astroviruses may pose a challenge for the diagnostics and for future prophylaxis strategies.

  8. Gene tree discordance of wild and cultivated Asian rice deciphered by genome-wide sequence comparison.

    PubMed

    Yang, Ching-chia; Sakai, Hiroaki; Numa, Hisataka; Itoh, Takeshi

    2011-05-15

    Although a large number of genes are expected to correctly solve a phylogenetic relationship, inconsistent gene tree topologies have been observed. This conflicting evidence in gene tree topologies, known as gene tree discordance, becomes increasingly important as advanced sequencing technologies produce an enormous amount of sequence information for phylogenomic studies among closely related species. Here, we aim to characterize the gene tree discordance of the Asian cultivated rice Oryza sativa and its progenitor, O. rufipogon, which will be an ideal case study of gene tree discordance. Using genome and cDNA sequences of O. sativa and O. rufipogon, we have conducted the first in-depth analyses of gene tree discordance in Asian rice. Our comparison of full-length cDNA sequences of O. rufipogon with the genome sequences of the japonica and indica cultivars of O. sativa revealed that 60% of the gene trees showed a topology consistent with the expected one, whereas the remaining genes supported significantly different topologies. Moreover, the proportions of the topologies deviated significantly from expectation, suggesting at least one hybridization event between the two subgroups of O. sativa, japonica and indica. In fact, a genome-wide alignment between japonica and indica indicated that significant portions of the indica genome are derived from japonica. In addition, literature concerning the pedigree of the indica cultivar strongly supported the hybridization hypothesis. Our molecular evolutionary analyses deciphered complicated evolutionary processes in closely related species. They also demonstrated the importance of gene tree discordance in the era of high-speed DNA sequencing.

  9. Advances in the translational genomics of neuroblastoma: From improving risk stratification and revealing novel biology to identifying actionable genomic alterations.

    PubMed

    Bosse, Kristopher R; Maris, John M

    2016-01-01

    Neuroblastoma is an embryonal malignancy that commonly affects young children and is remarkably heterogenous in its malignant potential. Recently, the genetic basis of neuroblastoma has come into focus and not only has catalyzed a more comprehensive understanding of neuroblastoma tumorigenesis but also has revealed novel oncogenic vulnerabilities that are being therapeutically leveraged. Neuroblastoma is a model pediatric solid tumor in its use of recurrent genomic alterations, such as high-level MYCN (v-myc avian myelocytomatosis viral oncogene neuroblastoma-derived homolog) amplification, for risk stratification. Given the relative paucity of recurrent, activating, somatic point mutations or gene fusions in primary neuroblastoma tumors studied at initial diagnosis, innovative treatment approaches beyond small molecules targeting mutated or dysregulated kinases will be required moving forward to achieve noticeable improvements in overall patient survival. However, the clonally acquired, oncogenic aberrations in relapsed neuroblastomas are currently being defined and may offer an opportunity to improve patient outcomes with molecularly targeted therapy directed toward aberrantly regulated pathways in relapsed disease. This review summarizes the current state of knowledge about neuroblastoma genetics and genomics, highlighting the improved prognostication and potential therapeutic opportunities that have arisen from recent advances in understanding germline predisposition, recurrent segmental chromosomal alterations, somatic point mutations and translocations, and clonal evolution in relapsed neuroblastoma.

  10. Comparison and quantitative verification of mapping algorithms for whole genome bisulfite sequencing

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Coupling bisulfite conversion with next-generation sequencing (Bisulfite-seq) enables genome-wide measurement of DNA methylation, but poses unique challenges for mapping. However, despite a proliferation of Bisulfite-seq mapping tools, no systematic comparison of their genomic coverage and quantitat...

  11. Sequence analysis reveals genomic factors affecting EST-SSR primer performance and polymorphism.

    PubMed

    Chen, Chunxian; Bock, Clive H; Beckman, Tom G

    2014-12-01

    This study was to explore genomic factors affecting the performance and polymorphism of 340 randomly selected EST-SSR (expressed sequence tag-simple sequence repeat) primers through BLAST of primer sequences to a reference genome. Genotyping showed 111 failed and 229 succeeded. The failed types included "no peaks" (NP, 69 primers), "weak peaks" (WP, 30), and "multiple peaks" (MP, 12). The successful types were divided into HM (homozygous between two selected parents, 78 primers) and HT (heterozygous at least in one parent, 151 primers). The BLAST revealed primer alignment status, genomic amplicon size (GAS), and genomic and expressed amplicon size difference (ASD). The alignment status was categorized as: "no hits found" (NHF); "multiple partial alignments" (MPA); "single partial alignment" (SPA); "multiple full alignments" (MFA); and "single full alignment" (SFA). NHF and partial alignment (PA) mainly resulted from discrepant nucleotides in contig-derived primers. The ASD separated 247 non-NHF primers into: "deletion", "same size", "insertion", "intron (GAS ≤500)", "intron (GAS >500)", and "error" categories. Most SFA primers were successful. About 88 % "error", 53 % NHF primers, and 47 % "intron (GAS >500)" failed. The "deletion" and "insertion" primers had the higher HT rates, and the "same size" had the highest HM rate. Optimized primer selection criteria are discussed.

  12. A Korarchael Genome Reveals Insights into the Evolution of the Archaea

    SciTech Connect

    Lapidus, Alla; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

    2008-01-07

    The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, ?Candidatus Korarchaeum cryptofilum,? which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

  13. A korarchaeal genome reveals insights into the evolution of the Archaea

    SciTech Connect

    Anderson, Iain J; Elkins, James G.; Podar, Mircea; Graham, David E.; Makarova, Kira S.; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P.; Brochier-Armanet, Celine; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V.; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

    2008-06-05

    The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name,"Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49percent. Of the 1,617 predicted protein-coding genes, 1,382 (85percent) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

  14. Unique Features of a Japanese ‘Candidatus Liberibacter asiaticus’ Strain Revealed by Whole Genome Sequencing

    PubMed Central

    Katoh, Hiroshi; Miyata, Shin-ichi; Inoue, Hiromitsu; Iwanami, Toru

    2014-01-01

    Citrus greening (huanglongbing) is the most destructive disease of citrus worldwide. It is spread by citrus psyllids and is associated with phloem-limited bacteria of three species of α-Proteobacteria, namely, ‘Candidatus Liberibacter asiaticus’, ‘Ca. L. americanus’, and ‘Ca. L. africanus’. Recent findings suggested that some Japanese strains lack the bacteriophage-type DNA polymerase region (DNA pol), in contrast to the Floridian psy62 strain. The whole genome sequence of the pol-negative ‘Ca. L. asiaticus’ Japanese isolate Ishi-1 was determined by metagenomic analysis of DNA extracted from ‘Ca. L. asiaticus’-infected psyllids and leaf midribs. The 1.19-Mb genome has an average 36.32% GC content. Annotation revealed 13 operons encoding rRNA and 44 tRNA genes, but no typical bacterial pathogenesis-related genes were located within the genome, similar to the Floridian psy62 and Chinese gxpsy. In contrast to other ‘Ca. L. asiaticus’ strains, the genome of the Japanese Ishi-1 strain lacks a prophage-related region. PMID:25180586

  15. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  16. Whole genome sequence of Staphylococcus saprophyticus reveals the pathogenesis of uncomplicated urinary tract infection.

    PubMed

    Kuroda, Makoto; Yamashita, Atsushi; Hirakawa, Hideki; Kumano, Miyuki; Morikawa, Kazuya; Higashide, Masato; Maruyama, Atsushi; Inose, Yumiko; Matoba, Kimio; Toh, Hidehiro; Kuhara, Satoru; Hattori, Masahira; Ohta, Toshiko

    2005-09-13

    Staphylococcus saprophyticus is a uropathogenic Staphylococcus frequently isolated from young female outpatients presenting with uncomplicated urinary tract infections. We sequenced the whole genome of S. saprophyticus type strain ATCC 15305, which harbors a circular chromosome of 2,516,575 bp with 2,446 ORFs and two plasmids. Comparative genomic analyses with the strains of two other species, Staphylococcus aureus and Staphylococcus epidermidis, as well as experimental data, revealed the following characteristics of the S. saprophyticus genome. S. saprophyticus does not possess any virulence factors found in S. aureus, such as coagulase, enterotoxins, exoenzymes, and extracellular matrix-binding proteins, although it does have a remarkable paralog expansion of transport systems related to highly variable ion contents in the urinary environment. A further unique feature is that only a single ORF is predictable as a cell wall-anchored protein, and it shows positive hemagglutination and adherence to human bladder cell associated with initial colonization in the urinary tract. It also shows significantly high urease activity in S. saprophyticus. The uropathogenicity of S. saprophyticus can be attributed to its genome that is needed for its survival in the human urinary tract by means of novel cell wall-anchored adhesin and redundant uro-adaptive transport systems, together with urease.

  17. Chromosome-specific sequencing reveals an extensive dispensable genome component in wheat

    PubMed Central

    Liu, Miao; Stiller, Jiri; Holušová, Kateřina; Vrána, Jan; Liu, Dengcai; Doležel, Jaroslav; Liu, Chunji

    2016-01-01

    The hexaploid wheat genotype Chinese Spring (CS) has been used worldwide as the reference base for wheat genetics and genomics, and significant resources have been used by the international community to generate a reference wheat genome based on this genotype. By sequencing flow-sorted 3B chromosome from a hexaploid wheat genotype CRNIL1A and comparing the obtained sequences with those available for CS, we detected that a large number of sequences in the former were missing in the latter. If the distribution of such sequences in the hexaploid wheat genome is random, CRNILA sequences missing in CS could be as much as 159.3 Mb even if only fragments of 50 bp or longer were considered. Analysing RNA sequences available in the public domains also revealed that dispensable genes are common in hexaploid wheat. Together with those extensive intra- and interchromosomal rearrangements in CS, the existence of such dispensable genes is another factor highlighting potential issues with the use of reference genomes in various studies. Strong deviation in distributions of these dispensable sequences among genotypes with different geographical origins provided the first evidence indicating that they could be associated with adaptation in wheat. PMID:27821854

  18. A map of rice genome variation reveals the origin of cultivated rice.

    PubMed

    Huang, Xuehui; Kurata, Nori; Wei, Xinghua; Wang, Zi-Xuan; Wang, Ahong; Zhao, Qiang; Zhao, Yan; Liu, Kunyan; Lu, Hengyun; Li, Wenjun; Guo, Yunli; Lu, Yiqi; Zhou, Congcong; Fan, Danlin; Weng, Qijun; Zhu, Chuanrang; Huang, Tao; Zhang, Lei; Wang, Yongchun; Feng, Lei; Furuumi, Hiroyasu; Kubo, Takahiko; Miyabayashi, Toshie; Yuan, Xiaoping; Xu, Qun; Dong, Guojun; Zhan, Qilin; Li, Canyang; Fujiyama, Asao; Toyoda, Atsushi; Lu, Tingting; Feng, Qi; Qian, Qian; Li, Jiayang; Han, Bin

    2012-10-25

    Crop domestications are long-term selection experiments that have greatly advanced human civilization. The domestication of cultivated rice (Oryza sativa L.) ranks as one of the most important developments in history. However, its origins and domestication processes are controversial and have long been debated. Here we generate genome sequences from 446 geographically diverse accessions of the wild rice species Oryza rufipogon, the immediate ancestral progenitor of cultivated rice, and from 1,083 cultivated indica and japonica varieties to construct a comprehensive map of rice genome variation. In the search for signatures of selection, we identify 55 selective sweeps that have occurred during domestication. In-depth analyses of the domestication sweeps and genome-wide patterns reveal that Oryza sativa japonica rice was first domesticated from a specific population of O. rufipogon around the middle area of the Pearl River in southern China, and that Oryza sativa indica rice was subsequently developed from crosses between japonica rice and local wild rice as the initial cultivars spread into South East and South Asia. The domestication-associated traits are analysed through high-resolution genetic mapping. This study provides an important resource for rice breeding and an effective genomics approach for crop domestication research.

  19. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice.

    PubMed

    Tong, Wei; He, Qiang; Park, Yong-Jin

    2017-03-03

    Mitochondrial genome variations have been detected despite the overall conservation of this gene content, which has been valuable for plant population genetics and evolutionary studies. Here, we describe mitochondrial variation architecture and our performance of a phylogenetic dissection of Korean landrace and weedy rice. A total of 4,717 variations across the mitochondrial genome were identified adjunct with 10 wild rice. Genetic diversity assessment revealed that wild rice has higher nucleotide diversity than landrace and/or weedy, and landrace rice has higher diversity than weedy rice. Genetic distance was suggestive of a high level of breeding between landrace and weedy rice, and the landrace showing a closer association with wild rice than weedy rice. Population structure and principal component analyses showed no obvious difference in the genetic backgrounds of landrace and weedy rice in mitochondrial genome level. Phylogenetic, population split, and haplotype network evaluations were suggestive of independent origins of the indica and japonica varieties. The origin of weedy rice is supposed to be more likely from cultivated rice rather than from wild rice in mitochondrial genome level.

  20. Genetic variation architecture of mitochondrial genome reveals the differentiation in Korean landrace and weedy rice

    PubMed Central

    Tong, Wei; He, Qiang; Park, Yong-Jin

    2017-01-01

    Mitochondrial genome variations have been detected despite the overall conservation of this gene content, which has been valuable for plant population genetics and evolutionary studies. Here, we describe mitochondrial variation architecture and our performance of a phylogenetic dissection of Korean landrace and weedy rice. A total of 4,717 variations across the mitochondrial genome were identified adjunct with 10 wild rice. Genetic diversity assessment revealed that wild rice has higher nucleotide diversity than landrace and/or weedy, and landrace rice has higher diversity than weedy rice. Genetic distance was suggestive of a high level of breeding between landrace and weedy rice, and the landrace showing a closer association with wild rice than weedy rice. Population structure and principal component analyses showed no obvious difference in the genetic backgrounds of landrace and weedy rice in mitochondrial genome level. Phylogenetic, population split, and haplotype network evaluations were suggestive of independent origins of the indica and japonica varieties. The origin of weedy rice is supposed to be more likely from cultivated rice rather than from wild rice in mitochondrial genome level. PMID:28256554

  1. Single Nucleus Genome Sequencing Reveals High Similarity among Nuclei of an Endomycorrhizal Fungus

    PubMed Central

    Zhang, Zhonghua; Ivanov, Sergey; Saunders, Diane G. O.; Mu, Desheng; Pang, Erli; Cao, Huifen; Cha, Hwangho; Lin, Tao; Zhou, Qian; Shang, Yi; Li, Ying; Sharma, Trupti; van Velzen, Robin; de Ruijter, Norbert; Aanen, Duur K.; Win, Joe; Kamoun, Sophien; Bisseling, Ton; Geurts, René; Huang, Sanwen

    2014-01-01

    Nuclei of arbuscular endomycorrhizal fungi have been described as highly diverse due to their asexual nature and absence of a single cell stage with only one nucleus. This has raised fundamental questions concerning speciation, selection and transmission of the genetic make-up to next generations. Although this concept has become textbook knowledge, it is only based on studying a few loci, including 45S rDNA. To provide a more comprehensive insight into the genetic makeup of arbuscular endomycorrhizal fungi, we applied de novo genome sequencing of individual nuclei of Rhizophagus irregularis. This revealed a surprisingly low level of polymorphism between nuclei. In contrast, within a nucleus, the 45S rDNA repeat unit turned out to be highly diverged. This finding demystifies a long-lasting hypothesis on the complex genetic makeup of arbuscular endomycorrhizal fungi. Subsequent genome assembly resulted in the first draft reference genome sequence of an arbuscular endomycorrhizal fungus. Its length is 141 Mbps, representing over 27,000 protein-coding gene models. We used the genomic sequence to reinvestigate the phylogenetic relationships of Rhizophagus irregularis with other fungal phyla. This unambiguously demonstrated that Glomeromycota are more closely related to Mucoromycotina than to its postulated sister Dikarya. PMID:24415955

  2. Comparative Analysis of 35 Basidiomycete Genomes Reveals Diversity and Uniqueness of the Phylum

    SciTech Connect

    Riley, Robert; Salamov, Asaf; Otillar, Robert; Fagnan, Kirsten; Boussau, Bastien; Brown, Daren; Henrissat, Bernard; Levasseur, Anthony; Held, Benjamin; Nagy, Laszlo; Floudas, Dimitris; Morin, Emmanuelle; Manning, Gerard; Baker, Scott; Martin, Francis; Blanchette, Robert; Hibbett, David; Grigoriev, Igor V.

    2013-03-11

    Fungi of the phylum Basidiomycota (basidiomycetes), make up some 37percent of the described fungi, and are important in forestry, agriculture, medicine, and bioenergy. This diverse phylum includes symbionts, pathogens, and saprobes including wood decaying fungi. To better understand the diversity of this phylum we compared the genomes of 35 basidiomycete fungi including 6 newly sequenced genomes. The genomes of basidiomycetes span extremes of genome size, gene number, and repeat content. A phylogenetic tree of Basidiomycota was generated using the Phyldog software, which uses all available protein sequence data to simultaneously infer gene and species trees. Analysis of core genes reveals that some 48percent of basidiomycete proteins are unique to the phylum with nearly half of those (22percent) comprising proteins found in only one organism. Phylogenetic patterns of plant biomass-degrading genes suggest a continuum rather than a sharp dichotomy between the white rot and brown rot modes of wood decay among the members of Agaricomycotina subphylum. There is a correlation of the profile of certain gene families to nutritional mode in Agaricomycotina. Based on phylogenetically-informed PCA analysis of such profiles, we predict that that Botryobasidium botryosum and Jaapia argillacea have properties similar to white rot species, although neither has liginolytic class II fungal peroxidases. Furthermore, we find that both fungi exhibit wood decay with white rot-like characteristics in growth assays. Analysis of the rate of discovery of proteins with no or few homologs suggests the high value of continued sequencing of basidiomycete fungi.

  3. Draft genome comparison of representatives of the three dominant genotype groups of dairy Bacillus licheniformis strains.

    PubMed

    Dhakal, Rajat; Seale, R Brent; Deeth, Hilton C; Craven, Heather; Turner, Mark S

    2014-06-01

    The spore-forming bacterium Bacillus licheniformis is a common contaminant of milk and milk products. Strains of this species isolated from dairy products can be differentiated into three major groups, namely, G, F1, and F2, using random amplification of polymorphic DNA (RAPD) analysis; however, little is known about the genomic differences between these groups and the identity of the fragments that make up their RAPD profiles. In this work we obtained high-quality draft genomes of representative strains from each of the three RAPD groups (designated strain G-1, strain F1-1, and strain F2-1) and compared them to each other and to B. licheniformis ATCC 14580 and Bacillus subtilis 168. Whole-genome comparison and multilocus sequence typing revealed that strain G-1 contains significant sequence variability and belongs to a lineage distinct from the group F strains. Strain G-1 was found to contain genes coding for a type I restriction modification system, urease production, and bacitracin synthesis, as well as the 8-kbp plasmid pFL7, and these genes were not present in strains F1-1 and F2-1. In agreement with this, all isolates of group G, but no group F isolates, were found to possess urease activity and antimicrobial activity against Micrococcus. Identification of RAPD band sequences revealed that differences in the RAPD profiles were due to differences in gene lengths, 3' ends of predicted primer binding sites, or gene presence or absence. This work provides a greater understanding of the phylogenetic and phenotypic differences observed within the B. licheniformis species.

  4. Draft Genome Comparison of Representatives of the Three Dominant Genotype Groups of Dairy Bacillus licheniformis Strains

    PubMed Central

    Dhakal, Rajat; Seale, R. Brent; Deeth, Hilton C.; Craven, Heather

    2014-01-01

    The spore-forming bacterium Bacillus licheniformis is a common contaminant of milk and milk products. Strains of this species isolated from dairy products can be differentiated into three major groups, namely, G, F1, and F2, using random amplification of polymorphic DNA (RAPD) analysis; however, little is known about the genomic differences between these groups and the identity of the fragments that make up their RAPD profiles. In this work we obtained high-quality draft genomes of representative strains from each of the three RAPD groups (designated strain G-1, strain F1-1, and strain F2-1) and compared them to each other and to B. licheniformis ATCC 14580 and Bacillus subtilis 168. Whole-genome comparison and multilocus sequence typing revealed that strain G-1 contains significant sequence variability and belongs to a lineage distinct from the group F strains. Strain G-1 was found to contain genes coding for a type I restriction modification system, urease production, and bacitracin synthesis, as well as the 8-kbp plasmid pFL7, and these genes were not present in strains F1-1 and F2-1. In agreement with this, all isolates of group G, but no group F isolates, were found to possess urease activity and antimicrobial activity against Micrococcus. Identification of RAPD band sequences revealed that differences in the RAPD profiles were due to differences in gene lengths, 3′ ends of predicted primer binding sites, or gene presence or absence. This work provides a greater understanding of the phylogenetic and phenotypic differences observed within the B. licheniformis species. PMID:24657871

  5. Genomic Analysis and Comparison of Two Gonorrhea Outbreaks

    PubMed Central

    Dordel, Janina; Whittles, Lilith K.; Collins, Caitlin; Bilek, Nicole; Bishop, Cynthia J.; White, Peter J.; Aanensen, David M.; Bentley, Stephen D.; Spratt, Brian G.

    2016-01-01

    ABSTRACT Gonorrhea is a sexually transmitted disease causing growing concern, with a substantial increase in reported incidence over the past few years in the United Kingdom and rising levels of resistance to a wide range of antibiotics. Understanding its epidemiology is therefore of major biomedical importance, not only on a population scale but also at the level of direct transmission. However, the molecular typing techniques traditionally used for gonorrhea infections do not provide sufficient resolution to investigate such fine-scale patterns. Here we sequenced the genomes of 237 isolates from two local collections of isolates from Sheffield and London, each of which was resolved into a single type using traditional methods. The two data sets were selected to have different epidemiological properties: the Sheffield data were collected over 6 years from a predominantly heterosexual population, whereas the London data were gathered within half a year and strongly associated with men who have sex with men. Based on contact tracing information between individuals in Sheffield, we found that transmission is associated with a median time to most recent common ancestor of 3.4 months, with an upper bound of 8 months, which we used as a criterion to identify likely transmission links in both data sets. In London, we found that transmission happened predominantly between individuals of similar age, sexual orientation, and location and also with the same HIV serostatus, which may reflect serosorting and associated risk behaviors. Comparison of the two data sets suggests that the London epidemic involved about ten times more cases than the Sheffield outbreak. PMID:27353752

  6. Evolution and phylogeny of the mud shrimps (Crustacea: Decapoda) revealed from complete mitochondrial genomes

    PubMed Central

    2012-01-01

    Background The evolutionary history and relationships of the mud shrimps (Crustacea: Decapoda: Gebiidea and Axiidea) are contentious, with previous attempts revealing mixed results. The mud shrimps were once classified in the infraorder Thalassinidea. Recent molecular phylogenetic analyses, however, suggest separation of the group into two individual infraorders, Gebiidea and Axiidea. Mitochondrial (mt) genome sequence and structure can be especially powerful in resolving higher systematic relationships that may offer new insights into the phylogeny of the mud shrimps and the other decapod infraorders, and test the hypothesis of dividing the mud shrimps into two infraorders. Results We present the complete mitochondrial genome sequences of five mud shrimps, Austinogebia edulis, Upogebia major, Thalassina kelanang (Gebiidea), Nihonotrypaea thermophilus and Neaxius glyptocercus (Axiidea). All five genomes encode a standard set of 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and a putative control region. Except for T. kelanang, mud shrimp mitochondrial genomes exhibited rearrangements and novel patterns compared to the pancrustacean ground pattern. Each of the two Gebiidea species (A. edulis and U. major) and two Axiidea species (N. glyptocercus and N. thermophiles) share unique gene order specific to their infraorders and analyses further suggest these two derived gene orders have evolved independently. Phylogenetic analyses based on the concatenated nucleotide and amino acid sequences of 13 protein-coding genes indicate the possible polyphyly of mud shrimps, supporting the division of the group into two infraorders. However, the infraordinal relationships among the Gebiidea and Axiidea, and other reptants are poorly resolved. The inclusion of mt genome from more taxa, in particular the reptant infraorders Polychelida and Glypheidea is required in further analysis. Conclusions Phylogenetic analyses on the mt genome sequences and the

  7. Genome-wide investigation reveals high evolutionary rates in annual model plants

    PubMed Central

    2010-01-01

    Background Rates of molecular evolution vary widely among species. While significant deviations from molecular clock have been found in many taxa, effects of life histories on molecular evolution are not fully understood. In plants, annual/perennial life history traits have long been suspected to influence the evolutionary rates at the molecular level. To date, however, the number of genes investigated on this subject is limited and the conclusions are mixed. To evaluate the possible heterogeneity in evolutionary rates between annual and perennial plants at the genomic level, we investigated 85 nuclear housekeeping genes, 10 non-housekeeping families, and 34 chloroplast genes using the genomic data from model plants including Arabidopsis thaliana and Medicago truncatula for annuals and grape (Vitis vinifera) and popular (Populus trichocarpa) for perennials. Results According to the cross-comparisons among the four species, 74-82% of the nuclear genes and 71-97% of the chloroplast genes suggested higher rates of molecular evolution in the two annuals than those in the two perennials. The significant heterogeneity in evolutionary rate between annuals and perennials was consistently found both in nonsynonymous sites and synonymous sites. While a linear correlation of evolutionary rates in orthologous genes between species was observed in nonsynonymous sites, the correlation was weak or invisible in synonymous sites. This tendency was clearer in nuclear genes than in chloroplast genes, in which the overall evolutionary rate was small. The slope of the regression line was consistently lower than unity, further confirming the higher evolutionary rate in annuals at the genomic level. Conclusions The higher evolutionary rate in annuals than in perennials appears to be a universal phenomenon both in nuclear and chloroplast genomes in the four dicot model plants we investigated. Therefore, such heterogeneity in evolutionary rate should result from factors that have genome

  8. Genomic Sequence Comparisons, 1987-2003 Final Report

    SciTech Connect

    George M. Church

    2004-07-29

    This project was to develop new DNA sequencing and RNA and protein quantitation methods and related genome annotation tools. The project began in 1987 with the development of multiplex sequencing (published in Science in 1988), and one of the first automated sequencing methods. This lead to the first commercial genome sequence in 1994 and to the establishment of the main commercial participants (GTC then Agencourt) in the public DOE/NIH genome project. In collaboration with GTC we contributed to one of the first complete DOE genome sequences, in 1997, that of Methanobacterium thermoautotropicum, a species of great relevance to energy-rich gas production.

  9. Correction: Comparative analysis of fungal genomes reveals different plant cell wall degrading capacity in fungi

    PubMed Central

    2014-01-01

    Abstract The version of this article published in BMC Genomics 2013, 14: 274, contains 9 unpublished genomes (Botryobasidium botryosum, Gymnopus luxurians, Hypholoma sublateritium, Jaapia argillacea, Hebeloma cylindrosporum, Conidiobolus coronatus, Laccaria amethystina, Paxillus involutus, and P. rubicundulus) downloaded from JGI website. In this correction, we removed these genomes after discussion with editors and data producers whom we should have contacted before downloading these genomes. Removing these data did not alter the principle results and conclusions of our original work. The relevant Figures 1, 2, 3, 4 and 6; and Table 1 have been revised. Additional files 1, 3, 4, and 5 were also revised. We would like to apologize for any confusion or inconvenience this may have caused. Background Fungi produce a variety of carbohydrate activity enzymes (CAZymes) for the degradation of plant polysaccharide materials to facilitate infection and/or gain nutrition. Identifying and comparing CAZymes from fungi with different nutritional modes or infection mechanisms may provide information for better understanding of their life styles and infection models. To date, over hundreds of fungal genomes are publicly available. However, a systematic comparative analysis of fungal CAZymes across the entire fungal kingdom has not been reported. Results In this study, we systemically identified glycoside hydrolases (GHs), polysaccharide lyases (PLs), carbohydrate esterases (CEs), and glycosyltransferases (GTs) as well as carbohydrate-binding modules (CBMs) in the predicted proteomes of 94 representative fungi from Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota. Comparative analysis of these CAZymes that play major roles in plant polysaccharide degradation revealed that fungi exhibit tremendous diversity in the number and variety of CAZymes. Among them, some families of GHs and CEs are the most prevalent CAZymes that are distributed in all of the fungi analyzed

  10. Analysis of segmental duplications reveals a distinct pattern of continuation-of-synteny between human and mouse genomes.

    PubMed

    Mehan, Michael R; Almonte, Maricel; Slaten, Erin; Freimer, Nelson B; Rao, P Nagesh; Ophoff, Roel A

    2007-03-01

    About 5% of the human genome consists of large-scale duplicated segments of almost identical sequences. Segmental duplications (SDs) have been proposed to be involved in non-allelic homologous recombination leading to recurrent genomic variation and disease. It has also been suggested that these SDs are associated with syntenic rearrangements that have shaped the human genome. We have analyzed 14 members of a single family of closely related SDs in the human genome, some of which are associated with common inversion polymorphisms at chromosomes 8p23 and 4p16. Comparative analysis with the mouse genome revealed syntenic inversions for these two human polymorphic loci. In addition, 12 of the 14 SDs, while absent in the mouse genome, occur at the breaks of synteny; suggesting a non-random involvement of these sequences in genome evolution. Furthermore, we observed a syntenic familial relationship between 8 and 12 breakpoint-loci, where broken synteny that ends at one family member resumes at another, even across different chromosomes. Subsequent genome-wide assessment revealed that this relationship, which we named continuation-of-synteny, is not limited to the 8p23 family and occurs 46 times in the human genome with high frequency at specific chromosomes. Our analysis supports a non-random breakage model of genomic evolution with an active involvement of segmental duplications for specific regions of the human genome.

  11. The Genome Sequence of Methanohalophilus mahii SLP T Reveals Differences in the Energy Metabolism among Members of the Methanosarcinaceae Inhabiting Freshwater and Saline Environments

    DOE PAGES

    Spring, Stefan; Scheuner, Carmen; Lapidus, Alla; ...

    2010-01-01

    Methanohalophilus mahii is the type species of the genus Methanohalophilus , which currently comprises three distinct species with validly published names. Mhp. mahii represents moderately halophilic methanogenic archaea with a strictly methylotrophic metabolism. The type strain SLP T was isolated from hypersaline sediments collected from the southern arm of Great Salt Lake, Utah. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,012,424 bp genome is a single replicon with 2032 protein-coding and 63 RNA genes and part of the Genomic Encyclopedia of Bacteria and Archaea project. A comparison of themore » reconstructed energy metabolism in the halophilic species Mhp. mahii with other representatives of the Methanosarcinaceae reveals some interesting differences to freshwater species.« less

  12. The Genome Sequence of Methanohalophilus mahii SLPT Reveals Differences in the Energy Metabolism among Members of the Methanosarcinaceae Inhabiting Freshwater and Saline Environments

    SciTech Connect

    Spring, Stefan; Scheuner, Carmen; Lapidus, Alla L.; Lucas, Susan; Glavina Del Rio, Tijana; Tice, Hope; Copeland, A; Cheng, Jan-Fang; Chen, Feng; Nolan, Matt; Saunders, Elizabeth H; Pitluck, Sam; Liolios, Konstantinos; Ivanova, N; Mavromatis, K; Lykidis, A; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam L; Hauser, Loren John; Chang, Yun-Juan; Jeffries, Cynthia; Goodwin, Lynne A.; Detter, J. Chris; Brettin, Thomas S; Rohde, Manfred; Goker, Markus; Woyke, Tanja; Bristow, James; Eisen, Jonathan; Markowitz, Victor; Hugenholtz, Philip; Kyrpides, Nikos C; Klenk, Hans-Peter

    2010-01-01

    Methanohalophilus mahii is the type species of the genus Methanohalophilus, which currently comprises three distinct species with validly published names. Mhp. mahii represents moderately halophilic methanogenic archaea with a strictly methylotrophic metabolism. The type strain SLPT was isolated from hypersaline sediments collected from the southern arm of Great Salt Lake, Utah. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,012,424 bp genome is a single replicon with 2032 protein-coding and 63 RNA genes and part of the Genomic Encyclopedia of Bacteria and Archaea project. A comparison of the reconstructed energy metabolism in the halophilic species Mhp. mahii with other representatives of the Methanosarcinaceae reveals some interesting differences to freshwater species.

  13. The Genome Sequence of Methanohalophilus mahii SLPT Reveals Differences in the Energy Metabolism among Members of the Methanosarcinaceae Inhabiting Freshwater and Saline Environments

    SciTech Connect

    Spring, Stefan; Scheuner, Carmen; Lapidus, Alla L.; Lucas, Susan; Glavina Del Rio, Tijana; Tice, Hope; Copeland, A; Cheng, Jan-Fang; Chen, Feng; Nolan, Matt; Saunders, Elizabeth H; Pitluck, Samuel; Liolios, Konstantinos; Ivanova, N; Mavromatis, K; Lykidis, A; Pati, Amrita; Chen, Amy; Palaniappan, Krishna; Land, Miriam L; Hauser, Loren John; Chang, Yun-Juan; Jeffries, Cynthia D; Goodwin, Lynne A.; Detter, J. Chris; Brettin, Thomas S; Rohde, Manfred; Goker, Markus; Woyke, Tanja; Bristow, James; Eisen, Jonathan; Markowitz, Victor; Hugenholtz, Philip; Kyrpidis, Nikos C; Klenk, Hans-Peter

    2010-12-01

    Methanohalophilus mahii is the type species of the genus Methanohalophilus, which currently comprises three distinct species with validly published names. Mhp. mahii represents moderately halophilic methanogenic archaea with a strictly methylotrophic metabolism. The type strain SLPT was isolated from hypersaline sediments collected from the southern arm of Great Salt Lake, Utah. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,012,424 bp genome is a single replicon with 2032 protein-coding and 63 RNA genes and part of the Genomic Encyclopedia of Bacteria and Archaea project. A comparison of the reconstructed energy metabolism in the halophilic species Mhp. mahii with other representatives of the Methanosarcinaceae reveals some interesting differences to freshwater species.

  14. Identification of Ohnolog Genes Originating from Whole Genome Duplication in Early Vertebrates, Based on Synteny Comparison across Multiple Genomes.

    PubMed

    Singh, Param Priya; Arora, Jatin; Isambert, Hervé

    2015-07-01

    Whole genome duplications (WGD) have now been firmly established in all major eukaryotic kingdoms. In particular, all vertebrates descend from two rounds of WGDs, that occurred in their jawless ancestor some 500 MY ago. Paralogs retained from WGD, also coined 'ohnologs' after Susumu Ohno, have been shown to be typically associated with development, signaling and gene regulation. Ohnologs, which amount to about 20 to 35% of genes in the human genome, have also been shown to be prone to dominant deleterious mutations and frequently implicated in cancer and genetic diseases. Hence, identifying ohnologs is central to better understand the evolution of vertebrates and their susceptibility to genetic diseases. Early computational analyses to identify vertebrate ohnologs relied on content-based synteny comparisons between the human genome and a single invertebrate outgroup genome or within the human genome itself. These approaches are thus limited by lineage specific rearrangements in individual genomes. We report, in this study, the identification of vertebrate ohnologs based on the quantitative assessment and integration of synteny conservation between six amniote vertebrates and six invertebrate outgroups. Such a synteny comparison across multiple genomes is shown to enhance the statistical power of ohnolog identification in vertebrates compared to earlier approaches, by overcoming lineage specific genome rearrangements. Ohnolog gene families can be browsed and downloaded for three statistical confidence levels or recompiled for specific, user-defined, significance criteria at http://ohnologs.curie.fr/. In the light of the importance of WGD on the genetic makeup of vertebrates, our analysis provides a useful resource for researchers interested in gaining further insights on vertebrate evolution and genetic diseases.

  15. Analysis of Adaptive Evolution in Lyssavirus Genomes Reveals Pervasive Diversifying Selection during Species Diversification

    PubMed Central

    Voloch, Carolina M.; Capellão, Renata T.; Mello, Beatriz; Schrago, Carlos G.

    2014-01-01

    Lyssavirus is a diverse genus of viruses that infect a variety of mammalian hosts, typically causing encephalitis. The evolution of this lineage, particularly the rabies virus, has been a focus of research because of the extensive occurrence of cross-species transmission, and the distinctive geographical patterns present throughout the diversification of these viruses. Although numerous studies have examined pattern-related questions concerning Lyssavirus evolution, analyses of the evolutionary processes acting on Lyssavirus diversification are scarce. To clarify the relevance of positive natural selection in Lyssavirus diversification, we conducted a comprehensive scan for episodic diversifying selection across all lineages and codon sites of the five coding regions in lyssavirus genomes. Although the genomes of these viruses are generally conserved, the glycoprotein (G), RNA-dependent RNA polymerase (L) and polymerase (P) genes were frequently targets of adaptive evolution during the diversification of the genus. Adaptive evolution is particularly manifest in the glycoprotein gene, which was inferred to have experienced the highest density of positively selected codon sites along branches. Substitutions in the L gene were found to be associated with the early diversification of phylogroups. A comparison between the number of positively selected sites inferred along the branches of RABV population branches and Lyssavirus intespecies branches suggested that the occurrence of positive selection was similar on the five coding regions of the genome in both groups. PMID:25415197

  16. The components of the Daphnia pulex immune system as revealed by complete genome sequencing

    PubMed Central

    McTaggart, Seanna J; Conlon, Claire; Colbourne, John K; Blaxter, Mark L; Little, Tom J

    2009-01-01

    Background Branchiopod crustaceans in the genus Daphnia are key model organisms for investigating interactions between genes and the environment. One major theme of research on Daphnia species has been the evolution of resistance to pathogens and parasites, but lack of knowledge of the Daphnia immune system has limited the study of immune responses. Here we provide a survey of the immune-related genome of D. pulex, derived from the newly completed genome sequence. Genes likely to be involved in innate immune responses were identified by comparison to homologues from other arthropods. For each candidate, the gene model was refined, and we conducted an analysis of sequence divergence from homologues from other taxa. Results and conclusion We found that some immune pathways, in particular the TOLL pathway, are fairly well conserved between insects and Daphnia, while other elements, in particular antimicrobial peptides, could not be recovered from the genome sequence. We also found considerable variation in gene family copy number when comparing Daphnia to insects and present phylogenetic analyses to shed light on the evolution of a range of conserved immune gene families. PMID:19386092

  17. Evolution of genome organizations of squirrels (Sciuridae) revealed by cross-species chromosome painting.

    PubMed

    Li, Tangliang; O'Brien, Patricia C M; Biltueva, Larisa; Fu, Beiyuan; Wang, Jinhuan; Nie, Wenhui; Ferguson-Smith, Malcolm A; Graphodatsky, Alexander S; Yang, Fengtang

    2004-01-01

    With complete sets of chromosome-specific painting probes derived from flow-sorted chromosomes of human and grey squirrel (Sciurus carolinensis), the whole genome homologies between human and representatives of tree squirrels (Sciurus carolinensis, Callosciurus erythraeus), flying squirrels (Petaurista albiventer) and chipmunks (Tamias sibiricus) have been defined by cross-species chromosome painting. The results show that, unlike the highly rearranged karyotypes of mouse and rat, the karyotypes of squirrels are highly conserved. Two methods have been used to reconstruct the genome phylogeny of squirrels with the laboratory rabbit (Oryctolagus cuniculus) as the out-group: (1) phylogenetic analysis by parsimony using chromosomal characters identified by comparative cytogenetic approaches; (2) mapping the genome rearrangements onto recently published sequence-based molecular trees. Our chromosome painting results, in combination with molecular data, show that flying squirrels are phylogenetically close to New World tree squirrels. Chromosome painting and G-banding comparisons place chipmunks (Tamias sibiricus ), with a derived karyotype, outside the clade comprising tree and flying squirrels. The superorder Glires (orde Rodentia + order Lagomorpha) is firmly supported by two conserved syntenic associations between human chromosomes 1 and 10p homologues, and between 9 and 11 homologues.

  18. Genomic analysis reveals versatile heterotrophic capacity of a potentially symbiotic sulfur-oxidizing bacterium in sponge.

    PubMed

    Tian, Ren-Mao; Wang, Yong; Bougouffa, Salim; Gao, Zhao-Ming; Cai, Lin; Bajic, Vladimir; Qian, Pei-Yuan

    2014-11-01

    Sulfur-reducing bacteria (SRB) and sulfur-oxidizing bacteria (SOB) play essential roles in marine sponges. However, the detailed characteristics and physiology of the bacteria are largely unknown. Here, we present and analyse the first genome of sponge-associated SOB using a recently developed metagenomic binning strategy. The loss of transposase and virulence-associated genes and the maintenance of the ancient polyphosphate glucokinase gene suggested a stabilized SOB genome that might have coevolved with the ancient host during establishment of their association. Exclusive distribution in sponge, bacterial detoxification for the host (sulfide oxidation) and the enrichment for symbiotic characteristics (genes-encoding ankyrin) in the SOB genome supported the bacterial role as an intercellular symbiont. Despite possessing complete autotrophic sulfur oxidation pathways, the bacterium developed a much more versatile capacity for carbohydrate uptake and metabolism, in comparison with its closest relatives (Thioalkalivibrio) and to other representative autotrophs from the same order (Chromatiales). The ability to perform both autotrophic and heterotrophic metabolism likely results from the unstable supply of reduced sulfur in the sponge and is considered critical for the sponge-SOB consortium. Our study provides insights into SOB of sponge-specific clade with thioautotrophic and versatile heterotrophic metabolism relevant to its roles in the micro-environment of the sponge body.

  19. Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution.

    PubMed

    Filée, Jonathan

    2015-01-01

    Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements), whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages.

  20. Genomic comparison of closely related Giant Viruses supports an accordion-like model of evolution

    PubMed Central

    Filée, Jonathan

    2015-01-01

    Genome gigantism occurs so far in Phycodnaviridae and Mimiviridae (order Megavirales). Origin and evolution of these Giant Viruses (GVs) remain open questions. Interestingly, availability of a collection of closely related GV genomes enabling genomic comparisons offer the opportunity to better understand the different evolutionary forces acting on these genomes. Whole genome alignment for five groups of viruses belonging to the Mimiviridae and Phycodnaviridae families show that there is no trend of genome expansion or general tendency of genome contraction. Instead, GV genomes accumulated genomic mutations over the time with gene gains compensating the different losses. In addition, each lineage displays specific patterns of genome evolution. Mimiviridae (megaviruses and mimiviruses) and Chlorella Phycodnaviruses evolved mainly by duplications and losses of genes belonging to large paralogous families (including movements of diverse mobiles genetic elements), whereas Micromonas and Ostreococcus Phycodnaviruses derive most of their genetic novelties thought lateral gene transfers. Taken together, these data support an accordion-like model of evolution in which GV genomes have undergone successive steps of gene gain and gene loss, accrediting the hypothesis that genome gigantism appears early, before the diversification of the different GV lineages. PMID:26136734

  1. Oil Accumulation by the Oleaginous Diatom Fistulifera solaris as Revealed by the Genome and Transcriptome

    PubMed Central

    Veluchamy, Alaguraj; Tanaka, Michihiro; Abida, Heni; Maréchal, Eric; Bowler, Chris; Muto, Masaki; Sunaga, Yoshihiko; Tanaka, Masayoshi; Taniguchi, Takeaki; Fukuda, Yorikane; Nemoto, Michiko; Matsumoto, Mitsufumi; Wong, Pui Shan; Aburatani, Sachiyo; Fujibuchi, Wataru

    2015-01-01

    Oleaginous photosynthetic organisms such as microalgae are promising sources for biofuel production through the generation of carbon-neutral sustainable energy. However, the metabolic mechanisms driving high-rate lipid production in these oleaginous organisms remain unclear, thus impeding efforts to improve productivity through genetic modifications. We analyzed the genome and transcriptome of the oleaginous diatom Fistulifera solaris JPCC DA0580. Next-generation sequencing technology provided evidence of an allodiploid genome structure, suggesting unorthodox molecular evolutionary and genetic regulatory systems for reinforcing metabolic efficiencies. Although major metabolic pathways were shared with nonoleaginous diatoms, transcriptome analysis revealed unique expression patterns, such as concomitant upregulation of fatty acid/triacylglycerol biosynthesis and fatty acid degradation (β-oxidation) in concert with ATP production. This peculiar pattern of gene expression may account for the simultaneous growth and oil accumulation phenotype and may inspire novel biofuel production technology based on this oleaginous microalga. PMID:25634988

  2. The complete genome sequence of Chromobacterium violaceum reveals remarkable and exploitable bacterial adaptability

    PubMed Central

    2003-01-01

    Chromobacterium violaceum is one of millions of species of free-living microorganisms that populate the soil and water in the extant areas of tropical biodiversity around the world. Its complete genome sequence reveals (i) extensive alternative pathways for energy generation, (ii) ≈500 ORFs for transport-related proteins, (iii) complex and extensive systems for stress adaptation and motility, and (iv) widespread utilization of quorum sensing for control of inducible systems, all of which underpin the versatility and adaptability of the organism. The genome also contains extensive but incomplete arrays of ORFs coding for proteins associated with mammalian pathogenicity, possibly involved in the occasional but often fatal cases of human C. violaceum infection. There is, in addition, a series of previously unknown but important enzymes and secondary metabolites including paraquat-inducible proteins, drug and heavy-metal-resistance proteins, multiple chitinases, and proteins for the detoxification of xenobiotics that may have biotechnological applications. PMID:14500782

  3. Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis.

    PubMed

    Huang, Xuehui; Yang, Shihua; Gong, Junyi; Zhao, Yan; Feng, Qi; Gong, Hao; Li, Wenjun; Zhan, Qilin; Cheng, Benyi; Xia, Junhui; Chen, Neng; Hao, Zhongna; Liu, Kunyan; Zhu, Chuanrang; Huang, Tao; Zhao, Qiang; Zhang, Lei; Fan, Danlin; Zhou, Congcong; Lu, Yiqi; Weng, Qijun; Wang, Zi-Xuan; Li, Jiayang; Han, Bin

    2015-02-05

    Exploitation of heterosis is one of the most important applications of genetics in agriculture. However, the genetic mechanisms of heterosis are only partly understood, and a global view of heterosis from a representative number of hybrid combinations is lacking. Here we develop an integrated genomic approach to construct a genome map for 1,495 elite hybrid rice varieties and their inbred parental lines. We investigate 38 agronomic traits and identify 130 associated loci. In-depth analyses of the effects of heterozygous genotypes reveal that there are only a few loci with strong overdominance effects in hybrids, but a strong correlation is observed between the yield and the number of superior alleles. While most parental inbred lines have only a small number of superior alleles, high-yielding hybrid varieties have several. We conclude that the accumulation of numerous rare superior alleles with positive dominance is an important contributor to the heterotic phenomena.

  4. Bifidobacterium asteroides PRL2011 Genome Analysis Reveals Clues for Colonization of the Insect Gut

    PubMed Central

    Bottacini, Francesca; Milani, Christian; Turroni, Francesca; Sánchez, Borja; Foroni, Elena; Duranti, Sabrina; Serafini, Fausta; Viappiani, Alice; Strati, Francesco; Ferrarini, Alberto; Delledonne, Massimo; Henrissat, Bernard; Coutinho, Pedro; Fitzgerald, Gerald F.; Margolles, Abelardo; van Sinderen, Douwe; Ventura, Marco

    2012-01-01

    Bifidobacteria are known as anaerobic/microaerophilic and fermentative microorganisms, which commonly inhabit the gastrointestinal tract of various animals and insects. Analysis of the 2,167,301 bp genome of Bifidobacterium asteroides PRL2011, a strain isolated from the hindgut of Apis mellifera var. ligustica, commonly known as the honey bee, revealed its predicted capability for respiratory metabolism. Conservation of the latter gene clusters in various B. asteroides strains enforces the notion that respiration is a common metabolic feature of this ancient bifidobacterial species, which has been lost in currently known mammal-derived Bifidobacterium species. In fact, phylogenomic based analyses suggested an ancient origin of B. asteroides and indicates it as an ancestor of the genus Bifidobacterium. Furthermore, the B. asteroides PRL2011 genome encodes various enzymes for coping with toxic products that arise as a result of oxygen-mediated respiration. PMID:23028506

  5. The Chlamydomonas Genome Reveals the Evolution of Key Animal and Plant Functions

    SciTech Connect

    Merchant, Sabeeha S

    2007-04-09

    Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the 120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella.

  6. Ancient mitochondrial genome reveals trace of prehistoric migration in the east Pamir by pastoralists.

    PubMed

    Ning, Chao; Gao, Shizhu; Deng, Boping; Zheng, Hongxiang; Wei, Dong; Lv, Haoze; Li, Hongjie; Song, Li; Wu, Yong; Zhou, Hui; Cui, Yinqiu

    2016-02-01

    The complete mitochondrial genome of one 700-year-old individual found in Tashkurgan, Xinjiang was target enriched and sequenced in order to shed light on the population history of Tashkurgan and determine the phylogenetic relationship of haplogroup U5a. The ancient sample was assigned to a subclade of haplogroup U5a2a1, which is defined by two rare and stable transversions at 16114A and 13928C. Phylogenetic analysis shows a distribution pattern for U5a2a that is indicative of an origin in the Volga-Ural region and exhibits a clear eastward geographical expansion that correlates with the pastoral culture also entering the Eurasian steppe. The haplogroup U5a2a present in the ancient Tashkurgan individual reveals prehistoric migration in the East Pamir by pastoralists. This study shows that studying an ancient mitochondrial genome is a useful approach for studying the evolutionary process and population history of Eastern Pamir.

  7. In vivo binding of PRDM9 reveals interactions with noncanonical genomic sites

    PubMed Central

    Grey, Corinne; Clément, Julie A.J.; Buard, Jérôme; Leblanc, Benjamin; Gut, Ivo; Gut, Marta; Duret, Laurent

    2017-01-01

    In mouse and human meiosis, DNA double-strand breaks (DSBs) initiate homologous recombination and occur at specific sites called hotspots. The localization of these sites is determined by the sequence-specific DNA binding domain of the PRDM9 histone methyl transferase. Here, we performed an extensive analysis of PRDM9 binding in mouse spermatocytes. Unexpectedly, we identified a noncanonical recruitment of PRDM9 to sites that lack recombination activity and the PRDM9 binding consensus motif. These sites include gene promoters, where PRDM9 is recruited in a DSB-dependent manner. Another subset reveals DSB-independent interactions between PRDM9 and genomic sites, such as the binding sites for the insulator protein CTCF. We propose that these DSB-independent sites result from interactions between hotspot-bound PRDM9 and genomic sequences located on the chromosome axis. PMID:28336543

  8. In vivo binding of PRDM9 reveals interactions with noncanonical genomic sites.

    PubMed

    Grey, Corinne; Clément, Julie A J; Buard, Jérôme; Leblanc, Benjamin; Gut, Ivo; Gut, Marta; Duret, Laurent; de Massy, Bernard

    2017-04-01

    In mouse and human meiosis, DNA double-strand breaks (DSBs) initiate homologous recombination and occur at specific sites called hotspots. The localization of these sites is determined by the sequence-specific DNA binding domain of the PRDM9 histone methyl transferase. Here, we performed an extensive analysis of PRDM9 binding in mouse spermatocytes. Unexpectedly, we identified a noncanonical recruitment of PRDM9 to sites that lack recombination activity and the PRDM9 binding consensus motif. These sites include gene promoters, where PRDM9 is recruited in a DSB-dependent manner. Another subset reveals DSB-independent interactions between PRDM9 and genomic sites, such as the binding sites for the insulator protein CTCF. We propose that these DSB-independent sites result from interactions between hotspot-bound PRDM9 and genomic sequences located on the chromosome axis.

  9. Genomes of cryptic chimpanzee Plasmodium species reveal key evolutionary events leading to human malaria.

    PubMed

    Sundararaman, Sesh A; Plenderleith, Lindsey J; Liu, Weimin; Loy, Dorothy E; Learn, Gerald H; Li, Yingying; Shaw, Katharina S; Ayouba, Ahidjo; Peeters, Martine; Speede, Sheri; Shaw, George M; Bushman, Frederic D; Brisson, Dustin; Rayner, Julian C; Sharp, Paul M; Hahn, Beatrice H

    2016-03-22

    African apes harbour at least six Plasmodium species of the subgenus Laverania, one of which gave rise to human Plasmodium falciparum. Here we use a selective amplification strategy to sequence the genome of chimpanzee parasites classified as Plasmodium reichenowi and Plasmodium gaboni based on the subgenomic fragments. Genome-wide analyses show that these parasites indeed represent distinct species, with no evidence of cross-species mating. Both P. reichenowi and P. gaboni are 10-fold more diverse than P. falciparum, indicating a very recent origin of the human parasite. We also find a remarkable Laverania-specific expansion of a multigene family involved in erythrocyte remodelling, and show that a short region on chromosome 4, which encodes two essential invasion genes, was horizontally transferred into a recent P. falciparum ancestor. Our results validate the selective amplification strategy for characterizing cryptic pathogen species, and reveal evolutionary events that likely predisposed the precursor of P. falciparum to colonize humans.

  10. Genomes of cryptic chimpanzee Plasmodium species reveal key evolutionary events leading to human malaria

    PubMed Central

    Sundararaman, Sesh A.; Plenderleith, Lindsey J.; Liu, Weimin; Loy, Dorothy E.; Learn, Gerald H.; Li, Yingying; Shaw, Katharina S.; Ayouba, Ahidjo; Peeters, Martine; Speede, Sheri; Shaw, George M.; Bushman, Frederic D.; Brisson, Dustin; Rayner, Julian C.; Sharp, Paul M.; Hahn, Beatrice H.

    2016-01-01

    African apes harbour at least six Plasmodium species of the subgenus Laverania, one of which gave rise to human Plasmodium falciparum. Here we use a selective amplification strategy to sequence the genome of chimpanzee parasites classified as Plasmodium reichenowi and Plasmodium gaboni based on the subgenomic fragments. Genome-wide analyses show that these parasites indeed represent distinct species, with no evidence of cross-species mating. Both P. reichenowi and P. gaboni are 10-fold more diverse than P. falciparum, indicating a very recent origin of the human parasite. We also find a remarkable Laverania-specific expansion of a multigene family involved in erythrocyte remodelling, and show that a short region on chromosome 4, which encodes two essential invasion genes, was horizontally transferred into a recent P. falciparum ancestor. Our results validate the selective amplification strategy for characterizing cryptic pathogen species, and reveal evolutionary events that likely predisposed the precursor of P. falciparum to colonize humans. PMID:27002652

  11. Comparative Genome Analysis Reveals Metabolic Versatility and Environmental Adaptations of Sulfobacillus thermosulfidooxidans Strain ST

    PubMed Central

    Guo, Xue; Yin, Huaqun; Liang, Yili; Hu, Qi; Zhou, Xishu; Xiao, Yunhua; Ma, Liyuan; Zhang, Xian; Qiu, Guanzhou; Liu, Xueduan

    2014-01-01

    The genus Sulfobacillus is a cohort of mildly thermophilic or thermotolerant acidophiles within the phylum Firmicutes and requires extremely acidic environments and hypersalinity for optimal growth. However, our understanding of them is still preliminary partly because few genome sequences are available. Here, the draft genome of Sulfobacillus thermosulfidooxidans strain ST was deciphered to obtain a comprehensive insight into the genetic content and to understand the cellular mechanisms necessary for its survival. Furthermore, the expressions of key genes related with iron and sulfur oxidation were verified by semi-quantitative RT-PCR analysis. The draft genome sequence of Sulfobacillus thermosulfidooxidans strain ST, which encodes 3225 predicted coding genes on a total length of 3,333,554 bp and a 48.35% G+C, revealed the high degree of heterogeneity with other Sulfobacillus species. The presence of numerous transposases, genomic islands and complete CRISPR/Cas defence systems testifies to its dynamic evolution consistent with the genome heterogeneity. As expected, S. thermosulfidooxidans encodes a suit of conserved enzymes required for the oxidation of inorganic sulfur compounds (ISCs). The model of sulfur oxidation in S. thermosulfidooxidans was proposed, which showed some different characteristics from the sulfur oxidation of Gram-negative A. ferrooxidans. Sulfur oxygenase reductase and heterodisulfide reductase were suggested to play important roles in the sulfur oxidation. Although the iron oxidation ability was observed, some key proteins cannot be identified in S. thermosulfidooxidans. Unexpectedly, a predicted sulfocyanin is proposed to transfer electrons in the iron oxidation. Furthermore, its carbon metabolism is rather flexible, can perform the transformation of pentose through the oxidative and non-oxidative pentose phosphate pathways and has the ability to take up small organic compounds. It encodes a multitude of heavy metal resistance systems to

  12. Comparative genome analysis reveals metabolic versatility and environmental adaptations of Sulfobacillus thermosulfidooxidans strain ST.

    PubMed

    Guo, Xue; Yin, Huaqun; Liang, Yili; Hu, Qi; Zhou, Xishu; Xiao, Yunhua; Ma, Liyuan; Zhang, Xian; Qiu, Guanzhou; Liu, Xueduan

    2014-01-01

    The genus Sulfobacillus is a cohort of mildly thermophilic or thermotolerant acidophiles within the phylum Firmicutes and requires extremely acidic environments and hypersalinity for optimal growth. However, our understanding of them is still preliminary partly because few genome sequences are available. Here, the draft genome of Sulfobacillus thermosulfidooxidans strain ST was deciphered to obtain a comprehensive insight into the genetic content and to understand the cellular mechanisms necessary for its survival. Furthermore, the expressions of key genes related with iron and sulfur oxidation were verified by semi-quantitative RT-PCR analysis. The draft genome sequence of Sulfobacillus thermosulfidooxidans strain ST, which encodes 3225 predicted coding genes on a total length of 3,333,554 bp and a 48.35% G+C, revealed the high degree of heterogeneity with other Sulfobacillus species. The presence of numerous transposases, genomic islands and complete CRISPR/Cas defence systems testifies to its dynamic evolution consistent with the genome heterogeneity. As expected, S. thermosulfidooxidans encodes a suit of conserved enzymes required for the oxidation of inorganic sulfur compounds (ISCs). The model of sulfur oxidation in S. thermosulfidooxidans was proposed, which showed some different characteristics from the sulfur oxidation of Gram-negative A. ferrooxidans. Sulfur oxygenase reductase and heterodisulfide reductase were suggested to play important roles in the sulfur oxidation. Although the iron oxidation ability was observed, some key proteins cannot be identified in S. thermosulfidooxidans. Unexpectedly, a predicted sulfocyanin is proposed to transfer electrons in the iron oxidation. Furthermore, its carbon metabolism is rather flexible, can perform the transformation of pentose through the oxidative and non-oxidative pentose phosphate pathways and has the ability to take up small organic compounds. It encodes a multitude of heavy metal resistance systems to

  13. Breakpoint profiling of 64 cancer genomes reveals numerous complex rearrangements spawned by homology-independent mechanisms

    PubMed Central

    Malhotra, Ankit; Lindberg, Michael; Faust, Gregory G.; Leibowitz, Mitchell L.; Clark, Royden A.; Layer, Ryan M.; Quinlan, Aaron R.; Hall, Ira M.

    2013-01-01

    Tumor genomes are generally thought to evolve through a gradual accumulation of mutations, but the observation that extraordinarily complex rearrangements can arise through single mutational events suggests that evolution may be accelerated by punctuated changes in genome architecture. To assess the prevalence and origins of complex genomic rearrangements (CGRs), we mapped 6179 somatic structural variation breakpoints in 64 cancer genomes from seven tumor types and screened for clusters of three or more interconnected breakpoints. We find that complex breakpoint clusters are extremely common: 154 clusters comprise 25% of all somatic breakpoints, and 75% of tumors exhibit at least one complex cluster. Based on copy number state profiling, 63% of breakpoint clusters are consistent with being CGRs that arose through a single mutational event. CGRs have diverse architectures including focal breakpoint clusters, large-scale rearrangements joining clusters from one or more chromosomes, and staggeringly complex chromothripsis events. Notably, chromothripsis has a significantly higher incidence in glioblastoma samples (39%) relative to other tumor types (9%). Chromothripsis breakpoints also show significantly elevated intra-tumor allele frequencies relative to simple SVs, which indicates that they arise early during tumorigenesis or confer selective advantage. Finally, assembly and analysis of 4002 somatic and 6982 germline breakpoint sequences reveal that somatic breakpoints show significantly less microhomology and fewer templated insertions than germline breakpoints, and this effect is stronger at CGRs than at simple variants. These results are inconsistent with replication-based models of CGR genesis and strongly argue that nonhomologous repair of concurrently arising DNA double-strand breaks is the predominant mechanism underlying complex cancer genome rearrangements. PMID:23410887

  14. Comparative Genomics of a Plant-Pathogenic Fungus, Pyrenophora tritici-repentis, Reveals Transduplication and the Impact of Repeat Elements on Pathogenicity and Population Divergence

    PubMed Central

    Manning, Viola A.; Pandelova, Iovanna; Dhillon, Braham; Wilhelm, Larry J.; Goodwin, Stephen B.; Berlin, Aaron M.; Figueroa, Melania; Freitag, Michael; Hane, James K.; Henrissat, Bernard; Holman, Wade H.; Kodira, Chinnappa D.; Martin, Joel; Oliver, Richard P.; Robbertse, Barbara; Schackwitz, Wendy; Schwartz, David C.; Spatafora, Joseph W.; Turgeon, B. Gillian; Yandava, Chandri; Young, Sarah; Zhou, Shiguo; Zeng, Qiandong; Grigoriev, Igor V.; Ma, Li-Jun; Ciuffetti, Lynda M.

    2013-01-01

    Pyrenophora tritici-repentis is a necrotrophic fungus causal to the disease tan spot of wheat, whose contribution to crop loss has increased significantly during the last few decades. Pathogenicity by this fungus is attributed to the production of host-selective toxins (HST), which are recognized by their host in a genotype-specific manner. To better understand the mechanisms that have led to the increase in disease incidence related to this pathogen, we sequenced the genomes of three P. tritici-repentis isolates. A pathogenic isolate that produces two known HSTs was used to assemble a reference nuclear genome of approximately 40 Mb composed of 11 chromosomes that encode 12,141 predicted genes. Comparison of the reference genome with those of a pathogenic isolate that produces a third HST, and a nonpathogenic isolate, showed the nonpathogen genome to be more diverged than those of the two pathogens. Examination of gene-coding regions has provided candidate pathogen-specific proteins and revealed gene families that may play a role in a necrotrophic lifestyle. Analysis of transposable elements suggests that their presence in the genome of pathogenic isolates contributes to the creation of novel genes, effector diversification, possible horizontal gene transfer events, identified copy number variation, and the first example of transduplication by DNA transposable elements in fungi. Overall, comparative analysis of these genomes provides evidence that pathogenicity in this species arose through an influx of transposable elements, which created a genetically flexible landscape that can easily respond to environmental changes. PMID:23316438

  15. Comparative Genomics of a Plant-Pathogenic Fungus, Pyrenophora tritici-repentis, Reveals Transduplication and the Impact of Repeat Elements on Pathogenicity and Population Divergence

    SciTech Connect

    Manning, Viola A.; Pandelova, Iovanna; Dhillon, Braham; Wilhelm, Larry J.; Goodwin, Stephen B.; Berlin, Aaron M.; Figueroa, Melania; Freitag, Michael; Hane, James K.; Henrissat, Bernard; Holman, Wade H.; Kodira, Chinnappa D.; Martin, Joel; Oliver, Richard P.; Robbertse, Barbara; Schackwitz, Wendy; Schwartz, David C.; Spatafora, Joseph W.; Turgeon, B. Gillian; Yandava, Chandri; Young, Sarah; Zhou, Shiguo; Zeng, Qiandong; Grigoriev, Igor V.; Ma, Li-Jun; Ciuffetti, Lynda M.

    2012-08-16

    Pyrenophora tritici-repentis is a necrotrophic fungus causal to the disease tan spot of wheat, whose contribution to crop loss has increased significantly during the last few decades. Pathogenicity by this fungus is attributed to the production of host-selective toxins (HST), which are recognized by their host in a genotype-specific manner. To better understand the mechanisms that have led to the increase in disease incidence related to this pathogen, we sequenced the genomes of three P. tritici-repentis isolates. A pathogenic isolate that produces two known HSTs was used to assemble a reference nuclear genome of approximately 40 Mb composed of 11 chromosomes that encode 12,141 predicted genes. Comparison of the reference genome with those of a pathogenic isolate that produces a third HST, and a nonpathogenic isolate, showed the nonpathogen genome to be more diverged than those of the two pathogens. Examination of gene-coding regions has provided candidate pathogen-specific proteins and revealed gene families that may play a role in a necrotrophic lifestyle. Analysis of transposable elements suggests that their presence in the genome of pathogenic isolates contributes to the creation of novel genes, effector diversification, possible horizontal gene transfer events, identified copy number variation, and the first example of transduplication by DNA transposable elements in fungi. Overall, comparative analysis of these genomes provides evidence that pathogenicity in this species arose through an influx of transposable elements, which created a genetically flexible landscape that can easily respond to environmental changes.

  16. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae

    PubMed Central

    Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J.; Leitch, Ilia J.

    2015-01-01

    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55–83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes. PMID:26606051

  17. The genome of Pelobacter carbinolicus reveals surprising metabolic capabilities and physiological features

    PubMed Central

    2012-01-01

    Background The bacterium Pelobacter carbinolicus is able to grow by fermentation, syntrophic hydrogen/formate transfer, or electron transfer to sulfur from short-chain alcohols, hydrogen or formate; it does not oxidize acetate and is not known to ferment any sugars or grow autotrophically. The genome of P. carbinolicus was sequenced in order to understand its metabolic capabilities and physiological features in comparison with its relatives, acetate-oxidizing Geobacter species. Results Pathways were predicted for catabolism of known substrates: 2,3-butanediol, acetoin, glycerol, 1,2-ethanediol, ethanolamine, choline and ethanol. Multiple isozymes of 2,3-butanediol dehydrogenase, ATP synthase and [FeFe]-hydrogenase were differentiated and assigned roles according to their structural properties and genomic contexts. The absence of asparagine synthetase and the presence of a mutant tRNA for asparagine encoded among RNA-active enzymes suggest that P. carbinolicus may make asparaginyl-tRNA in a novel way. Catabolic glutamate dehydrogenases were discovered, implying that the tricarboxylic acid (TCA) cycle can function catabolically. A phosphotransferase system for uptake of sugars was discovered, along with enzymes that function in 2,3-butanediol production. Pyruvate:ferredoxin/flavodoxin oxidoreductase was identified as a potential bottleneck in both the supply of oxaloacetate for oxidation of acetate by the TCA cycle and the connection of glycolysis to production of ethanol. The P. carbinolicus genome was found to encode autotransporters and various appendages, including three proteins with similarity to the geopilin of electroconductive nanowires. Conclusions Several surprising metabolic capabilities and physiological features were predicted from the genome of P. carbinolicus, suggesting that it is more versatile than anticipated. PMID:23227809

  18. The genome of Pelobacter carbinolicus reveals surprising metabolic capabilities and physiological features

    SciTech Connect

    Aklujkar, Muktak; Haveman, Shelley; DiDonatoJr, Raymond; Chertkov, Olga; Han, Cliff; Land, Miriam L; Brown, Peter; Lovley, Derek

    2012-01-01

    Background: The bacterium Pelobacter carbinolicus is able to grow by fermentation, syntrophic hydrogen/formate transfer, or electron transfer to sulfur from short-chain alcohols, hydrogen or formate; it does not oxidize acetate and is not known to ferment any sugars or grow autotrophically. The genome of P. carbinolicus was sequenced in order to understand its metabolic capabilities and physiological features in comparison with its relatives, acetate-oxidizing Geobacter species. Results: Pathways were predicted for catabolism of known substrates: 2,3-butanediol, acetoin, glycerol, 1,2-ethanediol, ethanolamine, choline and ethanol. Multiple isozymes of 2,3-butanediol dehydrogenase, ATP synthase and [FeFe]-hydrogenase were differentiated and assigned roles according to their structural properties and genomic contexts. The absence of asparagine synthetase and the presence of a mutant tRNA for asparagine encoded among RNA-active enzymes suggest that P. carbinolicus may make asparaginyl-tRNA in a novel way. Catabolic glutamate dehydrogenases were discovered, implying that the tricarboxylic acid (TCA) cycle can function catabolically. A phosphotransferase system for uptake of sugars was discovered, along with enzymes that function in 2,3-butanediol production. Pyruvate: ferredoxin/flavodoxin oxidoreductase was identified as a potential bottleneck in both the supply of oxaloacetate for oxidation of acetate by the TCA cycle and the connection of glycolysis to production of ethanol. The P. carbinolicus genome was found to encode autotransporters and various appendages, including three proteins with similarity to the geopilin of electroconductive nanowires. Conclusions: Several surprising metabolic capabilities and physiological features were predicted from the genome of P. carbinolicus, suggesting that it is more versatile than anticipated.

  19. Characterization and Phylogenetic Analysis of the Mitochondrial Genome of Shiraia bambusicola Reveals Special Features in the Order of Pleosporales

    PubMed Central

    Shen, Xiao-Ye; Li, Tong; Chen, Shuang; Fan, Li; Gao, Jian; Hou, Cheng-Lin

    2015-01-01

    Shiraia bambusicola P. Henn. is a pathogenic fungus of bamboo, and its fruiting bodies are regarded as folk medicine. We determined and analyzed its complete mitochondrial DNA sequence (circular DNA molecule of 39,030 bp, G + C content of 25.19%). It contains the typical genes encoding proteins involved in electron transport and coupled oxidative phosphorylation (nad1-6 and nad4L, cob and cox1-3), one ATP synthase subunit (atp6), 4 hypothetical proteins, and two genes for large and small rRNAs (rnl and rns). There is a set of 32 tRNA genes comprising all 20 amino acids, and these genes are evenly distributed on the two strands. Phylogenetic analyses based on concatenated mitochondrial proteins indicated that S. bambusicola clustered with members of the order Pleosporales, which is in agreement with previous results. The gene arrangements of Dothideomycetes species contained three regions of gene orders partitioned in their mitochondrial genomes, including block 1 (nad6-atp6), block 2 (nad1-cox3) and block 3 (genes around rns). S. bambusicola displayed unique special features that differed from the other Pleosporales species, especially in the coding regions around rns (trnR-trnY). Moreover, a comparison of gene orders in mitochondrial genomes from Pezizomycotina revealed that although all encoded regions are located on the same strand in most Pezizomycotina mtDNAs, genes from Dothideomycetes species had different orientations, as well as diverse positions and colocalization of genes (such as cox3, cox1-cox2 and nad2–nad3); these distinctions were regarded as class-specific features. Interestingly, two incomplete copies of the atp6 gene were found on different strands of the mitogenomic DNA, a finding that has not been observed in the other analyzed fungal species. In our study, mitochondrial genomes from Dothideomycetes species were comprehensively analyzed for the first time, including many species that have not appeared in previous reports. PMID:25790308

  20. A Quantitative Profiling Tool for Diverse Genomic Data Types Reveals Potential Associations between Chromatin and Pre-mRNA Processing.

    PubMed

    Kremsky, Isaac; Bellora, Nicolás; Eyras, Eduardo

    2015-01-01

    High-throughput sequencing, and genome-based datasets in general, are often represented as profiles centered at reference points to study the association of protein binding and other signals to particular regulatory mechanisms. Although these profiles often provide compelling evidence of these associations, they do not provide a quantitative assessment of the enrichment, which makes the comparison between signals and conditions difficult. In addition, a number of biases can confound profiles, but are rarely accounted for in the tools currently available. We present a novel computational method, ProfileSeq, for the quantitative assessment of biological profiles to provide an exact, nonparametric test that specific regions of the test profile have higher or lower signal densities than a control set. The method is applicable to high-throughput sequencing data (ChIP-Seq, GRO-Seq, CLIP-Seq, etc.) and to genome-based datasets (motifs, etc.). We validate ProfileSeq by recovering and providing a quantitative assessment of several results reported before in the literature using independent datasets. We show that input signal and mappability have confounding effects on the profile results, but that normalizing the signal by input reads can eliminate these biases while preserving the biological signal. Moreover, we apply ProfileSeq to ChIP-Seq data for transcription factors, as well as for motif and CLIP-Seq data for splicing factors. In all examples considered, the profiles were robust to biases in mappability of sequencing reads. Furthermore, analyses performed with ProfileSeq reveal a number of putative relationships between transcription factor binding to DNA and splicing factor binding to pre-mRNA, adding to the growing body of evidence relating chromatin and pre-mRNA processing. ProfileSeq provides a robust way to quantify genome-wide coordinate-based signal. Software and documentation are freely available for academic use at https://bitbucket.org/regulatorygenomicsupf/profileseq/.

  1. RNA Profiles of Porcine Embryos during Genome Activation Reveal Complex Metabolic Switch Sensitive to In Vitro Conditions

    PubMed Central

    Østrup, Olga; Olbricht, Gayla; Østrup, Esben; Hyttel, Poul; Collas, Philippe; Cabot, Ryan

    2013-01-01

    Fertilization is followed by complex changes in cytoplasmic composition and extensive chromatin reprogramming which results in the abundant activation of totipotent embryonic genome at embryonic genome activation (EGA). While chromatin reprogramming has been widely studied in several species, only a handful of reports characterize changing transcriptome profiles and resulting metabolic changes in cleavage stage embryos. The aims of the current study were to investigate RNA profiles of in vivo developed (ivv) and in vitro produced (ivt) porcine embryos before (2-cell stage) and after (late 4-cell stage) EGA and determine major metabolic changes that regulate totipotency. The period before EGA was dominated by transcripts responsible for cell cycle regulation, mitosis, RNA translation and processing (including ribosomal machinery), protein catabolism, and chromatin remodelling. Following EGA an increase in the abundance of transcripts involved in transcription, translation, DNA metabolism, histone and chromatin modification, as well as protein catabolism was detected. The further analysis of members of overlapping GO terms revealed that despite that comparable cellular processes are taking place before and after EGA (RNA splicing, protein catabolism), different metabolic pathways are involved. This strongly suggests that a complex metabolic switch accompanies EGA. In vitro conditions significantly altered RNA profiles before EGA, and the character of these changes indicates that they originate from oocyte and are imposed either before oocyte aspiration or during in vitro maturation. IVT embryos have altered content of apoptotic factors, cell cycle regulation factors and spindle components, and transcription factors, which all may contribute to reduced developmental competence of embryos produced in vitro. Overall, our data are in good accordance with previously published, genome-wide profiling data in other species. Moreover, comparison with mouse and human embryos

  2. Comparative genomics of four closely related Clostridium perfringens bacteriophages reveals variable rates of evolution within a core genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context. We sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricu...

  3. Scanning the landscape of genome architecture of non-O1 and non-O139 Vibrio cholerae by whole genome mapping reveals extensive population genetic diversity.

    PubMed

    Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A; Awosika, Joy; Briska, Adam; Ptashkin, Ryan N; Wagner, Trevor; Rajanna, Chythanya; Tsang, Hsinyi; Johnson, Shannon L; Mokashi, Vishwesh P; Chain, Patrick S G; Sozhamannan, Shanmuga

    2015-01-01

    Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, ordered restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.

  4. Scanning the landscape of genome architecture of non-O1 and non-O139 Vibrio cholerae by whole genome mapping reveals extensive population genetic diversity

    DOE PAGES

    Chapman, Carol; Henry, Matthew; Bishop-Lilly, Kimberly A.; ...

    2015-03-20

    Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haiti cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera and the lack of genomic sequences of such strains from around the world. Here we address these gaps by scanning a global collection of V. cholerae strains as a first step towards understanding the population genetic diversity and epidemic potential of non O1/non-O139 strains. Whole Genome Mapping (Optical Mapping) based bar coding produces a high resolution, orderedmore » restriction map, depicting a complete view of the unique chromosomal architecture of an organism. To assess the genomic diversity of non-O1/non-O139 V. cholerae, we applied a Whole Genome Mapping strategy on a well-defined and geographically and temporally diverse strain collection, the Sakazaki serogroup type strains. Whole Genome Map data on 91 of the 206 serogroup type strains support the hypothesis that V. cholerae has an unprecedented genetic and genomic structural diversity. Interestingly, we discovered chromosomal fusions in two unusual strains that possess a single chromosome instead of the two chromosomes usually found in V. cholerae. We also found pervasive chromosomal rearrangements such as duplications and indels in many strains. The majority of Vibrio genome sequences currently in public databases are unfinished draft sequences. The Whole Genome Mapping approach presented here enables rapid screening of large strain collections to capture genomic complexities that would not have been otherwise revealed by unfinished draft genome sequencing and thus aids in assembling and finishing draft sequences of complex genomes. Furthermore, Whole Genome Mapping allows for prediction of novel V. cholerae non-O1/non-O139 strains that may have the potential to cause future cholera outbreaks.« less

  5. Comparison of eight methods of genomic DNA extraction from babassu.

    PubMed

    Viana, J P G; Borges, A N C; Lopes, A C A; Gomes, R L F; Britto, F B; Lima, P S C; Valente, S E S

    2015-12-22

    Babassu (Orbignya phalerata Martius) is one of the most important palms in Brazil because of the largest morphological variation, wide geographic distribution, and high socio-economic importance. The diversity present in babassu germplasm should be protected against loss to ensure their use with high productivity. Study of the available variability in populations of babassu is necessary to develop conservation strategies. The study of genetic variability can be conducted using molecular markers and many of these studies require significant quantity of high-quality DNA. The present study aimed to effect comparison among eight DNA extraction methods in case of O. phalerata. The quality and concentration of nucleic acids were analyzed by spectrophotometry and integrity of DNA was ascertained by agarose gel electrophoresis. The spectrophotometry revealed that some methods resulted in high levels of concentration of nucleic acids, in which values of the ratio A260/280 and A260/230 were outside the range of purity. The agarose gel electrophoresis established the concentration and integrity of DNA. The methods of Murray and Thompson (1980) and Ferreira and Grattapaglia (1998) did not result in satisfactory quantities of DNA. Conversely, the method proposed by Khanuja et al. (1999) resulted in DNA of adequate quality and quantity that could be satisfactorily used for amplification reactions performed with two ISSR primers.

  6. Representational difference analysis reveals genomic differences between Q. robur and Q. suber: implications for the study of genome evolution in the genus Quercus.

    PubMed

    Zoldos, V; Siljak-Yakovlev, S; Papes, D; Sarr, A; Panaud, O

    2001-04-01

    Very similar genome sizes, similar karyotypes and heterochromatin organisation, and identical number/position of ribosomal loci characterise the common oak (Q. robur) and the cork oak (Q. suber), two distantly related oak species. Representational Difference Analysis (RDA) was used to subtract the genome of Q. suber from the genome of Q. robur in order to search for genome differentiation. A library of 400 clones (bearing RDA fragments) representing genome differences between the two species was obtained. Seven Q. robur-specific DNA sequences were analysed with respect to their molecular and chromosome organisation. All belong to the dispersed repetitive component of the genome, as revealed by Southern hybridisation and in situ hybridisation. They are present in the Q. robur genome in between 100 and 700 copies, and are distributed along the length of almost all chromosomes. A search for homologies between RDA fragments and sequences in Genbank revealed similarities of all RDA fragments with known retrotransposons. The RDA fragments were also tested for their presence/absence in the genomes of six additional oak species belonging to different phylogenetic groups, in order to examine the evolutionary dynamics of these DNA sequences.

  7. Comparison of larval and adult Drosophila astrocytes reveals stage-specific gene expression profiles.

    PubMed

    Huang, Yanmei; Ng, Fanny S; Jackson, F Rob

    2015-02-04

    The analysis of adult astrocyte glial cells has revealed a remarkable heterogeneity with regard to morphology, molecular signature, and physiology. A key question in glial biology is how such heterogeneity arises during brain development. One approach to this question is to identify genes with differential astrocyte expression during development; certain genes expressed later in neural development may contribute to astrocyte differentiation. We have utilized the Drosophila model and Translating Ribosome Affinity Purification (TRAP)-RNA-seq methods to derive the genome-wide expression profile of Drosophila larval astrocyte-like cells (hereafter referred to as astrocytes) for the first time. These studies identified hundreds of larval astrocyte-enriched genes that encode proteins important for metabolism, energy production, and protein synthesis, consistent with the known role of astrocytes in the metabolic support of neurons. Comparison of the larval profile with that observed for adults has identified genes with astrocyte-enriched expression specific to adulthood. These include genes important for metabolism and energy production, translation, chromatin modification, protein glycosylation, neuropeptide signaling, immune responses, vesicle-mediated trafficking or secretion, and the regulation of behavior. Among these functional classes, the expression of genes important for chromatin modification and vesicle-mediated trafficking or secretion is overrepresented in adult astrocytes based on Gene Ontology analysis. Certain genes with selective adult enrichment may mediate functions specific to this stage or may be important for the differentiation or maintenance of adult astrocytes, with the latter perhaps contributing to population heterogeneity.

  8. New Implications on Genomic Adaptation Derived from the Helicobacter pylori Genome Comparison

    PubMed Central

    Lara-Ramírez, Edgar Eduardo; Segura-Cabrera, Aldo; Guo, Xianwu; Yu, Gongxin; García-Pérez, Carlos Armando; Rodríguez-Pérez, Mario A.

    2011-01-01

    Background Helicobacter pylori has a reduced genome and lives in a tough environment for long-term persistence. It evolved with its particular characteristics for biological adaptation. Because several H. pylori genome sequences are available, comparative analysis could help to better understand genomic adaptation of this particular bacterium. Principal Findings We analyzed nine H. pylori genomes with emphasis on microevolution from a different perspective. Inversion was an important factor to shape the genome structure. Illegitimate recombination not only led to genomic inversion but also inverted fragment duplication, both of which contributed to the creation of new genes and gene family, and further, homological recombination contributed to events of inversion. Based on the information of genomic rearrangement, the first genome scaffold structure of H. pylori last common ancestor was produced. The core genome consists of 1186 genes, of which 22 genes could particularly adapt to human stomach niche. H. pylori contains high proportion of pseudogenes whose genesis was principally caused by homopolynucleotide (HPN) mutations. Such mutations are reversible and facilitate the control of gene expression through the change of DNA structure. The reversible mutations and a quasi-panmictic feature could allow such genes or gene fragments frequently transferred within or between populations. Hence, pseudogenes could be a reservoir of adaptation materials and the HPN mutations could be favorable to H. pylori adaptation, leading to HPN accumulation on the genomes, which corresponds to a special feature of Helicobacter species: extremely high HPN composition of genome. Conclusion Our research demonstrated that both genome content and structure of H. pylori have been highly adapted to its particular life style. PMID:21387011

  9. The smallest known genomes of multicellular and toxic cyanobacteria: comparison, minimal gene sets for linked traits and the evolutionary implications.

    PubMed

    Stucken, Karina; John, Uwe; Cembella, Allan; Murillo, Alejandro A; Soto-Liebe, Katia; Fuentes-Valdés, Juan J; Friedel, Maik; Plominsky, Alvaro M; Vásquez, Mónica; Glöckner, Gernot

    2010-02-16

    Cyanobacterial morphology is diverse, ranging from unicellular spheres or rods to multicellular structures such as colonies and filaments. Multicellular species represent an evolutionary strategy to differentiate and compartmentalize certain metabolic functions for reproduction and nitrogen (N(2)) fixation into specialized cell types (e.g. akinetes, heterocysts and diazocytes). Only a few filamentous, differentiated cyanobacterial species, with genome sizes over 5 Mb, have been sequenced. We sequenced the genomes of two strains of closely related filamentous cyanobacterial species to yield further insights into the molecular basis of the traits of N(2) fixation, filament formation and cell differentiation. Cylindrospermopsis raciborskii CS-505 is a cylindrospermopsin-producing strain from Australia, whereas Raphidiopsis brookii D9 from Brazil synthesizes neurotoxins associated with paralytic shellfish poisoning (PSP). Despite their different morphology, toxin composition and disjunct geographical distribution, these strains form a monophyletic group. With genome sizes of approximately 3.9 (CS-505) and 3.2 (D9) Mb, these are the smallest genomes described for free-living filamentous cyanobacteria. We observed remarkable gene order conservation (synteny) between these genomes despite the difference in repetitive element content, which accounts for most of the genome size difference between them. We show here that the strains share a specific set of 2539 genes with >90% average nucleotide identity. The fact that the CS-505 and D9 genomes are small and streamlined compared to those of other filamentous cyanobacterial species and the lack of the ability for heterocyst formation in strain D9 allowed us to define a core set of genes responsible for each trait in filamentous species. We presume that in strain D9 the ability to form proper heterocysts was secondarily lost together with N(2) fixation capacity. Further comparisons to all available cyanobacterial genomes

  10. The Smallest Known Genomes of Multicellular and Toxic Cyanobacteria: Comparison, Minimal Gene Sets for Linked Traits and the Evolutionary Implications

    PubMed Central

    Stucken, Karina; John, Uwe; Cembella, Allan; Murillo, Alejandro A.; Soto-Liebe, Katia; Fuentes-Valdés, Juan J.; Friedel, Maik; Plominsky, Alvaro M.; Vásquez, Mónica; Glöckner, Gernot

    2010-01-01

    Cyanobacterial morphology is diverse, ranging from unicellular spheres or rods to multicellular structures such as colonies and filaments. Multicellular species represent an evolutionary strategy to differentiate and compartmentalize certain metabolic functions for reproduction and nitrogen (N2) fixation into specialized cell types (e.g. akinetes, heterocysts and diazocytes). Only a few filamentous, differentiated cyanobacterial species, with genome sizes over 5 Mb, have been sequenced. We sequenced the genomes of two strains of closely related filamentous cyanobacterial species to yield further insights into the molecular basis of the traits of N2 fixation, filament formation and cell differentiation. Cylindrospermopsis raciborskii CS-505 is a cylindrospermopsin-producing strain from Australia, whereas Raphidiopsis brookii D9 from Brazil synthesizes neurotoxins associated with paralytic shellfish poisoning (PSP). Despite their different morphology, toxin composition and disjunct geographical distribution, these strains form a monophyletic group. With genome sizes of approximately 3.9 (CS-505) and 3.2 (D9) Mb, these are the smallest genomes described for free-living filamentous cyanobacteria. We observed remarkable gene order conservation (synteny) between these genomes despite the difference in repetitive element content, which accounts for most of the genome size difference between them. We show here that the strains share a specific set of 2539 genes with >90% average nucleotide identity. The fact that the CS-505 and D9 genomes are small and streamlined compared to those of other filamentous cyanobacterial species and the lack of the ability for heterocyst formation in strain D9 allowed us to define a core set of genes responsible for each trait in filamentous species. We presume that in strain D9 the ability to form proper heterocysts was secondarily lost together with N2 fixation capacity. Further comparisons to all available cyanobacterial genomes covering

  11. Genome-wide sequencing data reveals virulence factors implicated in banana Xanthomonas wilt.

    PubMed

    Studholme, David J; Kemen, Eric; MacLean, Daniel; Schornack, Sebastian; Aritua, Valente; Thwaites, Richard; Grant, Murray; Smith, Julian; Jones, Jonathan D G

    2010-09-01

    Banana Xanthomonas wilt is a newly emerging disease that is currently threatening the livelihoods of millions of farmers in East Africa. The causative agent is Xanthomonas campestris pathovar musacearum (Xcm), but previous work suggests that this pathogen is much more closely related to species Xanthomonas vasicola than to X. campestris. We have generated draft genome sequences for a banana-pathogenic strain of Xcm isolated in Uganda and for a very closely related strain of X. vasicola pathovar vasculorum, originally isolated from sugarcane, that is nonpathogenic on banana. The draft sequences revealed overlapping but distinct repertoires of candidate virulence effectors in the two strains. Both strains encode homologues of the Pseudomonas syringae effectors HopW, HopAF1 and RipT from Ralstonia solanacearum. The banana-pathogenic and non-banana-pathogenic strains also differed with respect to lipopolysaccharide synthesis and type-IV pili, and in at least several thousand single-nucleotide polymorphisms in the core conserved genome. We found evidence of horizontal transfer between X. vasicola and very distantly related bacteria, including members of other divisions of the Proteobacteria. The availability of these draft genomes will be an invaluable tool for further studies aimed at understanding and combating this important disease.

  12. Infectious diseases of marine molluscs and host responses as revealed by genomic tools

    PubMed Central

    Ford, Susan E.

    2016-01-01

    More and more infectious diseases affect marine molluscs. Some diseases have impacted commercial species including MSX and Dermo of the eastern oyster, QPX of hard clams, withering syndrome of abalone and ostreid herpesvirus 1 (OsHV-1) infections of many molluscs. Although the exact transmission mechanisms are not well understood, human activities and associated environmental changes often correlate with increased disease prevalence. For instance, hatcheries and large-scale aquaculture create high host densities, which, along with increasing ocean temperature, might have contributed to OsHV-1 epizootics in scallops and oysters. A key to understanding linkages between the environment and disease is to understand how the environment affects the host immune system. Although we might be tempted to downplay the role of immunity in invertebrates, recent advances in genomics have provided insights into host and parasite genomes and revealed surprisingly sophisticated innate immune systems in molluscs. All major innate immune pathways are found in molluscs with many immune receptors, regulators and effectors expanded. The expanded gene families provide great diversity and complexity in innate immune response, which may be key to mollusc's defence against diverse pathogens in the absence of adaptive immunity. Further advances in host and parasite genomics should improve our understanding of genetic variation in parasite virulence and host disease resistance. PMID:26880838

  13. Comparative Genomic Analysis Reveals Organization, Function and Evolution of ars Genes in Pantoea spp.

    PubMed Central

    Wang, Liying; Wang, Jin; Jing, Chuanyong

    2017-01-01

    Numerous genes are involved in various strategies to resist toxic arsenic (As). However, the As resistance strategy in genus Pantoea is poorly understood. In this study, a comparative genome analysis of 23 Pantoea genomes was conducted. Two vertical genetic arsC-like genes without any contribution to As resistance were found to exist in the 23 Pantoea strains. Besides the two arsC-like genes, As resistance gene clusters arsRBC or arsRBCH were found in 15 Pantoea genomes. These ars clusters were found to be acquired by horizontal gene transfer (HGT) from sources related to Franconibacter helveticus, Serratia marcescens, and Citrobacter freundii. During the history of evolution, the ars clusters were acquired more than once in some species, and were lost in some strains, producing strains without As resistance capability. This study revealed the organization, distribution and the complex evolutionary history of As resistance genes in Pantoea spp.. The insights gained in this study improved our understanding on the As resistance strategy of Pantoea spp. and its roles in the biogeochemical cycling of As. PMID:28377759

  14. Determinants of spontaneous mutation in the bacterium Escherichia coli as revealed by whole-genome sequencing

    PubMed Central

    Foster, Patricia L.; Lee, Heewook; Popodi, Ellen; Townes, Jesse P.; Tang, Haixu

    2015-01-01

    A complete understanding of evolutionary processes requires that factors determining spontaneous mutation rates and spectra be identified and characterized. Using mutation accumulation followed by whole-genome sequencing, we found that the mutation rates of three widely diverged commensal Escherichia coli strains differ only by about 50%, suggesting that a rate of 1–2 × 10−3 mutations per generation per genome is common for this bacterium. Four major forces are postulated to contribute to spontaneous mutations: intrinsic DNA polymerase errors, endogenously induced DNA damage, DNA damage caused by exogenous agents, and the activities of error-prone polymerases. To determine the relative importance of these factors, we studied 11 strains, each defective for a major DNA repair pathway. The striking result was that only loss of the ability to prevent or repair oxidative DNA damage significantly impacted mutation rates or spectra. These results suggest that, with the exception of oxidative damage, endogenously induced DNA damage does not perturb the overall accuracy of DNA replication in normally growing cells and that repair pathways may exist primarily to defend against exogenously induced DNA damage. The thousands of mutations caused by oxidative damage recovered across the entire genome revealed strong local-sequence biases of these mutations. Specifically, we found that the identity of the 3′ base can affect the mutability of a purine by oxidative damage by as much as eightfold. PMID:26460006

  15. The genome sequencing of an albino Western lowland gorilla reveals inbreeding in the wild

    PubMed Central

    2013-01-01

    Background The only known albino gorilla, named Snowflake, was a male wild born individual from Equatorial Guinea who lived at the Barcelona Zoo for almost 40 years. He was diagnosed with non-syndromic oculocutaneous albinism, i.e. white hair, light eyes, pink skin, photophobia and reduced visual acuity. Despite previous efforts to explain the genetic cause, this is still unknown. Here, we study the genetic cause of his albinism and making use of whole genome sequencing data we find a higher inbreeding coefficient compared to other gorillas. Results We successfully identified the causal genetic variant for Snowflake’s albinism, a non-synonymous single nucleotide variant located in a transmembrane region of SLC45A2. This transporter is known to be involved in oculocutaneous albinism type 4 (OCA4) in humans. We provide experimental evidence that shows that this amino acid replacement alters the membrane spanning capability of this transmembrane region. Finally, we provide a comprehensive study of genome-wide patterns of autozygogosity revealing that Snowflake’s parents were related, being this the first report of inbreeding in a wild born Western lowland gorilla. Conclusions In this study we demonstrate how the use of whole genome sequencing can be extended to link genotype and phenotype in non-model organisms and it can be a powerful tool in conservation genetics (e.g., inbreeding and genetic diversity) with the expected decrease in sequencing cost. PMID:23721540

  16. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  17. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  18. Infectious diseases of marine molluscs and host responses as revealed by genomic tools.

    PubMed

    Guo, Ximing; Ford, Susan E

    2016-03-05

    More and more infectious diseases affect marine molluscs. Some diseases have impacted commercial species including MSX and Dermo of the eastern oyster, QPX of hard clams, withering syndrome of abalone and ostreid herpesvirus 1 (OsHV-1) infections of many molluscs. Although the exact transmission mechanisms are not well understood, human activities and associated environmental changes often correlate with increased disease prevalence. For instance, hatcheries and large-scale aquaculture create high host densities, which, along with increasing ocean temperature, might have contributed to OsHV-1 epizootics in scallops and oysters. A key to understanding linkages between the environment and disease is to understand how the environment affects the host immune system. Although we might be tempted to downplay the role of immunity in invertebrates, recent advances in genomics have provided insights into host and parasite genomes and revealed surprisingly sophisticated innate immune systems in molluscs. All major innate immune pathways are found in molluscs with many immune receptors, regulators and effectors expanded. The expanded gene families provide great diversity and complexity in innate immune response, which may be key to mollusc's defence against diverse pathogens in the absence of adaptive immunity. Further advances in host and parasite genomics should improve our understanding of genetic variation in parasite virulence and host disease resistance.

  19. Nitrosopumilus maritimus genome reveals unique mechanisms for nitrification and autotrophy in globally distributed marine crenarchaea

    PubMed Central

    Walker, C. B.; de la Torre, J. R.; Klotz, M. G.; Urakawa, H.; Pinel, N.; Arp, D. J.; Brochier-Armanet, C.; Chain, P. S. G.; Chan, P. P.; Gollabgir, A.; Hemp, J.; Hügler, M.; Karr, E. A.; Könneke, M.; Lawton, T. J.; Lowe, T.; Martens-Habbena, W.; Sayavedra-Soto, L. A.; Lang, D.; Sievert, S. M.; Rosenzweig, A. C.; Manning, G.; Stahl, D. A.

    2010-01-01

    Ammonia-oxidizing archaea are ubiquitous in marine and terrestrial environments and now thought to be significant contributors to carbon and nitrogen cycling. The isolation of Candidatus “Nitrosopumilus maritimus” strain SCM1 provided the opportunity for linking its chemolithotrophic physiology with a genomic inventory of the globally distributed archaea. Here we report the 1,645,259-bp closed genome of strain SCM1, revealing highly copper-dependent systems for ammonia oxidation and electron transport that are distinctly different from known ammonia-oxidizing bacteria. Consistent with in situ isotopic studies of marine archaea, the genome sequence indicates N. maritimus grows autotrophically using a variant of the 3-hydroxypropionate/4-hydroxybutryrate pathway for carbon assimilation, while maintaining limited capacity for assimilation of organic carbon. This unique instance of archaeal biosynthesis of the osmoprotectant ectoine and an unprecedented enrichment of multicopper oxidases, thioredoxin-like proteins, and transcriptional regulators points to an organism responsive to environmental cues and adapted to handling reactive copper and nitrogen species that likely derive from its distinctive biochemistry. The conservation of N. maritimus gene content and organization within marine metagenomes indicates that the unique physiology of these specialized oligophiles may play a significant role in the biogeochemical cycles of carbon and nitrogen. PMID:20421470

  20. The draft genome of Tibetan hulless barley reveals adaptive patterns to the high stressful Tibetan Plateau

    PubMed Central

    Zeng, Xingquan; Long, Hai; Wang, Zhuo; Zhao, Shancen; Tang, Yawei; Huang, Zhiyong; Wang, Yulin; Xu, Qijun; Mao, Likai; Deng, Guangbing; Yao, Xiaoming; Li, Xiangfeng; Bai, Lijun; Yuan, Hongjun; Pan, Zhifen; Liu, Renjian; Chen, Xin; WangMu, QiMei; Chen, Ming; Yu, Lili; Liang, Junjun; DunZhu, DaWa; Zheng, Yuan; Yu, Shuiyang; LuoBu, ZhaXi; Guang, Xuanmin; Li, Jiang; Deng, Cao; Hu, Wushu; Chen, Chunhai; TaBa, XiongNu; Gao, Liyun; Lv, Xiaodan; Abu, Yuval Ben; Fang, Xiaodong; Nevo, Eviatar; Yu, Maoqun; Wang, Jun; Tashi, Nyima

    2015-01-01

    The Tibetan hulless barley (Hordeum vulgare L. var. nudum), also called “Qingke” in Chinese and “Ne” in Tibetan, is the staple food for Tibetans and an important livestock feed in the Tibetan Plateau. The diploid nature and adaptation to diverse environments of the highland give it unique resources for genetic research and crop improvement. Here we produced a 3.89-Gb draft assembly of Tibetan hulless barley with 36,151 predicted protein-coding genes. Comparative analyses revealed the divergence times and synteny between barley and other representative Poaceae genomes. The expansion of the gene family related to stress responses was found in Tibetan hulless barley. Resequencing of 10 barley accessions uncovered high levels of genetic variation in Tibetan wild barley and genetic divergence between Tibetan and non-Tibetan barley genomes. Selective sweep analyses demonstrate adaptive correlations of genes under selection with extensive environmental variables. Our results not only construct a genomic framework for crop improvement but also provide evolutionary insights of highland adaptation of Tibetan hulless barley. PMID:25583503

  1. Genome scan for nonadditive heterotic trait loci reveals mainly underdominant effects in Saccharomyces cerevisiae.

    PubMed

    Laiba, Efrat; Glikaite, Ilana; Levy, Yael; Pasternak, Zohar; Fridman, Eyal

    2016-04-01

    The overdominant model of heterosis explains the superior phenotype of hybrids by synergistic allelic interaction within heterozygous loci. To map such genetic variation in yeast, we used a population doubling time dataset of Saccharomyces cerevisiae 16 × 16 diallel and searched for major contributing heterotic trait loci (HTL). Heterosis was observed for the majority of hybrids, as they surpassed their best parent growth rate. However, most of the local heterozygous loci identified by genome scan were surprisingly underdominant, i.e., reduced growth. We speculated that in these loci adverse effects on growth resulted from incompatible allelic interactions. To test this assumption, we eliminated these allelic interactions by creating hybrids with local hemizygosity for the underdominant HTLs, as well as for control random loci. Growth of hybrids was indeed elevated for most hemizygous to HTL genes but not for control genes, hence validating the results of our genome scan. Assessing the consequences of local heterozygosity by reciprocal hemizygosity and allele replacement assays revealed the influence of genetic background on the underdominant effects of HTLs. Overall, this genome-wide study on a multi-parental hybrid population provides a strong argument against single gene overdominance as a major contributor to heterosis, and favors the dominance complementation model.

  2. The king cobra genome reveals dynamic gene evolution and adaptation in the snake venom system.

    PubMed

    Vonk, Freek J; Casewell, Nicholas R; Henkel, Christiaan V; Heimberg, Alysha M; Jansen, Hans J; McCleary, Ryan J R; Kerkkamp, Harald M E; Vos, Rutger A; Guerreiro, Isabel; Calvete, Juan J; Wüster, Wolfgang; Woods, Anthony E; Logan, Jessica M; Harrison, Robert A; Castoe, Todd A; de Koning, A P Jason; Pollock, David D; Yandell, Mark; Calderon, Diego; Renjifo, Camila; Currier, Rachel B; Salgado, David; Pla, Davinia; Sanz, Libia; Hyder, Asad S; Ribeiro, José M C; Arntzen, Jan W; van den Thillart, Guido E E J M; Boetzer, Marten; Pirovano, Walter; Dirks, Ron P; Spaink, Herman P; Duboule, Denis; McGlinn, Edwina; Kini, R Manjunatha; Richardson, Michael K

    2013-12-17

    Snakes are limbless predators, and many species use venom to help overpower relatively large, agile prey. Snake venoms are complex protein mixtures encoded by several multilocus gene families that function synergistically to cause incapacitation. To examine venom evolution, we sequenced and interrogated the genome of a venomous snake, the king cobra (Ophiophagus hannah), and compared it, together with our unique transcriptome, microRNA, and proteome datasets from this species, with data from other vertebrates. In contrast to the platypus, the only other venomous vertebrate with a sequenced genome, we find that snake toxin genes evolve through several distinct co-option mechanisms and exhibit surprisingly variable levels of gene duplication and directional selection that correlate with their functional importance in prey capture. The enigmatic accessory venom gland shows a very different pattern of toxin gene expression from the main venom gland and seems to have recruited toxin-like lectin genes repeatedly for new nontoxic functions. In addition, tissue-specific microRNA analyses suggested the co-option of core genetic regulatory components of the venom secretory system from a pancreatic origin. Although the king cobra is limbless, we recovered coding sequences for all Hox genes involved in amniote limb development, with the exception of Hoxd12. Our results provide a unique view of the origin and evolution of snake venom and reveal multiple genome-level adaptive responses to natural selection in this complex biological weapon system. More generally, they provide insight into mechanisms of protein evolution under strong selection.

  3. Structural genomics reveals EVE as a new ASCH/PUA-related domain.

    PubMed

    Bertonati, Claudia; Punta, Marco; Fischer, Markus; Yachdav, Guy; Forouhar, Farhad; Zhou, Weihong; Kuzin, Alexander P; Seetharaman, Jayaraman; Abashidze, Mariam; Ramelot, Theresa A; Kennedy, Michael A; Cort, John R; Belachew, Adam; Hunt, John F; Tong, Liang; Montelione, Gaetano T; Rost, Burkhard

    2009-05-15

    We report on several proteins recently solved by structural genomics consortia, in particular by the Northeast Structural Genomics consortium (NESG). The proteins considered in this study differ substantially in their sequences but they share a similar structural core, characterized by a pseudobarrel five-stranded beta sheet. This core corresponds to the PUA domain-like architecture in the SCOP database. By connecting sequence information with structural knowledge, we characterize a new subgroup of these proteins that we propose to be distinctly different from previously described PUA domain-like domains such as PUA proper or ASCH. We refer to these newly defined domains as EVE. Although EVE may have retained the ability of PUA domains to bind RNA, the available experimental and computational data suggests that both the details of its molecular function and its cellular function differ from those of other PUA domain-like domains. This study of EVE and its relatives illustrates how the combination of structure and genomics creates new insights by connecting a cornucopia of structures that map to the same evolutionary potential. Primary sequence information alone would have not been sufficient to reveal these evolutionary links.

  4. The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea.

    PubMed

    Olsen, Jeanine L; Rouzé, Pierre; Verhelst, Bram; Lin, Yao-Cheng; Bayer, Till; Collen, Jonas; Dattolo, Emanuela; De Paoli, Emanuele; Dittami, Simon; Maumus, Florian; Michel, Gurvan; Kersting, Anna; Lauritano, Chiara; Lohaus, Rolf; Töpel, Mats; Tonon, Thierry; Vanneste, Kevin; Amirebrahimi, Mojgan; Brakel, Janina; Boström, Christoffer; Chovatia, Mansi; Grimwood, Jane; Jenkins, Jerry W; Jueterbock, Alexander; Mraz, Amy; Stam, Wytze T; Tice, Hope; Bornberg-Bauer, Erich; Green, Pamela J; Pearson, Gareth A; Procaccini, Gabriele; Duarte, Carlos M; Schmutz, Jeremy; Reusch, Thorsten B H; Van de Peer, Yves

    2016-02-18

    Seagrasses colonized the sea on at least three independent occasions to form the basis of one of the most productive and widespread coastal ecosystems on the planet. Here we report the genome of Zostera marina (L.), the first, to our knowledge, marine angiosperm to be fully sequenced. This reveals unique insights into the genomic losses and gains involved in achieving the structural and physiological adaptations required for its marine lifestyle, arguably the most severe habitat shift ever accomplished by flowering plants. Key angiosperm innovations that were lost include the entire repertoire of stomatal genes, genes involved in the synthesis of terpenoids and ethylene signalling, and genes for ultraviolet protection and phytochromes for far-red sensing. Seagrasses have also regained functions enabling them to adjust to full salinity. Their cell walls contain all of the polysaccharides typical of land plants, but also contain polyanionic, low-methylated pectins and sulfated galactans, a feature shared with the cell walls of all macroalgae and that is important for ion homoeostasis, nutrient uptake and O2/CO2 exchange through leaf epidermal cells. The Z. marina genome resource will markedly advance a wide range of functional ecological studies from adaptation of marine ecosystems under climate warming, to unravelling the mechanisms of osmoregulation under high salinities that may further inform our understanding of the evolution of salt tolerance in crop plants.

  5. Peltaster fructicola genome reveals evolution from an invasive phytopathogen to an ectophytic parasite

    PubMed Central

    Xu, Chao; Chen, Huan; Gleason, Mark L.; Xu, Jin-Rong; Liu, Huiquan; Zhang, Rong; Sun, Guangyu

    2016-01-01

    Sooty blotch and flyspeck (SBFS) fungi are unconventional plant pathogens that cause economic losses by blemishing the surface appearance of infected fruit. Here, we introduce the 18.14-Mb genome of Peltaster fructicola, one of the most prevalent SBFS species on apple. This undersized assembly contains only 8,334 predicted protein-coding genes and a very small repertoire of repetitive elements. Phylogenomics and comparative genomics revealed that P. fructicola had undergone a reductive evolution, during which the numbers of orphan genes and genes involved in plant cell wall degradation, secondary metabolism, and secreted peptidases and effectors were drastically reduced. In contrast, the genes controlling 1,8-dihydroxynaphthalene (DHN)-melanin biosynthesis and appressorium-mediated penetration were retained substantially. Additionally, microscopic examination of the surfaces of infected apple indicated for the first time that P. fructicola can not only dissolve epicuticular waxes but also partially penetrate the cuticle proper. Our findings indicate that genome contraction, characterized mainly by the massive loss of pathogenicity-related genes, has played an important role in the evolution of P. fructicola (and by implication other SBFS species) from a plant-penetrating ancestor to a non-invasive ectophyte, displaying a novel form of trophic interaction between plants and fungi. PMID:26964666

  6. Phylogeny of a Genomically Diverse Group of Elymus (Poaceae) Allopolyploids Reveals Multiple Levels of Reticulation

    PubMed Central

    Mason-Gamer, Roberta J.

    2013-01-01

    The grass tribe Triticeae (=Hordeeae) comprises only about 300 species, but it is well known for the economically important crop plants wheat, barley, and rye. The group is also recognized as a fascinating example of evolutionary complexity, with a history shaped by numerous events of auto- and allopolyploidy and apparent introgression involving diploids and polyploids. The genus Elymus comprises a heterogeneous collection of allopolyploid genome combinations, all of which include at least one set of homoeologs, designated St, derived from Pseudoroegneria. The current analysis includes a geographically and genomically diverse collection of 21 tetraploid Elymus species, and a single hexaploid species. Diploid and polyploid relationships were estimated using four molecular data sets, including one that combines two regions of the chloroplast genome, and three from unlinked nuclear genes: phosphoenolpyruvate carboxylase, β-amylase, and granule-bound starch synthase I. Four gene trees were generated using maximum likelihood, and the phylogenetic placement of the polyploid sequences reveals extensive reticulation beyond allopolyploidy alone. The trees were interpreted with reference to numerous phenomena known to complicate allopolyploid phylogenies, and introgression was identified as a major factor in their history. The work illustrates the interpretation of complicated phylogenetic results through the sequential consideration of numerous possible explanations, and the results highlight the value of careful inspection of multiple independent molecular phylogenetic estimates, with particular focus on the differences among them. PMID:24302986

  7. Nearly finished genomes produced using gel microdroplet culturing reveal substantial intraspecies genomic diversity within the human microbiome.

    PubMed

    Fitzsimons, Michael S; Novotny, Mark; Lo, Chien-Chi; Dichosa, Armand E K; Yee-Greenbaum, Joyclyn L; Snook, Jeremy P; Gu, Wei; Chertkov, Olga; Davenport, Karen W; McMurry, Kim; Reitenga, Krista G; Daughton, Ashlynn R; He, Jian; Johnson, Shannon L; Gleasner, Cheryl D; Wills, Patti L; Parson-Quintana, Beverly; Chain, Patrick S; Detter, John C; Lasken, Roger S; Han, Cliff S

    2013-05-01

    The majority of microbial genomic diversity remains unexplored. This is largely due to our inability to culture most microorganisms in isolation, which is a prerequisite for traditional genome sequencing. Single-cell sequencing has allowed researchers to circumvent this limitation. DNA is amplified directly from a single cell using the whole-genome amplification technique of multiple displacement amplification (MDA). However, MDA from a single chromosome copy suffers from amplification bias and a large loss of specificity from even very small amounts of DNA contamination, which makes assembling a genome difficult and completely finishing a genome impossible except in extraordinary circumstances. Gel microdrop cultivation allows culturing of a diverse microbial community and provides hundreds to thousands of genetically identical cells as input for an MDA reaction. We demonstrate the utility of this approach by comparing sequencing results of gel microdroplets and single cells following MDA. Bias is reduced in the MDA reaction and genome sequencing, and assembly is greatly improved when using gel microdroplets. We acquired multiple near-complete genomes for two bacterial species from human oral and stool microbiome samples. A significant amount of genome diversity, including single nucleotide polymorphisms and genome recombination, is discovered. Gel microdroplets offer a powerful and high-throughput technology for assembling whole genomes from complex samples and for probing the pan-genome of naturally occurring populations.

  8. Genome-based metabolic mapping and 13C flux analysis reveal systematic properties of an oleaginous microalga Chlorella protothecoides.

    PubMed

    Wu, Chao; Xiong, Wei; Dai, Junbiao; Wu, Qingyu

    2015-02-01

    Integrated and genome-based flux balance analysis, metabolomics, and (13)C-label profiling of phototrophic and heterotrophic metabolism in Chlorella protothecoides, an oleaginous green alga for biofuel. The green alga Chlorella protothecoides, capable of autotrophic and heterotrophic growth with rapid lipid synthesis, is a promising candidate for biofuel production. Based on the newly available genome knowledge of the alga, we reconstructed the compartmentalized metabolic network consisting of 272 metabolic reactions, 270 enzymes, and 461 encoding genes and simulated the growth in different cultivation conditions with flux balance analysis. Phenotype-phase plane analysis shows conditions achieving theoretical maximum of the biomass and corresponding fatty acid-producing rate for phototrophic cells (the ratio of photon uptake rate to CO2 uptake rate equals 8.4) and heterotrophic ones (the glucose uptake rate to O2 consumption rate reaches 2.4), respectively. Isotope-assisted liquid chromatography-mass spectrometry/mass spectrometry reveals higher metabolite concentrations in the glycolytic pathway and the tricarboxylic acid cycle in heterotrophic cells compared with autotrophic cells. We also observed enhanced levels of ATP, nicotinamide adenine dinucleotide (phosphate), reduced, acetyl-Coenzyme A, and malonyl-Coenzyme A in heterotrophic cells consistently, consistent with a strong activity of lipid synthesis. To profile the flux map in experimental conditions, we applied nonstationary (13)C metabolic flux analysis as a complementing strategy to flux balance analysis. The result reveals negligible photorespiratory fluxes and a metabolically low active tricarboxylic acid cycle in phototrophic C. protothecoides. In comparison, high throughput of amphibolic reactions and the tricarboxylic acid cycle with no glyoxylate shunt activities were measured for heterotrophic cells. Taken together, the metabolic network modeling assisted by experimental metabolomics and (13)C

  9. Genome-Based Metabolic Mapping and 13C Flux Analysis Reveal Systematic Properties of an Oleaginous Microalga Chlorella protothecoides

    DOE PAGES

    Wu, Chao; Xiong, Wei; Dai, Junbiao; ...

    2014-12-15

    We report that integrated and genome-based flux balance analysis, metabolomics, and 13C-label profiling of phototrophic and heterotrophic metabolism in Chlorella protothecoides, an oleaginous green alga for biofuel. The green alga Chlorella protothecoides, capable of autotrophic and heterotrophic growth with rapid lipid synthesis, is a promising candidate for biofuel production. Based on the newly available genome knowledge of the alga, we reconstructed the compartmentalized metabolic network consisting of 272 metabolic reactions, 270 enzymes, and 461 encoding genes and simulated the growth in different cultivation conditions with flux balance analysis. Phenotype-phase plane analysis shows conditions achieving theoretical maximum of the biomass andmore » corresponding fatty acid-producing rate for phototrophic cells (the ratio of photon uptake rate to CO2 uptake rate equals 8.4) and heterotrophic ones (the glucose uptake rate to O2 consumption rate reaches 2.4), respectively. Isotope-assisted liquid chromatography-mass spectrometry/mass spectrometry reveals higher metabolite concentrations in the glycolytic pathway and the tricarboxylic acid cycle in heterotrophic cells compared with autotrophic cells. We also observed enhanced levels of ATP, nicotinamide adenine dinucleotide (phosphate), reduced, acetyl-Coenzyme A, and malonyl-Coenzyme A in heterotrophic cells consistently, consistent with a strong activity of lipid synthesis. To profile the flux map in experimental conditions, we applied nonstationary 13C metabolic flux analysis as a complementing strategy to flux balance analysis. We found that the result reveals negligible photorespiratory fluxes and a metabolically low active tricarboxylic acid cycle in phototrophic C. protothecoides. In comparison, high throughput of amphibolic reactions and the tricarboxylic acid cycle with no glyoxylate shunt activities were measured for heterotrophic cells. Lastly, taken together, the metabolic network modeling assisted

  10. Biosynthesis of Antibiotic Leucinostatins in Bio-control Fungus Purpureocillium lilacinum and Their Inhibition on Phytophthora Revealed by Genome Mining

    PubMed Central

    Li, Erfeng; Mao, Zhenchuan; Ling, Jian; Yang, Yuhong; Yin, Wen-Bing; Xie, Bingyan

    2016-01-01

    Purpureocillium lilacinum of Ophiocordycipitaceae is one of the most promising and commercialized agents for controlling plant parasitic nematodes, as well as other insects and plant pathogens. However, how the fungus functions at the molecular level remains unknown. Here, we sequenced two isolates (PLBJ-1 and PLFJ-1) of P. lilacinum from different places Beijing and Fujian. Genomic analysis showed high synteny of the two isolates, and the phylogenetic analysis indicated they were most related to the insect pathogen Tolypocladium inflatum. A comparison with other species revealed that this fungus was enriched in carbohydrate-active enzymes (CAZymes), proteases and pathogenesis related genes. Whole genome search revealed a rich repertoire of secondary metabolites (SMs) encoding genes. The non-ribosomal peptide synthetase LcsA, which is comprised of ten C-A-PCP modules, was identified as the core biosynthetic gene of lipopeptide leucinostatins, which was specific to P. lilacinum and T. ophioglossoides, as confirmed by phylogenetic analysis. Furthermore, gene expression level was analyzed when PLBJ-1 was grown in leucinostatin-inducing and non-inducing medium, and 20 genes involved in the biosynthesis of leucionostatins were identified. Disruption mutants allowed us to propose a putative biosynthetic pathway of leucinostatin A. Moreover, overexpression of the transcription factor lcsF increased the production (1.5-fold) of leucinostatins A and B compared to wild type. Bioassays explored a new bioactivity of leucinostatins and P. lilacinum: inhibiting the growth of Phytophthora infestans and P. capsici. These results contribute to our understanding of the biosynthetic mechanism of leucinostatins and may allow us to utilize P. lilacinum better as bio-control agent. PMID:27416025

  11. Polyploid genome of Camelina sativa revealed by isolation of fatty acid synthesis genes

    PubMed Central

    2010-01-01

    Background Camelina sativa, an oilseed crop in the Brassicaceae family, has inspired renewed interest due to its potential for biofuels applications. Little is understood of the nature of the C. sativa genome, however. A study was undertaken to characterize two genes in the fatty acid biosynthesis pathway, fatty acid desaturase (FAD) 2 and fatty acid elongase (FAE) 1, which revealed unexpected complexity in the C. sativa genome. Results In C. sativa, Southern analysis indicates the presence of three copies of both FAD2 and FAE1 as well as LFY, a known single copy gene in other species. All three copies of both CsFAD2 and CsFAE1 are expressed in developing seeds, and sequence alignments show that previously described conserved sites are present, suggesting that all three copies of both genes could be functional. The regions downstream of CsFAD2 and upstream of CsFAE1 demonstrate co-linearity with the Arabidopsis genome. In addition, three expressed haplotypes were observed for six predicted single-copy genes in 454 sequencing analysis and results from flow cytometry indicate that the DNA content of C. sativa is approximately three-fold that of diploid Camelina relatives. Phylogenetic analyses further support a history of duplication and indicate that C. sativa and C. microcarpa might share a parental genome. Conclusions There is compelling evidence for triplication of the C. sativa genome, including a larger chromosome number and three-fold larger measured genome size than other Camelina relatives, three isolated copies of FAD2, FAE1, and the KCS17-FAE1 intergenic region, and three expressed haplotypes observed for six predicted single-copy genes. Based on these results, we propose that C. sativa be considered an allohexaploid. The characterization of fatty acid synthesis pathway genes will allow for the future manipulation of oil composition of this emerging biofuel crop; however, targeted manipulations of oil composition and general development of C. sativa should

  12. Comparative genomic analysis reveals 2-oxoacid dehydrogenase complex lipoylation correlation with aerobiosis in archaea.

    PubMed

    Borziak, Kirill; Posner, Mareike G; Upadhyay, Abhishek; Danson, Michael J; Bagby, Stefan; Dorus, Steve

    2014-01-01

    , the extension of comparative genomic pathway profiling to broader metabolic and homeostasis networks should be useful in revealing characteristics from metagenomic datasets related to adaptations to diverse environments.

  13. Single-cell genomics reveal low recombination frequencies in freshwater bacteria of the SAR11 clade

    PubMed Central

    2013-01-01

    Background The SAR11 group of Alphaproteobacteria is highly abundant in the oceans. It contains a recently diverged freshwater clade, which offers the opportunity to compare adaptations to salt- and freshwaters in a monophyletic bacterial group. However, there are no cultivated members of the freshwater SAR11 group and no genomes have been sequenced yet. Results We isolated ten single SAR11 cells from three freshwater lakes and sequenced and assembled their genomes. A phylogeny based on 57 proteins indicates that the cells are organized into distinct microclusters. We show that the freshwater genomes have evolved primarily by the accumulation of nucleotide substitutions and that they have among the lowest ratio of recombination to mutation estimated for bacteria. In contrast, members of the marine SAR11 clade have one of the highest ratios. Additional metagenome reads from six lakes confirm low recombination frequencies for the genome overall and reveal lake-specific variations in microcluster abundances. We identify hypervariable regions with gene contents broadly similar to those in the hypervariable regions of the marine isolates, containing genes putatively coding for cell surface molecules. Conclusions We conclude that recombination rates differ dramatically in phylogenetic sister groups of the SAR11 clade adapted to freshwater and marine ecosystems. The results suggest that the transition from marine to freshwater systems has purged diversity and resulted in reduced opportunities for recombination with divergent members of the clade. The low recombination frequencies of the LD12 clade resemble the low genetic divergence of host-restricted pathogens that have recently shifted to a new host. PMID:24286338

  14. Genome Analysis of Two Pseudonocardia Phylotypes Associated with Acromyrmex Leafcutter Ants Reveals Their Biosynthetic Potential.

    PubMed

    Holmes, Neil A; Innocent, Tabitha M; Heine, Daniel; Bassam, Mahmoud Al; Worsley, Sarah F; Trottmann, Felix; Patrick, Elaine H; Yu, Douglas W; Murrell, J C; Schiøtt, Morten; Wilkinson, Barrie; Boomsma, Jacobus J; Hutchings, Matthew I

    2016-01-01

    The attine ants of South and Central America are ancient farmers, having evolved a symbiosis with a fungal food crop >50 million years ago. The most evolutionarily derived attines are the Atta and Acromyrmex leafcutter ants, which harvest fresh leaves to feed their fungus. Acromyrmex and many other attines vertically transmit a mutualistic strain of Pseudonocardia and use antifungal compounds made by these bacteria to protect their fungal partner against co-evolved fungal pathogens of the genus Escovopsis. Pseudonocardia mutualists associated with the attines Apterostigma dentigerum and Trachymyrmex cornetzi make novel cyclic depsipeptide compounds called gerumycins, while a mutualist strain isolated from derived Acromyrmex octospinosus makes an unusual polyene antifungal called nystatin P1. The novelty of these antimicrobials suggests there is merit in exploring secondary metabolites of Pseudonocardia on a genome-wide scale. Here, we report a genomic analysis of the Pseudonocardia phylotypes Ps1 and Ps2 that are consistently associated with Acromyrmex ants collected in Gamboa, Panama. These were previously distinguished solely on the basis of 16S rRNA gene sequencing but genome sequencing of five Ps1 and five Ps2 strains revealed that the phylotypes are distinct species and each encodes between 11 and 15 secondary metabolite biosynthetic gene clusters (BGCs). There are signature BGCs for Ps1 and Ps2 strains and some that are conserved in both. Ps1 strains all contain BGCs encoding nystatin P1-like antifungals, while the Ps2 strains encode novel nystatin-like molecules. Strains show variations in the arrangement of these BGCs that resemble those seen in gerumycin gene clusters. Genome analyses and invasion assays support our hypothesis that vertically transmitted Ps1 and Ps2 strains have antibacterial activity that could help shape the cuticular microbiome. Thus, our work defines the Pseudonocardia species associated with Acromyrmex ants and supports the hypothesis

  15. Genome Analysis of Two Pseudonocardia Phylotypes Associated with Acromyrmex Leafcutter Ants Reveals Their Biosynthetic Potential

    PubMed Central

    Holmes, Neil A.; Innocent, Tabitha M.; Heine, Daniel; Bassam, Mahmoud Al; Worsley, Sarah F.; Trottmann, Felix; Patrick, Elaine H.; Yu, Douglas W.; Murrell, J. C.; Schiøtt, Morten; Wilkinson, Barrie; Boomsma, Jacobus J.; Hutchings, Matthew I.

    2016-01-01

    The attine ants of South and Central America are ancient farmers, having evolved a symbiosis with a fungal food crop >50 million years ago. The most evolutionarily derived attines are the Atta and Acromyrmex leafcutter ants, which harvest fresh leaves to feed their fungus. Acromyrmex and many other attines vertically transmit a mutualistic strain of Pseudonocardia and use antifungal compounds made by these bacteria to protect their fungal partner against co-evolved fungal pathogens of the genus Escovopsis. Pseudonocardia mutualists associated with the attines Apterostigma dentigerum and Trachymyrmex cornetzi make novel cyclic depsipeptide compounds called gerumycins, while a mutualist strain isolated from derived Acromyrmex octospinosus makes an unusual polyene antifungal called nystatin P1. The novelty of these antimicrobials suggests there is merit in exploring secondary metabolites of Pseudonocardia on a genome-wide scale. Here, we report a genomic analysis of the Pseudonocardia phylotypes Ps1 and Ps2 that are consistently associated with Acromyrmex ants collected in Gamboa, Panama. These were previously distinguished solely on the basis of 16S rRNA gene sequencing but genome sequencing of five Ps1 and five Ps2 strains revealed that the phylotypes are distinct species and each encodes between 11 and 15 secondary metabolite biosynthetic gene clusters (BGCs). There are signature BGCs for Ps1 and Ps2 strains and some that are conserved in both. Ps1 strains all contain BGCs encoding nystatin P1-like antifungals, while the Ps2 strains encode novel nystatin-like molecules. Strains show variations in the arrangement of these BGCs that resemble those seen in gerumycin gene clusters. Genome analyses and invasion assays support our hypothesis that vertically trans