Science.gov

Sample records for bacterial genome scale

  1. In the fast lane: large-scale bacterial genome engineering.

    PubMed

    Fehér, Tamás; Burland, Valerie; Pósfai, György

    2012-07-31

    The last few years have witnessed rapid progress in bacterial genome engineering. The long-established, standard ways of DNA synthesis, modification, transfer into living cells, and incorporation into genomes have given way to more effective, large-scale, robust genome modification protocols. Expansion of these engineering capabilities is due to several factors. Key advances include: (i) progress in oligonucleotide synthesis and in vitro and in vivo assembly methods, (ii) optimization of recombineering techniques, (iii) introduction of parallel, large-scale, combinatorial, and automated genome modification procedures, and (iv) rapid identification of the modifications by barcode-based analysis and sequencing. Combination of the brute force of these techniques with sophisticated bioinformatic design and modeling opens up new avenues for the analysis of gene functions and cellular network interactions, but also in engineering more effective producer strains. This review presents a summary of recent technological advances in bacterial genome engineering.

  2. Genome-scale models of bacterial metabolism: reconstruction and applications

    PubMed Central

    Durot, Maxime; Bourguignon, Pierre-Yves; Schachter, Vincent

    2009-01-01

    Genome-scale metabolic models bridge the gap between genome-derived biochemical information and metabolic phenotypes in a principled manner, providing a solid interpretative framework for experimental data related to metabolic states, and enabling simple in silico experiments with whole-cell metabolism. Models have been reconstructed for almost 20 bacterial species, so far mainly through expert curation efforts integrating information from the literature with genome annotation. A wide variety of computational methods exploiting metabolic models have been developed and applied to bacteria, yielding valuable insights into bacterial metabolism and evolution, and providing a sound basis for computer-assisted design in metabolic engineering. Recent advances in computational systems biology and high-throughput experimental technologies pave the way for the systematic reconstruction of metabolic models from genomes of new species, and a corresponding expansion of the scope of their applications. In this review, we provide an introduction to the key ideas of metabolic modeling, survey the methods, and resources that enable model reconstruction and refinement, and chart applications to the investigation of global properties of metabolic systems, the interpretation of experimental results, and the re-engineering of their biochemical capabilities. PMID:19067749

  3. Bacterial Genome Instability

    PubMed Central

    Darmon, Elise

    2014-01-01

    SUMMARY Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease. PMID:24600039

  4. Genome scale patterns of supercoiling in a bacterial chromosome

    PubMed Central

    Lal, Avantika; Dhar, Amlanjyoti; Trostel, Andrei; Kouzine, Fedor; Seshasayee, Aswin S. N.; Adhya, Sankar

    2016-01-01

    DNA in bacterial cells primarily exists in a negatively supercoiled state. The extent of supercoiling differs between regions of the chromosome, changes in response to external conditions and regulates gene expression. Here we report the use of trimethylpsoralen intercalation to map the extent of supercoiling across the Escherichia coli chromosome during exponential and stationary growth phases. We find that stationary phase E. coli cells display a gradient of negative supercoiling, with the terminus being more negatively supercoiled than the origin of replication, and that such a gradient is absent in exponentially growing cells. This stationary phase pattern is correlated with the binding of the nucleoid-associated protein HU, and we show that it is lost in an HU deletion strain. We suggest that HU establishes higher supercoiling near the terminus of the chromosome during stationary phase, whereas during exponential growth DNA gyrase and/or transcription equalizes supercoiling across the chromosome. PMID:27025941

  5. Targeted Large-Scale Deletion of Bacterial Genomes Using CRISPR-Nickases.

    PubMed

    Standage-Beier, Kylie; Zhang, Qi; Wang, Xiao

    2015-11-20

    Programmable CRISPR-Cas systems have augmented our ability to produce precise genome manipulations. Here we demonstrate and characterize the ability of CRISPR-Cas derived nickases to direct targeted recombination of both small and large genomic regions flanked by repetitive elements in Escherichia coli. While CRISPR directed double-stranded DNA breaks are highly lethal in many bacteria, we show that CRISPR-guided nickase systems can be programmed to make precise, nonlethal, single-stranded incisions in targeted genomic regions. This induces recombination events and leads to targeted deletion. We demonstrate that dual-targeted nicking enables deletion of 36 and 97 Kb of the genome. Furthermore, multiplex targeting enables deletion of 133 Kb, accounting for approximately 3% of the entire E. coli genome. This technology provides a framework for methods to manipulate bacterial genomes using CRISPR-nickase systems. We envision this system working synergistically with preexisting bacterial genome engineering methods.

  6. Biofilm Formation Mechanisms of Pseudomonas aeruginosa Predicted via Genome-Scale Kinetic Models of Bacterial Metabolism

    PubMed Central

    Vital-Lopez, Francisco G.; Reifman, Jaques; Wallqvist, Anders

    2015-01-01

    A hallmark of Pseudomonas aeruginosa is its ability to establish biofilm-based infections that are difficult to eradicate. Biofilms are less susceptible to host inflammatory and immune responses and have higher antibiotic tolerance than free-living planktonic cells. Developing treatments against biofilms requires an understanding of bacterial biofilm-specific physiological traits. Research efforts have started to elucidate the intricate mechanisms underlying biofilm development. However, many aspects of these mechanisms are still poorly understood. Here, we addressed questions regarding biofilm metabolism using a genome-scale kinetic model of the P. aeruginosa metabolic network and gene expression profiles. Specifically, we computed metabolite concentration differences between known mutants with altered biofilm formation and the wild-type strain to predict drug targets against P. aeruginosa biofilms. We also simulated the altered metabolism driven by gene expression changes between biofilm and stationary growth-phase planktonic cultures. Our analysis suggests that the synthesis of important biofilm-related molecules, such as the quorum-sensing molecule Pseudomonas quinolone signal and the exopolysaccharide Psl, is regulated not only through the expression of genes in their own synthesis pathway, but also through the biofilm-specific expression of genes in pathways competing for precursors to these molecules. Finally, we investigated why mutants defective in anthranilate degradation have an impaired ability to form biofilms. Alternative to a previous hypothesis that this biofilm reduction is caused by a decrease in energy production, we proposed that the dysregulation of the synthesis of secondary metabolites derived from anthranilate and chorismate is what impaired the biofilms of these mutants. Notably, these insights generated through our kinetic model-based approach are not accessible from previous constraint-based model analyses of P. aeruginosa biofilm

  7. Biofilm Formation Mechanisms of Pseudomonas aeruginosa Predicted via Genome-Scale Kinetic Models of Bacterial Metabolism.

    PubMed

    Vital-Lopez, Francisco G; Reifman, Jaques; Wallqvist, Anders

    2015-10-01

    A hallmark of Pseudomonas aeruginosa is its ability to establish biofilm-based infections that are difficult to eradicate. Biofilms are less susceptible to host inflammatory and immune responses and have higher antibiotic tolerance than free-living planktonic cells. Developing treatments against biofilms requires an understanding of bacterial biofilm-specific physiological traits. Research efforts have started to elucidate the intricate mechanisms underlying biofilm development. However, many aspects of these mechanisms are still poorly understood. Here, we addressed questions regarding biofilm metabolism using a genome-scale kinetic model of the P. aeruginosa metabolic network and gene expression profiles. Specifically, we computed metabolite concentration differences between known mutants with altered biofilm formation and the wild-type strain to predict drug targets against P. aeruginosa biofilms. We also simulated the altered metabolism driven by gene expression changes between biofilm and stationary growth-phase planktonic cultures. Our analysis suggests that the synthesis of important biofilm-related molecules, such as the quorum-sensing molecule Pseudomonas quinolone signal and the exopolysaccharide Psl, is regulated not only through the expression of genes in their own synthesis pathway, but also through the biofilm-specific expression of genes in pathways competing for precursors to these molecules. Finally, we investigated why mutants defective in anthranilate degradation have an impaired ability to form biofilms. Alternative to a previous hypothesis that this biofilm reduction is caused by a decrease in energy production, we proposed that the dysregulation of the synthesis of secondary metabolites derived from anthranilate and chorismate is what impaired the biofilms of these mutants. Notably, these insights generated through our kinetic model-based approach are not accessible from previous constraint-based model analyses of P. aeruginosa biofilm

  8. Bacterial genome annotation.

    PubMed

    Beckloff, Nicholas; Starkenburg, Shawn; Freitas, Tracey; Chain, Patrick

    2012-01-01

    Annotation of prokaryotic sequences can be separated into structural and functional annotation. Structural annotation is dependent on algorithmic interrogation of experimental evidence to discover the physical characteristics of a gene. This is done in an effort to construct accurate gene models, so understanding function or evolution of genes among organisms is not impeded. Functional annotation is dependent on sequence similarity to other known genes or proteins in an effort to assess the function of the gene. Combining structural and functional annotation across genomes in a comparative manner promotes higher levels of accurate annotation as well as an advanced understanding of genome evolution. As the availability of bacterial sequences increases and annotation methods improve, the value of comparative annotation will increase.

  9. The large-scale blast score ratio (LS-BSR) pipeline: a method to rapidly compare genetic content between bacterial genomes.

    PubMed

    Sahl, Jason W; Caporaso, J Gregory; Rasko, David A; Keim, Paul

    2014-01-01

    Background. As whole genome sequence data from bacterial isolates becomes cheaper to generate, computational methods are needed to correlate sequence data with biological observations. Here we present the large-scale BLAST score ratio (LS-BSR) pipeline, which rapidly compares the genetic content of hundreds to thousands of bacterial genomes, and returns a matrix that describes the relatedness of all coding sequences (CDSs) in all genomes surveyed. This matrix can be easily parsed in order to identify genetic relationships between bacterial genomes. Although pipelines have been published that group peptides by sequence similarity, no other software performs the rapid, large-scale, full-genome comparative analyses carried out by LS-BSR. Results. To demonstrate the utility of the method, the LS-BSR pipeline was tested on 96 Escherichia coli and Shigella genomes; the pipeline ran in 163 min using 16 processors, which is a greater than 7-fold speedup compared to using a single processor. The BSR values for each CDS, which indicate a relative level of relatedness, were then mapped to each genome on an independent core genome single nucleotide polymorphism (SNP) based phylogeny. Comparisons were then used to identify clade specific CDS markers and validate the LS-BSR pipeline based on molecular markers that delineate between classical E. coli pathogenic variant (pathovar) designations. Scalability tests demonstrated that the LS-BSR pipeline can process 1,000 E. coli genomes in 27-57 h, depending upon the alignment method, using 16 processors. Conclusions. LS-BSR is an open-source, parallel implementation of the BSR algorithm, enabling rapid comparison of the genetic content of large numbers of genomes. The results of the pipeline can be used to identify specific markers between user-defined phylogenetic groups, and to identify the loss and/or acquisition of genetic information between bacterial isolates. Taxa-specific genetic markers can then be translated into clinical

  10. Large scale genomic analysis shows no evidence for pathogen adaptation between the blood and cerebrospinal fluid niches during bacterial meningitis

    PubMed Central

    Lees, John A.; Kremer, Philip H. C.; Manso, Ana S.; Croucher, Nicholas J.; Ferwerda, Bart; Serón, Mercedes Valls; Oggioni, Marco R.; Parkhill, Julian; Brouwer, Matthijs C.; van der Ende, Arie; van de Beek, Diederik

    2017-01-01

    Recent studies have provided evidence for rapid pathogen genome diversification, some of which could potentially affect the course of disease. We have previously described such variation seen between isolates infecting the blood and cerebrospinal fluid (CSF) of a single patient during a case of bacterial meningitis. Here, we performed whole-genome sequencing of paired isolates from the blood and CSF of 869 meningitis patients to determine whether such variation frequently occurs between these two niches in cases of bacterial meningitis. Using a combination of reference-free variant calling approaches, we show that no genetic adaptation occurs in either invaded niche during bacterial meningitis for two major pathogen species, Streptococcus pneumoniae and Neisseria meningitidis. This study therefore shows that the bacteria capable of causing meningitis are already able to do this upon entering the blood, and no further sequence change is necessary to cross the blood–brain barrier. Our findings place the focus back on bacterial evolution between nasopharyngeal carriage and invasion, or diversity of the host, as likely mechanisms for determining invasiveness. PMID:28348877

  11. Insights from twenty years of bacterial genome sequencing

    SciTech Connect

    Land, Miriam L; Hauser, Loren John; Jun, Se Ran; Nookaew, Intawat; Leuze, Michael Rex; Ahn, Tae-Hyuk; Karpinets, Tatiana V; Lund, Ole; Kora, Guruprasad H; Wassenaar, Trudy; Poudel, Suresh; Ussery, David W

    2015-01-01

    Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome

  12. Insights from 20 years of bacterial genome sequencing.

    PubMed

    Land, Miriam; Hauser, Loren; Jun, Se-Ran; Nookaew, Intawat; Leuze, Michael R; Ahn, Tae-Hyuk; Karpinets, Tatiana; Lund, Ole; Kora, Guruprased; Wassenaar, Trudy; Poudel, Suresh; Ussery, David W

    2015-03-01

    Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome

  13. Correlation between genome reduction and bacterial growth.

    PubMed

    Kurokawa, Masaomi; Seno, Shigeto; Matsuda, Hideo; Ying, Bei-Wen

    2016-12-01

    Genome reduction by removing dispensable genomic sequences in bacteria is commonly used in both fundamental and applied studies to determine the minimal genetic requirements for a living system or to develop highly efficient bioreactors. Nevertheless, whether and how the accumulative loss of dispensable genomic sequences disturbs bacterial growth remains unclear. To investigate the relationship between genome reduction and growth, a series of Escherichia coli strains carrying genomes reduced in a stepwise manner were used. Intensive growth analyses revealed that the accumulation of multiple genomic deletions caused decreases in the exponential growth rate and the saturated cell density in a deletion-length-dependent manner as well as gradual changes in the patterns of growth dynamics, regardless of the growth media. Accordingly, a perspective growth model linking genome evolution to genome engineering was proposed. This study provides the first demonstration of a quantitative connection between genomic sequence and bacterial growth, indicating that growth rate is potentially associated with dispensable genomic sequences.

  14. Genome dynamics in major bacterial pathogens

    PubMed Central

    Ambur, Ole Herman; Davidsen, Tonje; Frye, Stephan A; Balasingham, Seetha V; Lagesen, Karin; Rognes, Torbjørn; Tønjum, Tone

    2009-01-01

    Pathogenic bacteria continuously encounter multiple forms of stress in their hostile environments, which leads to DNA damage. With the new insight into biology offered by genome sequences, the elucidation of the gene content encoding proteins provides clues toward understanding the microbial lifestyle related to habitat and niche. Campylobacter jejuni, Haemophilus influenzae, Helicobacter pylori, Mycobacterium tuberculosis, the pathogenic Neisseria, Streptococcus pneumoniae, Streptococcus pyogenes and Staphylococcus aureus are major human pathogens causing detrimental morbidity and mortality at a global scale. An algorithm for the clustering of orthologs was established in order to identify whether orthologs of selected genes were present or absent in the genomes of the pathogenic bacteria under study. Based on the known genes for the various functions and their orthologs in selected pathogenic bacteria, an overview of the presence of the different types of genes was created. In this context, we focus on selected processes enabling genome dynamics in these particular pathogens, namely DNA repair, recombination and horizontal gene transfer. An understanding of the precise molecular functions of the enzymes participating in DNA metabolism and their importance in the maintenance of bacterial genome integrity has also, in recent years, indicated a future role for these enzymes as targets for therapeutic intervention. PMID:19396949

  15. The bacterial essence of tiny symbiont genomes

    PubMed Central

    McCutcheon, John P.

    2010-01-01

    Bacterial genomes vary in size over two orders of magnitude. The Mycoplasma genitalium genome has traditionally defined the extreme small end of this spectrum, and has therefore heavily informed theoretical and experimental work aimed at determining the minimal gene content necessary to support cellular life. Recent genomic data from insect symbionts has revealed bacterial genomes that are incredibly small—two to four times smaller than M. genitalium—and these tiny genomes have raised questions about the limits of genome reduction and have blurred the once-clear distinction between autonomous cellular life and highly integrated organelle. New data from various systems with symbiotic bacterial or archaeal partners have begun to shed light on how these bacteria may function with such small gene sets, but major mechanistic questions remain. PMID:20044299

  16. Bacterial Communities: Interactions to Scale

    PubMed Central

    Stubbendieck, Reed M.; Vargas-Bautista, Carol; Straight, Paul D.

    2016-01-01

    In the environment, bacteria live in complex multispecies communities. These communities span in scale from small, multicellular aggregates to billions or trillions of cells within the gastrointestinal tract of animals. The dynamics of bacterial communities are determined by pairwise interactions that occur between different species in the community. Though interactions occur between a few cells at a time, the outcomes of these interchanges have ramifications that ripple through many orders of magnitude, and ultimately affect the macroscopic world including the health of host organisms. In this review we cover how bacterial competition influences the structures of bacterial communities. We also emphasize methods and insights garnered from culture-dependent pairwise interaction studies, metagenomic analyses, and modeling experiments. Finally, we argue that the integration of multiple approaches will be instrumental to future understanding of the underlying dynamics of bacterial communities. PMID:27551280

  17. Ten years of bacterial genome sequencing: comparative-genomics-based discoveries.

    PubMed

    Binnewies, Tim T; Motro, Yair; Hallin, Peter F; Lund, Ole; Dunn, David; La, Tom; Hampson, David J; Bellgard, Matthew; Wassenaar, Trudy M; Ussery, David W

    2006-07-01

    It has been more than 10 years since the first bacterial genome sequence was published. Hundreds of bacterial genome sequences are now available for comparative genomics, and searching a given protein against more than a thousand genomes will soon be possible. The subject of this review will address a relatively straightforward question: "What have we learned from this vast amount of new genomic data?" Perhaps one of the most important lessons has been that genetic diversity, at the level of large-scale variation amongst even genomes of the same species, is far greater than was thought. The classical textbook view of evolution relying on the relatively slow accumulation of mutational events at the level of individual bases scattered throughout the genome has changed. One of the most obvious conclusions from examining the sequences from several hundred bacterial genomes is the enormous amount of diversity--even in different genomes from the same bacterial species. This diversity is generated by a variety of mechanisms, including mobile genetic elements and bacteriophages. An examination of the 20 Escherichia coli genomes sequenced so far dramatically illustrates this, with the genome size ranging from 4.6 to 5.5 Mbp; much of the variation appears to be of phage origin. This review also addresses mobile genetic elements, including pathogenicity islands and the structure of transposable elements. There are at least 20 different methods available to compare bacterial genomes. Metagenomics offers the chance to study genomic sequences found in ecosystems, including genomes of species that are difficult to culture. It has become clear that a genome sequence represents more than just a collection of gene sequences for an organism and that information concerning the environment and growth conditions for the organism are important for interpretation of the genomic data. The newly proposed Minimal Information about a Genome Sequence standard has been developed to obtain this

  18. Freshwater bacterial lifestyles inferred from comparative genomics.

    PubMed

    Livermore, Joshua A; Emrich, Scott J; Tan, John; Jones, Stuart E

    2014-03-01

    While micro-organisms actively mediate and participate in freshwater ecosystem services, we know little about freshwater microbial genetic diversity. Genome sequences are available for many bacteria from the human microbiome and the ocean (over 800 and 200, respectively), but only two freshwater genomes are currently available: the streamlined genomes of Polynucleobacter necessarius ssp. asymbioticus and the Actinobacterium AcI-B1. Here, we sequenced and analysed draft genomes of eight phylogentically diverse freshwater bacteria exhibiting a range of lifestyle characteristics. Comparative genomics of these bacteria reveals putative freshwater bacterial lifestyles based on differences in predicted growth rate, capability to respond to environmental stimuli and diversity of useable carbon substrates. Our conceptual model based on these genomic characteristics provides a foundation on which further ecophysiological and genomic studies can be built. In addition, these genomes greatly expand the diversity of existing genomic context for future studies on the ecology and genetics of freshwater bacteria.

  19. Value of a newly sequenced bacterial genome

    PubMed Central

    Barbosa, Eudes GV; Aburjaile, Flavia F; Ramos, Rommel TJ; Carneiro, Adriana R; Le Loir, Yves; Baumbach, Jan; Miyoshi, Anderson; Silva, Artur; Azevedo, Vasco

    2014-01-01

    Next-generation sequencing (NGS) technologies have made high-throughput sequencing available to medium- and small-size laboratories, culminating in a tidal wave of genomic information. The quantity of sequenced bacterial genomes has not only brought excitement to the field of genomics but also heightened expectations that NGS would boost antibacterial discovery and vaccine development. Although many possible drug and vaccine targets have been discovered, the success rate of genome-based analysis has remained below expectations. Furthermore, NGS has had consequences for genome quality, resulting in an exponential increase in draft (partial data) genome deposits in public databases. If no further interests are expressed for a particular bacterial genome, it is more likely that the sequencing of its genome will be limited to a draft stage, and the painstaking tasks of completing the sequencing of its genome and annotation will not be undertaken. It is important to know what is lost when we settle for a draft genome and to determine the “scientific value” of a newly sequenced genome. This review addresses the expected impact of newly sequenced genomes on antibacterial discovery and vaccinology. Also, it discusses the factors that could be leading to the increase in the number of draft deposits and the consequent loss of relevant biological information. PMID:24921006

  20. Dynamics of genome rearrangement in bacterial populations.

    PubMed

    Darling, Aaron E; Miklós, István; Ragan, Mark A

    2008-07-18

    characterization of genome arrangement evolution in a bacterial population evolving outside laboratory conditions. Insight into the process of genomic rearrangement may further the understanding of pathogen population dynamics and selection on the architecture of circular bacterial chromosomes.

  1. One Bacterial Cell, One Complete Genome

    SciTech Connect

    Woyke, Tanja; Tighe, Damon; Mavrommatis, Konstantinos; Clum, Alicia; Copeland, Alex; Schackwitz, Wendy; Lapidus, Alla; Wu, Dongying; McCutcheon, John P.; McDonald, Bradon R.; Moran, Nancy A.; Bristow, James; Cheng, Jan-Fang

    2010-04-26

    While the bulk of the finished microbial genomes sequenced to date are derived from cultured bacterial and archaeal representatives, the vast majority of microorganisms elude current culturing attempts, severely limiting the ability to recover complete or even partial genomes from these environmental species. Single cell genomics is a novel culture-independent approach, which enables access to the genetic material of an individual cell. No single cell genome has to our knowledge been closed and finished to date. Here we report the completed genome from an uncultured single cell of Candidatus Sulcia muelleri DMIN. Digital PCR on single symbiont cells isolated from the bacteriome of the green sharpshooter Draeculacephala minerva bacteriome allowed us to assess that this bacteria is polyploid with genome copies ranging from approximately 200?900 per cell, making it a most suitable target for single cell finishing efforts. For single cell shotgun sequencing, an individual Sulcia cell was isolated and whole genome amplified by multiple displacement amplification (MDA). Sanger-based finishing methods allowed us to close the genome. To verify the correctness of our single cell genome and exclude MDA-derived artifacts, we independently shotgun sequenced and assembled the Sulcia genome from pooled bacteriomes using a metagenomic approach, yielding a nearly identical genome. Four variations we detected appear to be genuine biological differences between the two samples. Comparison of the single cell genome with bacteriome metagenomic sequence data detected two single nucleotide polymorphisms (SNPs), indicating extremely low genetic diversity within a Sulcia population. This study demonstrates the power of single cell genomics to generate a complete, high quality, non-composite reference genome within an environmental sample, which can be used for population genetic analyzes.

  2. The layout of a bacterial genome.

    PubMed

    Képès, François; Jester, Brian C; Lepage, Thibaut; Rafiei, Nafiseh; Rosu, Bianca; Junier, Ivan

    2012-07-16

    Recently the mismatch between our newly acquired capacity to synthetize DNA at genome scale, and our low capacity to design ab initio a functional genome has become conspicuous. This essay gathers a variety of constraints that globally shape natural genomes, with a focus on eubacteria. These constraints originate from chromosome replication (leading/lagging strand asymmetry; gene dosage gradient from origin to terminus; collisions with the transcription complexes), from biased codon usage, from noise control in gene expression, and from genome layout for co-functional genes. On the basis of this analysis, lessons are drawn for full genome design.

  3. Genome Calligrapher: A Web Tool for Refactoring Bacterial Genome Sequences for de Novo DNA Synthesis.

    PubMed

    Christen, Matthias; Deutsch, Samuel; Christen, Beat

    2015-08-21

    Recent advances in synthetic biology have resulted in an increasing demand for the de novo synthesis of large-scale DNA constructs. Any process improvement that enables fast and cost-effective streamlining of digitized genetic information into fabricable DNA sequences holds great promise to study, mine, and engineer genomes. Here, we present Genome Calligrapher, a computer-aided design web tool intended for whole genome refactoring of bacterial chromosomes for de novo DNA synthesis. By applying a neutral recoding algorithm, Genome Calligrapher optimizes GC content and removes obstructive DNA features known to interfere with the synthesis of double-stranded DNA and the higher order assembly into large DNA constructs. Subsequent bioinformatics analysis revealed that synthesis constraints are prevalent among bacterial genomes. However, a low level of codon replacement is sufficient for refactoring bacterial genomes into easy-to-synthesize DNA sequences. To test the algorithm, 168 kb of synthetic DNA comprising approximately 20 percent of the synthetic essential genome of the cell-cycle bacterium Caulobacter crescentus was streamlined and then ordered from a commercial supplier of low-cost de novo DNA synthesis. The successful assembly into eight 20 kb segments indicates that Genome Calligrapher algorithm can be efficiently used to refactor difficult-to-synthesize DNA. Genome Calligrapher is broadly applicable to recode biosynthetic pathways, DNA sequences, and whole bacterial genomes, thus offering new opportunities to use synthetic biology tools to explore the functionality of microbial diversity. The Genome Calligrapher web tool can be accessed at https://christenlab.ethz.ch/GenomeCalligrapher  .

  4. Bacterial genome remodeling through bacteriophage recombination.

    PubMed

    Menouni, Rachid; Hutinet, Geoffrey; Petit, Marie-Agnès; Ansaldi, Mireille

    2015-01-01

    Bacteriophages co-exist and co-evolve with their hosts in natural environments. Virulent phages lyse infected cells through lytic cycles, whereas temperate phages often remain dormant and can undergo lysogenic or lytic cycles. In their lysogenic state, prophages are actually part of the host genome and replicate passively in rhythm with host division. However, prophages are far from being passive residents: they can modify or bring new properties to their host. In this review, we focus on two important phage-encoded recombination mechanisms, i.e. site-specific recombination and homologous recombination, and how they remodel bacterial genomes.

  5. Dominant short repeated sequences in bacterial genomes.

    PubMed

    Avershina, Ekaterina; Rudi, Knut

    2015-03-01

    We use a novel multidimensional searching approach to present the first exhaustive search for all possible repeated sequences in 166 genomes selected to cover the bacterial domain. We found an overrepresentation of repeated sequences in all but one of the genomes. The most prevalent repeats by far were related to interspaced short palindromic repeats (CRISPRs)—conferring bacterial adaptive immunity. We identified a deep branching clade of thermophilic Firmicutes containing the highest number of CRISPR repeats. We also identified a high prevalence of tandem repeated heptamers. In addition, we identified GC-rich repeats that could potentially be involved in recombination events. Finally, we identified repeats in a 16322 amino acid mega protein (involved in biofilm formation) and inverted repeats flanking miniature transposable elements (MITEs). In conclusion, the exhaustive search for repeated sequences identified new elements and distribution of these, which has implications for understanding both the ecology and evolution of bacteria.

  6. New perspectives from genomic analyses of bacterial infectious agents.

    PubMed

    Goldstone, R J; Smith, D G E

    2016-04-01

    Recent advances in the technologies for genomic sequencing and systems for handling and processing sequencing data have transformed bacterial genomics into a near-routine approach for both small- and large-scale investigations of infectious agents. Nonetheless, the application of genomics - especially largerscale studies - to animal infectious agents lags behind its application to human pathogens, despite the growing importance of many animal species as food sources. Assiduously conducted genomic studies offer major benefits, not merely by providing a detailed understanding of infectious agents but also through the exploitation of such findings to enable more accurate diagnosis, high-resolution typing and the development of improved interventions. The use of genomics for these and other purposes is likely to grow in future years and it must be anticipated that investigation and characterisation of important animal infectious agents will also gain considerable benefits. Using mainly animal pathogens as examples - including several infectious agents listed by the World Organisation for Animal Health - this paper provides a concise summary of some recent purposes and developments in bacterial genomics analysis.

  7. Genomic perspectives on the evolution and spread of bacterial pathogens.

    PubMed

    Bentley, Stephen D; Parkhill, Julian

    2015-12-22

    Since the first complete sequencing of a free-living organism, Haemophilus influenzae, genomics has been used to probe both the biology of bacterial pathogens and their evolution. Single-genome approaches provided information on the repertoire of virulence determinants and host-interaction factors, and, along with comparative analyses, allowed the proposal of hypotheses to explain the evolution of many of these traits. These analyses suggested many bacterial pathogens to be of relatively recent origin and identified genome degradation as a key aspect of host adaptation. The advent of very-high-throughput sequencing has allowed for detailed phylogenetic analysis of many important pathogens, revealing patterns of global and local spread, and recent evolution in response to pressure from therapeutics and the human immune system. Such analyses have shown that bacteria can evolve and transmit very rapidly, with emerging clones showing adaptation and global spread over years or decades. The resolution achieved with whole-genome sequencing has shown considerable benefits in clinical microbiology, enabling accurate outbreak tracking within hospitals and across continents. Continued large-scale sequencing promises many further insights into genetic determinants of drug resistance, virulence and transmission in bacterial pathogens.

  8. Genomic perspectives on the evolution and spread of bacterial pathogens

    PubMed Central

    Bentley, Stephen D.

    2015-01-01

    Since the first complete sequencing of a free-living organism, Haemophilus influenzae, genomics has been used to probe both the biology of bacterial pathogens and their evolution. Single-genome approaches provided information on the repertoire of virulence determinants and host-interaction factors, and, along with comparative analyses, allowed the proposal of hypotheses to explain the evolution of many of these traits. These analyses suggested many bacterial pathogens to be of relatively recent origin and identified genome degradation as a key aspect of host adaptation. The advent of very-high-throughput sequencing has allowed for detailed phylogenetic analysis of many important pathogens, revealing patterns of global and local spread, and recent evolution in response to pressure from therapeutics and the human immune system. Such analyses have shown that bacteria can evolve and transmit very rapidly, with emerging clones showing adaptation and global spread over years or decades. The resolution achieved with whole-genome sequencing has shown considerable benefits in clinical microbiology, enabling accurate outbreak tracking within hospitals and across continents. Continued large-scale sequencing promises many further insights into genetic determinants of drug resistance, virulence and transmission in bacterial pathogens. PMID:26702036

  9. Finishing bacterial genome assemblies with Mix

    PubMed Central

    2013-01-01

    Motivation Among challenges that hamper reaping the benefits of genome assembly are both unfinished assemblies and the ensuing experimental costs. First, numerous software solutions for genome de novo assembly are available, each having its advantages and drawbacks, without clear guidelines as to how to choose among them. Second, these solutions produce draft assemblies that often require a resource intensive finishing phase. Methods In this paper we address these two aspects by developing Mix , a tool that mixes two or more draft assemblies, without relying on a reference genome and having the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices represent extremities of contigs and edges represent existing alignments between these extremities. These alignment edges are used for contig extension. The resulting output assembly corresponds to a set of paths in the extension graph that maximizes the cumulative contig length. Results We evaluate the performance of Mix on bacterial NGS data from the GAGE-B study and apply it to newly sequenced Mycoplasma genomes. Resulting final assemblies demonstrate a significant improvement in the overall assembly quality. In particular, Mix is consistent by providing better overall quality results even when the choice is guided solely by standard assembly statistics, as is the case for de novo projects. Availability Mix is implemented in Python and is available at https://github.com/cbib/MIX, novel data for our Mycoplasma study is available at http://services.cbib.u-bordeaux2.fr/mix/. PMID:24564706

  10. Transforming clinical microbiology with bacterial genome sequencing.

    PubMed

    Didelot, Xavier; Bowden, Rory; Wilson, Daniel J; Peto, Tim E A; Crook, Derrick W

    2012-09-01

    Whole-genome sequencing of bacteria has recently emerged as a cost-effective and convenient approach for addressing many microbiological questions. Here, we review the current status of clinical microbiology and how it has already begun to be transformed by using next-generation sequencing. We focus on three essential tasks: identifying the species of an isolate, testing its properties, such as resistance to antibiotics and virulence, and monitoring the emergence and spread of bacterial pathogens. We predict that the application of next-generation sequencing will soon be sufficiently fast, accurate and cheap to be used in routine clinical microbiology practice, where it could replace many complex current techniques with a single, more efficient workflow.

  11. Transforming clinical microbiology with bacterial genome sequencing

    PubMed Central

    2016-01-01

    Whole genome sequencing of bacteria has recently emerged as a cost-effective and convenient approach for addressing many microbiological questions. Here we review the current status of clinical microbiology and how it has already begun to be transformed by the use of next-generation sequencing. We focus on three essential tasks: identifying the species of an isolate, testing its properties such as resistance to antibiotics and virulence, and monitoring the emergence and spread of bacterial pathogens. The application of next-generation sequencing will soon be sufficiently fast, accurate and cheap to be used in routine clinical microbiology practice, where it could replace many complex current techniques with a single, more efficient workflow. PMID:22868263

  12. From bacterial genome to functionality; case bifidobacteria.

    PubMed

    Ventura, Marco; O'Connell-Motherway, Mary; Leahy, Sinead; Moreno-Munoz, Jose Antonio; Fitzgerald, Gerald F; van Sinderen, Douwe

    2007-11-30

    The availability of complete bacterial genome sequences has significantly furthered our understanding of the genetics, physiology and biochemistry of the microorganisms in question, particularly those that have commercially important applications. Bifidobacteria are among such microorganisms, as they constitute mammalian commensals of biotechnological significance due to their perceived role in maintaining a balanced gastrointestinal (GIT) microflora. Bifidobacteria are therefore frequently used as health-promoting or probiotic components in functional food products. A fundamental understanding of the metabolic activities employed by these commensal bacteria, in particular their capability to utilize a wide range of complex oligosaccharides, can reveal ways to provide in vivo growth advantages relative to other competing gut bacteria or pathogens. Furthermore, an in depth analysis of adaptive responses to nutritional or environmental stresses may provide methodologies to retain viability and improve functionality during commercial preparation, storage and delivery of the probiotic organism.

  13. Gene calling and bacterial genome annotation with BG7.

    PubMed

    Tobes, Raquel; Pareja-Tobes, Pablo; Manrique, Marina; Pareja-Tobes, Eduardo; Kovach, Evdokim; Alekhin, Alexey; Pareja, Eduardo

    2015-01-01

    New massive sequencing technologies are providing many bacterial genome sequences from diverse taxa but a refined annotation of these genomes is crucial for obtaining scientific findings and new knowledge. Thus, bacterial genome annotation has emerged as a key point to investigate in bacteria. Any efficient tool designed specifically to annotate bacterial genomes sequenced with massively parallel technologies has to consider the specific features of bacterial genomes (absence of introns and scarcity of nonprotein-coding sequence) and of next-generation sequencing (NGS) technologies (presence of errors and not perfectly assembled genomes). These features make it convenient to focus on coding regions and, hence, on protein sequences that are the elements directly related with biological functions. In this chapter we describe how to annotate bacterial genomes with BG7, an open-source tool based on a protein-centered gene calling/annotation paradigm. BG7 is specifically designed for the annotation of bacterial genomes sequenced with NGS. This tool is sequence error tolerant maintaining their capabilities for the annotation of highly fragmented genomes or for annotating mixed sequences coming from several genomes (as those obtained through metagenomics samples). BG7 has been designed with scalability as a requirement, with a computing infrastructure completely based on cloud computing (Amazon Web Services).

  14. Genomic and Transcriptomic Analyses of Foodborne Bacterial Pathogens

    NASA Astrophysics Data System (ADS)

    Zhang, Wei; Dudley, Edward G.; Wade, Joseph T.

    DNA microarrays (often interchangeably called DNA chips or DNA arrays) are among the most popular analytical tools for high-throughput comparative genomic and transcriptomic analyses of foodborne bacterial pathogens. A typical DNA microarray contains hundreds to millions of small DNA probes that are chemically attached (or "printed") onto the surface of a microscopic glass slide. Depending on the specific "printing" and probe synthesis technologies for different microarray platforms, such DNA probes can be PCR amplicons or in situ synthesized short oligonucleotides. DNA microarray technologies have revolutionized the way that we investigate the biology of foodborne bacterial pathogens. The major advantage of these technologies is that DNA microarrays allow comparison of subtle genomic or transcriptomic variations between two bacterial samples, such as genomic variations between two different bacterial strains or transcriptomic alterations of same bacterial strain under two different treatments. Some applications of comparative genomic hybridization microarrays and global gene expression microarrays have been covered in previous chapters of this book.

  15. Genome engineering and gene expression control for bacterial strain development.

    PubMed

    Song, Chan Woo; Lee, Joungmin; Lee, Sang Yup

    2015-01-01

    In recent years, a number of techniques and tools have been developed for genome engineering and gene expression control to achieve desired phenotypes of various bacteria. Here we review and discuss the recent advances in bacterial genome manipulation and gene expression control techniques, and their actual uses with accompanying examples. Genome engineering has been commonly performed based on homologous recombination. During such genome manipulation, the counterselection systems employing SacB or nucleases have mainly been used for the efficient selection of desired engineered strains. The recombineering technology enables simple and more rapid manipulation of the bacterial genome. The group II intron-mediated genome engineering technology is another option for some bacteria that are difficult to be engineered by homologous recombination. Due to the increasing demands on high-throughput screening of bacterial strains having the desired phenotypes, several multiplex genome engineering techniques have recently been developed and validated in some bacteria. Another approach to achieve desired bacterial phenotypes is the repression of target gene expression without the modification of genome sequences. This can be performed by expressing antisense RNA, small regulatory RNA, or CRISPR RNA to repress target gene expression at the transcriptional or translational level. All of these techniques allow efficient and rapid development and screening of bacterial strains having desired phenotypes, and more advanced techniques are expected to be seen.

  16. Holotransformations of bacterial colonies and genome cybernetics

    NASA Astrophysics Data System (ADS)

    Ben-Jacob, Eshel; Tenenbaum, Adam; Shochet, Ofer; Avidan, Orna

    1994-01-01

    We present a study of colony transformations during growth of Bacillus subtilis under adverse environmental conditions. It is a continuation of our pilot study of “Adaptive self-organization during growth of bacterial colonies” (Physica A 187 (1992) 378). First we identify and describe the transformations pathway, i.e. the excitation of the branching modes from Bacillus subtilis 168 (grown under diffusion limited conditions) and the phase transformations between the tip-splitting phase (phase T) and the chiral phase (phase C) which belong to the same mode. This pathway shows the evolution of complexity as the bacteria are exposed to adverse growth conditions. We present the morphology diagram of phases T and C as a function of agar concentration and pepton level. As expected, the growth of phase T is ramified (fractal-like or DLA-like) at low pepton level (about 1 g/1) and turns compact at high pepton level (about 10 g/1). The growth of phase C is also ramified at low pepton level and turns denser and finally compact as the pepton level increases. Generally speaking, the colonies develop more complex patterns and higher micro-level organization for more adverse environments. We use the growth velocity as a response function to describe the growth. At low agar concentration (and low pepton level) phase C grows faster than phase T, and for a high agar concentration (about 2%) phase T grows faster. We observe colony transformations between the two phases (phase transformations). They are found to be consistent with the “fastest growing morphology” selection principle adopted from azoic systems. The transformations are always from the slower phase to the faster one. Hence, we observe T→ C transformations at low agar concentrations and C→ T transformations at high agar concentrations. We have observed both localized and extended transformations. Usually, the transformations are localized for more adverse growth conditions, and extended for growth conditions

  17. Harnessing CRISPR-Cas systems for bacterial genome editing.

    PubMed

    Selle, Kurt; Barrangou, Rodolphe

    2015-04-01

    Manipulation of genomic sequences facilitates the identification and characterization of key genetic determinants in the investigation of biological processes. Genome editing via clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) constitutes a next-generation method for programmable and high-throughput functional genomics. CRISPR-Cas systems are readily reprogrammed to induce sequence-specific DNA breaks at target loci, resulting in fixed mutations via host-dependent DNA repair mechanisms. Although bacterial genome editing is a relatively unexplored and underrepresented application of CRISPR-Cas systems, recent studies provide valuable insights for the widespread future implementation of this technology. This review summarizes recent progress in bacterial genome editing and identifies fundamental genetic and phenotypic outcomes of CRISPR targeting in bacteria, in the context of tool development, genome homeostasis, and DNA repair.

  18. LATERAL GENE TRANSFER AND THE HISTORY OF BACTERIAL GENOMES

    SciTech Connect

    Howard Ochman

    2006-02-22

    The aims of this research were to elucidate the role and extent of lateral transfer in the differentiation of bacterial strains and species, and to assess the impact of gene transfer on the evolution of bacterial genomes. The ultimate goal of the project is to examine the dynamics of a core set of protein-coding genes (i.e., those that are distributed universally among Bacteria) by developing conserved primers that would allow their amplification and sequencing in any bacterial taxa. In addition, we adopted a bioinformatic approach to elucidate the extent of lateral gene transfer in sequenced genome.

  19. Identifying characteristic scales in the human genome

    NASA Astrophysics Data System (ADS)

    Carpena, P.; Bernaola-Galván, P.; Coronado, A. V.; Hackenberg, M.; Oliver, J. L.

    2007-03-01

    The scale-free, long-range correlations detected in DNA sequences contrast with characteristic lengths of genomic elements, being particularly incompatible with the isochores (long, homogeneous DNA segments). By computing the local behavior of the scaling exponent α of detrended fluctuation analysis (DFA), we discriminate between sequences with and without true scaling, and we find that no single scaling exists in the human genome. Instead, human chromosomes show a common compositional structure with two characteristic scales, the large one corresponding to the isochores and the other to small and medium scale genomic elements.

  20. Correcting Inconsistencies and Errors in Bacterial Genome Metadata Using an Automated Curation Tool in Excel (AutoCurE).

    PubMed

    Schmedes, Sarah E; King, Jonathan L; Budowle, Bruce

    2015-01-01

    Whole-genome data are invaluable for large-scale comparative genomic studies. Current sequencing technologies have made it feasible to sequence entire bacterial genomes with relative ease and time with a substantially reduced cost per nucleotide, hence cost per genome. More than 3,000 bacterial genomes have been sequenced and are available at the finished status. Publically available genomes can be readily downloaded; however, there are challenges to verify the specific supporting data contained within the download and to identify errors and inconsistencies that may be present within the organizational data content and metadata. AutoCurE, an automated tool for bacterial genome database curation in Excel, was developed to facilitate local database curation of supporting data that accompany downloaded genomes from the National Center for Biotechnology Information. AutoCurE provides an automated approach to curate local genomic databases by flagging inconsistencies or errors by comparing the downloaded supporting data to the genome reports to verify genome name, RefSeq accession numbers, the presence of archaea, BioProject/UIDs, and sequence file descriptions. Flags are generated for nine metadata fields if there are inconsistencies between the downloaded genomes and genomes reports and if erroneous or missing data are evident. AutoCurE is an easy-to-use tool for local database curation for large-scale genome data prior to downstream analyses.

  1. Ensembl Genomes 2013: scaling up access to genome-wide data

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Hughes, Daniel Seth Toney; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Langridge, Nicholas; McDowall, Mark D.; Maheswari, Uma; Maslen, Gareth; Nuhn, Michael; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Toneva, Iliana; Tuli, Mary Ann; Walts, Brandon; Williams, Gareth; Wilson, Derek; Youens-Clark, Ken; Monaco, Marcela K.; Stein, Joshua; Wei, Xuehong; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin Lee; Kulesha, Eugene; Lawson, Daniel; Staines, Daniel Michael

    2014-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future. PMID:24163254

  2. Ecological and Temporal Constraints in the Evolution of Bacterial Genomes

    PubMed Central

    Boto, Luis; Martínez, Jose Luis

    2011-01-01

    Studies on the experimental evolution of microorganisms, on their in vivo evolution (mainly in the case of bacteria producing chronic infections), as well as the availability of multiple full genomic sequences, are placing bacteria in the playground of evolutionary studies. In the present article we review the differential contribution to the evolution of bacterial genomes that processes such as gene modification, gene acquisition and gene loss may have when bacteria colonize different habitats that present characteristic ecological features. In particular, we review how the different processes contribute to evolution in microbial communities, in free-living bacteria or in bacteria living in isolation. In addition, we discuss the temporal constraints in the evolution of bacterial genomes, considering bacterial evolution from the perspective of processes of short-sighted evolution and punctual acquisition of evolutionary novelties followed by long stasis periods. PMID:24710293

  3. Bacterial Cellular Engineering by Genome Editing and Gene Silencing

    PubMed Central

    Nakashima, Nobutaka; Miyazaki, Kentaro

    2014-01-01

    Genome editing is an important technology for bacterial cellular engineering, which is commonly conducted by homologous recombination-based procedures, including gene knockout (disruption), knock-in (insertion), and allelic exchange. In addition, some new recombination-independent approaches have emerged that utilize catalytic RNAs, artificial nucleases, nucleic acid analogs, and peptide nucleic acids. Apart from these methods, which directly modify the genomic structure, an alternative approach is to conditionally modify the gene expression profile at the posttranscriptional level without altering the genomes. This is performed by expressing antisense RNAs to knock down (silence) target mRNAs in vivo. This review describes the features and recent advances on methods used in genomic engineering and silencing technologies that are advantageously used for bacterial cellular engineering. PMID:24552876

  4. Bacterial Recombineering: Genome Engineering via Phage-Based Homologous Recombination.

    PubMed

    Pines, Gur; Freed, Emily F; Winkler, James D; Gill, Ryan T

    2015-11-20

    The ability to specifically modify bacterial genomes in a precise and efficient manner is highly desired in various fields, ranging from molecular genetics to metabolic engineering and synthetic biology. Much has changed from the initial realization that phage-derived genes may be employed for such tasks to today, where recombineering enables complex genetic edits within a genome or a population. Here, we review the major developments leading to recombineering becoming the method of choice for in situ bacterial genome editing while highlighting the various applications of recombineering in pushing the boundaries of synthetic biology. We also present the current understanding of the mechanism of recombineering. Finally, we discuss in detail issues surrounding recombineering efficiency and future directions for recombineering-based genome editing.

  5. Applications of genome-scale metabolic reconstructions

    PubMed Central

    Oberhardt, Matthew A; Palsson, Bernhard Ø; Papin, Jason A

    2009-01-01

    The availability and utility of genome-scale metabolic reconstructions have exploded since the first genome-scale reconstruction was published a decade ago. Reconstructions have now been built for a wide variety of organisms, and have been used toward five major ends: (1) contextualization of high-throughput data, (2) guidance of metabolic engineering, (3) directing hypothesis-driven discovery, (4) interrogation of multi-species relationships, and (5) network property discovery. In this review, we examine the many uses and future directions of genome-scale metabolic reconstructions, and we highlight trends and opportunities in the field that will make the greatest impact on many fields of biology. PMID:19888215

  6. Genome Sequence of Bacterial Interference Strain Staphylococcus aureus 502A.

    PubMed

    Parker, Dane; Narechania, Apurva; Sebra, Robert; Deikus, Gintaras; Larussa, Samuel; Ryan, Chanelle; Smith, Hannah; Prince, Alice; Mathema, Barun; Ratner, Adam J; Kreiswirth, Barry; Planet, Paul J

    2014-04-10

    Staphylococcus aureus 502A was a strain used in bacterial interference programs during the 1960s and early 1970s. Infants were deliberately colonized with 502A with the goal of preventing colonization with more invasive strains. We present the completed genome sequence of this organism.

  7. The influence of the accessory genome on bacterial pathogen evolution

    PubMed Central

    Vinatzer, Boris; Arnold, Dawn L; Dorus, Steve; Murillo, Jesús

    2011-01-01

    Bacterial pathogens exhibit significant variation in their genomic content of virulence factors. This reflects the abundance of strategies pathogens evolved to infect host organisms by suppressing host immunity. Molecular arms-races have been a strong driving force for the evolution of pathogenicity, with pathogens often encoding overlapping or redundant functions, such as type III protein secretion effectors and hosts encoding ever more sophisticated immune systems. The pathogens' frequent exposure to other microbes, either in their host or in the environment, provides opportunities for the acquisition or interchange of mobile genetic elements. These DNA elements accessorize the core genome and can play major roles in shaping genome structure and altering the complement of virulence factors. Here, we review the different mobile genetic elements focusing on the more recent discoveries and highlighting their role in shaping bacterial pathogen evolution. PMID:22016845

  8. Genes but Not Genomes Reveal Bacterial Domestication of Lactococcus Lactis

    PubMed Central

    Passerini, Delphine; Beltramo, Charlotte; Coddeville, Michele; Quentin, Yves; Ritzenthaler, Paul

    2010-01-01

    Background The population structure and diversity of Lactococcus lactis subsp. lactis, a major industrial bacterium involved in milk fermentation, was determined at both gene and genome level. Seventy-six lactococcal isolates of various origins were studied by different genotyping methods and thirty-six strains displaying unique macrorestriction fingerprints were analyzed by a new multilocus sequence typing (MLST) scheme. This gene-based analysis was compared to genomic characteristics determined by pulsed-field gel electrophoresis (PFGE). Methodology/Principal Findings The MLST analysis revealed that L. lactis subsp. lactis is essentially clonal with infrequent intra- and intergenic recombination; also, despite its taxonomical classification as a subspecies, it displays a genetic diversity as substantial as that within several other bacterial species. Genome-based analysis revealed a genome size variability of 20%, a value typical of bacteria inhabiting different ecological niches, and that suggests a large pan-genome for this subspecies. However, the genomic characteristics (macrorestriction pattern, genome or chromosome size, plasmid content) did not correlate to the MLST-based phylogeny, with strains from the same sequence type (ST) differing by up to 230 kb in genome size. Conclusion/Significance The gene-based phylogeny was not fully consistent with the traditional classification into dairy and non-dairy strains but supported a new classification based on ecological separation between “environmental” strains, the main contributors to the genetic diversity within the subspecies, and “domesticated” strains, subject to recent genetic bottlenecks. Comparison between gene- and genome-based analyses revealed little relationship between core and dispensable genome phylogenies, indicating that clonal diversification and phenotypic variability of the “domesticated” strains essentially arose through substantial genomic flux within the dispensable genome

  9. Genome scale engineering techniques for metabolic engineering.

    PubMed

    Liu, Rongming; Bassalo, Marcelo C; Zeitoun, Ramsey I; Gill, Ryan T

    2015-11-01

    Metabolic engineering has expanded from a focus on designs requiring a small number of genetic modifications to increasingly complex designs driven by advances in genome-scale engineering technologies. Metabolic engineering has been generally defined by the use of iterative cycles of rational genome modifications, strain analysis and characterization, and a synthesis step that fuels additional hypothesis generation. This cycle mirrors the Design-Build-Test-Learn cycle followed throughout various engineering fields that has recently become a defining aspect of synthetic biology. This review will attempt to summarize recent genome-scale design, build, test, and learn technologies and relate their use to a range of metabolic engineering applications.

  10. Genome-scale rates of evolutionary change in bacteria

    PubMed Central

    Duchêne, Sebastian; Holt, Kathryn E.; Weill, François-Xavier; Le Hello, Simon; Hawkey, Jane; Edwards, David J.; Fourment, Mathieu

    2016-01-01

    Estimating the rates at which bacterial genomes evolve is critical to understanding major evolutionary and ecological processes such as disease emergence, long-term host–pathogen associations and short-term transmission patterns. The surge in bacterial genomic data sets provides a new opportunity to estimate these rates and reveal the factors that shape bacterial evolutionary dynamics. For many organisms estimates of evolutionary rate display an inverse association with the time-scale over which the data are sampled. However, this relationship remains unexplored in bacteria due to the difficulty in estimating genome-wide evolutionary rates, which are impacted by the extent of temporal structure in the data and the prevalence of recombination. We collected 36 whole genome sequence data sets from 16 species of bacterial pathogens to systematically estimate and compare their evolutionary rates and assess the extent of temporal structure in the absence of recombination. The majority (28/36) of data sets possessed sufficient clock-like structure to robustly estimate evolutionary rates. However, in some species reliable estimates were not possible even with ‘ancient DNA’ data sampled over many centuries, suggesting that they evolve very slowly or that they display extensive rate variation among lineages. The robustly estimated evolutionary rates spanned several orders of magnitude, from approximately 10−5 to 10−8 nucleotide substitutions per site year−1. This variation was negatively associated with sampling time, with this relationship best described by an exponential decay curve. To avoid potential estimation biases, such time-dependency should be considered when inferring evolutionary time-scales in bacteria. PMID:28348834

  11. Synthetic genomics and the construction of a synthetic bacterial cell.

    PubMed

    Glass, John I

    2012-01-01

    The first synthetic cellular organism was created in 2010 and based on a very small, very simple bacterium called Mycoplasma mycoides. The bacterium was called synthetic because its DNA genome was chemically synthesized rather than replicated from an existing template DNA, as occurs in all other known cellular life on Earth. The experiment was undertaken in order to develop a system that would allow creation of a minimal bacterial cell that could lead to a better understand of the first principles of cellular life. The effort resulted in new synthetic genomics techniques called genome assembly and genome transplantation. The ability of scientists to design and build bacteria opens new possibilities for creating microbes to solve human problems.

  12. Genix: a new online automated pipeline for bacterial genome annotation.

    PubMed

    Kremer, Frederico Schmitt; Eslabão, Marcus Redü; Dellagostin, Odir Antônio; Pinto, Luciano da Silva

    2016-12-01

    Next-generation sequencing has significantly reduced the cost of genome-sequencing projects, resulting in an expressive increase in the availability of genomic data in public databases. The cheaper and easier is to sequence new genomes, the more accurate the annotation steps have to be to avoid both the loss of information and the accumulation of erroneous features that may affect the accuracy of further analysis. In the case of bacteria genomes, a range of web annotation software has been developed; however, many applications have yet to incorporate the steps required to improve their result, including the removal of false-positive/spurious and a more complete identification of non-coding features. We present Genix, a new web-based bacterial genome annotation pipeline. A comparison of the results generated by Genix for four reference genomes against those generated by other annotation tools indicated that our pipeline is able to provide results that are closer to the reference genome annotation, with a smaller amount of false-positive proteins and missing functional annotated proteins. Additionally, the metrics obtained by Genix were slightly better than those obtained by Prokka, a state-of-art standalone annotation system. Our results indicate that Genix is a useful tool that is able to provide a more refined result, and may be a user-friendly way to obtain high-quality results.

  13. Breakthroughs in field-scale bacterial transport

    SciTech Connect

    Balkwill, D; Chen, J; Deflaun, Mary; Dobbs, F; Dong, H; Fredrickson, Jim K. ); Fuller, M; Green, M ); Ginn, T; Griffin, T; Holben, W; Hubbard, S; Johnson, W; Long, Philip E. ); Mailloux, B; Majer, E; Mcinerney, M; Murray, Christopher J. ); Onstott, T; Phelps, T; Scheibe, Timothy D. ); Swift, D; White, D; Wobber, F

    2001-06-01

    This article summarizes a bioaugmentation research project undertaken by a DOE-sponsored, multidisciplinary research team at a field site near Oyster, Virginia. The overall purpose of the ongoing project is to evaluate the relative importance of hydrogeological and geochemical heterogeneities in controlling bacterial transpor, and to develop an approach for quantitative prediction of bacterial transport needed to design optimal bioremediation strategies.

  14. Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences

    PubMed Central

    Bentley, Stephen D

    2014-01-01

    Evolution of bacterial pathogen populations has been detected in a variety of ways including phenotypic tests, such as metabolic activity, reaction to antisera and drug resistance and genotypic tests that measure variation in chromosome structure, repetitive loci and individual gene sequences. While informative, these methods only capture a small subset of the total variation and, therefore, have limited resolution. Advances in sequencing technologies have made it feasible to capture whole-genome sequence variation for each sample under study, providing the potential to detect all changes at all positions in the genome from single nucleotide changes to large-scale insertions and deletions. In this review, we focus on recent work that has applied this powerful new approach and summarize some of the advances that this has brought in our understanding of the details of how bacterial pathogens evolve. PMID:23075447

  15. Unveiling Bacterial Interactions through Multidimensional Scaling and Dynamics Modeling

    PubMed Central

    Dorado-Morales, Pedro; Vilanova, Cristina; P. Garay, Carlos; Martí, Jose Manuel; Porcar, Manuel

    2015-01-01

    We propose a new strategy to identify and visualize bacterial consortia by conducting replicated culturing of environmental samples coupled with high-throughput sequencing and multidimensional scaling analysis, followed by identification of bacteria-bacteria correlations and interactions. We conducted a proof of concept assay with pine-tree resin-based media in ten replicates, which allowed detecting and visualizing dynamical bacterial associations in the form of statistically significant and yet biologically relevant bacterial consortia. PMID:26671778

  16. Elucidation of operon structures across closely related bacterial genomes.

    PubMed

    Zhou, Chuan; Ma, Qin; Li, Guojun

    2014-01-01

    About half of the protein-coding genes in prokaryotic genomes are organized into operons to facilitate co-regulation during transcription. With the evolution of genomes, operon structures are undergoing changes which could coordinate diverse gene expression patterns in response to various stimuli during the life cycle of a bacterial cell. Here we developed a graph-based model to elucidate the diversity of operon structures across a set of closely related bacterial genomes. In the constructed graph, each node represents one orthologous gene group (OGG) and a pair of nodes will be connected if any two genes, from the corresponding two OGGs respectively, are located in the same operon as immediate neighbors in any of the considered genomes. Through identifying the connected components in the above graph, we found that genes in a connected component are likely to be functionally related and these identified components tend to form treelike topology, such as paths and stars, corresponding to different biological mechanisms in transcriptional regulation as follows. Specifically, (i) a path-structure component integrates genes encoding a protein complex, such as ribosome; and (ii) a star-structure component not only groups related genes together, but also reflects the key functional roles of the central node of this component, such as the ABC transporter with a transporter permease and substrate-binding proteins surrounding it. Most interestingly, the genes from organisms with highly diverse living environments, i.e., biomass degraders and animal pathogens of clostridia in our study, can be clearly classified into different topological groups on some connected components.

  17. Mining Bacterial Genomes for Secondary Metabolite Gene Clusters.

    PubMed

    Adamek, Martina; Spohn, Marius; Stegmann, Evi; Ziemert, Nadine

    2017-01-01

    With the emergence of bacterial resistance against frequently used antibiotics, novel antibacterial compounds are urgently needed. Traditional bioactivity-guided drug discovery strategies involve laborious screening efforts and display high rediscovery rates. With the progress in next generation sequencing methods and the knowledge that the majority of antibiotics in clinical use are produced as secondary metabolites by bacteria, mining bacterial genomes for secondary metabolites with antimicrobial activity is a promising approach, which can guide a more time and cost-effective identification of novel compounds. However, what sounds easy to accomplish, comes with several challenges. To date, several tools for the prediction of secondary metabolite gene clusters are available, some of which are based on the detection of signature genes, while others are searching for specific patterns in gene content or regulation.Apart from the mere identification of gene clusters, several other factors such as determining cluster boundaries and assessing the novelty of the detected cluster are important. For this purpose, comparison of the predicted secondary metabolite genes with different cluster and compound databases is necessary. Furthermore, it is advisable to classify detected clusters into gene cluster families. So far, there is no standardized procedure for genome mining; however, different approaches to overcome all of these challenges exist and are addressed in this chapter. We give practical guidance on the workflow for secondary metabolite gene cluster identification, which includes the determination of gene cluster boundaries, addresses problems occurring with the use of draft genomes, and gives an outlook on the different methods for gene cluster classification. Based on comprehensible examples a protocol is set, which should enable the readers to mine their own genome data for interesting secondary metabolites.

  18. Reconstruction of a Bacterial Genome from DNA Cassettes

    SciTech Connect

    Christopher Dupont; John Glass; Laura Sheahan; Shibu Yooseph; Lisa Zeigler Allen; Mathangi Thiagarajan; Andrew Allen; Robert Friedman; J. Craig Venter

    2011-12-31

    This basic research program comprised two major areas: (1) acquisition and analysis of marine microbial metagenomic data and development of genomic analysis tools for broad, external community use; (2) development of a minimal bacterial genome. Our Marine Metagenomic Diversity effort generated and analyzed shotgun sequencing data from microbial communities sampled from over 250 sites around the world. About 40% of the 26 Gbp of sequence data has been made publicly available to date with a complete release anticipated in six months. Our results and those mining the deposited data have revealed a vast diversity of genes coding for critical metabolic processes whose phylogenetic and geographic distributions will enable a deeper understanding of carbon and nutrient cycling, microbial ecology, and rapid rate evolutionary processes such as horizontal gene transfer by viruses and plasmids. A global assembly of the generated dataset resulted in a massive set (5Gbp) of genome fragments that provide context to the majority of the generated data that originated from uncultivated organisms. Our Synthetic Biology team has made significant progress towards the goal of synthesizing a minimal mycoplasma genome that will have all of the machinery for independent life. This project, once completed, will provide fundamentally new knowledge about requirements for microbial life and help to lay a basic research foundation for developing microbiological approaches to bioenergy.

  19. Use of whole genome DNA spectrograms in bacterial classification.

    PubMed

    Kubicova, Vladimira; Provaznik, Ivo

    2016-02-01

    A spectrogram reflects the arrangement of nucleotides through the whole chromosome or genome. Our previous study suggested that the spectrogram of whole genome DNA sequences is a suitable tool for the determination of relationships among bacteria. Related bacteria have similar spectrograms, and similarity in spectrograms was measured using a color layout descriptor. Several parameters, such as the mapping of four bases into a spectrogram, the number of considered elements in the color layout descriptor, the color model of the image and the building tree method, can be changed. This study addresses the use of parameter selection to ensure the best classification results. The quality of the classification was measured by Matthew's correlation coefficient (MCC). The proposed method with optimal parameters (called SpectCMP-Spectrogram CoMParison method) achieved an average MCC of 0.73 at the phylum level. The SpectCMP method was also tested at the order level; the average MCC in the classification of class Gammaproteobacteria was 0.76. The success of a classification with respect to the correct phyla was compared to three methods that are used in bacterial phylogeny: the CVTree method, OGTree method and moment vector method. The results show that the SpectCMP method can be used in bacterial classification at various taxonomic levels.

  20. The bacterial species definition in the genomic era

    PubMed Central

    Konstantinidis, Konstantinos T; Ramette, Alban; Tiedje, James M

    2006-01-01

    The bacterial species definition, despite its eminent practical significance for identification, diagnosis, quarantine and diversity surveys, remains a very difficult issue to advance. Genomics now offers novel insights into intra-species diversity and the potential for emergence of a more soundly based system. Although we share the excitement, we argue that it is premature for a universal change to the definition because current knowledge is based on too few phylogenetic groups and too few samples of natural populations. Our analysis of five important bacterial groups suggests, however, that more stringent standards for species may be justifiable when a solid understanding of gene content and ecological distinctiveness becomes available. Our analysis also reveals what is actually encompassed in a species according to the current standards, in terms of whole-genome sequence and gene-content diversity, and shows that this does not correspond to coherent clusters for the environmental Burkholderia and Shewanella genera examined. In contrast, the obligatory pathogens, which have a very restricted ecological niche, do exhibit clusters. Therefore, the idea of biologically meaningful clusters of diversity that applies to most eukaryotes may not be universally applicable in the microbial world, or if such clusters exist, they may be found at different levels of distinction. PMID:17062412

  1. Lateral gene transfer, bacterial genome evolution, and the Anthropocene.

    PubMed

    Gillings, Michael R

    2017-02-01

    Lateral gene transfer (LGT) has significantly influenced bacterial evolution since the origins of life. It helped bacteria generate flexible, mosaic genomes and enables individual cells to rapidly acquire adaptive phenotypes. In turn, this allowed bacteria to mount strong defenses against human attempts to control their growth. The widespread dissemination of genes conferring resistance to antimicrobial agents has precipitated a crisis for modern medicine. Our actions can promote increased rates of LGT and also provide selective forces to fix such events in bacterial populations. For instance, the use of selective agents induces the bacterial SOS response, which stimulates LGT. We create hotspots for lateral transfer, such as wastewater systems, hospitals, and animal production facilities. Conduits of gene transfer between humans and animals ensure rapid dissemination of recent transfer events, as does modern transport and globalization. As resistance to antibacterial compounds becomes universal, there is likely to be increasing selection pressure for phenotypes with adverse consequences for human welfare, such as enhanced virulence, pathogenicity, and transmission. Improved understanding of the ecology of LGT could help us devise strategies to control this fundamental evolutionary process.

  2. Reductive genome evolution at both ends of the bacterial population size spectrum.

    PubMed

    Batut, Bérénice; Knibbe, Carole; Marais, Gabriel; Daubin, Vincent

    2014-12-01

    Bacterial genomes show substantial variations in size. The smallest bacterial genomes are those of endocellular symbionts of eukaryotic hosts, which have undergone massive genome reduction and show patterns that are consistent with the degenerative processes that are predicted to occur in species with small effective population sizes. However, similar genome reduction is found in some free-living marine cyanobacteria that are characterized by extremely large populations. In this Opinion article, we discuss the different hypotheses that have been proposed to account for this reductive genome evolution at both ends of the bacterial population size spectrum.

  3. Comparative genomics of Mortierella elongata and its bacterial endosymbiont Mycoavidus cysteinexigens: Comparative genomics of Mortierella elongata

    DOE PAGES

    Uehling, J.; Gryganskyi, A.; Hameed, K.; ...

    2017-01-01

    Endosymbiosis of bacteria by eukaryotes is a defining feature of cellular evolution. In addition to well-known bacterial origins for mitochondria and chloroplasts, multiple origins of bacterial endosymbiosis are known within the cells of diverse animals, plants and fungi. Early-diverging lineages of terrestrial fungi harbor endosymbiotic bacteria belonging to the Burkholderiaceae. Furthermore, we sequenced the metagenome of the soil-inhabiting fungus Mortierella elongata and assembled the complete circular chromosome of its endosymbiont, Mycoavidus cysteinexigens, which we place within a lineage of endofungal symbionts that are sister clade to Burkholderia. The genome of M. elongata strain AG77 features a core set of primarymore » metabolic pathways for degradation of simple carbohydrates and lipid biosynthesis, while the M. cysteinexigens (AG77) genome is reduced in size and function. Experiments using antibiotics to cure the endobacterium from the host demonstrate that the fungal host metabolism is highly modulated by presence/ absence of M. cysteinexigens. In independent comparative phylogenomic analyses of fungal and bacterial genomes we find that they are consistent with an ancient origin for M. elongata M. cysteinexigens symbiosis, most likely over 350 million years ago and concomitant with the terrestrialization of Earth and diversification of land fungi and plants.« less

  4. [Genetics and genomics for the study of bacterial resistance].

    PubMed

    Garza-Ramos, Ulises; Silva-Sánchez, Jesús; Martínez-Romero, Esperanza

    2009-01-01

    Bacterial resistance is a public health problem causing high rates of morbidity and mortality in hospital settings. To the extent that different antibiotics are used, bacteria resistant to multiple drugs are selected. The development of new molecular genomic and proteomic tools such as real-time PCR, DNA pyrosequencing, mass spectrometry, DNA microarrays, and bioinformatics allow for more in-depth knowledge about the physiology and structure of bacteria and mechanisms involved in antibiotic resistance. These studies identify new targets for drugs and design specific antibiotics to provide more accurate treatments to combat infections caused by bacteria. Using these techniques, it will also be possible to rapidly identify genes that confer resistance to antibiotics, and to identify complex genetic structures, such as integrons that are involved in the spread of genes that confer multidrug-resistance.

  5. Drivers of bacterial β-diversity depend on spatial scale

    PubMed Central

    Martiny, Jennifer B. H.; Eisen, Jonathan A.; Penn, Kevin; Allison, Steven D.; Horner-Devine, M. Claire

    2011-01-01

    The factors driving β-diversity (variation in community composition) yield insights into the maintenance of biodiversity on the planet. Here we tested whether the mechanisms that underlie bacterial β-diversity vary over centimeters to continental spatial scales by comparing the composition of ammonia-oxidizing bacteria communities in salt marsh sediments. As observed in studies of macroorganisms, the drivers of salt marsh bacterial β-diversity depend on spatial scale. In contrast to macroorganism studies, however, we found no evidence of evolutionary diversification of ammonia-oxidizing bacteria taxa at the continental scale, despite an overall relationship between geographic distance and community similarity. Our data are consistent with the idea that dispersal limitation at local scales can contribute to β-diversity, even though the 16S rRNA genes of the relatively common taxa are globally distributed. These results highlight the importance of considering multiple spatial scales for understanding microbial biogeography. PMID:21518859

  6. Sequencing of Bacterial Genomes: Principles and Insights into Pathogenesis and Development of Antibiotics

    PubMed Central

    Donkor, Eric S.

    2013-01-01

    The impact of bacterial diseases on public health has become enormous, and is partly due to the increasing trend of antibiotic resistance displayed by bacterial pathogens. Sequencing of bacterial genomes has significantly improved our understanding about the biology of many bacterial pathogens as well as identification of novel antibiotic targets. Since the advent of genome sequencing two decades ago, about 1,800 bacterial genomes have been fully sequenced and these include important aetiological agents such as Streptococcus pneumoniae, Mycobacterium tuberculosis, Escherichia coli O157:H7, Vibrio cholerae, Clostridium difficile and Staphylococcus aureus. Very recently, there has been an explosion of bacterial genome data and is due to the development of next generation sequencing technologies, which are evolving so rapidly. Indeed, the field of microbial genomics is advancing at a very fast rate and it is difficult for researchers to be abreast with the new developments. This highlights the need for regular updates in microbial genomics through comprehensive reviews. This review paper seeks to provide an update on bacterial genome sequencing generally, and to analyze insights gained from sequencing in two areas, including bacterial pathogenesis and the development of antibiotics. PMID:24705262

  7. Bacterial genomic epidemiology, from local outbreak characterization to species-history reconstruction

    PubMed Central

    Gaiarsa, Stefano; De Marco, Leone; Comandatore, Francesco; Marone, Piero; Bandi, Claudio; Sassera, Davide

    2015-01-01

    Bacteriology has embraced the next-generation sequencing revolution, swiftly moving from the time of single genome sequencing to the age of genomic epidemiology. Hundreds and now even thousands of genomes are being sequenced for single bacterial species, allowing unprecedented levels of resolution and insight in the evolution and epidemic diffusion of the main bacterial pathogens. Here, we present a review of some of the most recent and groundbreaking studies in this field. PMID:26878934

  8. [Genome-wide non-sequencing strategies for bacterial genome comparison: the necessity and an analysis of the variable bacterial world].

    PubMed

    Sverdlov, E D

    2003-01-01

    A tremendous success in bacterial genome sequencing has been achieved during the recent years; it resulted in making available, for analysis, multiple sequences of different bacterial genomes, including such pathogens as causative agents of syphilis, typhus, and tuberculosis as well as such organisms like archaebacterias living under extreme conditions. A comparative analysis of bacterial genomes leads to conclusions, which have a general biological value, and, in particular, to the conclusions about mechanisms and evolution rate as well as about the variability of genomes and interrelation between organisms and their habitat. On the other hand, the analysis reveals specific features of separate bacterial species responsible for their pathogenicity and ability to avoid the destruction of the host immune system as well as for adaptation to exist within a certain ecological niche. However, the variability of bacterial genomes is so high that methods, which enable to evaluate the variability without full genome sequencing, are needed to depict adequately the evolution and ecological characteristics of the prokaryotic world and to develop new effective therapeutics and diagnostic tools. The survey covers two approaches to such comparative analysis, i.e. DNA arrays and subtractive hybridization. The advantages and disadvantages of each approach are discussed and the necessity in a new approach combining the positive features of the two mentioned approaches is substantiated.

  9. Phylogeny Inference of Closely Related Bacterial Genomes: Combining the Features of Both Overlapping Genes and Collinear Genomic Regions

    PubMed Central

    Zhang, Yan-Cong; Lin, Kui

    2015-01-01

    Overlapping genes (OGs) represent one type of widespread genomic feature in bacterial genomes and have been used as rare genomic markers in phylogeny inference of closely related bacterial species. However, the inference may experience a decrease in performance for phylogenomic analysis of too closely or too distantly related genomes. Another drawback of OGs as phylogenetic markers is that they usually take little account of the effects of genomic rearrangement on the similarity estimation, such as intra-chromosome/genome translocations, horizontal gene transfer, and gene losses. To explore such effects on the accuracy of phylogeny reconstruction, we combine phylogenetic signals of OGs with collinear genomic regions, here called locally collinear blocks (LCBs). By putting these together, we refine our previous metric of pairwise similarity between two closely related bacterial genomes. As a case study, we used this new method to reconstruct the phylogenies of 88 Enterobacteriale genomes of the class Gammaproteobacteria. Our results demonstrated that the topological accuracy of the inferred phylogeny was improved when both OGs and LCBs were simultaneously considered, suggesting that combining these two phylogenetic markers may reduce, to some extent, the influence of gene loss on phylogeny inference. Such phylogenomic studies, we believe, will help us to explore a more effective approach to increasing the robustness of phylogeny reconstruction of closely related bacterial organisms. PMID:26715828

  10. Bacterial genome replication at subzero temperatures in permafrost

    PubMed Central

    Tuorto, Steven J; Darias, Phillip; McGuinness, Lora R; Panikov, Nicolai; Zhang, Tingjun; Häggblom, Max M; Kerkhof, Lee J

    2014-01-01

    Microbial metabolic activity occurs at subzero temperatures in permafrost, an environment representing ∼25% of the global soil organic matter. Although much of the observed subzero microbial activity may be due to basal metabolism or macromolecular repair, there is also ample evidence for cellular growth. Unfortunately, most metabolic measurements or culture-based laboratory experiments cannot elucidate the specific microorganisms responsible for metabolic activities in native permafrost, nor, can bulk approaches determine whether different members of the microbial community modulate their responses as a function of changing subzero temperatures. Here, we report on the use of stable isotope probing with 13C-acetate to demonstrate bacterial genome replication in Alaskan permafrost at temperatures of 0 to −20 °C. We found that the majority (80%) of operational taxonomic units detected in permafrost microcosms were active and could synthesize 13C-labeled DNA when supplemented with 13C-acetate at temperatures of 0 to −20 °C during a 6-month incubation. The data indicated that some members of the bacterial community were active across all of the experimental temperatures, whereas many others only synthesized DNA within a narrow subzero temperature range. Phylogenetic analysis of 13C-labeled 16S rRNA genes revealed that the subzero active bacteria were members of the Acidobacteria, Actinobacteria, Chloroflexi, Gemmatimonadetes and Proteobacteria phyla and were distantly related to currently cultivated psychrophiles. These results imply that small subzero temperature changes may lead to changes in the active microbial community, which could have consequences for biogeochemical cycling in permanently frozen systems. PMID:23985750

  11. What Makes a Bacterial Species Pathogenic?:Comparative Genomic Analysis of the Genus Leptospira

    PubMed Central

    Fouts, Derrick E.; Matthias, Michael A.; Adhikarla, Haritha; Adler, Ben; Amorim-Santos, Luciane; Berg, Douglas E.; Bulach, Dieter; Buschiazzo, Alejandro; Chang, Yung-Fu; Galloway, Renee L.; Haake, David A.; Haft, Daniel H.; Hartskeerl, Rudy; Ko, Albert I.; Levett, Paul N.; Matsunaga, James; Mechaly, Ariel E.; Monk, Jonathan M.; Nascimento, Ana L. T.; Nelson, Karen E.; Palsson, Bernhard; Peacock, Sharon J.; Picardeau, Mathieu; Ricaldi, Jessica N.; Thaipandungpanit, Janjira; Wunder, Elsio A.; Yang, X. Frank; Zhang, Jun-Jie; Vinetz, Joseph M.

    2016-01-01

    Leptospirosis, caused by spirochetes of the genus Leptospira, is a globally widespread, neglected and emerging zoonotic disease. While whole genome analysis of individual pathogenic, intermediately pathogenic and saprophytic Leptospira species has been reported, comprehensive cross-species genomic comparison of all known species of infectious and non-infectious Leptospira, with the goal of identifying genes related to pathogenesis and mammalian host adaptation, remains a key gap in the field. Infectious Leptospira, comprised of pathogenic and intermediately pathogenic Leptospira, evolutionarily diverged from non-infectious, saprophytic Leptospira, as demonstrated by the following computational biology analyses: 1) the definitive taxonomy and evolutionary relatedness among all known Leptospira species; 2) genomically-predicted metabolic reconstructions that indicate novel adaptation of infectious Leptospira to mammals, including sialic acid biosynthesis, pathogen-specific porphyrin metabolism and the first-time demonstration of cobalamin (B12) autotrophy as a bacterial virulence factor; 3) CRISPR/Cas systems demonstrated only to be present in pathogenic Leptospira, suggesting a potential mechanism for this clade’s refractoriness to gene targeting; 4) finding Leptospira pathogen-specific specialized protein secretion systems; 5) novel virulence-related genes/gene families such as the Virulence Modifying (VM) (PF07598 paralogs) proteins and pathogen-specific adhesins; 6) discovery of novel, pathogen-specific protein modification and secretion mechanisms including unique lipoprotein signal peptide motifs, Sec-independent twin arginine protein secretion motifs, and the absence of certain canonical signal recognition particle proteins from all Leptospira; and 7) and demonstration of infectious Leptospira-specific signal-responsive gene expression, motility and chemotaxis systems. By identifying large scale changes in infectious (pathogenic and intermediately pathogenic

  12. Scale-Invariant Correlations in Dynamic Bacterial Clusters

    NASA Astrophysics Data System (ADS)

    Chen, Xiao; Dong, Xu; Be'er, Avraham; Swinney, Harry L.; Zhang, H. P.

    2012-04-01

    In Bacillus subtilis colonies, motile bacteria move collectively, spontaneously forming dynamic clusters. These bacterial clusters share similarities with other systems exhibiting polarized collective motion, such as bird flocks or fish schools. Here we study experimentally how velocity and orientation fluctuations within clusters are spatially correlated. For a range of cell density and cluster size, the correlation length is shown to be 30% of the spatial size of clusters, and the correlation functions collapse onto a master curve after rescaling the separation with correlation length. Our results demonstrate that correlations of velocity and orientation fluctuations are scale invariant in dynamic bacterial clusters.

  13. Identification of protein secretion systems in bacterial genomes.

    PubMed

    Abby, Sophie S; Cury, Jean; Guglielmini, Julien; Néron, Bertrand; Touchon, Marie; Rocha, Eduardo P C

    2016-03-16

    Bacteria with two cell membranes (diderms) have evolved complex systems for protein secretion. These systems were extensively studied in some model bacteria, but the characterisation of their diversity has lagged behind due to lack of standard annotation tools. We built online and standalone computational tools to accurately predict protein secretion systems and related appendages in bacteria with LPS-containing outer membranes. They consist of models describing the systems' components and genetic organization to be used with MacSyFinder to search for T1SS-T6SS, T9SS, flagella, Type IV pili and Tad pili. We identified ~10,000 candidate systems in bacterial genomes, where T1SS and T5SS were by far the most abundant and widespread. All these data are made available in a public database. The recently described T6SS(iii) and T9SS were restricted to Bacteroidetes, and T6SS(ii) to Francisella. The T2SS, T3SS, and T4SS were frequently encoded in single-copy in one locus, whereas most T1SS were encoded in two loci. The secretion systems of diderm Firmicutes were similar to those found in other diderms. Novel systems may remain to be discovered, since some clades of environmental bacteria lacked all known protein secretion systems. Our models can be fully customized, which should facilitate the identification of novel systems.

  14. Metabolomic Functional Analysis of Bacterial Genomes: Final Report

    SciTech Connect

    Arp, Daniel J; Sayavedra-Soto, Luis A

    2008-01-01

    The availability of the complete DNA sequence of the bacterial genome of Nitrosomonas europaea offered the opportunity for unprecedented and detailed investigations of function. We studied the function of genes involved in carbohydrate and Fe metabolism. N. europaea has genes for the synthesis and degradation of glycogen and sucrose but cannot grow on substrates other than ammonia and CO2. Granules of glycogen were detected in whole cells by electron microscopy and quantified in cell-free extracts by enzymatic methods. The cellular glycogen and sucrose content varied depending on the composition of the growth medium and cellular growth stage. N. europaea also depends heavily on iron for metabolism of ammonia, is particularly interesting since it lacks genes for siderophore production, and has genes with only low similarity to known iron reductases, yet grows relatively well in medium containing low Fe. By comparing the transcriptomes of cells grown in iron-replete medium versus iron-limited medium, 247 genes were identified as differentially expressed. Mutant strains deficient in genes for sucrose, glycogen and iron metabolism were created and are being used to further our understanding of ammonia oxidizing bacteria.

  15. Mechanical Genomics Identifies Diverse Modulators of Bacterial Cell Stiffness.

    PubMed

    Auer, George K; Lee, Timothy K; Rajendram, Manohary; Cesar, Spencer; Miguel, Amanda; Huang, Kerwyn Casey; Weibel, Douglas B

    2016-06-22

    Bacteria must maintain mechanical integrity to withstand the large osmotic pressure differential across the cell membrane and wall. Although maintaining mechanical integrity is critical for proper cellular function, a fact exploited by prominent cell-wall-targeting antibiotics, the proteins that contribute to cellular mechanics remain unidentified. Here, we describe a high-throughput optical method for quantifying cell stiffness and apply this technique to a genome-wide collection of ∼4,000 Escherichia coli mutants. We identify genes with roles in diverse functional processes spanning cell-wall synthesis, energy production, and DNA replication and repair that significantly change cell stiffness when deleted. We observe that proteins with biochemically redundant roles in cell-wall synthesis exhibit different stiffness defects when deleted. Correlating our data with chemical screens reveals that reducing membrane potential generally increases cell stiffness. In total, our work demonstrates that bacterial cell stiffness is a property of both the cell wall and broader cell physiology and lays the groundwork for future systematic studies of mechanoregulation.

  16. Simplified DGS procedure for large-scale genome structural study.

    PubMed

    Jung, Yong-Chul; Xu, Jia; Chen, Jun; Kim, Yeong; Winchester, David; Wang, San Ming

    2009-11-01

    Ditag genome scanning (DGS) uses next-generation DNA sequencing to sequence the ends of ditag fragments produced by restriction enzymes. These sequences are compared to known genome sequences to determine their structure. In order to use DGS for large-scale genome structural studies, we have substantially revised the original protocol by replacing the in vivo genomic DNA cloning with in vitro adaptor ligation, eliminating the ditag concatemerization steps, and replacing the 454 sequencer with Solexa or SOLiD sequencers for ditag sequence collection. This revised protocol further increases genome coverage and resolution and allows DGS to be used to analyze multiple genomes simultaneously.

  17. Bacterial Genomics Reveal the Complex Epidemiology of an Emerging Pathogen in Arctic and Boreal Ungulates.

    PubMed

    Forde, Taya L; Orsel, Karin; Zadoks, Ruth N; Biek, Roman; Adams, Layne G; Checkley, Sylvia L; Davison, Tracy; De Buck, Jeroen; Dumond, Mathieu; Elkin, Brett T; Finnegan, Laura; Macbeth, Bryan J; Nelson, Cait; Niptanatiak, Amanda; Sather, Shane; Schwantje, Helen M; van der Meer, Frank; Kutz, Susan J

    2016-01-01

    Northern ecosystems are currently experiencing unprecedented ecological change, largely driven by a rapidly changing climate. Pathogen range expansion, and emergence and altered patterns of infectious disease, are increasingly reported in wildlife at high latitudes. Understanding the causes and consequences of shifting pathogen diversity and host-pathogen interactions in these ecosystems is important for wildlife conservation, and for indigenous populations that depend on wildlife. Among the key questions are whether disease events are associated with endemic or recently introduced pathogens, and whether emerging strains are spreading throughout the region. In this study, we used a phylogenomic approach to address these questions of pathogen endemicity and spread for Erysipelothrix rhusiopathiae, an opportunistic multi-host bacterial pathogen associated with recent mortalities in arctic and boreal ungulate populations in North America. We isolated E. rhusiopathiae from carcasses associated with large-scale die-offs of muskoxen in the Canadian Arctic Archipelago, and from contemporaneous mortality events and/or population declines among muskoxen in northwestern Alaska and caribou and moose in western Canada. Bacterial genomic diversity differed markedly among these locations; minimal divergence was present among isolates from muskoxen in the Canadian Arctic, while in caribou and moose populations, strains from highly divergent clades were isolated from the same location, or even from within a single carcass. These results indicate that mortalities among northern ungulates are not associated with a single emerging strain of E. rhusiopathiae, and that alternate hypotheses need to be explored. Our study illustrates the value and limitations of bacterial genomic data for discriminating between ecological hypotheses of disease emergence, and highlights the importance of studying emerging pathogens within the broader context of environmental and host factors.

  18. Bacterial genomics reveal the complex epidemiology of an emerging pathogen in arctic and boreal ungulates

    USGS Publications Warehouse

    Forde, Taya L.; Orsel, Karin; Zadoks, Ruth N.; Biek, Roman; Adams, Layne G.; Checkley, Sylvia L.; Davison, Tracy; De Buck, Jeroen; Dumond, Mathieu; Elkin, Brett T.; Finnegan, Laura; Macbeth, Bryan J.; Nelson, Cait; Niptanatiak, Amanda; Sather, Shane; Schwantje, Helen M.; van der Meer, Frank; Kutz, Susan J.

    2016-01-01

    Northern ecosystems are currently experiencing unprecedented ecological change, largely driven by a rapidly changing climate. Pathogen range expansion, and emergence and altered patterns of infectious disease, are increasingly reported in wildlife at high latitudes. Understanding the causes and consequences of shifting pathogen diversity and host-pathogen interactions in these ecosystems is important for wildlife conservation, and for indigenous populations that depend on wildlife. Among the key questions are whether disease events are associated with endemic or recently introduced pathogens, and whether emerging strains are spreading throughout the region. In this study, we used a phylogenomic approach to address these questions of pathogen endemicity and spread for Erysipelothrix rhusiopathiae, an opportunistic multi-host bacterial pathogen associated with recent mortalities in arctic and boreal ungulate populations in North America. We isolated E. rhusiopathiae from carcasses associated with large-scale die-offs of muskoxen in the Canadian Arctic Archipelago, and from contemporaneous mortality events and/or population declines among muskoxen in northwestern Alaska and caribou and moose in western Canada. Bacterial genomic diversity differed markedly among these locations; minimal divergence was present among isolates from muskoxen in the Canadian Arctic, while in caribou and moose populations, strains from highly divergent clades were isolated from the same location, or even from within a single carcass. These results indicate that mortalities among northern ungulates are not associated with a single emerging strain of E. rhusiopathiae, and that alternate hypotheses need to be explored. Our study illustrates the value and limitations of bacterial genomic data for discriminating between ecological hypotheses of disease emergence, and highlights the importance of studying emerging pathogens within the broader context of environmental and host factors.

  19. Bacterial Genomics Reveal the Complex Epidemiology of an Emerging Pathogen in Arctic and Boreal Ungulates

    PubMed Central

    Forde, Taya L.; Orsel, Karin; Zadoks, Ruth N.; Biek, Roman; Adams, Layne G.; Checkley, Sylvia L.; Davison, Tracy; De Buck, Jeroen; Dumond, Mathieu; Elkin, Brett T.; Finnegan, Laura; Macbeth, Bryan J.; Nelson, Cait; Niptanatiak, Amanda; Sather, Shane; Schwantje, Helen M.; van der Meer, Frank; Kutz, Susan J.

    2016-01-01

    Northern ecosystems are currently experiencing unprecedented ecological change, largely driven by a rapidly changing climate. Pathogen range expansion, and emergence and altered patterns of infectious disease, are increasingly reported in wildlife at high latitudes. Understanding the causes and consequences of shifting pathogen diversity and host-pathogen interactions in these ecosystems is important for wildlife conservation, and for indigenous populations that depend on wildlife. Among the key questions are whether disease events are associated with endemic or recently introduced pathogens, and whether emerging strains are spreading throughout the region. In this study, we used a phylogenomic approach to address these questions of pathogen endemicity and spread for Erysipelothrix rhusiopathiae, an opportunistic multi-host bacterial pathogen associated with recent mortalities in arctic and boreal ungulate populations in North America. We isolated E. rhusiopathiae from carcasses associated with large-scale die-offs of muskoxen in the Canadian Arctic Archipelago, and from contemporaneous mortality events and/or population declines among muskoxen in northwestern Alaska and caribou and moose in western Canada. Bacterial genomic diversity differed markedly among these locations; minimal divergence was present among isolates from muskoxen in the Canadian Arctic, while in caribou and moose populations, strains from highly divergent clades were isolated from the same location, or even from within a single carcass. These results indicate that mortalities among northern ungulates are not associated with a single emerging strain of E. rhusiopathiae, and that alternate hypotheses need to be explored. Our study illustrates the value and limitations of bacterial genomic data for discriminating between ecological hypotheses of disease emergence, and highlights the importance of studying emerging pathogens within the broader context of environmental and host factors. PMID

  20. A profile-based method for identifying functional divergence of orthologous genes in bacterial genomes

    PubMed Central

    Wheeler, Nicole E.; Barquist, Lars; Kingsley, Robert A.; Gardner, Paul P.

    2016-01-01

    Motivation: Next generation sequencing technologies have provided us with a wealth of information on genetic variation, but predicting the functional significance of this variation is a difficult task. While many comparative genomics studies have focused on gene flux and large scale changes, relatively little attention has been paid to quantifying the effects of single nucleotide polymorphisms and indels on protein function, particularly in bacterial genomics. Results: We present a hidden Markov model based approach we call delta-bitscore (DBS) for identifying orthologous proteins that have diverged at the amino acid sequence level in a way that is likely to impact biological function. We benchmark this approach with several widely used datasets and apply it to a proof-of-concept study of orthologous proteomes in an investigation of host adaptation in Salmonella enterica. We highlight the value of the method in identifying functional divergence of genes, and suggest that this tool may be a better approach than the commonly used dN/dS metric for identifying functionally significant genetic changes occurring in recently diverged organisms. Availability and Implementation: A program implementing DBS for pairwise genome comparisons is freely available at: https://github.com/UCanCompBio/deltaBS. Contact: nicole.wheeler@pg.canterbury.ac.nz or lars.barquist@uni-wuerzburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27503221

  1. A Bacterial Analysis Platform: An Integrated System for Analysing Bacterial Whole Genome Sequencing Data for Clinical Diagnostics and Surveillance

    PubMed Central

    Ahrenfeldt, Johanne; Cisneros, Jose Luis Bellod; Jurtz, Vanessa; Larsen, Mette Voldby; Hasman, Henrik; Aarestrup, Frank Møller; Lund, Ole

    2016-01-01

    Recent advances in whole genome sequencing have made the technology available for routine use in microbiological laboratories. However, a major obstacle for using this technology is the availability of simple and automatic bioinformatics tools. Based on previously published and already available web-based tools we developed a single pipeline for batch uploading of whole genome sequencing data from multiple bacterial isolates. The pipeline will automatically identify the bacterial species and, if applicable, assemble the genome, identify the multilocus sequence type, plasmids, virulence genes and antimicrobial resistance genes. A short printable report for each sample will be provided and an Excel spreadsheet containing all the metadata and a summary of the results for all submitted samples can be downloaded. The pipeline was benchmarked using datasets previously used to test the individual services. The reported results enable a rapid overview of the major results, and comparing that to the previously found results showed that the platform is reliable and able to correctly predict the species and find most of the expected genes automatically. In conclusion, a combined bioinformatics platform was developed and made publicly available, providing easy-to-use automated analysis of bacterial whole genome sequencing data. The platform may be of immediate relevance as a guide for investigators using whole genome sequencing for clinical diagnostics and surveillance. The platform is freely available at: https://cge.cbs.dtu.dk/services/CGEpipeline-1.1 and it is the intention that it will continue to be expanded with new features as these become available. PMID:27327771

  2. Modeling cancer metabolism on a genome scale

    PubMed Central

    Yizhak, Keren; Chaneton, Barbara; Gottlieb, Eyal; Ruppin, Eytan

    2015-01-01

    Cancer cells have fundamentally altered cellular metabolism that is associated with their tumorigenicity and malignancy. In addition to the widely studied Warburg effect, several new key metabolic alterations in cancer have been established over the last decade, leading to the recognition that altered tumor metabolism is one of the hallmarks of cancer. Deciphering the full scope and functional implications of the dysregulated metabolism in cancer requires both the advancement of a variety of omics measurements and the advancement of computational approaches for the analysis and contextualization of the accumulated data. Encouragingly, while the metabolic network is highly interconnected and complex, it is at the same time probably the best characterized cellular network. Following, this review discusses the challenges that genome-scale modeling of cancer metabolism has been facing. We survey several recent studies demonstrating the first strides that have been done, testifying to the value of this approach in portraying a network-level view of the cancer metabolism and in identifying novel drug targets and biomarkers. Finally, we outline a few new steps that may further advance this field. PMID:26130389

  3. Modeling cancer metabolism on a genome scale.

    PubMed

    Yizhak, Keren; Chaneton, Barbara; Gottlieb, Eyal; Ruppin, Eytan

    2015-06-30

    Cancer cells have fundamentally altered cellular metabolism that is associated with their tumorigenicity and malignancy. In addition to the widely studied Warburg effect, several new key metabolic alterations in cancer have been established over the last decade, leading to the recognition that altered tumor metabolism is one of the hallmarks of cancer. Deciphering the full scope and functional implications of the dysregulated metabolism in cancer requires both the advancement of a variety of omics measurements and the advancement of computational approaches for the analysis and contextualization of the accumulated data. Encouragingly, while the metabolic network is highly interconnected and complex, it is at the same time probably the best characterized cellular network. Following, this review discusses the challenges that genome-scale modeling of cancer metabolism has been facing. We survey several recent studies demonstrating the first strides that have been done, testifying to the value of this approach in portraying a network-level view of the cancer metabolism and in identifying novel drug targets and biomarkers. Finally, we outline a few new steps that may further advance this field.

  4. BactoGeNIE: A large-scale comparative genome visualization for big displays

    DOE PAGES

    Aurisano, Jillian; Reda, Khairi; Johnson, Andrew; ...

    2015-08-13

    The volume of complete bacterial genome sequence data available to comparative genomics researchers is rapidly increasing. However, visualizations in comparative genomics--which aim to enable analysis tasks across collections of genomes--suffer from visual scalability issues. While large, multi-tiled and high-resolution displays have the potential to address scalability issues, new approaches are needed to take advantage of such environments, in order to enable the effective visual analysis of large genomics datasets. In this paper, we present Bacterial Gene Neighborhood Investigation Environment, or BactoGeNIE, a novel and visually scalable design for comparative gene neighborhood analysis on large display environments. We evaluate BactoGeNIE throughmore » a case study on close to 700 draft Escherichia coli genomes, and present lessons learned from our design process. In conclusion, BactoGeNIE accommodates comparative tasks over substantially larger collections of neighborhoods than existing tools and explicitly addresses visual scalability. Given current trends in data generation, scalable designs of this type may inform visualization design for large-scale comparative research problems in genomics.« less

  5. BactoGeNIE: A large-scale comparative genome visualization for big displays

    SciTech Connect

    Aurisano, Jillian; Reda, Khairi; Johnson, Andrew; Marai, Elisabeta G.; Leigh, Jason

    2015-08-13

    The volume of complete bacterial genome sequence data available to comparative genomics researchers is rapidly increasing. However, visualizations in comparative genomics--which aim to enable analysis tasks across collections of genomes--suffer from visual scalability issues. While large, multi-tiled and high-resolution displays have the potential to address scalability issues, new approaches are needed to take advantage of such environments, in order to enable the effective visual analysis of large genomics datasets. In this paper, we present Bacterial Gene Neighborhood Investigation Environment, or BactoGeNIE, a novel and visually scalable design for comparative gene neighborhood analysis on large display environments. We evaluate BactoGeNIE through a case study on close to 700 draft Escherichia coli genomes, and present lessons learned from our design process. In conclusion, BactoGeNIE accommodates comparative tasks over substantially larger collections of neighborhoods than existing tools and explicitly addresses visual scalability. Given current trends in data generation, scalable designs of this type may inform visualization design for large-scale comparative research problems in genomics.

  6. Facile, High Quality Sequencing of Bacterial Genomes from Small Amounts of DNA

    PubMed Central

    Vuyisich, Momchilo; Arefin, Ayesha; Davenport, Karen; Feng, Shihai; Gleasner, Cheryl; McMurry, Kim; Parson-Quintana, Beverly; Price, Jennifer; Scholz, Matthew; Chain, Patrick

    2014-01-01

    Sequencing bacterial genomes has traditionally required large amounts of genomic DNA (~1 μg). There have been few studies to determine the effects of the input DNA amount or library preparation method on the quality of sequencing data. Several new commercially available library preparation methods enable shotgun sequencing from as little as 1 ng of input DNA. In this study, we evaluated the NEBNext Ultra library preparation reagents for sequencing bacterial genomes. We have evaluated the utility of NEBNext Ultra for resequencing and de novo assembly of four bacterial genomes and compared its performance with the TruSeq library preparation kit. The NEBNext Ultra reagents enable high quality resequencing and de novo assembly of a variety of bacterial genomes when using 100 ng of input genomic DNA. For the two most challenging genomes (Burkholderia spp.), which have the highest GC content and are the longest, we also show that the quality of both resequencing and de novo assembly is not decreased when only 10 ng of input genomic DNA is used. PMID:25478564

  7. Bacterial DNA Sifted from the Trichoplax adhaerens (Animalia: Placozoa) Genome Project Reveals a Putative Rickettsial Endosymbiont

    PubMed Central

    Driscoll, Timothy; Gillespie, Joseph J.; Nordberg, Eric K.; Azad, Abdu F.; Sobral, Bruno W.

    2013-01-01

    Eukaryotic genome sequencing projects often yield bacterial DNA sequences, data typically considered as microbial contamination. However, these sequences may also indicate either symbiont genes or lateral gene transfer (LGT) to host genomes. These bacterial sequences can provide clues about eukaryote–microbe interactions. Here, we used the genome of the primitive animal Trichoplax adhaerens (Metazoa: Placozoa), which is known to harbor an uncharacterized Gram-negative endosymbiont, to search for the presence of bacterial DNA sequences. Bioinformatic and phylogenomic analyses of extracted data from the genome assembly (181 bacterial coding sequences [CDS]) and trace read archive (16S rDNA) revealed a dominant proteobacterial profile strongly skewed to Rickettsiales (Alphaproteobacteria) genomes. By way of phylogenetic analysis of 16S rDNA and 113 proteins conserved across proteobacterial genomes, as well as identification of 27 rickettsial signature genes, we propose a Rickettsiales endosymbiont of T. adhaerens (RETA). The majority (93%) of the identified bacterial CDS belongs to small scaffolds containing prokaryotic-like genes; however, 12 CDS were identified on large scaffolds comprised of eukaryotic-like genes, suggesting that T. adhaerens might have recently acquired bacterial genes. These putative LGTs may coincide with the placozoan’s aquatic niche and symbiosis with RETA. This work underscores the rich, and relatively untapped, resource of eukaryotic genome projects for harboring data pertinent to host–microbial interactions. The nature of unknown (or poorly characterized) bacterial species may only emerge via analysis of host genome sequencing projects, particularly if these species are resistant to cell culturing, as are many obligate intracellular microbes. Our work provides methodological insight for such an approach. PMID:23475938

  8. Ensembl Genomes 2013: scaling up access to genome-wide data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provi...

  9. Construction and analysis of Siberian tiger bacterial artificial chromosome library with approximately 6.5-fold genome equivalent coverage.

    PubMed

    Liu, Changqing; Bai, Chunyu; Guo, Yu; Liu, Dan; Lu, Taofeng; Li, Xiangchen; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2014-03-07

    Bacterial artificial chromosome (BAC) libraries are extremely valuable for the genome-wide genetic dissection of complex organisms. The Siberian tiger, one of the most well-known wild primitive carnivores in China, is an endangered animal. In order to promote research on its genome, a high-redundancy BAC library of the Siberian tiger was constructed and characterized. The library is divided into two sub-libraries prepared from blood cells and two sub-libraries prepared from fibroblasts. This BAC library contains 153,600 individually archived clones; for PCR-based screening of the library, BACs were placed into 40 superpools of 10 × 384-deep well microplates. The average insert size of BAC clones was estimated to be 116.5 kb, representing approximately 6.46 genome equivalents of the haploid genome and affording a 98.86% statistical probability of obtaining at least one clone containing a unique DNA sequence. Screening the library with 19 microsatellite markers and a SRY sequence revealed that each of these markers were present in the library; the average number of positive clones per marker was 6.74 (range 2 to 12), consistent with 6.46 coverage of the tiger genome. Additionally, we identified 72 microsatellite markers that could potentially be used as genetic markers. This BAC library will serve as a valuable resource for physical mapping, comparative genomic study and large-scale genome sequencing in the tiger.

  10. [Homologous recombination among bacterial genomes: the measurement and identification].

    PubMed

    Xianwei, Yang; Ruifu, Yang; Yujun, Cui

    2016-02-01

    Homologous recombination is one of important sources in shaping the bacterial population diversity, which disrupts the clonal relationship among different lineages through horizontal transferring of DNA-segments. As consequence of blurring the vertical inheritance signals, the homologous recombination raises difficulties in phylogenetic analysis and reconstruction of population structure. Here we discuss the impacts of homologous recombination in inferring phylogenetic relationship among bacterial isolates, and summarize the tools and models separately used in recombination measurement and identification. We also highlight the merits and drawbacks of various approaches, aiming to assist in the practical application for the analysis of homologous recombination in bacterial evolution research.

  11. Segmentation of genomic DNA through entropic divergence: Power laws and scaling

    NASA Astrophysics Data System (ADS)

    Azad, Rajeev K.; Bernaola-Galván, Pedro; Ramaswamy, Ramakrishna; Rao, J. Subba

    2002-05-01

    Genomic DNA is fragmented into segments using the Jensen-Shannon divergence. Use of this criterion results in the fragments being entropically homogeneous to within a predefined level of statistical significance. Application of this procedure is made to complete genomes of organisms from archaebacteria, eubacteria, and eukaryotes. The distribution of fragment lengths in bacterial and primitive eukaryotic DNAs shows two distinct regimes of power-law scaling. The characteristic length separating these two regimes appears to be an intrinsic property of the sequence rather than a finite-size artifact, and is independent of the significance level used in segmenting a given genome. Fragment length distributions obtained in the segmentation of the genomes of more highly evolved eukaryotes do not have such distinct regimes of power-law behavior.

  12. Construction and characterization of bacterial artificial chromosomes (BACs) containing herpes simplex virus full-length genomes.

    PubMed

    Nagel, Claus-Henning; Pohlmann, Anja; Sodeik, Beate

    2014-01-01

    Bacterial artificial chromosomes (BACs) are suitable vectors not only to maintain the large genomes of herpesviruses in Escherichia coli but also to enable the traceless introduction of any mutation using modern tools of bacterial genetics. To clone a herpes simplex virus genome, a BAC replication origin is first introduced into the viral genome by homologous recombination in eukaryotic host cells. As part of their nuclear replication cycle, genomes of herpesviruses circularize and these replication intermediates are then used to transform bacteria. After cloning, the integrity of the recombinant viral genomes is confirmed by restriction length polymorphism analysis and sequencing. The BACs may then be used to design virus mutants. Upon transfection into eukaryotic cells new herpesvirus strains harboring the desired mutations can be recovered and used for experiments in cultured cells as well as in animal infection models.

  13. Phages and the Evolution of Bacterial Pathogens: from Genomic Rearrangements to Lysogenic Conversion

    PubMed Central

    Brüssow, Harald; Canchaya, Carlos; Hardt, Wolf-Dietrich

    2004-01-01

    Comparative genomics demonstrated that the chromosomes from bacteria and their viruses (bacteriophages) are coevolving. This process is most evident for bacterial pathogens where the majority contain prophages or phage remnants integrated into the bacterial DNA. Many prophages from bacterial pathogens encode virulence factors. Two situations can be distinguished: Vibrio cholerae, Shiga toxin-producing Escherichia coli, Corynebacterium diphtheriae, and Clostridium botulinum depend on a specific prophage-encoded toxin for causing a specific disease, whereas Staphylococcus aureus, Streptococcus pyogenes, and Salmonella enterica serovar Typhimurium harbor a multitude of prophages and each phage-encoded virulence or fitness factor makes an incremental contribution to the fitness of the lysogen. These prophages behave like “swarms” of related prophages. Prophage diversification seems to be fueled by the frequent transfer of phage material by recombination with superinfecting phages, resident prophages, or occasional acquisition of other mobile DNA elements or bacterial chromosomal genes. Prophages also contribute to the diversification of the bacterial genome architecture. In many cases, they actually represent a large fraction of the strain-specific DNA sequences. In addition, they can serve as anchoring points for genome inversions. The current review presents the available genomics and biological data on prophages from bacterial pathogens in an evolutionary framework. PMID:15353570

  14. A highly precise and portable genome engineering method allows comparison of mutational effects across bacterial species

    PubMed Central

    Nyerges, Ákos; Csörgő, Bálint; Nagy, István; Bálint, Balázs; Bihari, Péter; Lázár, Viktória; Apjok, Gábor; Umenhoffer, Kinga; Bogos, Balázs; Pósfai, György; Pál, Csaba

    2016-01-01

    Currently available tools for multiplex bacterial genome engineering are optimized for a few laboratory model strains, demand extensive prior modification of the host strain, and lead to the accumulation of numerous off-target modifications. Building on prior development of multiplex automated genome engineering (MAGE), our work addresses these problems in a single framework. Using a dominant-negative mutant protein of the methyl-directed mismatch repair (MMR) system, we achieved a transient suppression of DNA repair in Escherichia coli, which is necessary for efficient oligonucleotide integration. By integrating all necessary components into a broad-host vector, we developed a new workflow we term pORTMAGE. It allows efficient modification of multiple loci, without any observable off-target mutagenesis and prior modification of the host genome. Because of the conserved nature of the bacterial MMR system, pORTMAGE simultaneously allows genome editing and mutant library generation in other biotechnologically and clinically relevant bacterial species. Finally, we applied pORTMAGE to study a set of antibiotic resistance-conferring mutations in Salmonella enterica and E. coli. Despite over 100 million y of divergence between the two species, mutational effects remained generally conserved. In sum, a single transformation of a pORTMAGE plasmid allows bacterial species of interest to become an efficient host for genome engineering. These advances pave the way toward biotechnological and therapeutic applications. Finally, pORTMAGE allows systematic comparison of mutational effects and epistasis across a wide range of bacterial species. PMID:26884157

  15. A highly precise and portable genome engineering method allows comparison of mutational effects across bacterial species.

    PubMed

    Nyerges, Ákos; Csörgő, Bálint; Nagy, István; Bálint, Balázs; Bihari, Péter; Lázár, Viktória; Apjok, Gábor; Umenhoffer, Kinga; Bogos, Balázs; Pósfai, György; Pál, Csaba

    2016-03-01

    Currently available tools for multiplex bacterial genome engineering are optimized for a few laboratory model strains, demand extensive prior modification of the host strain, and lead to the accumulation of numerous off-target modifications. Building on prior development of multiplex automated genome engineering (MAGE), our work addresses these problems in a single framework. Using a dominant-negative mutant protein of the methyl-directed mismatch repair (MMR) system, we achieved a transient suppression of DNA repair in Escherichia coli, which is necessary for efficient oligonucleotide integration. By integrating all necessary components into a broad-host vector, we developed a new workflow we term pORTMAGE. It allows efficient modification of multiple loci, without any observable off-target mutagenesis and prior modification of the host genome. Because of the conserved nature of the bacterial MMR system, pORTMAGE simultaneously allows genome editing and mutant library generation in other biotechnologically and clinically relevant bacterial species. Finally, we applied pORTMAGE to study a set of antibiotic resistance-conferring mutations in Salmonella enterica and E. coli. Despite over 100 million y of divergence between the two species, mutational effects remained generally conserved. In sum, a single transformation of a pORTMAGE plasmid allows bacterial species of interest to become an efficient host for genome engineering. These advances pave the way toward biotechnological and therapeutic applications. Finally, pORTMAGE allows systematic comparison of mutational effects and epistasis across a wide range of bacterial species.

  16. Functional Genome Mining for Metabolites Encoded by Large Gene Clusters through Heterologous Expression of a Whole-Genome Bacterial Artificial Chromosome Library in Streptomyces spp.

    PubMed Central

    Xu, Min; Wang, Yemin; Zhao, Zhilong; Gao, Guixi; Huang, Sheng-Xiong; Kang, Qianjin; He, Xinyi; Lin, Shuangjun; Pang, Xiuhua; Deng, Zixin

    2016-01-01

    ABSTRACT Genome sequencing projects in the last decade revealed numerous cryptic biosynthetic pathways for unknown secondary metabolites in microbes, revitalizing drug discovery from microbial metabolites by approaches called genome mining. In this work, we developed a heterologous expression and functional screening approach for genome mining from genomic bacterial artificial chromosome (BAC) libraries in Streptomyces spp. We demonstrate mining from a strain of Streptomyces rochei, which is known to produce streptothricins and borrelidin, by expressing its BAC library in the surrogate host Streptomyces lividans SBT5, and screening for antimicrobial activity. In addition to the successful capture of the streptothricin and borrelidin biosynthetic gene clusters, we discovered two novel linear lipopeptides and their corresponding biosynthetic gene cluster, as well as a novel cryptic gene cluster for an unknown antibiotic from S. rochei. This high-throughput functional genome mining approach can be easily applied to other streptomycetes, and it is very suitable for the large-scale screening of genomic BAC libraries for bioactive natural products and the corresponding biosynthetic pathways. IMPORTANCE Microbial genomes encode numerous cryptic biosynthetic gene clusters for unknown small metabolites with potential biological activities. Several genome mining approaches have been developed to activate and bring these cryptic metabolites to biological tests for future drug discovery. Previous sequence-guided procedures relied on bioinformatic analysis to predict potentially interesting biosynthetic gene clusters. In this study, we describe an efficient approach based on heterologous expression and functional screening of a whole-genome library for the mining of bioactive metabolites from Streptomyces. The usefulness of this function-driven approach was demonstrated by the capture of four large biosynthetic gene clusters for metabolites of various chemical types, including

  17. Quantitative assessment of insertion sequence impact on bacterial genome architecture

    PubMed Central

    Bishop, Brian; Wright, Meredith S.

    2016-01-01

    Insertion sequence (IS) elements are important mediators of genome plasticity and can lead to phenotypic changes with evolutionary significance. In multidrug-resistant Acinetobacter baumannii and Klebsiella pneumoniae, IS elements have contributed significantly to the mobilization of genes that encode resistance to antimicrobial drugs. A systematic analysis of IS elements is needed for a more comprehensive understanding of their evolutionary impact. We developed a computational approach (ISseeker) to annotate IS elements in draft genome assemblies and applied the method to analysis of IS elements in all publicly available A. baumannii(>1000) and K. pneumoniae(>800) genome sequences, in a phylogenetic context. Most IS elements in A. baumanniigenomes are species-specific ISAba elements, whereas K. pneumoniaegenomes contain significant numbers of both ISKpn elements and elements that are found throughout the Enterobacteriaceae. A. baumanniigenomes have a higher density of IS elements than K. pneumoniae, averaging ~33 vs ~27 copies per genome. In K. pneumoniae, several insertion sites are shared by most genomes in the ST258 clade, whereas in A. baumannii, different IS elements are abundant in different phylogenetic groups, even among closely related Global Clone 2 strains. IS elements differ in the distribution of insertion locations relative to genes, with some more likely to disrupt genes and others predominantly in intergenic regions. Several genes and intergenic regions had multiple independent insertion events, suggesting that those events may confer a selective advantage. Genome- and taxon-wide characterization of insertion locations revealed that IS elements have been active contributors to genome diversity in both species.

  18. Reductive Evolution of Bacterial Genome in Insect Gut Environment

    PubMed Central

    Nikoh, Naruo; Hosokawa, Takahiro; Oshima, Kenshiro; Hattori, Masahira; Fukatsu, Takema

    2011-01-01

    Obligate endocellular symbiotic bacteria of insects and other organisms generally exhibit drastic genome reduction. Recently, it was shown that symbiotic gut bacteria of some stinkbugs also have remarkably reduced genomes. Here, we report the complete genome sequence of such a gut bacterium Ishikawaella capsulata of the plataspid stinkbug Megacopta punctatissima. Gene repertoire and evolutionary patterns, including AT richness and elevated evolutionary rate, of the 745,590 bp genome were strikingly similar to those of obligate γ-proteobacterial endocellular insect symbionts like Buchnera in aphids and Wigglesworthia in tsetse flies. Ishikawaella was suggested to supply essential amino acids for the plant-sucking stinkbug as Buchnera does for the host aphid. Although Buchnera is phylogenetically closer to Wigglesworthia than to Ishikawaella, in terms of gene repertoire Buchnera was similar to Ishikawaella rather than to Wigglesworthia, providing a possible case of genome-level convergence of gene content. Meanwhile, several notable differences were identified between the genomes of Ishikawaella and Buchnera, including retention of TCA cycle genes and lack of flagellum-related genes in Ishikawaella, which may reflect their adaptation to distinct symbiotic habitats. Unexpectedly, Ishikawaella retained fewer genes related to cell wall synthesis and lipid metabolism than many endocellular insect symbionts. The plasmid of Ishikawaella encoded genes for arginine metabolism and oxalate detoxification, suggesting the possibility of additional Ishikawaella roles similar to those of human gut bacteria. Our data highlight strikingly similar evolutionary patterns that are shared between the extracellular and endocellular insect symbiont genomes. PMID:21737395

  19. A reference pan-genome approach to comparative bacterial genomics: identification of novel epidemiological markers in pathogenic Campylobacter.

    PubMed

    Méric, Guillaume; Yahara, Koji; Mageiros, Leonardos; Pascoe, Ben; Maiden, Martin C J; Jolley, Keith A; Sheppard, Samuel K

    2014-01-01

    The increasing availability of hundreds of whole bacterial genomes provides opportunities for enhanced understanding of the genes and alleles responsible for clinically important phenotypes and how they evolved. However, it is a significant challenge to develop easy-to-use and scalable methods for characterizing these large and complex data and relating it to disease epidemiology. Existing approaches typically focus on either homologous sequence variation in genes that are shared by all isolates, or non-homologous sequence variation--focusing on genes that are differentially present in the population. Here we present a comparative genomics approach that simultaneously approximates core and accessory genome variation in pathogen populations and apply it to pathogenic species in the genus Campylobacter. A total of 7 published Campylobacter jejuni and Campylobacter coli genomes were selected to represent diversity across these species, and a list of all loci that were present at least once was compiled. After filtering duplicates a 7-isolate reference pan-genome, of 3,933 loci, was defined. A core genome of 1,035 genes was ubiquitous in the sample accounting for 59% of the genes in each isolate (average genome size of 1.68 Mb). The accessory genome contained 2,792 genes. A Campylobacter population sample of 192 genomes was screened for the presence of reference pan-genome loci with gene presence defined as a BLAST match of ≥ 70% identity over ≥ 50% of the locus length--aligned using MUSCLE on a gene-by-gene basis. A total of 21 genes were present only in C. coli and 27 only in C. jejuni, providing information about functional differences associated with species and novel epidemiological markers for population genomic analyses. Homologs of these genes were found in several of the genomes used to define the pan-genome and, therefore, would not have been identified using a single reference strain approach.

  20. Identification of large-scale genomic variation in cancer genomes using in silico reference models

    PubMed Central

    Killcoyne, Sarah; del Sol, Antonio

    2016-01-01

    Identifying large-scale structural variation in cancer genomes continues to be a challenge to researchers. Current methods rely on genome alignments based on a reference that can be a poor fit to highly variant and complex tumor genomes. To address this challenge we developed a method that uses available breakpoint information to generate models of structural variations. We use these models as references to align previously unmapped and discordant reads from a genome. By using these models to align unmapped reads, we show that our method can help to identify large-scale variations that have been previously missed. PMID:26264669

  1. The FUN of identifying gene function in bacterial pathogens; insights from Salmonella functional genomics.

    PubMed

    Hammarlöf, Disa L; Canals, Rocío; Hinton, Jay C D

    2013-10-01

    The availability of thousands of genome sequences of bacterial pathogens poses a particular challenge because each genome contains hundreds of genes of unknown function (FUN). How can we easily discover which FUN genes encode important virulence factors? One solution is to combine two different functional genomic approaches. First, transcriptomics identifies bacterial FUN genes that show differential expression during the process of mammalian infection. Second, global mutagenesis identifies individual FUN genes that the pathogen requires to cause disease. The intersection of these datasets can reveal a small set of candidate genes most likely to encode novel virulence attributes. We demonstrate this approach with the Salmonella infection model, and propose that a similar strategy could be used for other bacterial pathogens.

  2. Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes

    NASA Astrophysics Data System (ADS)

    Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.

    2012-02-01

    Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.

  3. The Bacterial Origins of the CRISPR Genome-Editing Revolution.

    PubMed

    Sontheimer, Erik J; Barrangou, Rodolphe

    2015-07-01

    Like most of the tools that enable modern life science research, the recent genome-editing revolution has its biological roots in the world of bacteria and archaea. Clustered, regularly interspaced, short palindromic repeats (CRISPR) loci are found in the genomes of many bacteria and most archaea, and underlie an adaptive immune system that protects the host cell against invasive nucleic acids such as viral genomes. In recent years, engineered versions of these systems have enabled efficient DNA targeting in living cells from dozens of species (including humans and other eukaryotes), and the exploitation of the resulting endogenous DNA repair pathways has provided a route to fast, easy, and affordable genome editing. In only three years after RNA-guided DNA cleavage was first harnessed, the ability to edit genomes via simple, user-defined RNA sequences has already revolutionized nearly all areas of biological science. CRISPR-based technologies are now poised to similarly revolutionize many facets of clinical medicine, and even promise to advance the long-term goal of directly editing genomic sequences of patients with inherited disease. In this review, we describe the biological and mechanistic basis for these remarkable immune systems, and how their engineered derivatives are revolutionizing basic and clinical research.

  4. Inferring Ancestral Recombination Graphs from Bacterial Genomic Data

    PubMed Central

    Vaughan, Timothy G.; Welch, David; Drummond, Alexei J.; Biggs, Patrick J.; George, Tessy; French, Nigel P.

    2017-01-01

    Homologous recombination is a central feature of bacterial evolution, yet it confounds traditional phylogenetic methods. While a number of methods specific to bacterial evolution have been developed, none of these permit joint inference of a bacterial recombination graph and associated parameters. In this article, we present a new method which addresses this shortcoming. Our method uses a novel Markov chain Monte Carlo algorithm to perform phylogenetic inference under the ClonalOrigin model. We demonstrate the utility of our method by applying it to ribosomal multilocus sequence typing data sequenced from pathogenic and nonpathogenic Escherichia coli serotype O157 and O26 isolates collected in rural New Zealand. The method is implemented as an open source BEAST 2 package, Bacter, which is available via the project web page at http://tgvaughan.github.io/bacter. PMID:28007885

  5. Draft Genomes, Phylogenetic Reconstruction, and Comparative Genomics of Two Novel Cohabiting Bacterial Symbionts Isolated from Frankliniella occidentalis

    PubMed Central

    Facey, Paul D.; Méric, Guillaume; Hitchings, Matthew D.; Pachebat, Justin A.; Hegarty, Matt J.; Chen, Xiaorui; Morgan, Laura V.A.; Hoeppner, James E.; Whitten, Miranda M.A.; Kirk, William D.J.; Dyson, Paul J.; Sheppard, Sam K.; Sol, Ricardo Del

    2015-01-01

    Obligate bacterial symbionts are widespread in many invertebrates, where they are often confined to specialized host cells and are transmitted directly from mother to progeny. Increasing numbers of these bacteria are being characterized but questions remain about their population structure and evolution. Here we take a comparative genomics approach to investigate two prominent bacterial symbionts (BFo1 and BFo2) isolated from geographically separated populations of western flower thrips, Frankliniella occidentalis. Our multifaceted approach to classifying these symbionts includes concatenated multilocus sequence analysis (MLSA) phylogenies, ribosomal multilocus sequence typing (rMLST), construction of whole-genome phylogenies, and in-depth genomic comparisons. We showed that the BFo1 genome clusters more closely to species in the genus Erwinia, and is a putative close relative to Erwinia aphidicola. BFo1 is also likely to have shared a common ancestor with Erwinia pyrifoliae/Erwinia amylovora and the nonpathogenic Erwinia tasmaniensis and genetic traits similar to Erwinia billingiae. The BFo1 genome contained virulence factors found in the genus Erwinia but represented a divergent lineage. In contrast, we showed that BFo2 belongs within the Enterobacteriales but does not group closely with any currently known bacterial species. Concatenated MLSA phylogenies indicate that it may have shared a common ancestor to the Erwinia and Pantoea genera, and based on the clustering of rMLST genes, it was most closely related to Pantoea ananatis but represented a divergent lineage. We reconstructed a core genome of a putative common ancestor of Erwinia and Pantoea and compared this with the genomes of BFo bacteria. BFo2 possessed none of the virulence determinants that were omnipresent in the Erwinia and Pantoea genera. Taken together, these data are consistent with BFo2 representing a highly novel species that maybe related to known Pantoea. PMID:26185096

  6. Scaling, crumpled wires, and genome packing in virions

    NASA Astrophysics Data System (ADS)

    de Holanda, V. H.; Gomes, M. A. F.

    2016-12-01

    The packing of a genome in virions is a topic of intense current interest in biology and biological physics. The area is dominated by allometric scaling relations that connect, e.g., the length of the encapsulated genome and the size of the corresponding virion capsid. Here we report scaling laws obtained from extensive experiments of packing of a macroscopic wire within rigid three-dimensional spherical and nonspherical cavities that can shed light on the details of the genome packing in virions. We show that these results obtained with crumpled wires are comparable to those from a large compilation of biological data from several classes of virions.

  7. Genome-scale resources for Thermoanaerobacterium saccharolyticum

    DOE PAGES

    Currie, Devin H.; Raman, Babu; Gowen, Christopher M.; ...

    2015-06-26

    Thermoanaerobacterium saccharolyticum is a hemicellulose-degrading thermophilic anaerobe that was previously engineered to produce ethanol at high yield. For this research, a major project was undertaken to develop this organism into an industrial biocatalyst, but the lack of genome information and resources were recognized early on as a key limitation.

  8. Toward Complete Bacterial Genome Sequencing Through the Combined Use of Multiple Next-Generation Sequencing Platforms.

    PubMed

    Jeong, Haeyoung; Lee, Dae-Hee; Ryu, Choong-Min; Park, Seung-Hwan

    2016-01-01

    PacBio's long-read sequencing technologies can be successfully used for a complete bacterial genome assembly using recently developed non-hybrid assemblers in the absence of secondgeneration, high-quality short reads. However, standardized procedures that take into account multiple pre-existing second-generation sequencing platforms are scarce. In addition to Illumina HiSeq and Ion Torrent PGM-based genome sequencing results derived from previous studies, we generated further sequencing data, including from the PacBio RS II platform, and applied various bioinformatics tools to obtain complete genome assemblies for five bacterial strains. Our approach revealed that the hierarchical genome assembly process (HGAP) non-hybrid assembler resulted in nearly complete assemblies at a moderate coverage of ~75x, but that different versions produced non-compatible results requiring post processing. The other two platforms further improved the PacBio assembly through scaffolding and a final error correction.

  9. A Markovian analysis of bacterial genome sequence constraints

    PubMed Central

    Skewes, Aaron D.

    2013-01-01

    The arrangement of nucleotides within a bacterial chromosome is influenced by numerous factors. The degeneracy of the third codon within each reading frame allows some flexibility of nucleotide selection; however, the third nucleotide in the triplet of each codon is at least partly determined by the preceding two. This is most evident in organisms with a strong G + C bias, as the degenerate codon must contribute disproportionately to maintaining that bias. Therefore, a correlation exists between the first two nucleotides and the third in all open reading frames. If the arrangement of nucleotides in a bacterial chromosome is represented as a Markov process, we would expect that the correlation would be completely captured by a second-order Markov model and an increase in the order of the model (e.g., third-, fourth-…order) would not capture any additional uncertainty in the process. In this manuscript, we present the results of a comprehensive study of the Markov property that exists in the DNA sequences of 906 bacterial chromosomes. All of the 906 bacterial chromosomes studied exhibit a statistically significant Markov property that extends beyond second-order, and therefore cannot be fully explained by codon usage. An unrooted tree containing all 906 bacterial chromosomes based on their transition probability matrices of third-order shares ∼25% similarity to a tree based on sequence homologies of 16S rRNA sequences. This congruence to the 16S rRNA tree is greater than for trees based on lower-order models (e.g., second-order), and higher-order models result in diminishing improvements in congruence. A nucleotide correlation most likely exists within every bacterial chromosome that extends past three nucleotides. This correlation places significant limits on the number of nucleotide sequences that can represent probable bacterial chromosomes. Transition matrix usage is largely conserved by taxa, indicating that this property is likely inherited, however some

  10. A CRISPR-Cas9 Assisted Non-Homologous End-Joining Strategy for One-step Engineering of Bacterial Genome

    PubMed Central

    Su, Tianyuan; Liu, Fapeng; Gu, Pengfei; Jin, Haiying; Chang, Yizhao; Wang, Qian; Liang, Quanfeng; Qi, Qingsheng

    2016-01-01

    Homologous recombination-mediated genome engineering has been broadly applied in prokaryotes with high efficiency and accuracy. However, this method is limited in realizing larger-scale genome editing with numerous genes or large DNA fragments because of the relatively complicated procedure for DNA editing template construction. Here, we describe a CRISPR-Cas9 assisted non-homologous end-joining (CA-NHEJ) strategy for the rapid and efficient inactivation of bacterial gene (s) in a homologous recombination-independent manner and without the use of selective marker. Our study show that CA-NHEJ can be used to delete large chromosomal DNA fragments in a single step that does not require homologous DNA template. It is thus a novel and powerful tool for bacterial genomes reducing and possesses the potential for accelerating the genome evolution. PMID:27883076

  11. DNA phosphorothioation is widespread and quantized in bacterial genomes

    PubMed Central

    Wang, Lianrong; Chen, Shi; Vergin, Kevin L.; Giovannoni, Stephen J.; Chan, Simon W.; DeMott, Michael S.; Taghizadeh, Koli; Cordero, Otto X.; Cutler, Michael; Timberlake, Sonia; Alm, Eric J.; Polz, Martin F.; Pinhassi, Jarone; Deng, Zixin; Dedon, Peter C.

    2011-01-01

    Phosphorothioate (PT) modification of DNA, with sulfur replacing a nonbridging phosphate oxygen, was recently discovered as a product of the dnd genes found in bacteria and archaea. Given our limited understanding of the biological function of PT modifications, including sequence context, genomic frequencies, and relationships to the diversity of dnd gene clusters, we undertook a quantitative study of PT modifications in prokaryotic genomes using a liquid chromatography-coupled tandem quadrupole mass spectrometry approach. The results revealed a diversity of unique PT sequence contexts and three discrete genomic frequencies in a wide range of bacteria. Metagenomic analyses of PT modifications revealed unique ecological distributions, and a phylogenetic comparison of dnd genes and PT sequence contexts strongly supports the horizontal transfer of dnd genes. These results are consistent with the involvement of PT modifications in a type of restriction-modification system with wide distribution in prokaryotes. PMID:21285367

  12. GeneOrder3.0: Software for comparing the order of genes in pairs of small bacterial genomes

    PubMed Central

    Celamkoti, Srikanth; Kundeti, Sashidhara; Purkayastha, Anjan; Mazumder, Raja; Buck, Charles; Seto, Donald

    2004-01-01

    Background An increasing number of whole viral and bacterial genomes are being sequenced and deposited in public databases. In parallel to the mounting interest in whole genomes, the number of whole genome analyses software tools is also increasing. GeneOrder was originally developed to provide an analysis of genes between two genomes, allowing visualization of gene order and synteny comparisons of any small genomes. It was originally developed for comparing virus, mitochondrion and chloroplast genomes. This is now extended to small bacterial genomes of sizes less than 2 Mb. Results GeneOrder3.0 has been developed and validated successfully on several small bacterial genomes (ca. 580 kb to 1.83 Mb) archived in the NCBI GenBank database. It is an updated web-based "on-the-fly" computational tool allowing gene order and synteny comparisons of any two small bacterial genomes. Analyses of several bacterial genomes show that a large amount of gene and genome re-arrangement occurs, as seen with earlier DNA software tools. This can be displayed at the protein level using GeneOrder3.0. Whole genome alignments of genes are presented in both a table and a dot plot. This allows the detection of evolutionary more distant relationships since protein sequences are more conserved than DNA sequences. Conclusions GeneOrder3.0 allows researchers to perform comparative analysis of gene order and synteny in genomes of sizes up to 2 Mb "on-the-fly." Availability: and . PMID:15128433

  13. Large Scale Bacterial Colony Screening of Diversified FRET Biosensors

    PubMed Central

    Litzlbauer, Julia; Schifferer, Martina; Ng, David; Fabritius, Arne; Thestrup, Thomas; Griesbeck, Oliver

    2015-01-01

    Biosensors based on Förster Resonance Energy Transfer (FRET) between fluorescent protein mutants have started to revolutionize physiology and biochemistry. However, many types of FRET biosensors show relatively small FRET changes, making measurements with these probes challenging when used under sub-optimal experimental conditions. Thus, a major effort in the field currently lies in designing new optimization strategies for these types of sensors. Here we describe procedures for optimizing FRET changes by large scale screening of mutant biosensor libraries in bacterial colonies. We describe optimization of biosensor expression, permeabilization of bacteria, software tools for analysis, and screening conditions. The procedures reported here may help in improving FRET changes in multiple suitable classes of biosensors. PMID:26061878

  14. A hybrid approach for the automated finishing of bacterial genomes.

    PubMed

    Bashir, Ali; Klammer, Aaron A; Robins, William P; Chin, Chen-Shan; Webster, Dale; Paxinos, Ellen; Hsu, David; Ashby, Meredith; Wang, Susana; Peluso, Paul; Sebra, Robert; Sorenson, Jon; Bullard, James; Yen, Jackie; Valdovino, Marie; Mollova, Emilia; Luong, Khai; Lin, Steven; LaMay, Brianna; Joshi, Amruta; Rowe, Lori; Frace, Michael; Tarr, Cheryl L; Turnsek, Maryann; Davis, Brigid M; Kasarskis, Andrew; Mekalanos, John J; Waldor, Matthew K; Schadt, Eric E

    2012-07-01

    Advances in DNA sequencing technology have improved our ability to characterize most genomic diversity. However, accurate resolution of large structural events is challenging because of the short read lengths of second-generation technologies. Third-generation sequencing technologies, which can yield longer multikilobase reads, have the potential to address limitations associated with genome assembly. Here we combine sequencing data from second- and third-generation DNA sequencing technologies to assemble the two-chromosome genome of a recent Haitian cholera outbreak strain into two nearly finished contigs at >99.9% accuracy. Complex regions with clinically relevant structure were completely resolved. In separate control assemblies on experimental and simulated data for the canonical N16961 cholera reference strain, we obtained 14 scaffolds of greater than 1 kb for the experimental data and 8 scaffolds of greater than 1 kb for the simulated data, which allowed us to correct several errors in contigs assembled from the short-read data alone. This work provides a blueprint for the next generation of rapid microbial identification and full-genome assembly.

  15. Genome Sequences of Nine Gram-Negative Vaginal Bacterial Isolates

    PubMed Central

    Deitzler, Grace E.; Ruiz, Maria J.; Lu, Wendy; Weimer, Cory; Park, SoEun; Robinson, Lloyd S.; Hallsworth-Pepin, Kymberlie; Wollam, Aye; Mitreva, Makedonka

    2016-01-01

    The vagina is home to a wide variety of bacteria that have great potential to impact human health. Here, we announce reference strains (now available through BEI Resources) and draft genome sequences for 9 Gram-negative vaginal isolates from the taxa Citrobacter, Klebsiella, Fusobacterium, Proteus, and Prevotella. PMID:27688330

  16. The Extent of Genome Flux and Its Role in the Differentiation of Bacterial Lineages

    PubMed Central

    Nowell, Reuben W.; Green, Sarah; Laue, Bridget E.; Sharp, Paul M.

    2014-01-01

    Horizontal gene transfer (HGT) and gene loss are key processes in bacterial evolution. However, the role of gene gain and loss in the emergence and maintenance of ecologically differentiated bacterial populations remains an open question. Here, we use whole-genome sequence data to quantify gene gain and loss for 27 lineages of the plant-associated bacterium Pseudomonas syringae. We apply an extensive error-control procedure that accounts for errors in draft genome data and greatly improves the accuracy of patterns of gene occurrence among these genomes. We demonstrate a history of extensive genome fluctuation for this species and show that individual lineages could have acquired thousands of genes in the same period in which a 1% amino acid divergence accrues in the core genome. Elucidating the dynamics of genome fluctuation reveals the rapid turnover of gained genes, such that the majority of recently gained genes are quickly lost. Despite high observed rates of fluctuation, a phylogeny inferred from patterns of gene occurrence is similar to a phylogeny based on amino acid replacements within the core genome. Furthermore, the core genome phylogeny suggests that P. syringae should be considered a number of distinct species, with levels of divergence at least equivalent to those between recognized bacterial species. Gained genes are transferred from a variety of sources, reflecting the depth and diversity of the potential gene pool available via HGT. Overall, our results provide further insights into the evolutionary dynamics of genome fluctuation and implicate HGT as a major factor contributing to the diversification of P. syringae lineages. PMID:24923323

  17. The OME Framework for genome-scale systems biology

    SciTech Connect

    Palsson, Bernhard O.; Ebrahim, Ali; Federowicz, Steve

    2014-12-19

    The life sciences are undergoing continuous and accelerating integration with computational and engineering sciences. The biology that many in the field have been trained on may be hardly recognizable in ten to twenty years. One of the major drivers for this transformation is the blistering pace of advancements in DNA sequencing and synthesis. These advances have resulted in unprecedented amounts of new data, information, and knowledge. Many software tools have been developed to deal with aspects of this transformation and each is sorely needed [1-3]. However, few of these tools have been forced to deal with the full complexity of genome-scale models along with high throughput genome- scale data. This particular situation represents a unique challenge, as it is simultaneously necessary to deal with the vast breadth of genome-scale models and the dizzying depth of high-throughput datasets. It has been observed time and again that as the pace of data generation continues to accelerate, the pace of analysis significantly lags behind [4]. It is also evident that, given the plethora of databases and software efforts [5-12], it is still a significant challenge to work with genome-scale metabolic models, let alone next-generation whole cell models [13-15]. We work at the forefront of model creation and systems scale data generation [16-18]. The OME Framework was borne out of a practical need to enable genome-scale modeling and data analysis under a unified framework to drive the next generation of genome-scale biological models. Here we present the OME Framework. It exists as a set of Python classes. However, we want to emphasize the importance of the underlying design as an addition to the discussions on specifications of a digital cell. A great deal of work and valuable progress has been made by a number of communities [13, 19-24] towards interchange formats and implementations designed to achieve similar goals. While many software tools exist for handling genome-scale

  18. GFinisher: a new strategy to refine and finish bacterial genome assemblies

    NASA Astrophysics Data System (ADS)

    Guizelini, Dieval; Raittz, Roberto T.; Cruz, Leonardo M.; Souza, Emanuel M.; Steffens, Maria B. R.; Pedrosa, Fabio O.

    2016-10-01

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.

  19. GFinisher: a new strategy to refine and finish bacterial genome assemblies

    PubMed Central

    Guizelini, Dieval; Raittz, Roberto T.; Cruz, Leonardo M.; Souza, Emanuel M.; Steffens, Maria B. R.; Pedrosa, Fabio O.

    2016-01-01

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/. PMID:27721396

  20. GFinisher: a new strategy to refine and finish bacterial genome assemblies.

    PubMed

    Guizelini, Dieval; Raittz, Roberto T; Cruz, Leonardo M; Souza, Emanuel M; Steffens, Maria B R; Pedrosa, Fabio O

    2016-10-10

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.

  1. Draft genome sequence of XANTHOMONAS ARBORICOLA strain 3004, causal agent of bacterial disease on barley

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We report here the annotated genome sequence of XANTHOMONAS ARBORICOLA str. 3004, a Gram-negative phytopathogenic bacteria that includes several pathovars characterized by virulence specificity. Strain 3004 was isolated from barley leaves with symptoms of streak (bacterial blight) and also can infec...

  2. Complete Genome Sequence of the Intracellular Bacterial Symbiont TC1 in the Anaerobic Ciliate Trimyema compressum.

    PubMed

    Shinzato, Naoya; Aoyama, Hiroaki; Saitoh, Seikoh; Nikoh, Naruo; Nakano, Kazuma; Shimoji, Makiko; Shinzato, Misuzu; Satou, Kazuhito; Teruya, Kuniko; Hirano, Takashi; Yamada, Takanori; Nobu, Masaru K; Tamaki, Hideyuki; Shirai, Yumi; Park, Sanghwa; Narihiro, Takashi; Liu, Wen-Tso; Kamagata, Yoichi

    2016-09-22

    A free-living ciliate, Trimyema compressum, found in anoxic freshwater environments harbors methanogenic archaea and a bacterial symbiont named TC1 in its cytoplasm. Here, we report the complete genome sequence of the TC1 symbiont, consisting of a 1.59-Mb chromosome and a 35.8-kb plasmid, which was determined using the PacBio RSII sequencer.

  3. Impact of small repeat sequences on bacterial genome evolution.

    PubMed

    Delihas, Nicholas

    2011-01-01

    Intergenic regions of prokaryotic genomes carry multiple copies of terminal inverted repeat (TIR) sequences, the nonautonomous miniature inverted-repeat transposable element (MITE). In addition, there are the repetitive extragenic palindromic (REP) sequences that fold into a small stem loop rich in G-C bonding. And the clustered regularly interspaced short palindromic repeats (CRISPRs) display similar small stem loops but are an integral part of a complex genetic element. Other classes of repeats such as the REP2 element do not have TIRs but show other signatures. With the current availability of a large number of whole-genome sequences, many new repeat elements have been discovered. These sequences display diverse properties. Some show an intimate linkage to integrons, and at least one encodes a small RNA. Many repeats are found fused with chromosomal open reading frames, and some are located within protein coding sequences. Small repeat units appear to work hand in hand with the transcriptional and/or post-transcriptional apparatus of the cell. Functionally, they are multifaceted, and this can range from the control of gene expression, the facilitation of host/pathogen interactions, or stimulation of the mammalian immune system. The CRISPR complex displays dramatic functions such as an acquired immune system that defends against invading viruses and plasmids. Evolutionarily, mobile repeat elements may have influenced a cycle of active versus inactive genes in ancestral organisms, and some repeats are concentrated in regions of the chromosome where there is significant genomic plasticity. Changes in the abundance of genomic repeats during the evolution of an organism may have resulted in a benefit to the cell or posed a disadvantage, and some present day species may reflect a purification process. The diverse structure, eclectic functions, and evolutionary aspects of repeat elements are described.

  4. Distribution of repetitive DNA sequences in eubacteria and application to fingerprinting of bacterial genomes.

    PubMed Central

    Versalovic, J; Koeuth, T; Lupski, J R

    1991-01-01

    Dispersed repetitive DNA sequences have been described recently in eubacteria. To assess the distribution and evolutionary conservation of two distinct prokaryotic repetitive elements, consensus oligonucleotides were used in polymerase chain reaction [PCR] amplification and slot blot hybridization experiments with genomic DNA from diverse eubacterial species. Oligonucleotides matching Repetitive Extragenic Palindromic [REP] elements and Enterobacterial Repetitive Intergenic Consensus [ERIC] sequences were synthesized and tested as opposing PCR primers in the amplification of eubacterial genomic DNA. REP and ERIC consensus oligonucleotides produced clearly resolvable bands by agarose gel electrophoresis following PCR amplification. These band patterns provided unambiguous DNA fingerprints of different eubacterial species and strains. Both REP and ERIC probes hybridized preferentially to genomic DNA from Gram-negative enteric bacteria and related species. Widespread distribution of these repetitive DNA elements in the genomes of various microorganisms should enable rapid identification of bacterial species and strains, and be useful for the analysis of prokaryotic genomes. Images PMID:1762913

  5. Construction of a llama bacterial artificial chromosome library with approximately 9-fold genome equivalent coverage.

    PubMed

    Airmet, K W; Hinckley, J D; Tree, L T; Moss, M; Blumell, S; Ulicny, K; Gustafson, A K; Weed, M; Theodosis, R; Lehnardt, M; Genho, J; Stevens, M R; Kooyman, D L

    2012-01-01

    The Ilama is an important agricultural livestock in much of South America. The llama is increasing in popularity in the United States as a companion animal. Little work has been done to improve llama production using modern technology. A paucity of information is available regarding the llama genome. We report the construction of a llama bacterial artificial chromosome (BAC) library of about 196,224 clones in the vector pECBAC1. Using flow cytometry and bovine, human, mouse, and chicken as controls, we determined the llama genome size to be 2.4 × 10⁹ bp. The average insert size of the library is 137.8 kb corresponding to approximately 9-fold genome coverage. Further studies are needed to further characterize the library and llama genome. We anticipate that this new library will help facilitate future genomic studies in the llama.

  6. Reconstructing genome-scale metabolic models with merlin

    PubMed Central

    Dias, Oscar; Rocha, Miguel; Ferreira, Eugénio C.; Rocha, Isabel

    2015-01-01

    The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is a user-friendly Java application that aids the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced. It performs the major steps of the reconstruction process, including the functional genomic annotation of the whole genome and subsequent construction of the portfolio of reactions. Moreover, merlin includes tools for the identification and annotation of genes encoding transport proteins, generating the transport reactions for those carriers. It also performs the compartmentalisation of the model, predicting the organelle localisation of the proteins encoded in the genome and thus the localisation of the metabolites involved in the reactions promoted by such enzymes. The gene-proteins-reactions (GPR) associations are automatically generated and included in the model. Finally, merlin expedites the transition from genomic data to draft metabolic models reconstructions exported in the SBML standard format, allowing the user to have a preliminary view of the biochemical network, which can be manually curated within the environment provided by merlin. PMID:25845595

  7. Reconstructing genome-scale metabolic models with merlin.

    PubMed

    Dias, Oscar; Rocha, Miguel; Ferreira, Eugénio C; Rocha, Isabel

    2015-04-30

    The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is a user-friendly Java application that aids the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced. It performs the major steps of the reconstruction process, including the functional genomic annotation of the whole genome and subsequent construction of the portfolio of reactions. Moreover, merlin includes tools for the identification and annotation of genes encoding transport proteins, generating the transport reactions for those carriers. It also performs the compartmentalisation of the model, predicting the organelle localisation of the proteins encoded in the genome and thus the localisation of the metabolites involved in the reactions promoted by such enzymes. The gene-proteins-reactions (GPR) associations are automatically generated and included in the model. Finally, merlin expedites the transition from genomic data to draft metabolic models reconstructions exported in the SBML standard format, allowing the user to have a preliminary view of the biochemical network, which can be manually curated within the environment provided by merlin.

  8. Bacterial genome reduction using the progressive clustering of deletions via yeast sexual cycling

    PubMed Central

    Assad-Garcia, Nacyra; Kostylev, Maxim; Noskov, Vladimir N.; Wise, Kim S.; Karas, Bogumil J.; Stam, Jason; Montague, Michael G.; Hanly, Timothy J.; Enriquez, Nico J.; Ramon, Adi; Goldgof, Gregory M.; Richter, R. Alexander; Vashee, Sanjay; Chuang, Ray-Yuan; Winzeler, Elizabeth A.; Hutchison, Clyde A.; Gibson, Daniel G.; Smith, Hamilton O.; Glass, John I.; Venter, J. Craig

    2015-01-01

    The availability of genetically tractable organisms with simple genomes is critical for the rapid, systems-level understanding of basic biological processes. Mycoplasma bacteria, with the smallest known genomes among free-living cellular organisms, are ideal models for this purpose, but the natural versions of these cells have genome complexities still too great to offer a comprehensive view of a fundamental life form. Here we describe an efficient method for reducing genomes from these organisms by identifying individually deletable regions using transposon mutagenesis and progressively clustering deleted genomic segments using meiotic recombination between the bacterial genomes harbored in yeast. Mycoplasmal genomes subjected to this process and transplanted into recipient cells yielded two mycoplasma strains. The first simultaneously lacked eight singly deletable regions of the genome, representing a total of 91 genes and ∼10% of the original genome. The second strain lacked seven of the eight regions, representing 84 genes. Growth assay data revealed an absence of genetic interactions among the 91 genes under tested conditions. Despite predicted effects of the deletions on sugar metabolism and the proteome, growth rates were unaffected by the gene deletions in the seven-deletion strain. These results support the feasibility of using single-gene disruption data to design and construct viable genomes lacking multiple genes, paving the way toward genome minimization. The progressive clustering method is expected to be effective for the reorganization of any mega-sized DNA molecules cloned in yeast, facilitating the construction of designer genomes in microbes as well as genomic fragments for genetic engineering of higher eukaryotes. PMID:25654978

  9. Bacterial genome reduction using the progressive clustering of deletions via yeast sexual cycling

    SciTech Connect

    Suzuki, Yo; Assad-Garcia, Nacyra; Kostylev, Maxim; Noskov, Vladimir N.; Wise, Kim S.; Karas, Bogumil J.; Stam, Jason; Montague, Michael G.; Hanly, Timothy J.; Enriquez, Nico J.; Ramon, Adi; Goldgof, Gregory M.; Richter, R. Alexander; Vashee, Sanjay; Chuang, Ray-Yuan; Winzeler, Elizabeth A.; Hutchison, Clyde A.; Gibson, Daniel G.; Smith, Hamilton O.; Glass, John I.; Venter, J. Craig

    2015-02-05

    The availability of genetically tractable organisms with simple genomes is critical for the rapid, systems-level understanding of basic biological processes. Mycoplasma bacteria, with the smallest known genomes among free-living cellular organisms, are ideal models for this purpose, but the natural versions of these cells have genome complexities still too great to offer a comprehensive view of a fundamental life form. Here in this paper we describe an efficient method for reducing genomes from these organisms by identifying individually deletable regions using transposon mutagenesis and progressively clustering deleted genomic segments using meiotic recombination between the bacterial genomes harbored in yeast. Mycoplasmal genomes subjected to this process and transplanted into recipient cells yielded two mycoplasma strains. The first simultaneously lacked eight singly deletable regions of the genome, representing a total of 91 genes and ~10%of the original genome. The second strain lacked seven of the eight regions, representing 84 genes. Growth assay data revealed an absence of genetic interactions among the 91 genes under tested conditions. Despite predicted effects of the deletions on sugar metabolism and the proteome, growth rates were unaffected by the gene deletions in the seven-deletion strain. These results support the feasibility of using single-gene disruption data to design and construct viable genomes lacking multiple genes, paving the way toward genome minimization. The progressive clustering method is expected to be effective for the reorganization of any mega-sized DNA molecules cloned in yeast, facilitating the construction of designer genomes in microbes as well as genomic fragments for genetic engineering of higher eukaryotes.

  10. Bacterial genome reduction using the progressive clustering of deletions via yeast sexual cycling

    DOE PAGES

    Suzuki, Yo; Assad-Garcia, Nacyra; Kostylev, Maxim; ...

    2015-02-05

    The availability of genetically tractable organisms with simple genomes is critical for the rapid, systems-level understanding of basic biological processes. Mycoplasma bacteria, with the smallest known genomes among free-living cellular organisms, are ideal models for this purpose, but the natural versions of these cells have genome complexities still too great to offer a comprehensive view of a fundamental life form. Here in this paper we describe an efficient method for reducing genomes from these organisms by identifying individually deletable regions using transposon mutagenesis and progressively clustering deleted genomic segments using meiotic recombination between the bacterial genomes harbored in yeast. Mycoplasmalmore » genomes subjected to this process and transplanted into recipient cells yielded two mycoplasma strains. The first simultaneously lacked eight singly deletable regions of the genome, representing a total of 91 genes and ~10%of the original genome. The second strain lacked seven of the eight regions, representing 84 genes. Growth assay data revealed an absence of genetic interactions among the 91 genes under tested conditions. Despite predicted effects of the deletions on sugar metabolism and the proteome, growth rates were unaffected by the gene deletions in the seven-deletion strain. These results support the feasibility of using single-gene disruption data to design and construct viable genomes lacking multiple genes, paving the way toward genome minimization. The progressive clustering method is expected to be effective for the reorganization of any mega-sized DNA molecules cloned in yeast, facilitating the construction of designer genomes in microbes as well as genomic fragments for genetic engineering of higher eukaryotes.« less

  11. Genome-based microbial ecology of anammox granules in a full-scale wastewater treatment system

    PubMed Central

    Speth, Daan R.; in 't Zandt, Michiel H.; Guerrero-Cruz, Simon; Dutilh, Bas E.; Jetten, Mike S. M.

    2016-01-01

    Partial-nitritation anammox (PNA) is a novel wastewater treatment procedure for energy-efficient ammonium removal. Here we use genome-resolved metagenomics to build a genome-based ecological model of the microbial community in a full-scale PNA reactor. Sludge from the bioreactor examined here is used to seed reactors in wastewater treatment plants around the world; however, the role of most of its microbial community in ammonium removal remains unknown. Our analysis yielded 23 near-complete draft genomes that together represent the majority of the microbial community. We assign these genomes to distinct anaerobic and aerobic microbial communities. In the aerobic community, nitrifying organisms and heterotrophs predominate. In the anaerobic community, widespread potential for partial denitrification suggests a nitrite loop increases treatment efficiency. Of our genomes, 19 have no previously cultivated or sequenced close relatives and six belong to bacterial phyla without any cultivated members, including the most complete Omnitrophica (formerly OP3) genome to date. PMID:27029554

  12. From genes to genomes: universal scale-invariant properties of microbial chromosome organisation.

    PubMed

    Audit, Benjamin; Ouzounis, Christos A

    2003-09-19

    The availability of complete genome sequences for a large variety of organisms is a major advance in understanding genome structure and function. One attribute of genome structure is chromosome organisation in terms of gene localisation and orientation. For example, bacterial operons, i.e. clusters of co-oriented genes that form transcription units, enable functionally related genes to be expressed simultaneously. The description of genome organisation was pioneered with the study of the distribution of genes of the Escherichia coli partial genetic map before the full genome sequence was known. Deploying powerful techniques from circular statistics and signal processing, we revisit the issue of gene localisation and orientation using 89 complete microbial chromosomes from the eubacterial and archaeal domains. We demonstrate that there is no characteristic size pertinent to the description of chromosome structure, e.g. there does not exist any single length appropriate to describe gene clustering. Our results show that, for all 89 chromosomes, gene positions and gene orientations share a common form of scale-invariant correlations known as "long-range correlations" that we can reveal for distances from the gene length, up to the chromosome size. This observation indicates that genes tend to assemble and to co-orient over any scale of observation greater than a few kilobases. This unexpected property of chromosome structure can be portrayed as an operon-like organisation at all scales and implies that a complete scale range extending over more than three orders of magnitudes of chromosome segment lengths is necessary to properly describe prokaryotic genome organisation. We propose that this pattern results from the effects of the superhelical context on gene expression coupled with the structure and dynamics of the nucleoid, possibly accommodating the diverse gene expression profiles needed during the different stages of cellular life.

  13. The Distributed Genome Hypothesis as a Rubric for Understanding Evolution in situ During Chronic Bacterial Biofilm Infectious Processes

    PubMed Central

    Ehrlich, Garth D.; Ahmed, Azad; Earl, Josh; Hiller, N. Luisa; Costerton, J. William; Stoodley, Paul; Post, J. Christopher; DeMeo, Patrick; Hu, Fen Ze

    2010-01-01

    Most chronic infectious disease processes associated with bacteria are characterized by the formation of a biofilm which provides for bacterial attachment to the host tissue or implanted medical device. The biofilm protects the bacteria from the host’s adaptive immune response, as well as predation by phagocytic cells. However, the most insidious aspect of biofilm biology from the host’s point of view is that the biofilm provides an ideal setting for bacterial horizontal gene transfer (HGT). HGT provides for large-scale genome content changes in situ during the chronic infectious process. Obviously, for HGT processes to result in the reassortment of alleles and genes among bacterial strains the infection must be polyclonal (polymicrobial) in nature. In this review we marshal the evidence that all of the factors are present in biofilm infections to support HGT which results in the ongoing production of novel strains with unique combinations of genic characters and that the continual production of large numbers of novel, but related bacterial strains leads to persistence. This concept of an infecting population of bacteria undergoing mutagenesis to produce a ‘cloud’ of similar strains to confuse and overwhelm the host’s immune system parallels genetic diversity stratagies employed by viral and parasitic pathogens. PMID:20618850

  14. Recombination produces coherent bacterial species clusters in both core and accessory genomes

    PubMed Central

    Croucher, Nicholas J.; Gutmann, Michael U.; Corander, Jukka; Hanage, William P.

    2015-01-01

    Background: Population samples show bacterial genomes can be divided into a core of ubiquitous genes and accessory genes that are present in a fraction of isolates. The ecological significance of this variation in gene content remains unclear. However, microbiologists agree that a bacterial species should be ‘genomically coherent’, even though there is no consensus on how this should be determined. Results: We use a parsimonious model combining diversification in both the core and accessory genome, including mutation, homologous recombination (HR) and horizontal gene transfer (HGT) introducing new loci, to produce a population of interacting clusters of strains with varying genome content. New loci introduced by HGT may then be transferred on by HR. The model fits well to a systematic population sample of 616 pneumococcal genomes, capturing the major features of the population structure with parameter values that agree well with empirical estimates. Conclusions: The model does not include explicit selection on individual genes, suggesting that crude comparisons of gene content may be a poor predictor of ecological function. We identify a clearly divergent subpopulation of pneumococci that are inconsistent with the model and may be considered genomically incoherent with the rest of the population. These strains have a distinct disease tropism and may be rationally defined as a separate species. We also find deviations from the model that may be explained by recent population bottlenecks or spatial structure.

  15. Pervasive, genome-wide positive selection leading to functional divergence in the bacterial genus Campylobacter

    PubMed Central

    Lefébure, Tristan; Stanhope, Michael J.

    2009-01-01

    An open question in bacterial genomics is the role that adaptive evolution of the core genome plays in diversification and adaptation of bacterial species, and how this might differ between groups of bacteria occupying different environmental circumstances. The genus Campylobacter encompasses several important human and animal enteric pathogens, with genome sequence data available for eight species. We estimate the Campylobacter core genome at 647 genes, with 92.5% of the nonrecombinant core genome loci under positive selection in at least one lineage and the same gene frequently under positive selection in multiple lineages. Tests are provided that reject recombination, saturation, and variation in codon usage bias as factors contributing to this high level of selection. We suggest this genome-wide adaptive evolution may result from a Red Queen macroevolutionary dynamic, in which species are involved in competition for resources within the mammalian and/or vertebrate gastrointestinal tract. Much reduced levels of positive selection evident in Streptococcus, as reported by the authors in an earlier work, may be a consequence of these taxa inhabiting less species-rich habitats, and more unique niches. Despite many common loci under positive selection in multiple Campylobacter lineages, we found no evidence for molecular adaptive convergence at the level of the same or adjacent codons, or even protein domains. Taken collectively, these results describe the diversification of a bacterial genus that involves pervasive natural selection pressure across virtually the entire genome, with this adaptation occurring in different ways in different lineages, despite the species tendency toward a common gastrointestinal habitat. PMID:19304960

  16. Territorial Polymers and Large Scale Genome Organization

    NASA Astrophysics Data System (ADS)

    Grosberg, Alexander

    2012-02-01

    Chromatin fiber in interphase nucleus represents effectively a very long polymer packed in a restricted volume. Although polymer models of chromatin organization were considered, most of them disregard the fact that DNA has to stay not too entangled in order to function properly. One polymer model with no entanglements is the melt of unknotted unconcatenated rings. Extensive simulations indicate that rings in the melt at large length (monomer numbers) N approach the compact state, with gyration radius scaling as N^1/3, suggesting every ring being compact and segregated from the surrounding rings. The segregation is consistent with the known phenomenon of chromosome territories. Surface exponent β (describing the number of contacts between neighboring rings scaling as N^β) appears only slightly below unity, β 0.95. This suggests that the loop factor (probability to meet for two monomers linear distance s apart) should decay as s^-γ, where γ= 2 - β is slightly above one. The later result is consistent with HiC data on real human interphase chromosomes, and does not contradict to the older FISH data. The dynamics of rings in the melt indicates that the motion of one ring remains subdiffusive on the time scale well above the stress relaxation time.

  17. Genomic analyses of bacterial porin-cytochrome gene clusters

    DOE PAGES

    Shi, Liang; Fredrickson, James K.; Zachara, John M.

    2014-11-26

    In this study, the porin-cytochrome (Pcc) protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III) by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c type cytochrome (c-Cyt) and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters) of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteriamore » from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr) gene clusters of other Fe(III)-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III) and Mn(IV) oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular electron transfer reactions with the substrates other than Fe(III) and Mn(IV) oxides.« less

  18. Endozoicomonas genomes reveal functional adaptation and plasticity in bacterial strains symbiotically associated with diverse marine hosts

    PubMed Central

    Neave, Matthew J.; Michell, Craig T.; Apprill, Amy; Voolstra, Christian R.

    2017-01-01

    Endozoicomonas bacteria are globally distributed and often abundantly associated with diverse marine hosts including reef-building corals, yet their function remains unknown. In this study we generated novel Endozoicomonas genomes from single cells and metagenomes obtained directly from the corals Stylophora pistillata, Pocillopora verrucosa, and Acropora humilis. We then compared these culture-independent genomes to existing genomes of bacterial isolates acquired from a sponge, sea slug, and coral to examine the functional landscape of this enigmatic genus. Sequencing and analysis of single cells and metagenomes resulted in four novel genomes with 60–76% and 81–90% genome completeness, respectively. These data also confirmed that Endozoicomonas genomes are large and are not streamlined for an obligate endosymbiotic lifestyle, implying that they have free-living stages. All genomes show an enrichment of genes associated with carbon sugar transport and utilization and protein secretion, potentially indicating that Endozoicomonas contribute to the cycling of carbohydrates and the provision of proteins to their respective hosts. Importantly, besides these commonalities, the genomes showed evidence for differential functional specificity and diversification, including genes for the production of amino acids. Given this metabolic diversity of Endozoicomonas we propose that different genotypes play disparate roles and have diversified in concert with their hosts. PMID:28094347

  19. Large-scale investigation of genomic markers for severe periodontitis.

    PubMed

    Suzuki, Asami; Ji, Guijin; Numabe, Yukihiro; Ishii, Keisuke; Muramatsu, Masaaki; Kamoi, Kyuichi

    2004-09-01

    The purpose of the present study was to investigate the genomic markers for periodontitis, using large-scale single-nucleotide polymorphism (SNP) association studies comparing healthy volunteers and patients with periodontitis. Genomic DNA was obtained from 19 healthy volunteers and 22 patients with severe periodontitis, all of whom were Japanese. The subjects were genotyped at 637 SNPs in 244 genes on a large scale, using the TaqMan polymerase chain reaction (PCR) system. Statistically significant differences in allele and genotype frequencies were analyzed with Fisher's exact test. We found statistically significant differences (P < 0.01) between the healthy volunteers and patients with severe periodontitis in the following genes; gonadotropin-releasing hormone 1 (GNRH1), phosphatidylinositol 3-kinase regulatory 1 (PIK3R1), dipeptidylpeptidase 4 (DPP4), fibrinogen-like 2 (FGL2), and calcitonin receptor (CALCR). These results suggest that SNPs in the GNRH1, PIK3R1, DPP4, FGL2, and CALCR genes are genomic markers for severe periodontitis. Our findings indicate the necessity of analyzing SNPs in genes on a large scale (i.e., genome-wide approach), to identify genomic markers for periodontitis.

  20. Genome-scale engineering for systems and synthetic biology

    PubMed Central

    Esvelt, Kevin M; Wang, Harris H

    2013-01-01

    Genome-modification technologies enable the rational engineering and perturbation of biological systems. Historically, these methods have been limited to gene insertions or mutations at random or at a few pre-defined locations across the genome. The handful of methods capable of targeted gene editing suffered from low efficiencies, significant labor costs, or both. Recent advances have dramatically expanded our ability to engineer cells in a directed and combinatorial manner. Here, we review current technologies and methodologies for genome-scale engineering, discuss the prospects for extending efficient genome modification to new hosts, and explore the implications of continued advances toward the development of flexibly programmable chasses, novel biochemistries, and safer organismal and ecological engineering. PMID:23340847

  1. Genome-scale engineering for systems and synthetic biology.

    PubMed

    Esvelt, Kevin M; Wang, Harris H

    2013-01-01

    Genome-modification technologies enable the rational engineering and perturbation of biological systems. Historically, these methods have been limited to gene insertions or mutations at random or at a few pre-defined locations across the genome. The handful of methods capable of targeted gene editing suffered from low efficiencies, significant labor costs, or both. Recent advances have dramatically expanded our ability to engineer cells in a directed and combinatorial manner. Here, we review current technologies and methodologies for genome-scale engineering, discuss the prospects for extending efficient genome modification to new hosts, and explore the implications of continued advances toward the development of flexibly programmable chasses, novel biochemistries, and safer organismal and ecological engineering.

  2. CRISPR-Cas: From the Bacterial Adaptive Immune System to a Versatile Tool for Genome Engineering.

    PubMed

    Kirchner, Marion; Schneider, Sabine

    2015-11-09

    The field of biology has been revolutionized by the recent advancement of an adaptive bacterial immune system as a universal genome engineering tool. Bacteria and archaea use repetitive genomic elements termed clustered regularly interspaced short palindromic repeats (CRISPR) in combination with an RNA-guided nuclease (CRISPR-associated nuclease: Cas) to target and destroy invading DNA. By choosing the appropriate sequence of the guide RNA, this two-component system can be used to efficiently modify, target, and edit genomic loci of interest in plants, insects, fungi, mammalian cells, and whole organisms. This has opened up new frontiers in genome engineering, including the potential to treat or cure human genetic disorders. Now the potential risks as well as the ethical, social, and legal implications of this powerful new technique move into the limelight.

  3. Genomic analyses of bacterial porin-cytochrome gene clusters

    SciTech Connect

    Shi, Liang; Fredrickson, James K.; Zachara, John M.

    2014-11-26

    In this study, the porin-cytochrome (Pcc) protein complex is responsible for trans-outer membrane electron transfer during extracellular reduction of Fe(III) by the dissimilatory metal-reducing bacterium Geobacter sulfurreducens PCA. The identified and characterized Pcc complex of G. sulfurreducens PCA consists of a porin-like outer-membrane protein, a periplasmic 8-heme c type cytochrome (c-Cyt) and an outer-membrane 12-heme c-Cyt, and the genes encoding the Pcc proteins are clustered in the same regions of genome (i.e., the pcc gene clusters) of G. sulfurreducens PCA. A survey of additionally microbial genomes has identified the pcc gene clusters in all sequenced Geobacter spp. and other bacteria from six different phyla, including Anaeromyxobacter dehalogenans 2CP-1, A. dehalogenans 2CP-C, Anaeromyxobacter sp. K, Candidatus Kuenenia stuttgartiensis, Denitrovibrio acetiphilus DSM 12809, Desulfurispirillum indicum S5, Desulfurivibrio alkaliphilus AHT2, Desulfurobacterium thermolithotrophum DSM 11699, Desulfuromonas acetoxidans DSM 684, Ignavibacterium album JCM 16511, and Thermovibrio ammonificans HB-1. The numbers of genes in the pcc gene clusters vary, ranging from two to nine. Similar to the metal-reducing (Mtr) gene clusters of other Fe(III)-reducing bacteria, such as Shewanella spp., additional genes that encode putative c-Cyts with predicted cellular localizations at the cytoplasmic membrane, periplasm and outer membrane often associate with the pcc gene clusters. This suggests that the Pcc-associated c-Cyts may be part of the pathways for extracellular electron transfer reactions. The presence of pcc gene clusters in the microorganisms that do not reduce solid-phase Fe(III) and Mn(IV) oxides, such as D. alkaliphilus AHT2 and I. album JCM 16511, also suggests that some of the pcc gene clusters may be involved in extracellular

  4. Whole Genome Sequence Analysis of Pig Respiratory Bacterial Pathogens with Elevated Minimum Inhibitory Concentrations for Macrolides.

    PubMed

    Dayao, Denise Ann Estarez; Seddon, Jennifer M; Gibson, Justine S; Blackall, Patrick J; Turni, Conny

    2016-10-01

    Macrolides are often used to treat and control bacterial pathogens causing respiratory disease in pigs. This study analyzed the whole genome sequences of one clinical isolate of Actinobacillus pleuropneumoniae, Haemophilus parasuis, Pasteurella multocida, and Bordetella bronchiseptica, all isolated from Australian pigs to identify the mechanism underlying the elevated minimum inhibitory concentrations (MICs) for erythromycin, tilmicosin, or tulathromycin. The H. parasuis assembled genome had a nucleotide transition at position 2059 (A to G) in the six copies of the 23S rRNA gene. This mutation has previously been associated with macrolide resistance but this is the first reported mechanism associated with elevated macrolide MICs in H. parasuis. There was no known macrolide resistance mechanism identified in the other three bacterial genomes. However, strA and sul2, aminoglycoside and sulfonamide resistance genes, respectively, were detected in one contiguous sequence (contig 1) of A. pleuropneumoniae assembled genome. This contig was identical to plasmids previously identified in Pasteurellaceae. This study has provided one possible explanation of elevated MICs to macrolides in H. parasuis. Further studies are necessary to clarify the mechanism causing the unexplained macrolide resistance in other Australian pig respiratory pathogens including the role of efflux systems, which were detected in all analyzed genomes.

  5. Genome resequencing in Populus: Revealing large-scale genome variation and implications on specialized-trait genomics

    SciTech Connect

    Muchero, Wellington; Labbe, Jessy L; Priya, Ranjan; DiFazio, Steven P; Tuskan, Gerald A

    2014-01-01

    To date, Populus ranks among a few plant species with a complete genome sequence and other highly developed genomic resources. With the first genome sequence among all tree species, Populus has been adopted as a suitable model organism for genomic studies in trees. However, far from being just a model species, Populus is a key renewable economic resource that plays a significant role in providing raw materials for the biofuel and pulp and paper industries. Therefore, aside from leading frontiers of basic tree molecular biology and ecological research, Populus leads frontiers in addressing global economic challenges related to fuel and fiber production. The latter fact suggests that research aimed at improving quality and quantity of Populus as a raw material will likely drive the pursuit of more targeted and deeper research in order to unlock the economic potential tied in molecular biology processes that drive this tree species. Advances in genome sequence-driven technologies, such as resequencing individual genotypes, which in turn facilitates large scale SNP discovery and identification of large scale polymorphisms are key determinants of future success in these initiatives. In this treatise we discuss implications of genome sequence-enable technologies on Populus genomic and genetic studies of complex and specialized-traits.

  6. Unraveling the 3D genome: genomics tools for multi-scale exploration

    PubMed Central

    Risca, Viviana I.; Greenleaf, William J.

    2015-01-01

    A decade of rapid method development has begun to yield exciting insights into the three-dimensional architecture of the metazoan genome and the roles it may play in regulating transcription. We review here core methods and new tools in the modern genomicist’s toolbox at three length scales, ranging from single base pair to megabase scale chromosomal domains, and discuss the emerging picture of the 3D genome that these tools have revealed. Blind spots remain, especially at intermediate length scales spanning a few nucleosomes, but thanks in part to new technologies that permit targeted alteration of chromatin states and time-resolved studies, the next decade holds great promise for hypothesis-driven research into the mechanisms that drive genome architecture and transcriptional regulation. PMID:25887733

  7. Identification of a Bacteria-Specific Binding Protein from the Sequenced Bacterial Genome.

    PubMed

    Kong, Minsuk; Ryu, Sangryeol

    2016-01-01

    Novel and specific recognition elements are of central importance in the development of a pathogen detection method. Here, we describe a simple method for identifying the cell-wall binding domain (CBD) from a sequenced bacterial genome employing homology search for phage lysin genes. A putative CBD (CPF369_CBD) was identified from a genome of Clostridium perfringens type strain ATCC 13124, and its function was studied with the CBDGFP fusion protein recombinantly expressed in Escherichia coli. Fluorescence microscopy showed the specific binding of the fusion protein to C. perfringens cells, which demonstrates the potential of this method for the identification of novel bioprobes for specific detection of pathogenic bacteria.

  8. Compaction of bacterial genomic DNA: clarifying the concepts.

    PubMed

    Joyeux, Marc

    2015-09-30

    The unconstrained genomic DNA of bacteria forms a coil, whose volume exceeds 1000 times the volume of the cell. Since prokaryotes lack a membrane-bound nucleus, in sharp contrast with eukaryotes, the DNA may consequently be expected to occupy the whole available volume when constrained to fit in the cell. Still, it has been known for more than half a century that the DNA is localized in a well-defined region of the cell, called the nucleoid, which occupies only 15% to 25% of the total volume. Although this problem has focused the attention of many scientists in recent decades, there is still no certainty concerning the mechanism that enables such a dramatic compaction. The goal of this Topical Review is to take stock of our knowledge on this question by listing all possible compaction mechanisms with the proclaimed desire to clarify the physical principles they are based upon and discuss them in the light of experimental results and the results of simulations based on coarse-grained models. In particular, the fundamental differences between ψ-condensation and segregative phase separation and between the condensation by small and long polycations are highlighted. This review suggests that the importance of certain mechanisms, like supercoiling and the architectural properties of DNA-bridging and DNA-bending nucleoid proteins, may have been overestimated, whereas other mechanisms, like segregative phase separation and the self-association of nucleoid proteins, as well as the possible role of the synergy of two or more mechanisms, may conversely deserve more attention.

  9. Convergence of DNA methylation and phosphorothioation epigenetics in bacterial genomes.

    PubMed

    Chen, Chao; Wang, Lianrong; Chen, Si; Wu, Xiaolin; Gu, Meijia; Chen, Xi; Jiang, Susu; Wang, Yunfu; Deng, Zixin; Dedon, Peter C; Chen, Shi

    2017-04-11

    Explosive growth in the study of microbial epigenetics has revealed a diversity of chemical structures and biological functions of DNA modifications in restriction-modification (R-M) and basic genetic processes. Here, we describe the discovery of shared consensus sequences for two seemingly unrelated DNA modification systems, (6m)A methylation and phosphorothioation (PT), in which sulfur replaces a nonbridging oxygen in the DNA backbone. Mass spectrometric analysis of DNA from Escherichia coli B7A and Salmonella enterica serovar Cerro 87, strains possessing PT-based R-M genes, revealed d(GPS(6m)A) dinucleotides in the GPS(6m)AAC consensus representing ∼5% of the 1,100 to 1,300 PT-modified d(GPSA) motifs per genome, with (6m)A arising from a yet-to-be-identified methyltransferase. To further explore PT and (6m)A in another consensus sequence, GPS(6m)ATC, we engineered a strain of E. coli HST04 to express Dnd genes from Hahella chejuensis KCTC2396 (PT in GPSATC) and Dam methyltransferase from E. coli DH10B ((6m)A in G(6m)ATC). Based on this model, in vitro studies revealed reduced Dam activity in GPSATC-containing oligonucleotides whereas single-molecule real-time sequencing of HST04 DNA revealed (6m)A in all 2,058 GPSATC sites (5% of 37,698 total GATC sites). This model system also revealed temperature-sensitive restriction by DndFGH in KCTC2396 and B7A, which was exploited to discover that (6m)A can substitute for PT to confer resistance to restriction by the DndFGH system. These results point to complex but unappreciated interactions between DNA modification systems and raise the possibility of coevolution of interacting systems to facilitate the function of each.

  10. Diversification of bacterial genome content through distinct mechanisms over different timescales

    PubMed Central

    Croucher, Nicholas J.; Coupland, Paul G.; Stevenson, Abbie E.; Callendrello, Alanna; Bentley, Stephen D.; Hanage, William P.

    2014-01-01

    Bacterial populations often consist of multiple co-circulating lineages. Determining how such population structures arise requires understanding what drives bacterial diversification. Using 616 systematically sampled genomes, we show that Streptococcus pneumoniae lineages are typically characterized by combinations of infrequently transferred stable genomic islands: those moving primarily through transformation, along with integrative and conjugative elements and phage-related chromosomal islands. The only lineage containing extensive unique sequence corresponds to a set of atypical unencapsulated isolates that may represent a distinct species. However, prophage content is highly variable even within lineages, suggesting frequent horizontal transmission that would necessitate rapidly diversifying anti-phage mechanisms to prevent these viruses sweeping through populations. Correspondingly, two loci encoding Type I restriction-modification systems able to change their specificity over short timescales through intragenomic recombination are ubiquitous across the collection. Hence short-term pneumococcal variation is characterized by movement of phage and intragenomic rearrangements, with the slower transfer of stable loci distinguishing lineages. PMID:25407023

  11. The CRISPR-Cas system - from bacterial immunity to genome engineering.

    PubMed

    Czarnek, Maria; Bereta, Joanna

    2016-09-01

    Precise and efficient genome modifications present a great value in attempts to comprehend the roles of particular genes and other genetic elements in biological processes as well as in various pathologies. In recent years novel methods of genome modification known as genome editing, which utilize so called "programmable" nucleases, came into use. A true revolution in genome editing has been brought about by the introduction of the CRISP-Cas (clustered regularly interspaced short palindromic repeats-CRISPR associated) system, in which one of such nucleases, i.e. Cas9, plays a major role. This system is based on the elements of the bacterial and archaeal mechanism responsible for acquired immunity against phage infections and transfer of foreign genetic material. Microorganisms incorporate fragments of foreign DNA into CRISPR loci present in their genomes, which enables fast recognition and elimination of future infections. There are several types of CRISPR-Cas systems among prokaryotes but only elements of CRISPR type II are employed in genome engineering. CRISPR-Cas type II utilizes small RNA molecules (crRNA and tracrRNA) to precisely direct the effector nuclease - Cas9 - to a specific site in the genome, i.e. to the sequence complementary to crRNA. Cas9 may be used to: (i) introduce stable changes into genomes e.g. in the process of generation of knock-out and knock-in animals and cell lines, (ii) activate or silence the expression of a gene of interest, and (iii) visualize specific sites in genomes of living cells. The CRISPR-Cas-based tools have been successfully employed for generation of animal and cell models of a number of diseases, e.g. specific types of cancer. In the future, the genome editing by programmable nucleases may find wide application in medicine e.g. in the therapies of certain diseases of genetic origin and in the therapy of HIV-infected patients.

  12. Genome scale transcriptomics of baculovirus-insect interactions.

    PubMed

    Nguyen, Quan; Nielsen, Lars K; Reid, Steven

    2013-11-12

    Baculovirus-insect cell technologies are applied in the production of complex proteins, veterinary and human vaccines, gene delivery vectors' and biopesticides. Better understanding of how baculoviruses and insect cells interact would facilitate baculovirus-based production. While complete genomic sequences are available for over 58 baculovirus species, little insect genomic information is known. The release of the Bombyx mori and Plutella xylostella genomes, the accumulation of EST sequences for several Lepidopteran species, and especially the availability of two genome-scale analysis tools, namely oligonucleotide microarrays and next generation sequencing (NGS), have facilitated expression studies to generate a rich picture of insect gene responses to baculovirus infections. This review presents current knowledge on the interaction dynamics of the baculovirus-insect system' which is relatively well studied in relation to nucleocapsid transportation, apoptosis, and heat shock responses, but is still poorly understood regarding responses involved in pro-survival pathways, DNA damage pathways, protein degradation, translation, signaling pathways, RNAi pathways, and importantly metabolic pathways for energy, nucleotide and amino acid production. We discuss how the two genome-scale transcriptomic tools can be applied for studying such pathways and suggest that proteomics and metabolomics can produce complementary findings to transcriptomic studies.

  13. Large-scale production of magnetic nanoparticles using bacterial fermentation.

    PubMed

    Moon, Ji-Won; Rawn, Claudia J; Rondinone, Adam J; Love, Lonnie J; Roh, Yul; Everett, S Michelle; Lauf, Robert J; Phelps, Tommy J

    2010-10-01

    Production of both nano-sized particles of crystalline pure phase magnetite and magnetite substituted with Co, Ni, Cr, Mn, Zn or the rare earths for some of the Fe has been demonstrated using microbial processes. This microbial production of magnetic nanoparticles can be achieved in large quantities and at low cost. In these experiments, over 1 kg (wet weight) of Zn-substituted magnetite (nominal composition of Zn(0.6)Fe(2.4)O4) was recovered from 30 l fermentations. Transmission electron microscopy (TEM) was used to confirm that the extracellular magnetites exhibited good mono-dispersity. TEM results also showed a highly reproducible particle size and corroborated average crystallite size (ACS) of 13.1 ± 0.8 nm determined through X-ray diffraction (N = 7) at a 99% confidence level. Based on scale-up experiments performed using a 35-l reactor, the increase in ACS reproducibility may be attributed to a combination of factors including an increase of electron donor input, availability of divalent substitution metal ions and fewer ferrous ions in the case of substituted magnetite, and increased reactor volume overcoming differences in each batch. Commercial nanometer sized magnetite (25-50 nm) may cost $500/kg. However, microbial processes are potentially capable of producing 5-90 nm pure or substituted magnetites at a fraction of the cost of traditional chemical synthesis. While there are numerous approaches for the synthesis of nanoparticles, bacterial fermentation of magnetite or metal-substituted magnetite may represent an advantageous manufacturing technology with respect to yield, reproducibility and scalable synthesis with low costs at low energy input.

  14. Scalable Parameter Estimation for Genome-Scale Biochemical Reaction Networks

    PubMed Central

    Kaltenbacher, Barbara; Hasenauer, Jan

    2017-01-01

    Mechanistic mathematical modeling of biochemical reaction networks using ordinary differential equation (ODE) models has improved our understanding of small- and medium-scale biological processes. While the same should in principle hold for large- and genome-scale processes, the computational methods for the analysis of ODE models which describe hundreds or thousands of biochemical species and reactions are missing so far. While individual simulations are feasible, the inference of the model parameters from experimental data is computationally too intensive. In this manuscript, we evaluate adjoint sensitivity analysis for parameter estimation in large scale biochemical reaction networks. We present the approach for time-discrete measurement and compare it to state-of-the-art methods used in systems and computational biology. Our comparison reveals a significantly improved computational efficiency and a superior scalability of adjoint sensitivity analysis. The computational complexity is effectively independent of the number of parameters, enabling the analysis of large- and genome-scale models. Our study of a comprehensive kinetic model of ErbB signaling shows that parameter estimation using adjoint sensitivity analysis requires a fraction of the computation time of established methods. The proposed method will facilitate mechanistic modeling of genome-scale cellular processes, as required in the age of omics. PMID:28114351

  15. Genome-scale metabolic representation of Amycolatopsis balhimycina.

    PubMed

    Vongsangnak, Wanwipa; Figueiredo, Luís Filipe; Förster, Jochen; Weber, Tilmann; Thykaer, Jette; Stegmann, Evi; Wohlleben, Wolfgang; Nielsen, Jens

    2012-07-01

    Infection caused by methicillin-resistant Staphylococcus aureus (MRSA) is an increasing societal problem. Typically, glycopeptide antibiotics are used in the treatment of these infections. The most comprehensively studied glycopeptide antibiotic biosynthetic pathway is that of balhimycin biosynthesis in Amycolatopsis balhimycina. The balhimycin yield obtained by A. balhimycina is, however, low and there is therefore a need to improve balhimycin production. In this study, we performed genome sequencing, assembly and annotation analysis of A. balhimycina and further used these annotated data to reconstruct a genome-scale metabolic model for the organism. Here we generated an almost complete A. balhimycina genome sequence comprising 10,562,587 base pairs assembled into 2,153 contigs. The high GC-genome (∼ 69%) includes 8,585 open reading frames (ORFs). We used our integrative toolbox called SEQTOR for functional annotation and then integrated annotated data with biochemical and physiological information available for this organism to reconstruct a genome-scale metabolic model of A. balhimycina. The resulting metabolic model contains 583 ORFs as protein encoding genes (7% of the predicted 8,585 ORFs), 407 EC numbers, 647 metabolites and 1,363 metabolic reactions. During the analysis of the metabolic model, linear, quadratic and evolutionary programming algorithms using flux balance analysis (FBA), minimization of metabolic adjustment (MOMA), and OptGene, respectively were applied as well as phenotypic behavior and improved balhimycin production were simulated. The A. balhimycina model shows a good agreement between in silico data and experimental data and also identifies key reactions associated with increased balhimycin production. The reconstruction of the genome-scale metabolic model of A. balhimycina serves as a basis for physiological characterization. The model allows a rational design of engineering strategies for increasing balhimycin production in A

  16. Automation on the generation of genome-scale metabolic models.

    PubMed

    Reyes, R; Gamermann, D; Montagud, A; Fuente, D; Triana, J; Urchueguía, J F; de Córdoba, P Fernández

    2012-12-01

    Nowadays, the reconstruction of genome-scale metabolic models is a nonautomatized and interactive process based on decision making. This lengthy process usually requires a full year of one person's work in order to satisfactory collect, analyze, and validate the list of all metabolic reactions present in a specific organism. In order to write this list, one manually has to go through a huge amount of genomic, metabolomic, and physiological information. Currently, there is no optimal algorithm that allows one to automatically go through all this information and generate the models taking into account probabilistic criteria of unicity and completeness that a biologist would consider. This work presents the automation of a methodology for the reconstruction of genome-scale metabolic models for any organism. The methodology that follows is the automatized version of the steps implemented manually for the reconstruction of the genome-scale metabolic model of a photosynthetic organism, Synechocystis sp. PCC6803. The steps for the reconstruction are implemented in a computational platform (COPABI) that generates the models from the probabilistic algorithms that have been developed. For validation of the developed algorithm robustness, the metabolic models of several organisms generated by the platform have been studied together with published models that have been manually curated. Network properties of the models, like connectivity and average shortest mean path of the different models, have been compared and analyzed.

  17. Large-scale data mining pilot project in human genome

    SciTech Connect

    Musick, R.; Fidelis, R.; Slezak, T.

    1997-05-01

    This whitepaper briefly describes a new, aggressive effort in large- scale data Livermore National Labs. The implications of `large- scale` will be clarified Section. In the short term, this effort will focus on several @ssion-critical questions of Genome project. We will adapt current data mining techniques to the Genome domain, to quantify the accuracy of inference results, and lay the groundwork for a more extensive effort in large-scale data mining. A major aspect of the approach is that we will be fully-staffed data warehousing effort in the human Genome area. The long term goal is strong applications- oriented research program in large-@e data mining. The tools, skill set gained will be directly applicable to a wide spectrum of tasks involving a for large spatial and multidimensional data. This includes applications in ensuring non-proliferation, stockpile stewardship, enabling Global Ecology (Materials Database Industrial Ecology), advancing the Biosciences (Human Genome Project), and supporting data for others (Battlefield Management, Health Care).

  18. Rapid single-colony whole-genome sequencing of bacterial pathogens

    PubMed Central

    Köser, Claudio U.; Fraser, Louise J.; Ioannou, Avgousta; Becq, Jennifer; Ellington, Matthew J.; Holden, Matthew T. G.; Reuter, Sandra; Török, M. Estée; Bentley, Stephen D.; Parkhill, Julian; Gormley, Niall A.; Smith, Geoffrey P.; Peacock, Sharon J.

    2014-01-01

    Objectives As a result of the introduction of rapid benchtop sequencers, the time required to subculture a bacterial pathogen to extract sufficient DNA for library preparation can now exceed the time to sequence said DNA. We have eliminated this rate-limiting step by developing a protocol to generate DNA libraries for whole-genome sequencing directly from single bacterial colonies grown on primary culture plates. Methods We developed our protocol using single colonies of 17 bacterial pathogens responsible for severe human infection that were grown using standard diagnostic media and incubation conditions. We then applied this method to four clinical scenarios that currently require time-consuming reference laboratory tests: full identification and genotyping of salmonellae; identification of blaNDM-1, a highly transmissible carbapenemase resistance gene, in Klebsiella pneumoniae; detection of genes encoding staphylococcal toxins associated with specific disease syndromes; and monitoring of vaccine targets to detect vaccine escape in Neisseria meningitidis. Results We validated our single-colony whole-genome sequencing protocol for all 40 combinations of pathogen and selective, non-selective or indicator media tested in this study. Moreover, we demonstrated the clinical value of this method compared with current reference laboratory tests. Conclusions This advance will facilitate the implementation of whole-genome sequencing into diagnostic and public health microbiology. PMID:24370932

  19. bcgTree: automatized phylogenetic tree building from bacterial core genomes.

    PubMed

    Ankenbrand, Markus J; Keller, Alexander

    2016-10-01

    The need for multi-gene analyses in scientific fields such as phylogenetics and DNA barcoding has increased in recent years. In particular, these approaches are increasingly important for differentiating bacterial species, where reliance on the standard 16S rDNA marker can result in poor resolution. Additionally, the assembly of bacterial genomes has become a standard task due to advances in next-generation sequencing technologies. We created a bioinformatic pipeline, bcgTree, which uses assembled bacterial genomes either from databases or own sequencing results from the user to reconstruct their phylogenetic history. The pipeline automatically extracts 107 essential single-copy core genes, found in a majority of bacteria, using hidden Markov models and performs a partitioned maximum-likelihood analysis. Here, we describe the workflow of bcgTree and, as a proof-of-concept, its usefulness in resolving the phylogeny of 293 publically available bacterial strains of the genus Lactobacillus. We also evaluate its performance in both low- and high-level taxonomy test sets. The tool is freely available at github ( https://github.com/iimog/bcgTree ) and our institutional homepage ( http://www.dna-analytics.biozentrum.uni-wuerzburg.de ).

  20. Comparative Analysis of Lacinutrix Genomes and Their Association with Bacterial Habitat

    PubMed Central

    Lee, Yung Mi; Kim, Mi-Kyeong; Ahn, Do Hwan; Kim, Han-Woo; Park, Hyun; Shin, Seung Chul

    2016-01-01

    The genus Lacinutrix, which belongs to the family Flavobacteriaceae, consists of seven bacterial species that were mainly isolated from marine life and sediments. As most bacteria in the family Flavobacteriaceae favor aerobic conditions, the seven bacterial species in the genus Lacinutrix also showed aerobic growth. We selected four monophyletic bacterial species living in a polar environment. Two of these species were isolated from sediment and two types were isolated from algae. In a comparative analysis, we investigated how these different environments were related to genomic features of these four species in the genus Lacinutrix. We found that the gene sets for glycolysis, the Krebs cycle, and oxidative phosphorylation were conserved in these four type strains. However, the presence of nitrous oxide reductase for denitrification and the absence of essential components related to thiamin biosynthesis for aerobic respiration were only found in isolates from sediment. Elevated bacterial metabolism on the surface of marine sediments might limit the oxygen penetration into sediment, and such an environment might affect the genomes of bacteria isolated from these habitats. PMID:26882010

  1. An evaluation of Comparative Genome Sequencing (CGS) by comparing two previously-sequenced bacterial genomes

    PubMed Central

    Herring, Christopher D; Palsson, Bernhard Ø

    2007-01-01

    Background With the development of new technology, it has recently become practical to resequence the genome of a bacterium after experimental manipulation. It is critical though to know the accuracy of the technique used, and to establish confidence that all of the mutations were detected. Results In order to evaluate the accuracy of genome resequencing using the microarray-based Comparative Genome Sequencing service provided by Nimblegen Systems Inc., we resequenced the E. coli strain W3110 Kohara using MG1655 as a reference, both of which have been completely sequenced using traditional sequencing methods. CGS detected 7 of 8 small sequence differences, one large deletion, and 9 of 12 IS element insertions present in W3110, but did not detect a large chromosomal inversion. In addition, we confirmed that CGS also detected 2 SNPs, one deletion and 7 IS element insertions that are not present in the genome sequence, which we attribute to changes that occurred after the creation of the W3110 lambda clone library. The false positive rate for SNPs was one per 244 Kb of genome sequence. Conclusion CGS is an effective way to detect multiple mutations present in one bacterium relative to another, and while highly cost-effective, is prone to certain errors. Mutations occurring in repeated sequences or in sequences with a high degree of secondary structure may go undetected. It is also critical to follow up on regions of interest in which SNPs were not called because they often indicate deletions or IS element insertions. PMID:17697331

  2. Evaluation of genome-enabled selection for bacterial cold water disease resistance using progeny performance data in Rainbow Trout: Insights on genotyping methods and genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bacterial cold water disease (BCWD) causes significant economic losses in salmonid aquaculture, and traditional family-based breeding programs aimed at improving BCWD resistance have been limited to exploiting only between-family variation. We used genomic selection (GS) models to predict genomic br...

  3. Identification and analysis of integrons and cassette arrays in bacterial genomes

    PubMed Central

    Cury, Jean; Jové, Thomas; Touchon, Marie; Néron, Bertrand; Rocha, Eduardo PC

    2016-01-01

    Integrons recombine gene arrays and favor the spread of antibiotic resistance. Their broader roles in bacterial adaptation remain mysterious, partly due to lack of computational tools. We made a program – IntegronFinder – to identify integrons with high accuracy and sensitivity. IntegronFinder is available as a standalone program and as a web application. It searches for attC sites using covariance models, for integron-integrases using HMM profiles, and for other features (promoters, attI site) using pattern matching. We searched for integrons, integron-integrases lacking attC sites, and clusters of attC sites lacking a neighboring integron-integrase in bacterial genomes. All these elements are especially frequent in genomes of intermediate size. They are missing in some key phyla, such as α-Proteobacteria, which might reflect selection against cell lineages that acquire integrons. The similarity between attC sites is proportional to the number of cassettes in the integron, and is particularly low in clusters of attC sites lacking integron-integrases. The latter are unexpectedly abundant in genomes lacking integron-integrases or their remains, and have a large novel pool of cassettes lacking homologs in the databases. They might represent an evolutionary step between the acquisition of genes within integrons and their stabilization in the new genome. PMID:27130947

  4. Identification and analysis of integrons and cassette arrays in bacterial genomes.

    PubMed

    Cury, Jean; Jové, Thomas; Touchon, Marie; Néron, Bertrand; Rocha, Eduardo Pc

    2016-06-02

    Integrons recombine gene arrays and favor the spread of antibiotic resistance. Their broader roles in bacterial adaptation remain mysterious, partly due to lack of computational tools. We made a program - IntegronFinder - to identify integrons with high accuracy and sensitivity. IntegronFinder is available as a standalone program and as a web application. It searches for attC sites using covariance models, for integron-integrases using HMM profiles, and for other features (promoters, attI site) using pattern matching. We searched for integrons, integron-integrases lacking attC sites, and clusters of attC sites lacking a neighboring integron-integrase in bacterial genomes. All these elements are especially frequent in genomes of intermediate size. They are missing in some key phyla, such as α-Proteobacteria, which might reflect selection against cell lineages that acquire integrons. The similarity between attC sites is proportional to the number of cassettes in the integron, and is particularly low in clusters of attC sites lacking integron-integrases. The latter are unexpectedly abundant in genomes lacking integron-integrases or their remains, and have a large novel pool of cassettes lacking homologs in the databases. They might represent an evolutionary step between the acquisition of genes within integrons and their stabilization in the new genome.

  5. Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees and Supermatrices

    PubMed Central

    Lang, Jenna Morgan; Darling, Aaron E.; Eisen, Jonathan A.

    2013-01-01

    Over 3000 microbial (bacterial and archaeal) genomes have been made publically available to date, providing an unprecedented opportunity to examine evolutionary genomic trends and offering valuable reference data for a variety of other studies such as metagenomics. The utility of these genome sequences is greatly enhanced when we have an understanding of how they are phylogenetically related to each other. Therefore, we here describe our efforts to reconstruct the phylogeny of all available bacterial and archaeal genomes. We identified 24, single-copy, ubiquitous genes suitable for this phylogenetic analysis. We used two approaches to combine the data for the 24 genes. First, we concatenated alignments of all genes into a single alignment from which a Maximum Likelihood (ML) tree was inferred using RAxML. Second, we used a relatively new approach to combining gene data, Bayesian Concordance Analysis (BCA), as implemented in the BUCKy software, in which the results of 24 single-gene phylogenetic analyses are used to generate a “primary concordance” tree. A comparison of the concatenated ML tree and the primary concordance (BUCKy) tree reveals that the two approaches give similar results, relative to a phylogenetic tree inferred from the 16S rRNA gene. After comparing the results and the methods used, we conclude that the current best approach for generating a single phylogenetic tree, suitable for use as a reference phylogeny for comparative analyses, is to perform a maximum likelihood analysis of a concatenated alignment of conserved, single-copy genes. PMID:23638103

  6. Bacterial Genomic Data Analysis in the Next-Generation Sequencing Era.

    PubMed

    Orsini, Massimiliano; Cuccuru, Gianmauro; Uva, Paolo; Fotia, Giorgio

    2016-01-01

    Bacterial genome sequencing is now an affordable choice for many laboratories for applications in research, diagnostic, and clinical microbiology. Nowadays, an overabundance of tools is available for genomic data analysis. However, tools differ for algorithms, languages, hardware requirements, and user interface, and combining them as it is necessary for sequence data interpretation often requires (bio)informatics skills which can be difficult to find in many laboratories. In addition, multiple data sources, as well as exceedingly large dataset sizes, and increasingly computational complexity further challenge the accessibility, reproducibility, and transparency of the entire process. In this chapter we will cover the main bioinformatics steps required for a complete bacterial genome analysis using next-generation sequencing data, from the raw sequence data to assembled and annotated genomes. All the tools described are available in the Orione framework ( http://orione.crs4.it ), which uniquely combines in a transparent way the most used open source bioinformatics tools for microbiology, allowing microbiologist without any specific hardware or informatics skill to conduct data-intensive computational analyses from quality control to microbial gene annotation.

  7. Applying Shannon's information theory to bacterial and phage genomes and metagenomes

    PubMed Central

    Akhter, Sajia; Bailey, Barbara A.; Salamon, Peter; Aziz, Ramy K.; Edwards, Robert A.

    2013-01-01

    All sequence data contain inherent information that can be measured by Shannon's uncertainty theory. Such measurement is valuable in evaluating large data sets, such as metagenomic libraries, to prioritize their analysis and annotation, thus saving computational resources. Here, Shannon's index of complete phage and bacterial genomes was examined. The information content of a genome was found to be highly dependent on the genome length, GC content, and sequence word size. In metagenomic sequences, the amount of information correlated with the number of matches found by comparison to sequence databases. A sequence with more information (higher uncertainty) has a higher probability of being significantly similar to other sequences in the database. Measuring uncertainty may be used for rapid screening for sequences with matches in available database, prioritizing computational resources, and indicating which sequences with no known similarities are likely to be important for more detailed analysis. PMID:23301154

  8. Applying Shannon's information theory to bacterial and phage genomes and metagenomes

    NASA Astrophysics Data System (ADS)

    Akhter, Sajia; Bailey, Barbara A.; Salamon, Peter; Aziz, Ramy K.; Edwards, Robert A.

    2013-01-01

    All sequence data contain inherent information that can be measured by Shannon's uncertainty theory. Such measurement is valuable in evaluating large data sets, such as metagenomic libraries, to prioritize their analysis and annotation, thus saving computational resources. Here, Shannon's index of complete phage and bacterial genomes was examined. The information content of a genome was found to be highly dependent on the genome length, GC content, and sequence word size. In metagenomic sequences, the amount of information correlated with the number of matches found by comparison to sequence databases. A sequence with more information (higher uncertainty) has a higher probability of being significantly similar to other sequences in the database. Measuring uncertainty may be used for rapid screening for sequences with matches in available database, prioritizing computational resources, and indicating which sequences with no known similarities are likely to be important for more detailed analysis.

  9. ClonalFrameML: Efficient Inference of Recombination in Whole Bacterial Genomes

    PubMed Central

    Didelot, Xavier; Wilson, Daniel J.

    2015-01-01

    Recombination is an important evolutionary force in bacteria, but it remains challenging to reconstruct the imports that occurred in the ancestry of a genomic sample. Here we present ClonalFrameML, which uses maximum likelihood inference to simultaneously detect recombination in bacterial genomes and account for it in phylogenetic reconstruction. ClonalFrameML can analyse hundreds of genomes in a matter of hours, and we demonstrate its usefulness on simulated and real datasets. We find evidence for recombination hotspots associated with mobile elements in Clostridium difficile ST6 and a previously undescribed 310kb chromosomal replacement in Staphylococcus aureus ST582. ClonalFrameML is freely available at http://clonalframeml.googlecode.com/. PMID:25675341

  10. MEMOSys: Bioinformatics platform for genome-scale metabolic models

    PubMed Central

    2011-01-01

    Background Recent advances in genomic sequencing have enabled the use of genome sequencing in standard biological and biotechnological research projects. The challenge is how to integrate the large amount of data in order to gain novel biological insights. One way to leverage sequence data is to use genome-scale metabolic models. We have therefore designed and implemented a bioinformatics platform which supports the development of such metabolic models. Results MEMOSys (MEtabolic MOdel research and development System) is a versatile platform for the management, storage, and development of genome-scale metabolic models. It supports the development of new models by providing a built-in version control system which offers access to the complete developmental history. Moreover, the integrated web board, the authorization system, and the definition of user roles allow collaborations across departments and institutions. Research on existing models is facilitated by a search system, references to external databases, and a feature-rich comparison mechanism. MEMOSys provides customizable data exchange mechanisms using the SBML format to enable analysis in external tools. The web application is based on the Java EE framework and offers an intuitive user interface. It currently contains six annotated microbial metabolic models. Conclusions We have developed a web-based system designed to provide researchers a novel application facilitating the management and development of metabolic models. The system is freely available at http://www.icbi.at/MEMOSys. PMID:21276275

  11. Metabolic Complementarity and Genomics of the Dual Bacterial Symbiosis of Sharpshooters

    PubMed Central

    Wu, Dongying; Daugherty, Sean C; Van Aken, Susan E; Pai, Grace H; Watkins, Kisha L; Khouri, Hoda; Tallon, Luke J; Zaborsky, Jennifer M; Dunbar, Helen E; Tran, Phat L; Moran, Nancy A

    2006-01-01

    Mutualistic intracellular symbiosis between bacteria and insects is a widespread phenomenon that has contributed to the global success of insects. The symbionts, by provisioning nutrients lacking from diets, allow various insects to occupy or dominate ecological niches that might otherwise be unavailable. One such insect is the glassy-winged sharpshooter (Homalodisca coagulata), which feeds on xylem fluid, a diet exceptionally poor in organic nutrients. Phylogenetic studies based on rRNA have shown two types of bacterial symbionts to be coevolving with sharpshooters: the gamma-proteobacterium Baumannia cicadellinicola and the Bacteroidetes species Sulcia muelleri. We report here the sequencing and analysis of the 686,192–base pair genome of B. cicadellinicola and approximately 150 kilobase pairs of the small genome of S. muelleri, both isolated from H. coagulata. Our study, which to our knowledge is the first genomic analysis of an obligate symbiosis involving multiple partners, suggests striking complementarity in the biosynthetic capabilities of the two symbionts: B. cicadellinicola devotes a substantial portion of its genome to the biosynthesis of vitamins and cofactors required by animals and lacks most amino acid biosynthetic pathways, whereas S. muelleri apparently produces most or all of the essential amino acids needed by its host. This finding, along with other results of our genome analysis, suggests the existence of metabolic codependency among the two unrelated endosymbionts and their insect host. This dual symbiosis provides a model case for studying correlated genome evolution and genome reduction involving multiple organisms in an intimate, obligate mutualistic relationship. In addition, our analysis provides insight for the first time into the differences in symbionts between insects (e.g., aphids) that feed on phloem versus those like H. coagulata that feed on xylem. Finally, the genomes of these two symbionts provide potential targets for

  12. Meta-analysis of general bacterial subclades in whole-genome phylogenies using tree topology profiling.

    PubMed

    Meinel, Thomas; Krause, Antje

    2012-01-01

    In the last two decades, a large number of whole-genome phylogenies have been inferred to reconstruct the Tree of Life (ToL). Underlying data models range from gene or functionality content in species to phylogenetic gene family trees and multiple sequence alignments of concatenated protein sequences. Diversity in data models together with the use of different tree reconstruction techniques, disruptive biological effects and the steadily increasing number of genomes have led to a huge diversity in published phylogenies. Comparison of those and, moreover, identification of the impact of inference properties (underlying data model, inference technique) on particular reconstructions is almost impossible. In this work, we introduce tree topology profiling as a method to compare already published whole-genome phylogenies. This method requires visual determination of the particular topology in a drawn whole-genome phylogeny for a set of particular bacterial clans. For each clan, neighborhoods to other bacteria are collected into a catalogue of generalized alternative topologies. Particular topology alternatives found for an ordered list of bacterial clans reveal a topology profile that represents the analyzed phylogeny. To simulate the inhomogeneity of published gene content phylogenies we generate a set of seven phylogenies using different inference techniques and the SYSTERS-PhyloMatrix data model. After tree topology profiling on in total 54 selected published and newly inferred phylogenies, we separate artefactual from biologically meaningful phylogenies and associate particular inference results (phylogenies) with inference background (inference techniques as well as data models). Topological relationships of particular bacterial species groups are presented. With this work we introduce tree topology profiling into the scientific field of comparative phylogenomics.

  13. Minimum Core Genome Sequence Typing of Bacterial Pathogens: a Unified Approach for Clinical and Public Health Microbiology

    PubMed Central

    Chen, Chen; Zhang, Wen; Zheng, Han; Lan, Ruiting; Wang, Haiyin; Du, Pengcheng; Bai, Xuemei; Ji, Shaobo; Meng, Qiong; Jin, Dong; Liu, Kai; Jing, Huaiqi; Ye, Changyun; Gao, George F.; Wang, Lei; Gottschalk, Marcelo

    2013-01-01

    Bacterial pathogens impose a heavy health burden worldwide. In the new era of high-throughput sequencing and online bioinformatics, real-time genome typing of infecting agents, and in particular those with potential severe clinical outcomes, holds promise for guiding clinical care to limit the detrimental effects of infections and to prevent potential local or global outbreaks. Here, we sequenced and compared 85 isolates of Streptococcus suis, a zoonotic human and swine pathogen, wherein we analyzed 32 recognized serotypes and 75 sequence types representing the diversity of the species and the human clinical isolates with high public health significance. We found that 1,077 of the 2,469 genes are shared by all isolates. Excluding 201 common but mobile genes, 876 genes were defined as the minimum core genome (MCG) of the species. Of 190,894 single-nucleotide polymorphisms (SNPs) identified, 58,501 were located in the MCG genes and were referred to as MCG SNPs. A population structure analysis of these MCG SNPs classified the 85 isolates into seven MCG groups, of which MCG group 1 includes all isolates from human infections and outbreaks. Our MCG typing system for S. suis provided a clear separation of groups containing human-associated isolates from those containing animal-associated isolates. It also separated the group containing outbreak isolates, including those causing life-threatening streptococcal toxic shock-like syndrome, from sporadic or less severe meningitis or bacteremia-only isolates. The typing system facilitates the application of genome data to the fields of clinical medicine and epidemiology and to the surveillance of S. suis. The MCG groups may also be used as the taxonomical units of S. suis to define bacterial subpopulations with the potential to cause severe clinical infections and large-scale outbreaks. PMID:23720795

  14. Minimum core genome sequence typing of bacterial pathogens: a unified approach for clinical and public health microbiology.

    PubMed

    Chen, Chen; Zhang, Wen; Zheng, Han; Lan, Ruiting; Wang, Haiyin; Du, Pengcheng; Bai, Xuemei; Ji, Shaobo; Meng, Qiong; Jin, Dong; Liu, Kai; Jing, Huaiqi; Ye, Changyun; Gao, George F; Wang, Lei; Gottschalk, Marcelo; Xu, Jianguo

    2013-08-01

    Bacterial pathogens impose a heavy health burden worldwide. In the new era of high-throughput sequencing and online bioinformatics, real-time genome typing of infecting agents, and in particular those with potential severe clinical outcomes, holds promise for guiding clinical care to limit the detrimental effects of infections and to prevent potential local or global outbreaks. Here, we sequenced and compared 85 isolates of Streptococcus suis, a zoonotic human and swine pathogen, wherein we analyzed 32 recognized serotypes and 75 sequence types representing the diversity of the species and the human clinical isolates with high public health significance. We found that 1,077 of the 2,469 genes are shared by all isolates. Excluding 201 common but mobile genes, 876 genes were defined as the minimum core genome (MCG) of the species. Of 190,894 single-nucleotide polymorphisms (SNPs) identified, 58,501 were located in the MCG genes and were referred to as MCG SNPs. A population structure analysis of these MCG SNPs classified the 85 isolates into seven MCG groups, of which MCG group 1 includes all isolates from human infections and outbreaks. Our MCG typing system for S. suis provided a clear separation of groups containing human-associated isolates from those containing animal-associated isolates. It also separated the group containing outbreak isolates, including those causing life-threatening streptococcal toxic shock-like syndrome, from sporadic or less severe meningitis or bacteremia-only isolates. The typing system facilitates the application of genome data to the fields of clinical medicine and epidemiology and to the surveillance of S. suis. The MCG groups may also be used as the taxonomical units of S. suis to define bacterial subpopulations with the potential to cause severe clinical infections and large-scale outbreaks.

  15. Correlation Between Heterogeneous Bacterial Attachment Rate Coefficients and Hydraulic Conductivity and Impacts on Field-Scale Bacterial Transport

    SciTech Connect

    Scheibe, Timothy D.

    2002-10-28

    In granular porous media, bacterial transport is often modeled using the advection-dispersion transport equation, modified to account for interactions between the bacteria and grain surfaces (attachment and detachment) using a linear kinetic reaction model. In this paper we examine the relationships among the parameters of the above model in the context of bacterial transport for bioaugmentation. In this context, we wish to quantify the distance to which significant concentrations of bacteria can be transported, as well as the uniformity with which they can be distributed within the subsurface. Because kinetic detachment rates (Kr) are typically much smaller than corresponding attachment rates (Kf), the attachment rate exerts primary control on the distance of bacterial transport. Hydraulic conductivity (K) also plays a significant role because of its direct relationship to the advective velocity and its typically high degree of spatial variability at field scales. Because Kf is related to the velocity, grain size, and porosity of the medium, as is K, we expect that there exists correlation between these two parameters. Previous investigators have assumed a form of correlation between Kf and ln(K) based in part on reparameterization of clean-bed filtration equations in terms of published relations between grain size, effective porosity, and ln(K). The hypotheses examined here are that (1) field-scale relationships between K and Kf can be developed by combining a number of theoretical and empirical results in the context of a heterogeneous aquifer flow model (following a similar approach to previous investigators with some extensions), and (2) correlation between K and Kf will enhance the distance of field-scale bacterial transport in granular aquifers. We test these hypotheses using detailed numerical models and observations of field-scale bacterial transport in a shallow sandy aquifer within the South Oyster Site near Oyster, Virginia, USA.

  16. Units of plasticity in bacterial genomes: new insight from the comparative genomics of two bacteria interacting with invertebrates, Photorhabdus and Xenorhabdus

    PubMed Central

    2010-01-01

    Background Flexible genomes facilitate bacterial evolution and are classically organized into polymorphic strain-specific segments called regions of genomic plasticity (RGPs). Using a new web tool, RGPFinder, we investigated plasticity units in bacterial genomes, by exhaustive description of the RGPs in two Photorhabdus and two Xenorhabdus strains, belonging to the Enterobacteriaceae and interacting with invertebrates (insects and nematodes). Results RGPs account for about 60% of the genome in each of the four genomes studied. We classified RGPs into genomic islands (GIs), prophages and two new classes of RGP without the features of classical mobile genetic elements (MGEs) but harboring genes encoding enzymes catalyzing DNA recombination (RGPmob), or with no remarkable feature (RGPnone). These new classes accounted for most of the RGPs and are probably hypervariable regions, ancient MGEs with degraded mobilization machinery or non canonical MGEs for which the mobility mechanism has yet to be described. We provide evidence that not only the GIs and the prophages, but also RGPmob and RGPnone, have a mosaic structure consisting of modules. A module is a block of genes, 0.5 to 60 kb in length, displaying a conserved genomic organization among the different Enterobacteriaceae. Modules are functional units involved in host/environment interactions (22-31%), metabolism (22-27%), intracellular or intercellular DNA mobility (13-30%), drug resistance (4-5%) and antibiotic synthesis (3-6%). Finally, in silico comparisons and PCR multiplex analysis indicated that these modules served as plasticity units within the bacterial genome during genome speciation and as deletion units in clonal variants of Photorhabdus. Conclusions This led us to consider the modules, rather than the entire RGP, as the true unit of plasticity in bacterial genomes, during both short-term and long-term genome evolution. PMID:20950463

  17. Complete Genome Sequence of Ichthyobacterium seriolicida JBKA-6T, Isolated from Yellowtail (Seriola quinqueradiata) Affected by Bacterial Hemolytic Jaundice

    PubMed Central

    Nakamura, Yoji; Matsuyama, Tomomasa; Sakai, Takamitsu; Shigenobu, Yuya; Sugaya, Takuma; Yasuike, Motoshige; Fujiwara, Atushi; Kondo, Hidehiro; Hirono, Ikuo; Fukuda, Yutaka; Nakayasu, Chihaya

    2017-01-01

    ABSTRACT Ichthyobacterium seriolicida is a fish bacterial pathogen that causes hemolytic jaundice in farmed yellowtail in Japan. To understand more about the characteristics of this bacterium, we determined its complete genome sequence. Two hemolysin genes which may be important for its pathogenicity were identified in the I. seriolicida genome. PMID:28183761

  18. Genomic divergences among cattle, dog and human estimated from large-scale alignments of genomic sequences

    PubMed Central

    Liu, George E; Matukumalli, Lakshmi K; Sonstegard, Tad S; Shade, Larry L; Van Tassell, Curtis P

    2006-01-01

    Background Approximately 11 Mb of finished high quality genomic sequences were sampled from cattle, dog and human to estimate genomic divergences and their regional variation among these lineages. Results Optimal three-way multi-species global sequence alignments for 84 cattle clones or loci (each >50 kb of genomic sequence) were constructed using the human and dog genome assemblies as references. Genomic divergences and substitution rates were examined for each clone and for various sequence classes under different functional constraints. Analysis of these alignments revealed that the overall genomic divergences are relatively constant (0.32–0.37 change/site) for pairwise comparisons among cattle, dog and human; however substitution rates vary across genomic regions and among different sequence classes. A neutral mutation rate (2.0–2.2 × 10(-9) change/site/year) was derived from ancestral repetitive sequences, whereas the substitution rate in coding sequences (1.1 × 10(-9) change/site/year) was approximately half of the overall rate (1.9–2.0 × 10(-9) change/site/year). Relative rate tests also indicated that cattle have a significantly faster rate of substitution as compared to dog and that this difference is about 6%. Conclusion This analysis provides a large-scale and unbiased assessment of genomic divergences and regional variation of substitution rates among cattle, dog and human. It is expected that these data will serve as a baseline for future mammalian molecular evolution studies. PMID:16759380

  19. Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA

    NASA Technical Reports Server (NTRS)

    Siefert, J. L.; Martin, K. A.; Abdi, F.; Widger, W. R.; Fox, G. E.

    1997-01-01

    Five complete bacterial genome sequences have been released to the scientific community. These include four (eu)Bacteria, Haemophilus influenzae, Mycoplasma genitalium, M. pneumoniae, and Synechocystis PCC 6803, as well as one Archaeon, Methanococcus jannaschii. Features of organization shared by these genomes are likely to have arisen very early in the history of the bacteria and thus can be expected to provide further insight into the nature of early ancestors. Results of a genome comparison of these five organisms confirm earlier observations that gene order is remarkably unpreserved. There are, nevertheless, at least 16 clusters of two or more genes whose order remains the same among the four (eu)Bacteria and these are presumed to reflect conserved elements of coordinated gene expression that require gene proximity. Eight of these gene orders are essentially conserved in the Archaea as well. Many of these clusters are known to be regulated by RNA-level mechanisms in Escherichia coli, which supports the earlier suggestion that this type of regulation of gene expression may have arisen very early. We conclude that although the last common ancestor may have had a DNA genome, it likely was preceded by progenotes with an RNA genome.

  20. An empirical strategy for characterizing bacterial proteomes across species in the absence of genomic sequences.

    PubMed

    Turse, Joshua E; Marshall, Matthew J; Fredrickson, James K; Lipton, Mary S; Callister, Stephen J

    2010-11-12

    Global protein identification through current proteomics methods typically depends on the availability of sequenced genomes. In spite of increasingly high throughput sequencing technologies, this information is not available for every microorganism and rarely available for entire microbial communities. Nevertheless, the protein-level homology that exists between related bacteria makes it possible to extract biological information from the proteome of an organism or microbial community by using the genomic sequences of a near neighbor organism. Here, we demonstrate a trans-organism search strategy for determining the extent to which near-neighbor genome sequences can be applied to identify proteins in unsequenced environmental isolates. In proof of concept testing, we found that within a CLUSTAL W distance of 0.089, near-neighbor genomes successfully identified a high percentage of proteins within an organism. Application of this strategy to characterize environmental bacterial isolates lacking sequenced genomes, but having 16S rDNA sequence similarity to Shewanella resulted in the identification of 300-500 proteins in each strain. The majority of identified pathways mapped to core processes, as well as to processes unique to the Shewanellae, in particular to the presence of c-type cytochromes. Examples of core functional categories include energy metabolism, protein and nucleotide synthesis and cofactor biosynthesis, allowing classification of bacteria by observation of conserved processes. Additionally, within these core functionalities, we observed proteins involved in the alternative lactate utilization pathway, recently described in Shewanella.

  1. In-Yeast Engineering of a Bacterial Genome Using CRISPR/Cas9.

    PubMed

    Tsarmpopoulos, Iason; Gourgues, Géraldine; Blanchard, Alain; Vashee, Sanjay; Jores, Joerg; Lartigue, Carole; Sirand-Pugnet, Pascal

    2016-01-15

    One remarkable achievement in synthetic biology was the reconstruction of mycoplasma genomes and their cloning in yeast where they can be modified using available genetic tools. Recently, CRISPR/Cas9 editing tools were developed for yeast mutagenesis. Here, we report their adaptation for the engineering of bacterial genomes cloned in yeast. A seamless deletion of the mycoplasma glycerol-3-phosphate oxidase-encoding gene (glpO) was achieved without selection in one step, using 90 nt paired oligonucleotides as templates to drive recombination. Screening of the resulting clones revealed that more than 20% contained the desired deletion. After manipulation, the overall integrity of the cloned mycoplasma genome was verified by multiplex PCR and PFGE. Finally, the edited genome was back-transplanted into a mycoplasma recipient cell. In accordance with the deletion of glpO, the mutant mycoplasma was affected in the production of H2O2. This work paves the way to high-throughput manipulation of natural or synthetic genomes in yeast.

  2. Rapid phylogenetic analysis of large samples of recombinant bacterial whole genome sequences using Gubbins

    PubMed Central

    Croucher, Nicholas J.; Page, Andrew J.; Connor, Thomas R.; Delaney, Aidan J.; Keane, Jacqueline A.; Bentley, Stephen D.; Parkhill, Julian; Harris, Simon R.

    2015-01-01

    The emergence of new sequencing technologies has facilitated the use of bacterial whole genome alignments for evolutionary studies and outbreak analyses. These datasets, of increasing size, often include examples of multiple different mechanisms of horizontal sequence transfer resulting in substantial alterations to prokaryotic chromosomes. The impact of these processes demands rapid and flexible approaches able to account for recombination when reconstructing isolates’ recent diversification. Gubbins is an iterative algorithm that uses spatial scanning statistics to identify loci containing elevated densities of base substitutions suggestive of horizontal sequence transfer while concurrently constructing a maximum likelihood phylogeny based on the putative point mutations outside these regions of high sequence diversity. Simulations demonstrate the algorithm generates highly accurate reconstructions under realistically parameterized models of bacterial evolution, and achieves convergence in only a few hours on alignments of hundreds of bacterial genome sequences. Gubbins is appropriate for reconstructing the recent evolutionary history of a variety of haploid genotype alignments, as it makes no assumptions about the underlying mechanism of recombination. The software is freely available for download at github.com/sanger-pathogens/Gubbins, implemented in Python and C and supported on Linux and Mac OS X. PMID:25414349

  3. Bacterial genospecies that are not ecologically coherent: population genomics of Rhizobium leguminosarum

    PubMed Central

    Kumar, Nitin; Lad, Ganesh; Giuntini, Elisa; Kaye, Maria E.; Udomwong, Piyachat; Shamsani, N. Jannah; Young, J. Peter W.; Bailly, Xavier

    2015-01-01

    Biological species may remain distinct because of genetic isolation or ecological adaptation, but these two aspects do not always coincide. To establish the nature of the species boundary within a local bacterial population, we characterized a sympatric population of the bacterium Rhizobium leguminosarum by genomic sequencing of 72 isolates. Although all strains have 16S rRNA typical of R. leguminosarum, they fall into five genospecies by the criterion of average nucleotide identity (ANI). Many genes, on plasmids as well as the chromosome, support this division: recombination of core genes has been largely within genospecies. Nevertheless, variation in ecological properties, including symbiotic host range and carbon-source utilization, cuts across these genospecies, so that none of these phenotypes is diagnostic of genospecies. This phenotypic variation is conferred by mobile genes. The genospecies meet the Mayr criteria for biological species in respect of their core genes, but do not correspond to coherent ecological groups, so periodic selection may not be effective in purging variation within them. The population structure is incompatible with traditional ‘polyphasic taxonomy′ that requires bacterial species to have both phylogenetic coherence and distinctive phenotypes. More generally, genomics has revealed that many bacterial species share adaptive modules by horizontal gene transfer, and we envisage a more consistent taxonomic framework that explicitly recognizes this. Significant phenotypes should be recognized as ‘biovars' within species that are defined by core gene phylogeny. PMID:25589577

  4. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    DOE PAGES

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; ...

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of genemore » gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.« less

  5. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    PubMed Central

    Bendall, Matthew L; Stevens, Sarah LR; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G; Bertilsson, Stefan; Moran, Mary A; Shade, Ashley; Newton, Ryan J; McMahon, Katherine D; Malmstrom, Rex R

    2016-01-01

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Here, from a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. These patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model' of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment. PMID:26744812

  6. A genomic perspective on a new bacterial genus and species from the Alcaligenaceae family, Basilea psittacipulmonis

    PubMed Central

    2014-01-01

    Background A novel Gram-negative, non-haemolytic, non-motile, rod-shaped bacterium was discovered in the lungs of a dead parakeet (Melopsittacus undulatus) that was kept in captivity in a petshop in Basel, Switzerland. The organism is described with a chemotaxonomic profile and the nearly complete genome sequence obtained through the assembly of short sequence reads. Results Genome sequence analysis and characterization of respiratory quinones, fatty acids, polar lipids, and biochemical phenotype is presented here. Comparison of gene sequences revealed that the most similar species is Pelistega europaea, with BLAST identities of only 93% to the 16S rDNA gene, 76% identity to the rpoB gene, and a similar GC content (~43%) as the organism isolated from the parakeet, DSM 24701 (40%). The closest full genome sequences are those of Bordetella spp. and Taylorella spp. High-throughput sequencing reads from the Illumina-Solexa platform were assembled with the Edena de novo assembler to form 195 contigs comprising the ~2 Mb genome. Genome annotation with RAST, construction of phylogenetic trees with the 16S rDNA (rrs) gene sequence and the rpoB gene, and phylogenetic placement using other highly conserved marker genes with ML Tree all suggest that the bacterial species belongs to the Alcaligenaceae family. Analysis of samples from cages with healthy parakeets suggested that the newly discovered bacterial species is not widespread in parakeet living quarters. Conclusions Classification of this organism in the current taxonomy system requires the formation of a new genus and species. We designate the new genus Basilea and the new species psittacipulmonis. The type strain of Basilea psittacipulmonis is DSM 24701 (= CIP 110308 T, 16S rDNA gene sequence Genbank accession number JX412111 and GI 406042063). PMID:24581117

  7. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    SciTech Connect

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary A.; Shade, Ashley; Newton, Ryan J.; McMahon, Katherine D.; Malmstrom, Rex R.

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.

  8. Genomic Analysis of Caldithrix abyssi, the Thermophilic Anaerobic Bacterium of the Novel Bacterial Phylum Calditrichaeota

    PubMed Central

    Kublanov, Ilya V.; Sigalova, Olga M.; Gavrilov, Sergey N.; Lebedinsky, Alexander V.; Rinke, Christian; Kovaleva, Olga; Chernyh, Nikolai A.; Ivanova, Natalia; Daum, Chris; Reddy, T.B.K.; Klenk, Hans-Peter; Spring, Stefan; Göker, Markus; Reva, Oleg N.; Miroshnichenko, Margarita L.; Kyrpides, Nikos C.; Woyke, Tanja; Gelfand, Mikhail S.; Bonch-Osmolovskaya, Elizaveta A.

    2017-01-01

    The genome of Caldithrix abyssi, the first cultivated representative of a phylum-level bacterial lineage, was sequenced within the framework of Genomic Encyclopedia of Bacteria and Archaea (GEBA) project. The genomic analysis revealed mechanisms allowing this anaerobic bacterium to ferment peptides or to implement nitrate reduction with acetate or molecular hydrogen as electron donors. The genome encoded five different [NiFe]- and [FeFe]-hydrogenases, one of which, group 1 [NiFe]-hydrogenase, is presumably involved in lithoheterotrophic growth, three other produce H2 during fermentation, and one is apparently bidirectional. The ability to reduce nitrate is determined by a nitrate reductase of the Nap family, while nitrite reduction to ammonia is presumably catalyzed by an octaheme cytochrome c nitrite reductase εHao. The genome contained genes of respiratory polysulfide/thiosulfate reductase, however, elemental sulfur and thiosulfate were not used as the electron acceptors for anaerobic respiration with acetate or H2, probably due to the lack of the gene of the maturation protein. Nevertheless, elemental sulfur and thiosulfate stimulated growth on fermentable substrates (peptides), being reduced to sulfide, most probably through the action of the cytoplasmic sulfide dehydrogenase and/or NAD(P)-dependent [NiFe]-hydrogenase (sulfhydrogenase) encoded by the genome. Surprisingly, the genome of this anaerobic microorganism encoded all genes for cytochrome c oxidase, however, its maturation machinery seems to be non-operational due to genomic rearrangements of supplementary genes. Despite the fact that sugars were not among the substrates reported when C. abyssi was first described, our genomic analysis revealed multiple genes of glycoside hydrolases, and some of them were predicted to be secreted. This finding aided in bringing out four carbohydrates that supported the growth of C. abyssi: starch, cellobiose, glucomannan and xyloglucan. The genomic analysis

  9. Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomes

    PubMed Central

    Bohlin, Jon; Skjerve, Eystein; Ussery, David W

    2008-01-01

    Background The increasing number of sequenced prokaryotic genomes contains a wealth of genomic data that needs to be effectively analysed. A set of statistical tools exists for such analysis, but their strengths and weaknesses have not been fully explored. The statistical methods we are concerned with here are mainly used to examine similarities between archaeal and bacterial DNA from different genomes. These methods compare observed genomic frequencies of fixed-sized oligonucleotides with expected values, which can be determined by genomic nucleotide content, smaller oligonucleotide frequencies, or be based on specific statistical distributions. Advantages with these statistical methods include measurements of phylogenetic relationship with relatively small pieces of DNA sampled from almost anywhere within genomes, detection of foreign/conserved DNA, and homology searches. Our aim was to explore the reliability and best suited applications for some popular methods, which include relative oligonucleotide frequencies (ROF), di- to hexanucleotide zero'th order Markov methods (ZOM) and 2.order Markov chain Method (MCM). Tests were performed on distant homology searches with large DNA sequences, detection of foreign/conserved DNA, and plasmid-host similarity comparisons. Additionally, the reliability of the methods was tested by comparing both real and random genomic DNA. Results Our findings show that the optimal method is context dependent. ROFs were best suited for distant homology searches, whilst the hexanucleotide ZOM and MCM measures were more reliable measures in terms of phylogeny. The dinucleotide ZOM method produced high correlation values when used to compare real genomes to an artificially constructed random genome with similar %GC, and should therefore be used with care. The tetranucleotide ZOM measure was a good measure to detect horizontally transferred regions, and when used to compare the phylogenetic relationships between plasmids and hosts

  10. Genomic Analysis of Caldithrix abyssi, the Thermophilic Anaerobic Bacterium of the Novel Bacterial Phylum Calditrichaeota.

    PubMed

    Kublanov, Ilya V; Sigalova, Olga M; Gavrilov, Sergey N; Lebedinsky, Alexander V; Rinke, Christian; Kovaleva, Olga; Chernyh, Nikolai A; Ivanova, Natalia; Daum, Chris; Reddy, T B K; Klenk, Hans-Peter; Spring, Stefan; Göker, Markus; Reva, Oleg N; Miroshnichenko, Margarita L; Kyrpides, Nikos C; Woyke, Tanja; Gelfand, Mikhail S; Bonch-Osmolovskaya, Elizaveta A

    2017-01-01

    The genome of Caldithrix abyssi, the first cultivated representative of a phylum-level bacterial lineage, was sequenced within the framework of Genomic Encyclopedia of Bacteria and Archaea (GEBA) project. The genomic analysis revealed mechanisms allowing this anaerobic bacterium to ferment peptides or to implement nitrate reduction with acetate or molecular hydrogen as electron donors. The genome encoded five different [NiFe]- and [FeFe]-hydrogenases, one of which, group 1 [NiFe]-hydrogenase, is presumably involved in lithoheterotrophic growth, three other produce H2 during fermentation, and one is apparently bidirectional. The ability to reduce nitrate is determined by a nitrate reductase of the Nap family, while nitrite reduction to ammonia is presumably catalyzed by an octaheme cytochrome c nitrite reductase εHao. The genome contained genes of respiratory polysulfide/thiosulfate reductase, however, elemental sulfur and thiosulfate were not used as the electron acceptors for anaerobic respiration with acetate or H2, probably due to the lack of the gene of the maturation protein. Nevertheless, elemental sulfur and thiosulfate stimulated growth on fermentable substrates (peptides), being reduced to sulfide, most probably through the action of the cytoplasmic sulfide dehydrogenase and/or NAD(P)-dependent [NiFe]-hydrogenase (sulfhydrogenase) encoded by the genome. Surprisingly, the genome of this anaerobic microorganism encoded all genes for cytochrome c oxidase, however, its maturation machinery seems to be non-operational due to genomic rearrangements of supplementary genes. Despite the fact that sugars were not among the substrates reported when C. abyssi was first described, our genomic analysis revealed multiple genes of glycoside hydrolases, and some of them were predicted to be secreted. This finding aided in bringing out four carbohydrates that supported the growth of C. abyssi: starch, cellobiose, glucomannan and xyloglucan. The genomic analysis

  11. Quantitative analysis of correlation between AT and GC biases among bacterial genomes

    PubMed Central

    Zhang, Ge

    2017-01-01

    Due to different replication mechanisms between the leading and lagging strands, nucleotide composition asymmetries widely exist in bacterial genomes. A general consideration reveals that the leading strand is enriched in Guanine (G) and Thymine (T), and the lagging strand shows richness in Adenine (A) and Cytosine (C). However, some bacteria like Bacillus subtilis have been discovered composing more A than T in the leading strand. To investigate the difference, we analyze the nucleotide asymmetry from the aspect of AT and GC bias correlations. In this study, we propose a windowless method, the Z-curve Correlation Coefficient (ZCC) index, based on the Z-curve method, and analyzed more than 2000 bacterial genomes. We find that the majority of bacteria reveal negative correlations between AT and GC biases, while most genomes in Firmicutes and Tenericutes have positive ZCC indexes. The presence of PolC, purine asymmetry and stronger genes preference in the leading strand are not confined to Firmicutes, but also likely to happen in other phyla dominated by positive ZCC indexes. This method also provides a new insight into other relevant features like aerobism, and can be applied to analyze the correlation between RY (Purine and Pyrimidine) and MK (Amino and Keto) bias and so on. PMID:28158313

  12. Protein Mis-Termination Initiates Genetic Diseases, Cancers, and Restricts Bacterial Genome Expansion.

    PubMed

    Wong, Tit-Yee; Schwartzbach, Steve D

    2015-01-01

    Protein termination is an important cellular process. Protein termination relies on the stop-codons in the mRNA interacting properly with the releasing factors on the ribosome. One third of inherited diseases, including cancers, are associated with the mutation of the stop-codons. Many pathogens and viruses are able to manipulate their stop-codons to express their virulence. The influence of stop-codons is not limited to the primary reading frame of the genes. Stop-codons in the second and third reading frames are referred as premature stop signals (PSC). Stop-codons and PSCs together are collectively referred as stop-signals. The ratios of the stop-signals (referred as translation stop-signals ratio or TSSR) of genetically related bacteria, despite their great differences in gene contents, are much alike. This nearly identical Genomic-TSSR value of genetically related bacteria may suggest that bacterial genome expansion is limited by their unique stop-signals bias. We review the protein termination process and the different types of stop-codon mutation in plants, animals, microbes, and viruses, with special emphasis on the role of PSCs in directing bacterial evolution in their natural environments. Knowing the limit of genomic boundary could facilitate the formulation of new strategies in controlling the spread of diseases and combat antibiotic-resistant bacteria.

  13. Comparison of bacterial diversity in full scale anammox bioreactors operated under different conditions.

    PubMed

    Gonzalez-Martinez, Alejandro; Osorio, Francisco; Morillo, Jose A; Rodriguez-Sanchez, Alejandro; Gonzalez-Lopez, Jesus; Abbas, Ben A; van Loosdrecht, Mark C M

    2015-01-01

    Bacterial community structure of full-scale anammox bioreactor is still mainly unknown. It has never been analyzed whether different anammox bioreactor configurations might result in the development of different bacterial community structures among these systems. In this work, the bacterial community structure of six full-scale autotrophic nitrogen removal bioreactors located in The Netherlands and China operating under three different technologies and with different influent wastewater characteristics was studied by the means of pyrotag sequencing evaluation of the bacterial assemblage yielded a great diversity in all systems. The most represented phyla were the Bacteroidetes and the Proteobacteria, followed by the Planctomycetes. 14 OTUs were shared by all bioreactors, but none of them belonged to the Brocadiales order. Statistical analysis at OTU level showed that differences in the microbial communities were high, and that the main driver of the bacterial assemblage composition was different for the distinct phyla identified in the six bioreactors, depending on bioreactor technology or influent wastewater characteristics.

  14. Bacterial delivery of large intact genomic-DNA-containing BACs into mammalian cells.

    PubMed

    Cheung, Wing; Kotzamanis, George; Abdulrazzak, Hassan; Goussard, Sylvie; Kaname, Tadashi; Kotsinas, Athanassios; Gorgoulis, Vassilis G; Grillot-Courvalin, Catherine; Huxley, Clare

    2012-01-01

    Efficient delivery of large intact vectors into mammalian cells remains problematical. Here we evaluate delivery by bacterial invasion of two large BACs of more than 150 kb in size into various cells. First, we determined the effect of several drugs on bacterial delivery of a small plasmid into different cell lines. Most drugs tested resulted in a marginal increase of the overall efficiency of delivery in only some cell lines, except the lysosomotropic drug chloroquine, which was found to increase the efficiency of delivery by 6-fold in B16F10 cells. Bacterial invasion was found to be significantly advantageous compared with lipofection in delivering large intact BACs into mouse cells, resulting in 100% of clones containing intact DNA. Furthermore, evaluation of expression of the human hypoxanthine phosphoribosyltransferase (HPRT) gene from its genomic locus, which was present in one of the BACs, showed that single copy integrations of the HPRT-containing BAC had occurred in mouse B16F10 cells and that expression of HPRT from each human copy was 0.33 times as much as from each endogenous mouse copy. These data provide new evidence that bacterial delivery is a convenient and efficient method to transfer large intact therapeutic genes into mammalian cells.

  15. Genome-scale evidence of the nematode-arthropod clade

    PubMed Central

    Dopazo, Hernán; Dopazo, Joaquín

    2005-01-01

    Background The issue of whether coelomates form a single clade, the Coelomata, or whether all animals that moult an exoskeleton (such as the coelomate arthropods and the pseudocoelomate nematodes) form a distinct clade, the Ecdysozoa, is the most puzzling issue in animal systematics and a major open-ended subject in evolutionary biology. Previous single-gene and genome-scale analyses designed to resolve the issue have produced contradictory results. Here we present the first genome-scale phylogenetic evidence that strongly supports the Ecdysozoa hypothesis. Results Through the most extensive phylogenetic analysis carried out to date, the complete genomes of 11 eukaryotic species have been analyzed in order to find homologous sequences derived from 18 human chromosomes. Phylogenetic analysis of datasets showing an increased adjustment to equal evolutionary rates between nematode and arthropod sequences produced a gradual change from support for Coelomata to support for Ecdysozoa. Transition between topologies occurred when fast-evolving sequences of Caenorhabditis elegans were removed. When chordate, nematode and arthropod sequences were constrained to fit equal evolutionary rates, the Ecdysozoa topology was statistically accepted whereas Coelomata was rejected. Conclusions The reliability of a monophyletic group clustering arthropods and nematodes was unequivocally accepted in datasets where traces of the long-branch attraction effect were removed. This is the first phylogenomic evidence to strongly support the 'moulting clade' hypothesis. PMID:15892869

  16. Bacterial Genes in the Aphid Genome: Absence of Functional Gene Transfer from Buchnera to Its Host

    PubMed Central

    Nikoh, Naruo; McCutcheon, John P.; Kudo, Toshiaki; Miyagishima, Shin-ya; Moran, Nancy A.; Nakabachi, Atsushi

    2010-01-01

    Genome reduction is typical of obligate symbionts. In cellular organelles, this reduction partly reflects transfer of ancestral bacterial genes to the host genome, but little is known about gene transfer in other obligate symbioses. Aphids harbor anciently acquired obligate mutualists, Buchnera aphidicola (Gammaproteobacteria), which have highly reduced genomes (420–650 kb), raising the possibility of gene transfer from ancestral Buchnera to the aphid genome. In addition, aphids often harbor other bacteria that also are potential sources of transferred genes. Previous limited sampling of genes expressed in bacteriocytes, the specialized cells that harbor Buchnera, revealed that aphids acquired at least two genes from bacteria. The newly sequenced genome of the pea aphid, Acyrthosiphon pisum, presents the first opportunity for a complete inventory of genes transferred from bacteria to the host genome in the context of an ancient obligate symbiosis. Computational screening of the entire A. pisum genome, followed by phylogenetic and experimental analyses, provided strong support for the transfer of 12 genes or gene fragments from bacteria to the aphid genome: three LD–carboxypeptidases (LdcA1, LdcA2,ψLdcA), five rare lipoprotein As (RlpA1-5), N-acetylmuramoyl-L-alanine amidase (AmiD), 1,4-beta-N-acetylmuramidase (bLys), DNA polymerase III alpha chain (ψDnaE), and ATP synthase delta chain (ψAtpH). Buchnera was the apparent source of two highly truncated pseudogenes (ψDnaE and ψAtpH). Most other transferred genes were closely related to genes from relatives of Wolbachia (Alphaproteobacteria). At least eight of the transferred genes (LdcA1, AmiD, RlpA1-5, bLys) appear to be functional, and expression of seven (LdcA1, AmiD, RlpA1-5) are highly upregulated in bacteriocytes. The LdcAs and RlpAs appear to have been duplicated after transfer. Our results excluded the hypothesis that genome reduction in Buchnera has been accompanied by gene transfer to the host

  17. 13C metabolic flux analysis at a genome-scale.

    PubMed

    Gopalakrishnan, Saratram; Maranas, Costas D

    2015-11-01

    Metabolic models used in 13C metabolic flux analysis generally include a limited number of reactions primarily from central metabolism. They typically omit degradation pathways, complete cofactor balances, and atom transition contributions for reactions outside central metabolism. This study addresses the impact on prediction fidelity of scaling-up mapping models to a genome-scale. The core mapping model employed in this study accounts for (75 reactions and 65 metabolites) primarily from central metabolism. The genome-scale metabolic mapping model (GSMM) (697 reaction and 595 metabolites) is constructed using as a basis the iAF1260 model upon eliminating reactions guaranteed not to carry flux based on growth and fermentation data for a minimal glucose growth medium. Labeling data for 17 amino acid fragments obtained from cells fed with glucose labeled at the second carbon was used to obtain fluxes and ranges. Metabolic fluxes and confidence intervals are estimated, for both core and genome-scale mapping models, by minimizing the sum of square of differences between predicted and experimentally measured labeling patterns using the EMU decomposition algorithm. Overall, we find that both topology and estimated values of the metabolic fluxes remain largely consistent between core and GSM model. Stepping up to a genome-scale mapping model leads to wider flux inference ranges for 20 key reactions present in the core model. The glycolysis flux range doubles due to the possibility of active gluconeogenesis, the TCA flux range expanded by 80% due to the availability of a bypass through arginine consistent with labeling data, and the transhydrogenase reaction flux was essentially unresolved due to the presence of as many as five routes for the inter-conversion of NADPH to NADH afforded by the genome-scale model. By globally accounting for ATP demands in the GSMM model the unused ATP decreased drastically with the lower bound matching the maintenance ATP requirement. A non

  18. Genomics reveals historic and contemporary transmission dynamics of a bacterial disease among wildlife and livestock

    USGS Publications Warehouse

    Kamath, Pauline L.; Foster, Jeffrey T.; Drees, Kevin P.; Luikart, Gordon; Quance, Christine; Anderson, Neil J.; Clarke, P. Ryan; Cole, Eric K.; Drew, Mark L.; Edwards, William H.; Rhyan, Jack C.; Treanor, John J.; Wallen, Rick L.; White, Patrick J.; Robbe-Austerman, Suelee; Cross, Paul C.

    2016-01-01

    Whole-genome sequencing has provided fundamental insights into infectious disease epidemiology, but has rarely been used for examining transmission dynamics of a bacterial pathogen in wildlife. In the Greater Yellowstone Ecosystem (GYE), outbreaks of brucellosis have increased in cattle along with rising seroprevalence in elk. Here we use a genomic approach to examine Brucella abortus evolution, cross-species transmission and spatial spread in the GYE. We find that brucellosis was introduced into wildlife in this region at least five times. The diffusion rate varies among Brucella lineages (B3 to 8 km per year) and over time. We also estimate 12 host transitions from bison to elk, and 5 from elk to bison. Our results support the notion that free-ranging elk are currently a self-sustaining brucellosis reservoir and the source of livestock infections, and that control measures in bison are unlikely to affect the dynamics of unrelated strains circulating in nearby elk populations.

  19. Genomics reveals historic and contemporary transmission dynamics of a bacterial disease among wildlife and livestock.

    PubMed

    Kamath, Pauline L; Foster, Jeffrey T; Drees, Kevin P; Luikart, Gordon; Quance, Christine; Anderson, Neil J; Clarke, P Ryan; Cole, Eric K; Drew, Mark L; Edwards, William H; Rhyan, Jack C; Treanor, John J; Wallen, Rick L; White, Patrick J; Robbe-Austerman, Suelee; Cross, Paul C

    2016-05-11

    Whole-genome sequencing has provided fundamental insights into infectious disease epidemiology, but has rarely been used for examining transmission dynamics of a bacterial pathogen in wildlife. In the Greater Yellowstone Ecosystem (GYE), outbreaks of brucellosis have increased in cattle along with rising seroprevalence in elk. Here we use a genomic approach to examine Brucella abortus evolution, cross-species transmission and spatial spread in the GYE. We find that brucellosis was introduced into wildlife in this region at least five times. The diffusion rate varies among Brucella lineages (∼3 to 8 km per year) and over time. We also estimate 12 host transitions from bison to elk, and 5 from elk to bison. Our results support the notion that free-ranging elk are currently a self-sustaining brucellosis reservoir and the source of livestock infections, and that control measures in bison are unlikely to affect the dynamics of unrelated strains circulating in nearby elk populations.

  20. Genomics reveals historic and contemporary transmission dynamics of a bacterial disease among wildlife and livestock

    PubMed Central

    Kamath, Pauline L.; Foster, Jeffrey T.; Drees, Kevin P.; Luikart, Gordon; Quance, Christine; Anderson, Neil J.; Clarke, P. Ryan; Cole, Eric K.; Drew, Mark L.; Edwards, William H.; Rhyan, Jack C.; Treanor, John J.; Wallen, Rick L.; White, Patrick J.; Robbe-Austerman, Suelee; Cross, Paul C.

    2016-01-01

    Whole-genome sequencing has provided fundamental insights into infectious disease epidemiology, but has rarely been used for examining transmission dynamics of a bacterial pathogen in wildlife. In the Greater Yellowstone Ecosystem (GYE), outbreaks of brucellosis have increased in cattle along with rising seroprevalence in elk. Here we use a genomic approach to examine Brucella abortus evolution, cross-species transmission and spatial spread in the GYE. We find that brucellosis was introduced into wildlife in this region at least five times. The diffusion rate varies among Brucella lineages (∼3 to 8 km per year) and over time. We also estimate 12 host transitions from bison to elk, and 5 from elk to bison. Our results support the notion that free-ranging elk are currently a self-sustaining brucellosis reservoir and the source of livestock infections, and that control measures in bison are unlikely to affect the dynamics of unrelated strains circulating in nearby elk populations. PMID:27165544

  1. "Candidatus Cloacamonas acidaminovorans": genome sequence reconstruction provides a first glimpse of a new bacterial division.

    PubMed

    Pelletier, Eric; Kreimeyer, Annett; Bocs, Stéphanie; Rouy, Zoé; Gyapay, Gábor; Chouari, Rakia; Rivière, Delphine; Ganesan, Akila; Daegelen, Patrick; Sghir, Abdelghani; Cohen, Georges N; Médigue, Claudine; Weissenbach, Jean; Le Paslier, Denis

    2008-04-01

    Many microorganisms live in anaerobic environments. Most of these microorganisms have not yet been cultivated. Here, we present, from a metagenomic analysis of an anaerobic digester of a municipal wastewater treatment plant, a reconstruction of the complete genome of a bacterium belonging to the WWE1 candidate division. In silico proteome analysis indicated that this bacterium might derive most of its carbon and energy from the fermentation of amino acids, and hence, it was provisionally classified as "Candidatus Cloacamonas acidaminovorans." "Candidatus Cloacamonas acidaminovorans" is probably a syntrophic bacterium that is present in many anaerobic digesters. This report highlights how environmental sequence data might provide genomic and functional information about a new bacterial clade whose members are involved in anaerobic digestion.

  2. Correlation Between Bacterial Attachment Rate Coefficients and Hydraulic Conductivity and its Effect on Field-Scale Bacterial Transport

    SciTech Connect

    Scheibe, Timothy D.; Dong, Hailiang; Xie, YuLong

    2007-06-01

    It has been widely observed in field experiments that the apparent rate of bacterial attachment, particularly as parameterized by the collision efficiency in filtration-based models, decreases with transport distance (i.e., exhibits scale-dependency). This effect has previously been attributed to microbial heterogeneity; that is, variability in cell-surface properties within a single monoclonal population. We demonstrate that this effect could also be interpreted as a field-scale manifestation of local-scale correlation between physical heterogeneity (hydraulic conductivity variability) and reaction heterogeneity (attachment rate coefficient variability). A field-scale model of bacterial transport developed for the South Oyster field research site located near Oyster, Virginia, and observations from field experiments performed at that site, are used as the basis for this study. Three-dimensional Monte Carlo simulations of bacterial transport were performed under four alternative scenarios: 1) homogeneous hydraulic conductivity (K) and attachment rate coefficient (Kf), 2) heterogeneous K, homogeneous Kf, 3) heterogeneous K and Kf with local correlation based on empirical and theoretical relationships, and 4) heterogeneous K and Kf without local correlation. The results of the 3D simulations were analyzed using 1D model approximations following conventional methods of field data analysis. An apparent decrease with transport distance of effective collision efficiency was observed only in the case where the local properties were both heterogeneous and correlated. This effect was observed despite the fact that the local collision efficiency was specified as a constant in the 3D model, and can therefore be interpreted as a scale effect associated with the local correlated heterogeneity as manifested at the field scale.

  3. Evolution of Intra-specific Regulatory Networks in a Multipartite Bacterial Genome.

    PubMed

    Galardini, Marco; Brilli, Matteo; Spini, Giulia; Rossi, Matteo; Roncaglia, Bianca; Bani, Alessia; Chiancianesi, Manuela; Moretto, Marco; Engelen, Kristof; Bacci, Giovanni; Pini, Francesco; Biondi, Emanuele G; Bazzicalupo, Marco; Mengoni, Alessio

    2015-09-01

    Reconstruction of the regulatory network is an important step in understanding how organisms control the expression of gene products and therefore phenotypes. Recent studies have pointed out the importance of regulatory network plasticity in bacterial adaptation and evolution. The evolution of such networks within and outside the species boundary is however still obscure. Sinorhizobium meliloti is an ideal species for such study, having three large replicons, many genomes available and a significant knowledge of its transcription factors (TF). Each replicon has a specific functional and evolutionary mark; which might also emerge from the analysis of their regulatory signatures. Here we have studied the plasticity of the regulatory network within and outside the S. meliloti species, looking for the presence of 41 TFs binding motifs in 51 strains and 5 related rhizobial species. We have detected a preference of several TFs for one of the three replicons, and the function of regulated genes was found to be in accordance with the overall replicon functional signature: house-keeping functions for the chromosome, metabolism for the chromid, symbiosis for the megaplasmid. This therefore suggests a replicon-specific wiring of the regulatory network in the S. meliloti species. At the same time a significant part of the predicted regulatory network is shared between the chromosome and the chromid, thus adding an additional layer by which the chromid integrates itself in the core genome. Furthermore, the regulatory network distance was found to be correlated with both promoter regions and accessory genome evolution inside the species, indicating that both pangenome compartments are involved in the regulatory network evolution. We also observed that genes which are not included in the species regulatory network are more likely to belong to the accessory genome, indicating that regulatory interactions should also be considered to predict gene conservation in bacterial

  4. ARG-ANNOT, a New Bioinformatic Tool To Discover Antibiotic Resistance Genes in Bacterial Genomes

    PubMed Central

    Gupta, Sushim Kumar; Padmanabhan, Babu Roshan; Diene, Seydina M.; Lopez-Rojas, Rafael; Kempf, Marie; Landraud, Luce

    2014-01-01

    ARG-ANNOT (Antibiotic Resistance Gene-ANNOTation) is a new bioinformatic tool that was created to detect existing and putative new antibiotic resistance (AR) genes in bacterial genomes. ARG-ANNOT uses a local BLAST program in Bio-Edit software that allows the user to analyze sequences without a Web interface. All AR genetic determinants were collected from published works and online resources; nucleotide and protein sequences were retrieved from the NCBI GenBank database. After building a database that includes 1,689 antibiotic resistance genes, the software was tested in a blind manner using 100 random sequences selected from the database to verify that the sensitivity and specificity were at 100% even when partial sequences were queried. Notably, BLAST analysis results obtained using the rmtF gene sequence (a new aminoglycoside-modifying enzyme gene sequence that is not included in the database) as a query revealed that the tool was able to link this sequence to short sequences (17 to 40 bp) found in other genes of the rmt family with significant E values. Finally, the analysis of 178 Acinetobacter baumannii and 20 Staphylococcus aureus genomes allowed the detection of a significantly higher number of AR genes than the Resfinder gene analyzer and 11 point mutations in target genes known to be associated with AR. The average time for the analysis of a genome was 3.35 ± 0.13 min. We have created a concise database for BLAST using a Bio-Edit interface that can detect AR genetic determinants in bacterial genomes and can rapidly and easily discover putative new AR genetic determinants. PMID:24145532

  5. Multiplex sequencing of bacterial artificial chromosomes for assembling complex plant genomes.

    PubMed

    Beier, Sebastian; Himmelbach, Axel; Schmutzer, Thomas; Felder, Marius; Taudien, Stefan; Mayer, Klaus F X; Platzer, Matthias; Stein, Nils; Scholz, Uwe; Mascher, Martin

    2016-07-01

    Hierarchical shotgun sequencing remains the method of choice for assembling high-quality reference sequences of complex plant genomes. The efficient exploitation of current high-throughput technologies and powerful computational facilities for large-insert clone sequencing necessitates the sequencing and assembly of a large number of clones in parallel. We developed a multiplexed pipeline for shotgun sequencing and assembling individual bacterial artificial chromosomes (BACs) using the Illumina sequencing platform. We illustrate our approach by sequencing 668 barley BACs (Hordeum vulgare L.) in a single Illumina HiSeq 2000 lane. Using a newly designed parallelized computational pipeline, we obtained sequence assemblies of individual BACs that consist, on average, of eight sequence scaffolds and represent >98% of the genomic inserts. Our BAC assemblies are clearly superior to a whole-genome shotgun assembly regarding contiguity, completeness and the representation of the gene space. Our methods may be employed to rapidly obtain high-quality assemblies of a large number of clones to assemble map-based reference sequences of plant and animal species with complex genomes by sequencing along a minimum tiling path.

  6. Single-molecule approach to bacterial genomic comparisons via optical mapping.

    SciTech Connect

    Zhou, Shiguo; Kile, A.; Bechner, M.; Kvikstad, E.; Deng, W.; Wei, J.; Severin, J.; Runnheim, R.; Churas, C.; Forrest, D.; Dimalanta, E.; Lamers, C.; Burland, V.; Blattner, F. R.; Schwartz, David C.

    2004-01-01

    Modern comparative genomics has been established, in part, by the sequencing and annotation of a broad range of microbial species. To gain further insights, new sequencing efforts are now dealing with the variety of strains or isolates that gives a species definition and range; however, this number vastly outstrips our ability to sequence them. Given the availability of a large number of microbial species, new whole genome approaches must be developed to fully leverage this information at the level of strain diversity that maximize discovery. Here, we describe how optical mapping, a single-molecule system, was used to identify and annotate chromosomal alterations between bacterial strains represented by several species. Since whole-genome optical maps are ordered restriction maps, sequenced strains of Shigella flexneri serotype 2a (2457T and 301), Yersinia pestis (CO 92 and KIM), and Escherichia coli were aligned as maps to identify regions of homology and to further characterize them as possible insertions, deletions, inversions, or translocations. Importantly, an unsequenced Shigella flexneri strain (serotype Y strain AMC[328Y]) was optically mapped and aligned with two sequenced ones to reveal one novel locus implicated in serotype conversion and several other loci containing insertion sequence elements or phage-related gene insertions. Our results suggest that genomic rearrangements and chromosomal breakpoints are readily identified and annotated against a prototypic sequenced strain by using the tools of optical mapping.

  7. Pangolin genomes and the evolution of mammalian scales and immunity.

    PubMed

    Choo, Siew Woh; Rayko, Mike; Tan, Tze King; Hari, Ranjeev; Komissarov, Aleksey; Wee, Wei Yee; Yurchenko, Andrey A; Kliver, Sergey; Tamazian, Gaik; Antunes, Agostinho; Wilson, Richard K; Warren, Wesley C; Koepfli, Klaus-Peter; Minx, Patrick; Krasheninnikova, Ksenia; Kotze, Antoinette; Dalton, Desire L; Vermaak, Elaine; Paterson, Ian C; Dobrynin, Pavel; Sitam, Frankie Thomas; Rovie-Ryan, Jeffrine J; Johnson, Warren E; Yusoff, Aini Mohamed; Luo, Shu-Jin; Karuppannan, Kayal Vizi; Fang, Gang; Zheng, Deyou; Gerstein, Mark B; Lipovich, Leonard; O'Brien, Stephen J; Wong, Guat Jah

    2016-10-01

    Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Strikingly, we found that interferon epsilon (IFNE), exclusively expressed in epithelial cells and important in skin and mucosal immunity, is pseudogenized in all African and Asian pangolin species that we examined, perhaps impacting resistance to infection. We propose that scale development was an innovation that provided protection against injuries or stress and reduced pangolin vulnerability to infection. Further evidence of specialized adaptations was evident from positively selected genes involving immunity-related pathways, inflammation, energy storage and metabolism, muscular and nervous systems, and scale/hair development. Olfactory receptor gene families are significantly expanded in pangolins, reflecting their well-developed olfaction system. This study provides insights into mammalian adaptation and functional diversification, new research tools and questions, and perhaps a new natural IFNE-deficient animal model for studying mammalian immunity.

  8. Pangolin genomes and the evolution of mammalian scales and immunity

    PubMed Central

    Rayko, Mike; Tan, Tze King; Hari, Ranjeev; Komissarov, Aleksey; Wee, Wei Yee; Yurchenko, Andrey A.; Kliver, Sergey; Tamazian, Gaik; Antunes, Agostinho; Wilson, Richard K.; Warren, Wesley C.; Koepfli, Klaus-Peter; Minx, Patrick; Krasheninnikova, Ksenia; Kotze, Antoinette; Dalton, Desire L.; Vermaak, Elaine; Paterson, Ian C.; Dobrynin, Pavel; Sitam, Frankie Thomas; Rovie-Ryan, Jeffrine J.; Johnson, Warren E.; Yusoff, Aini Mohamed; Luo, Shu-Jin; Karuppannan, Kayal Vizi; Fang, Gang; Zheng, Deyou; Gerstein, Mark B.; Lipovich, Leonard; O'Brien, Stephen J.; Wong, Guat Jah

    2016-01-01

    Pangolins, unique mammals with scales over most of their body, no teeth, poor vision, and an acute olfactory system, comprise the only placental order (Pholidota) without a whole-genome map. To investigate pangolin biology and evolution, we developed genome assemblies of the Malayan (Manis javanica) and Chinese (M. pentadactyla) pangolins. Strikingly, we found that interferon epsilon (IFNE), exclusively expressed in epithelial cells and important in skin and mucosal immunity, is pseudogenized in all African and Asian pangolin species that we examined, perhaps impacting resistance to infection. We propose that scale development was an innovation that provided protection against injuries or stress and reduced pangolin vulnerability to infection. Further evidence of specialized adaptations was evident from positively selected genes involving immunity-related pathways, inflammation, energy storage and metabolism, muscular and nervous systems, and scale/hair development. Olfactory receptor gene families are significantly expanded in pangolins, reflecting their well-developed olfaction system. This study provides insights into mammalian adaptation and functional diversification, new research tools and questions, and perhaps a new natural IFNE-deficient animal model for studying mammalian immunity. PMID:27510566

  9. Rapid pair-wise synteny analysis of large bacterial genomes using web-based GeneOrder4.0

    PubMed Central

    2010-01-01

    Background The growing whole genome sequence databases necessitate the development of user-friendly software tools to mine these data. Web-based tools are particularly useful to wet-bench biologists as they enable platform-independent analysis of sequence data, without having to perform complex programming tasks and software compiling. Findings GeneOrder4.0 is a web-based "on-the-fly" synteny and gene order analysis tool for comparative bacterial genomics (ca. 8 Mb). It enables the visualization of synteny by plotting protein similarity scores between two genomes and it also provides visual annotation of "hypothetical" proteins from older archived genomes based on more recent annotations. Conclusions The web-based software tool GeneOrder4.0 is a user-friendly application that has been updated to allow the rapid analysis of synteny and gene order in large bacterial genomes. It is developed with the wet-bench researcher in mind. PMID:20178631

  10. Large-Scale Sequencing: The Future of Genomic Sciences Colloquium

    SciTech Connect

    Margaret Riley; Merry Buckley

    2009-01-01

    Genetic sequencing and the various molecular techniques it has enabled have revolutionized the field of microbiology. Examining and comparing the genetic sequences borne by microbes - including bacteria, archaea, viruses, and microbial eukaryotes - provides researchers insights into the processes microbes carry out, their pathogenic traits, and new ways to use microorganisms in medicine and manufacturing. Until recently, sequencing entire microbial genomes has been laborious and expensive, and the decision to sequence the genome of an organism was made on a case-by-case basis by individual researchers and funding agencies. Now, thanks to new technologies, the cost and effort of sequencing is within reach for even the smallest facilities, and the ability to sequence the genomes of a significant fraction of microbial life may be possible. The availability of numerous microbial genomes will enable unprecedented insights into microbial evolution, function, and physiology. However, the current ad hoc approach to gathering sequence data has resulted in an unbalanced and highly biased sampling of microbial diversity. A well-coordinated, large-scale effort to target the breadth and depth of microbial diversity would result in the greatest impact. The American Academy of Microbiology convened a colloquium to discuss the scientific benefits of engaging in a large-scale, taxonomically-based sequencing project. A group of individuals with expertise in microbiology, genomics, informatics, ecology, and evolution deliberated on the issues inherent in such an effort and generated a set of specific recommendations for how best to proceed. The vast majority of microbes are presently uncultured and, thus, pose significant challenges to such a taxonomically-based approach to sampling genome diversity. However, we have yet to even scratch the surface of the genomic diversity among cultured microbes. A coordinated sequencing effort of cultured organisms is an appropriate place to begin

  11. CodaChrome: a tool for the visualization of proteome conservation across all fully sequenced bacterial genomes

    PubMed Central

    2014-01-01

    Background The relationships between bacterial genomes are complicated by rampant horizontal gene transfer, varied selection pressures, acquisition of new genes, loss of genes, and divergence of genes, even in closely related lineages. As more and more bacterial genomes are sequenced, organizing and interpreting the incredible amount of relational information that connects them becomes increasingly difficult. Results We have developed CodaChrome (http://www.sourceforge.com/p/codachrome), a one-versus-all proteome comparison tool that allows the user to visually investigate the relationship between a bacterial proteome of interest and the proteomes encoded by every other bacterial genome recorded in GenBank in a massive interactive heat map. This tool has allowed us to rapidly identify the most highly conserved proteins encoded in the bacterial pan-genome, fast-clock genes useful for subtyping of bacterial species, the evolutionary history of an indel in the Sphingobium lineage, and an example of horizontal gene transfer from a member of the genus Enterococcus to a recent ancestor of Helicobacter pylori. Conclusion CodaChrome is a user-friendly and powerful tool for simultaneously visualizing relationships between thousands of proteomes. PMID:24460813

  12. Genome-scale constraint-based modeling of Geobacter metallireducens

    PubMed Central

    Sun, Jun; Sayyar, Bahareh; Butler, Jessica E; Pharkya, Priti; Fahland, Tom R; Famili, Iman; Schilling, Christophe H; Lovley, Derek R; Mahadevan, Radhakrishnan

    2009-01-01

    Background Geobacter metallireducens was the first organism that can be grown in pure culture to completely oxidize organic compounds with Fe(III) oxide serving as electron acceptor. Geobacter species, including G. sulfurreducens and G. metallireducens, are used for bioremediation and electricity generation from waste organic matter and renewable biomass. The constraint-based modeling approach enables the development of genome-scale in silico models that can predict the behavior of complex biological systems and their responses to the environments. Such a modeling approach was applied to provide physiological and ecological insights on the metabolism of G. metallireducens. Results The genome-scale metabolic model of G. metallireducens was constructed to include 747 genes and 697 reactions. Compared to the G. sulfurreducens model, the G. metallireducens metabolic model contains 118 unique reactions that reflect many of G. metallireducens' specific metabolic capabilities. Detailed examination of the G. metallireducens model suggests that its central metabolism contains several energy-inefficient reactions that are not present in the G. sulfurreducens model. Experimental biomass yield of G. metallireducens growing on pyruvate was lower than the predicted optimal biomass yield. Microarray data of G. metallireducens growing with benzoate and acetate indicated that genes encoding these energy-inefficient reactions were up-regulated by benzoate. These results suggested that the energy-inefficient reactions were likely turned off during G. metallireducens growth with acetate for optimal biomass yield, but were up-regulated during growth with complex electron donors such as benzoate for rapid energy generation. Furthermore, several computational modeling approaches were applied to accelerate G. metallireducens research. For example, growth of G. metallireducens with different electron donors and electron acceptors were studied using the genome-scale metabolic model, which

  13. Large-scale evaluation of experimentally determined DNA G+C contents with whole genome sequences of prokaryotes.

    PubMed

    Kim, Mincheol; Park, Sang-Cheol; Baek, Inwoo; Chun, Jongsik

    2015-03-01

    Historically, DNA G+C content has played a critical role in the description of bacterial and archaeal species. Despite its importance in prokaryote taxonomy, its accuracy has been questioned due to methodological heterogeneity and measurement errors of conventional methods. Here we investigated the extent of accuracy of experimentally determined DNA G+C contents by comparing the reference values calculated from whole genome sequences. The large-scale comparison revealed that G+C contents determined by high-performance liquid chromatography and buoyant density centrifugation methods were more similar to the genome-derived reference values than those generated by thermal denaturation method. However, there was a substantial degree of discrepancy in DNA G+C contents between values obtained by conventional methods and genome-derived reference values. The majority of the differences between them fell out of the acceptable range (i.e. 1 mol% G+C content difference) for species delimitation of prokaryotes. In contrast, when average nucleotide identity (ANI) was correlated to G+C difference among genomes, most G+C difference was confined to less than 1% within species. Therefore, erroneous conventional methods are not meaningful in the description of bacterial and archaeal species. For taxonomic purposes, DNA G+C content should be determined by calculating directly from high-quality genome sequences with at least 16× or higher sequencing depth of coverage.

  14. Group-theoretic models of the inversion process in bacterial genomes.

    PubMed

    Egri-Nagy, Attila; Gebhardt, Volker; Tanaka, Mark M; Francis, Andrew R

    2014-07-01

    The variation in genome arrangements among bacterial taxa is largely due to the process of inversion. Recent studies indicate that not all inversions are equally probable, suggesting, for instance, that shorter inversions are more frequent than longer, and those that move the terminus of replication are less probable than those that do not. Current methods for establishing the inversion distance between two bacterial genomes are unable to incorporate such information. In this paper we suggest a group-theoretic framework that in principle can take these constraints into account. In particular, we show that by lifting the problem from circular permutations to the affine symmetric group, the inversion distance can be found in polynomial time for a model in which inversions are restricted to acting on two regions. This requires the proof of new results in group theory, and suggests a vein of new combinatorial problems concerning permutation groups on which group theorists will be needed to collaborate with biologists. We apply the new method to inferring distances and phylogenies for published Yersinia pestis data.

  15. Statistical Analysis of Hurst Exponents of Essential/Nonessential Genes in 33 Bacterial Genomes

    PubMed Central

    Liu, Xiao; Wang, Baojin; Xu, Luo

    2015-01-01

    Methods for identifying essential genes currently depend predominantly on biochemical experiments. However, there is demand for improved computational methods for determining gene essentiality. In this study, we used the Hurst exponent, a characteristic parameter to describe long-range correlation in DNA, and analyzed its distribution in 33 bacterial genomes. In most genomes (31 out of 33) the significance levels of the Hurst exponents of the essential genes were significantly higher than for the corresponding full-gene-set, whereas the significance levels of the Hurst exponents of the nonessential genes remained unchanged or increased only slightly. All of the Hurst exponents of essential genes followed a normal distribution, with one exception. We therefore propose that the distribution feature of Hurst exponents of essential genes can be used as a classification index for essential gene prediction in bacteria. For computer-aided design in the field of synthetic biology, this feature can build a restraint for pre- or post-design checking of bacterial essential genes. Moreover, considering the relationship between gene essentiality and evolution, the Hurst exponents could be used as a descriptive parameter related to evolutionary level, or be added to the annotation of each gene. PMID:26067107

  16. Gain and Loss of Phototrophic Genes Revealed by Comparison of Two Citromicrobium Bacterial Genomes

    PubMed Central

    Zheng, Qiang; Zhang, Rui; Fogg, Paul C. M.; Beatty, J. Thomas; Wang, Yu; Jiao, Nianzhi

    2012-01-01

    Proteobacteria are thought to have diverged from a phototrophic ancestor, according to the scattered distribution of phototrophy throughout the proteobacterial clade, and so the occurrence of numerous closely related phototrophic and chemotrophic microorganisms may be the result of the loss of genes for phototrophy. A widespread form of bacterial phototrophy is based on the photochemical reaction center, encoded by puf and puh operons that typically are in a ‘photosynthesis gene cluster’ (abbreviated as the PGC) with pigment biosynthesis genes. Comparison of two closely related Citromicrobial genomes (98.1% sequence identity of complete 16S rRNA genes), Citromicrobium sp. JL354, which contains two copies of reaction center genes, and Citromicrobium strain JLT1363, which is chemotrophic, revealed evidence for the loss of phototrophic genes. However, evidence of horizontal gene transfer was found in these two bacterial genomes. An incomplete PGC (pufLMC-puhCBA) in strain JL354 was located within an integrating conjugative element, which indicates a potential mechanism for the horizontal transfer of genes for phototrophy. PMID:22558224

  17. Design and application of genome-scale reconstructed metabolic models.

    PubMed

    Rocha, Isabel; Förster, Jochen; Nielsen, Jens

    2008-01-01

    In this chapter, the process for the reconstruction of genome-scale metabolic networks is described, and some of the main applications of such models are illustrated. The reconstruction process can be viewed as an iterative process where information obtained from several sources is combined to construct a preliminary set of reactions and constraints. This involves steps such as genome annotation; identification of the reactions from the annotated genome sequence and available literature; determination of the reaction stoichiometry; definition of compartmentation and assignment of localization; determination of the biomass composition; measurement, calculation, or fitting of energy requirements; and definition of additional constraints. The reaction and constraint sets, after debugging, may be integrated into a stoichiometric model that can be used for simulation using tools such as Flux Balance Analysis (Section 3.8). From the flux distributions obtained, physiologic parameters such as growth yields or minimal medium components can be calculated, and their distance from similar experimental data provides a basis from where the model may need to be improved.

  18. Genome-scale metabolic models: reconstruction and analysis.

    PubMed

    Baart, Gino J E; Martens, Dirk E

    2012-01-01

    Metabolism can be defined as the complete set of chemical reactions that occur in living organisms in order to maintain life. Enzymes are the main players in this process as they are responsible for catalyzing the chemical reactions. The enzyme-reaction relationships can be used for the reconstruction of a network of reactions, which leads to a metabolic model of metabolism. A genome-scale metabolic network of chemical reactions that take place inside a living organism is primarily reconstructed from the information that is present in its genome and the literature and involves steps such as functional annotation of the genome, identification of the associated reactions and determination of their stoichiometry, assignment of localization, determination of the biomass composition, estimation of energy requirements, and definition of model constraints. This information can be integrated into a stoichiometric model of metabolism that can be used for detailed analysis of the metabolic potential of the organism using constraint-based modeling approaches and hence is valuable in understanding its metabolic capabilities.

  19. Mechanism for DNA transposons to generate introns on genomic scales.

    PubMed

    Huff, Jason T; Zilberman, Daniel; Roy, Scott W

    2016-10-27

    The discovery of introns four decades ago was one of the most unexpected findings in molecular biology. Introns are sequences interrupting genes that must be removed as part of messenger RNA production. Genome sequencing projects have shown that most eukaryotic genes contain at least one intron, and frequently many. Comparison of these genomes reveals a history of long evolutionary periods during which few introns were gained, punctuated by episodes of rapid, extensive gain. However, although several detailed mechanisms for such episodic intron generation have been proposed, none has been empirically supported on a genomic scale. Here we show how short, non-autonomous DNA transposons independently generated hundreds to thousands of introns in the prasinophyte Micromonas pusilla and the pelagophyte Aureococcus anophagefferens. Each transposon carries one splice site. The other splice site is co-opted from the gene sequence that is duplicated upon transposon insertion, allowing perfect splicing out of the RNA. The distributions of sequences that can be co-opted are biased with respect to codons, and phasing of transposon-generated introns is similarly biased. These transposons insert between pre-existing nucleosomes, so that multiple nearby insertions generate nucleosome-sized intervening segments. Thus, transposon insertion and sequence co-option may explain the intron phase biases and prevalence of nucleosome-sized exons observed in eukaryotes. Overall, the two independent examples of proliferating elements illustrate a general DNA transposon mechanism that can plausibly account for episodes of rapid, extensive intron gain during eukaryotic evolution.

  20. EFFECT OF BACTERIAL SULFATE REDUCTION ON IRON-CORROSION SCALES

    EPA Science Inventory

    Iron-sulfur geochemistry is important in many natural and engineered environments including drinking water systems. In the anaerobic environment beneath scales of corroding iron drinking water distribution system pipes, sulfate reducing bacteria (SRB) produce sulfide from natura...

  1. Merging Chemical Ecology with Bacterial Genome Mining for Secondary Metabolite Discovery

    PubMed Central

    Vizcaino, Maria I.; Guo, Xun; Crawford, Jason M.

    2013-01-01

    The integration of chemical ecology and bacterial genome mining can enhance the discovery of structurally diverse natural products in functional contexts. By examining bacterial secondary metabolism in the framework of its ecological niche, insights can be drawn for the upregulation of orphan biosynthetic pathways and the enhancement of enzyme substrate supply to illuminate new secondary metabolic pathways that would otherwise be silent or undetected under typical laboratory cultivation conditions. Access to these new natural products (i.e., the chemotypes) facilitates experimental genotype-to-phenotype linkages. Here, we describe select functional natural products produced by Xenorhabdus and Photorhabdus bacteria, with experimentally linked biosynthetic gene clusters, as illustrative examples of synergy between chemical ecology and bacterial genome mining in connecting genotypes to phenotypes through chemotype characterization. These Gammaproteobacteria share a mutualistic relationship with nematodes and a pathogenic relationship with insects, and in select cases, humans. The natural products encoded by these bacteria distinguish their interactions with animal hosts and other microorganisms in their multipartite symbiotic lifestyles. Though both genera have similar lifestyles, their genetic, chemical, and physiological attributes are distinct. Both undergo phenotypic variation and produce a profuse number of bioactive secondary metabolites. We provide further detail in the context of regulation, production, processing, and function of these genetically encoded small molecules with respect to their roles in mutualism and pathogenicity. These collective insights more widely promote the discovery of atypical orphan biosynthetic pathways encoding novel small molecules in symbiotic systems, which could open new avenues for investigating and exploiting microbial chemical signaling in host-bacteria interactions. PMID:24127069

  2. Predicting novel pathways in genome-scale metabolic networks.

    PubMed

    Schuster, Stefan; de Figueiredo, Luís F; Kaleta, Christoph

    2010-10-01

    Elementary-modes analysis has become a well-established theoretical tool in metabolic pathway analysis. It allows one to decompose complex metabolic networks into the smallest functional entities, which can be interpreted as biochemical pathways. This analysis has, in medium-size metabolic networks, led to the successful theoretical prediction of hitherto unknown pathways. For illustration, we discuss the example of the phosphoenolpyruvate-glyoxylate cycle in Escherichia coli. Elementary-modes analysis meets with the problem of combinatorial explosion in the number of pathways with increasing system size, which has hampered scaling it up to genome-wide models. We present a novel approach to overcoming this obstacle. That approach is based on elementary flux patterns, which are defined as sets of reactions representing the basic routes through a particular subsystem that are compatible with admissible fluxes in a (possibly) much larger metabolic network. The subsystem can be made up by reactions in which we are interested in, for example, reactions producing a certain metabolite. This allows one to predict novel metabolic pathways in genome-scale networks.

  3. Perspectives on the Transition From Bacterial Phytopathogen Genomics Studies to Applications Enhancing Disease Management: From Promise to Practice.

    PubMed

    Sundin, George W; Wang, Nian; Charkowski, Amy O; Castiblanco, Luisa F; Jia, Hongge; Zhao, Youfu

    2016-10-01

    The advent of genomics has advanced science into a new era, providing a plethora of "toys" for researchers in many related and disparate fields. Genomics has also spawned many new fields, including proteomics and metabolomics, furthering our ability to gain a more comprehensive view of individual organisms and of interacting organisms. Genomic information of both bacterial pathogens and their hosts has provided the critical starting point in understanding the molecular bases of how pathogens disrupt host cells to cause disease. In addition, knowledge of the complete genome sequence of the pathogen provides a potentially broad slate of targets for the development of novel virulence inhibitors that are desperately needed for disease management. Regarding plant bacterial pathogens and disease management, the potential for utilizing genomics resources in the development of durable resistance is enhanced because of developing technologies that enable targeted modification of the host. Here, we summarize the role of genomics studies in furthering efforts to manage bacterial plant diseases and highlight novel genomics-enabled strategies heading down this path.

  4. Genome Analysis of a Zygomycete Fungus Choanephora cucurbitarum Elucidates Necrotrophic Features Including Bacterial Genes Related to Plant Colonization

    PubMed Central

    Min, Byoungnam; Park, Ji-Hyun; Park, Hongjae; Shin, Hyeon-Dong; Choi, In-Geol

    2017-01-01

    A zygomycete fungus, Choanephora cucurbitarum is a plant pathogen that causes blossom rot in cucurbits and other plants. Here we report the genome sequence of Choanephora cucurbitarum KUS-F28377 isolated from squash. The assembled genome has a size of 29.1 Mbp and 11,977 protein-coding genes. The genome analysis indicated that C. cucurbitarum may employ a plant pathogenic mechanism similar to that of bacterial plant pathogens. The genome contained 11 genes with a Streptomyces subtilisin inhibitor-like domain, which plays an important role in the defense against plant immunity. This domain has been found only in bacterial genomes. Carbohydrate active enzyme analysis detected 312 CAZymes in this genome where carbohydrate esterase family 6, rarely found in dikaryotic fungal genomes, was comparatively enriched. The comparative genome analysis showed that the genes related to sexual communication such as the biosynthesis of β-carotene and trisporic acid were conserved and diverged during the evolution of zygomycete genomes. Overall, these findings will help us to understand how zygomycetes are associated with plants. PMID:28091548

  5. First genomic insights into members of a candidate bacterial phylum responsible for wastewater bulking.

    PubMed

    Sekiguchi, Yuji; Ohashi, Akiko; Parks, Donovan H; Yamauchi, Toshihiro; Tyson, Gene W; Hugenholtz, Philip

    2015-01-01

    Filamentous cells belonging to the candidate bacterial phylum KSB3 were previously identified as the causative agent of fatal filament overgrowth (bulking) in a high-rate industrial anaerobic wastewater treatment bioreactor. Here, we obtained near complete genomes from two KSB3 populations in the bioreactor, including the dominant bulking filament, using differential coverage binning of metagenomic data. Fluorescence in situ hybridization with 16S rRNA-targeted probes specific for the two populations confirmed that both are filamentous organisms. Genome-based metabolic reconstruction and microscopic observation of the KSB3 filaments in the presence of sugar gradients indicate that both filament types are Gram-negative, strictly anaerobic fermenters capable of non-flagellar based gliding motility, and have a strikingly large number of sensory and response regulator genes. We propose that the KSB3 filaments are highly sensitive to their surroundings and that cellular processes, including those causing bulking, are controlled by external stimuli. The obtained genomes lay the foundation for a more detailed understanding of environmental cues used by KSB3 filaments, which may lead to more robust treatment options to prevent bulking.

  6. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMcahon, Katherine D.; Mamlstrom, Rex R.

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.

  7. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    SciTech Connect

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Foster, Brian; Kang, Dongwan; Tringe, Susannah G.; Bertilsson, Stefan; Moran, Mary Ann; Shade, Ashley; Newton, Ryan J.; Stevens, Sarah; McMahon, Katherine D.; Malmstrom, Rex R.

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.

  8. Genomic evolution of bacterial populations under coselection by antibiotics and phage.

    PubMed

    Cairns, Johannes; Frickel, Jens; Jalasvuori, Matti; Hiltunen, Teppo; Becks, Lutz

    2016-12-15

    Bacteria live in dynamic systems where selection pressures can alter rapidly, forcing adaptation to the prevailing conditions. In particular, bacteriophages and antibiotics of anthropogenic origin are major bacterial stressors in many environments. We previously observed that populations of the bacterium Pseudomonas fluorescens SBW25 exposed to the lytic bacteriophage SBW25Φ2 and a noninhibitive concentration of the antibiotic streptomycin (coselection) achieved higher levels of phage resistance compared to populations exposed to the phage alone. In addition, the phage became extinct under coselection while remaining present in the phage alone environment. Further, phenotypic tests indicated that these observations might be associated with increased mutation rate under coselection. In this study, we examined the genetic causes behind these phenotypes by whole-genome sequencing clones isolated from the end of the experiments. We were able to identify genetic factors likely responsible for streptomycin resistance, phage resistance and hypermutable (mutator) phenotypes. This constitutes genomic evidence in support of the observation that while the presence of phage did not affect antibiotic resistance, the presence of antibiotic affected phage resistance. We had previously hypothesized an association between mutators and elevated levels of phage resistance under coselection. However, our evidence regarding the mechanism was inconclusive, as although with phage mutators were only found under coselection, additional genomic evidence was lacking and phage resistance was also observed in nonmutators under coselection. More generally, our study provides novel insights into evolution between univariate and multivariate selection (here two stressors), as well as the potential role of hypermutability in natural communities.

  9. Birth of a W sex chromosome by horizontal transfer of Wolbachia bacterial symbiont genome

    PubMed Central

    Leclercq, Sébastien; Thézé, Julien; Chebbi, Mohamed Amine; Giraud, Isabelle; Moumen, Bouziane; Ernenwein, Lise; Grève, Pierre; Cordaux, Richard

    2016-01-01

    Sex determination is a fundamental developmental pathway governing male and female differentiation, with profound implications for morphology, reproductive strategies, and behavior. In animals, sex differences between males and females are generally determined by genetic factors carried by sex chromosomes. Sex chromosomes are remarkably variable in origin and can differ even between closely related species, indicating that transitions occur frequently and independently in different groups of organisms. The evolutionary causes underlying sex chromosome turnover are poorly understood, however. Here we provide evidence indicating that Wolbachia bacterial endosymbionts triggered the evolution of new sex chromosomes in the common pillbug Armadillidium vulgare. We identified a 3-Mb insert of a feminizing Wolbachia genome that was recently transferred into the pillbug nuclear genome. The Wolbachia insert shows perfect linkage to the female sex, occurs in a male genetic background (i.e., lacking the ancestral W female sex chromosome), and is hemizygous. Our results support the conclusion that the Wolbachia insert is now acting as a female sex-determining region in pillbugs, and that the chromosome carrying the insert is a new W sex chromosome. Thus, bacteria-to-animal horizontal genome transfer represents a remarkable mechanism underpinning the birth of sex chromosomes. We conclude that sex ratio distorters, such as Wolbachia endosymbionts, can be powerful agents of evolutionary transitions in sex determination systems in animals. PMID:27930295

  10. Genomic comparisons of a bacterial lineage that inhabits both marine and terrestrial deep subsurface systems

    PubMed Central

    Glavina del Rio, Tijana; Tringe, Susannah G.; Stepanauskas, Ramunas

    2017-01-01

    It is generally accepted that diverse, poorly characterized microorganisms reside deep within Earth’s crust. One such lineage of deep subsurface-dwelling bacteria is an uncultivated member of the Firmicutes phylum that can dominate molecular surveys from both marine and continental rock fracture fluids, sometimes forming the sole member of a single-species microbiome. Here, we reconstructed a genome from basalt-hosted fluids of the deep subseafloor along the eastern Juan de Fuca Ridge flank and used a phylogenomic analysis to show that, despite vast differences in geographic origin and habitat, it forms a monophyletic clade with the terrestrial deep subsurface genome of “Candidatus Desulforudis audaxviator” MP104C. While a limited number of differences were observed between the marine genome of “Candidatus Desulfopertinax cowenii” modA32 and its terrestrial relative that may be of potential adaptive importance, here it is revealed that the two are remarkably similar thermophiles possessing the genetic capacity for motility, sporulation, hydrogenotrophy, chemoorganotrophy, dissimilatory sulfate reduction, and the ability to fix inorganic carbon via the Wood-Ljungdahl pathway for chemoautotrophic growth. Our results provide insights into the genetic repertoire within marine and terrestrial members of a bacterial lineage that is widespread in the global deep subsurface biosphere, and provides a natural means to investigate adaptations specific to these two environments.

  11. Search and Validation of Short Genome-Wide Biomarkers for Bacterial Biological Phylogenies

    NASA Astrophysics Data System (ADS)

    Garzon, Max H.; Wong, Tit-Yee

    We continue the exploration of DNA-based indexing as a universal coordinate system in DNA spaces to characterize very large groups (families, genera, and even phylla) of organisms on a uniform biomarker reference system, a comprehensive "Atlas of Life", as it is or as it could be on earth. We provide a second confirmation that DNA noncrosshybridizing (nxh) sets can be successfully applied to infer ab-initio phylogenetic trees by providing a method to measure distances among entire genomes indexed by sets of short oligonucleotides selected so as to minimize crosshybridization. These phylogenies are solidly established and well accepted in bacterial biology, albeit done by analyses of relatively small segments of highly conserved rybozomic DNA. Second, it is further demonstrated that DNA indexing does provide novel and principled genome-wide predictions into the phylogenesis of organisms hitherto inaccessible by current methods, such as a prediction of the origin of the Salmonella plasmid 50 as being acquired horizontally, likely from some bacteria somewhat related to Yesinia. We conclude with some discussion about the scalability and potential of this method to develop a comprehensive tree of life based on genome-wide methods.

  12. First genomic insights into members of a candidate bacterial phylum responsible for wastewater bulking

    PubMed Central

    Ohashi, Akiko; Parks, Donovan H.; Yamauchi, Toshihiro; Tyson, Gene W.

    2015-01-01

    Filamentous cells belonging to the candidate bacterial phylum KSB3 were previously identified as the causative agent of fatal filament overgrowth (bulking) in a high-rate industrial anaerobic wastewater treatment bioreactor. Here, we obtained near complete genomes from two KSB3 populations in the bioreactor, including the dominant bulking filament, using differential coverage binning of metagenomic data. Fluorescence in situ hybridization with 16S rRNA-targeted probes specific for the two populations confirmed that both are filamentous organisms. Genome-based metabolic reconstruction and microscopic observation of the KSB3 filaments in the presence of sugar gradients indicate that both filament types are Gram-negative, strictly anaerobic fermenters capable of non-flagellar based gliding motility, and have a strikingly large number of sensory and response regulator genes. We propose that the KSB3 filaments are highly sensitive to their surroundings and that cellular processes, including those causing bulking, are controlled by external stimuli. The obtained genomes lay the foundation for a more detailed understanding of environmental cues used by KSB3 filaments, which may lead to more robust treatment options to prevent bulking. PMID:25650158

  13. The ClinSeq Project: Piloting large-scale genome sequencing for research in genomic medicine

    PubMed Central

    Biesecker, Leslie G.; Mullikin, James C.; Facio, Flavia M.; Turner, Clesson; Cherukuri, Praveen F.; Blakesley, Robert W.; Bouffard, Gerard G.; Chines, Peter S.; Cruz, Pedro; Hansen, Nancy F.; Teer, Jamie K.; Maskeri, Baishali; Young, Alice C.; Manolio, Teri A.; Wilson, Alexander F.; Finkel, Toren; Hwang, Paul; Arai, Andrew; Remaley, Alan T.; Sachdev, Vandana; Shamburek, Robert; Cannon, Richard O.; Green, Eric D.

    2009-01-01

    ClinSeq is a pilot project to investigate the use of whole-genome sequencing as a tool for clinical research. By piloting the acquisition of large amounts of DNA sequence data from individual human subjects, we are fostering the development of hypothesis-generating approaches for performing research in genomic medicine, including the exploration of issues related to the genetic architecture of disease, implementation of genomic technology, informed consent, disclosure of genetic information, and archiving, analyzing, and displaying sequence data. In the initial phase of ClinSeq, we are enrolling roughly 1000 participants; the evaluation of each includes obtaining a detailed family and medical history, as well as a clinical evaluation. The participants are being consented broadly for research on many traits and for whole-genome sequencing. Initially, Sanger-based sequencing of 300–400 genes thought to be relevant to atherosclerosis is being performed, with the resulting data analyzed for rare, high-penetrance variants associated with specific clinical traits. The participants are also being consented to allow the contact of family members for additional studies of sequence variants to explore their potential association with specific phenotypes. Here, we present the general considerations in designing ClinSeq, preliminary results based on the generation of an initial 826 Mb of sequence data, the findings for several genes that serve as positive controls for the project, and our views about the potential implications of ClinSeq. The early experiences with ClinSeq illustrate how large-scale medical sequencing can be a practical, productive, and critical component of research in genomic medicine. PMID:19602640

  14. Spatial Scales of Bacterial Diversity in Cold-Water Coral Reef Ecosystems

    PubMed Central

    Schöttner, Sandra; Wild, Christian; Hoffmann, Friederike; Boetius, Antje; Ramette, Alban

    2012-01-01

    Background Cold-water coral reef ecosystems are recognized as biodiversity hotspots in the deep sea, but insights into their associated bacterial communities are still limited. Deciphering principle patterns of bacterial community variation over multiple spatial scales may however prove critical for a better understanding of factors contributing to cold-water coral reef stability and functioning. Methodology/Principal Findings Bacterial community structure, as determined by Automated Ribosomal Intergenic Spacer Analysis (ARISA), was investigated with respect to (i) microbial habitat type and (ii) coral species and color, as well as the three spatial components (iii) geomorphologic reef zoning, (iv) reef boundary, and (v) reef location. Communities revealed fundamental differences between coral-generated (branch surface, mucus) and ambient microbial habitats (seawater, sediments). This habitat specificity appeared pivotal for determining bacterial community shifts over all other study levels investigated. Coral-derived surfaces showed species-specific patterns, differing significantly between Lophelia pertusa and Madrepora oculata, but not between L. pertusa color types. Within the reef center, no community distinction corresponded to geomorphologic reef zoning for both coral-generated and ambient microbial habitats. Beyond the reef center, however, bacterial communities varied considerably from local to regional scales, with marked shifts toward the reef periphery as well as between different in- and offshore reef sites, suggesting significant biogeographic imprinting but weak microbe-host specificity. Conclusions/Significance This study presents the first multi-scale survey of bacterial diversity in cold-water coral reefs, spanning a total of five observational levels including three spatial scales. It demonstrates that bacterial communities in cold-water coral reefs are structured by multiple factors acting at different spatial scales, which has fundamental

  15. Genome Sequence of a Copper-Resistant Strain of Acidovorax citrulli Causing Bacterial Fruit Blotch of Melons.

    PubMed

    Wang, Tielin; Yang, Yuwen; Zhao, Tingchang

    2015-04-23

    Bacterial fruit blotch (BFB) of melons is a seed-borne disease caused by Acidovorax citrulli. We determined the draft genome of A. citrulli Tw6. The strain was isolated from a watermelon collected from Beijing, China. The A. citrulli Tw6 genome contains 5,080,614 bp and has a G+C content of 68.7 mol%.

  16. Genome-scale Proteome Quantification by DEEP SEQ Mass Spectrometry

    PubMed Central

    Zhou, Feng; Lu, Yu; Ficarro, Scott B.; Adelmant, Guillaume; Jiang, Wenyu; Luckey, C. John; Marto, Jarrod A.

    2013-01-01

    Advances in chemistry and massively parallel detection underlie DNA sequencing platforms that are poised for application in personalized medicine. In stark contrast, systematic generation of protein-level data lags well-behind genomics in virtually every aspect: depth of coverage, throughput, ease of sample preparation, and experimental time. Here, to bridge this gap, we develop an approach based on simple detergent lysis and single-enzyme digest, extreme, orthogonal separation of peptides, and true nanoflow LC-MS/MS that provides high peak capacity and ionization efficiency. This automated, deep efficient peptide sequencing and quantification (DEEP SEQ) mass spectrometry platform provides genome-scale proteome coverage equivalent to RNA-seq ribosomal profiling and accurate quantification for multiplexed isotope labels. In a model of the embryonic to epiblast transition in murine stem cells, we unambiguously quantify 11,352 gene products that span 70% of Swiss-Prot and capture protein regulation across the full detectable range of high-throughput gene expression and protein translation. PMID:23863870

  17. Next-generation genome-scale models for metabolic engineering.

    PubMed

    King, Zachary A; Lloyd, Colton J; Feist, Adam M; Palsson, Bernhard O

    2015-12-01

    Constraint-based reconstruction and analysis (COBRA) methods have become widely used tools for metabolic engineering in both academic and industrial laboratories. By employing a genome-scale in silico representation of the metabolic network of a host organism, COBRA methods can be used to predict optimal genetic modifications that improve the rate and yield of chemical production. A new generation of COBRA models and methods is now being developed--encompassing many biological processes and simulation strategies-and next-generation models enable new types of predictions. Here, three key examples of applying COBRA methods to strain optimization are presented and discussed. Then, an outlook is provided on the next generation of COBRA models and the new types of predictions they will enable for systems metabolic engineering.

  18. Dynamic bacterial communities on reverse-osmosis membranes in a full-scale desalination plant.

    PubMed

    Manes, C-L de O; West, N; Rapenne, S; Lebaron, P

    2011-01-01

    To better understand biofouling of seawater reverse osmosis (SWRO) membranes, bacterial diversity was characterized in the intake water, in subsequently pretreated water and on SWRO membranes from a full-scale desalination plant (FSDP) during a 9 month period. 16S rRNA gene fingerprinting and sequencing revealed that bacterial communities in the water samples and on the SWRO membranes were very different. For the different sampling dates, the bacterial diversity of the active and the total bacterial fractions of the water samples remained relatively stable over the sampling period whereas the bacterial community structure on the four SWRO membrane samples was significantly different. The richness and evenness of the SWRO membrane bacterial communities increased with usage time with an increase in the Shannon diversity index of 2.2 to 3.7. In the oldest SWRO membrane (330 days), no single operational taxonomic unit (OTU) dominated and the majority of the OTUs fell into the Alphaproteobacteria or the Planctomycetes. In striking contrast, a Betaproteobacteria OTU affiliated to the genus Ideonella was dominant and exclusively found in the membrane used for the shortest time (10 days). This suggests that bacteria belonging to this genus could be one of the primary colonizers of the SWRO membrane. Knowledge of the dominant bacterial species on SWRO membranes and their dynamics should help guide culture studies for physiological characterization of biofilm forming species.

  19. Distinct soil bacterial communities along a small-scale elevational gradient in alpine tundra.

    PubMed

    Shen, Congcong; Ni, Yingying; Liang, Wenju; Wang, Jianjun; Chu, Haiyan

    2015-01-01

    The elevational diversity pattern for microorganisms has received great attention recently but is still understudied, and phylogenetic relatedness is rarely studied for microbial elevational distributions. Using a bar-coded pyrosequencing technique, we examined the biodiversity patterns for soil bacterial communities of tundra ecosystem along 2000-2500 m elevations on Changbai Mountain in China. Bacterial taxonomic richness displayed a linear decreasing trend with increasing elevation. Phylogenetic diversity and mean nearest taxon distance (MNTD) exhibited a unimodal pattern with elevation. Bacterial communities were more phylogenetically clustered than expected by chance at all elevations based on the standardized effect size of MNTD metric. The bacterial communities differed dramatically among elevations, and the community composition was significantly correlated with soil total carbon (TC), total nitrogen, C:N ratio, and dissolved organic carbon. Multiple ordinary least squares regression analysis showed that the observed biodiversity patterns strongly correlated with soil TC and C:N ratio. Taken together, this is the first time that a significant bacterial diversity pattern has been observed across a small-scale elevational gradient. Our results indicated that soil carbon and nitrogen contents were the critical environmental factors affecting bacterial elevational distribution in Changbai Mountain tundra. This suggested that ecological niche-based environmental filtering processes related to soil carbon and nitrogen contents could play a dominant role in structuring bacterial communities along the elevational gradient.

  20. Predicting effects of structural stress in a genome-reduced model bacterial metabolism

    NASA Astrophysics Data System (ADS)

    Güell, Oriol; Sagués, Francesc; Serrano, M. Ángeles

    2012-08-01

    Mycoplasma pneumoniae is a human pathogen recently proposed as a genome-reduced model for bacterial systems biology. Here, we study the response of its metabolic network to different forms of structural stress, including removal of individual and pairs of reactions and knockout of genes and clusters of co-expressed genes. Our results reveal a network architecture as robust as that of other model bacteria regarding multiple failures, although less robust against individual reaction inactivation. Interestingly, metabolite motifs associated to reactions can predict the propagation of inactivation cascades and damage amplification effects arising in double knockouts. We also detect a significant correlation between gene essentiality and damages produced by single gene knockouts, and find that genes controlling high-damage reactions tend to be expressed independently of each other, a functional switch mechanism that, simultaneously, acts as a genetic firewall to protect metabolism. Prediction of failure propagation is crucial for metabolic engineering or disease treatment.

  1. Improved variation calling via an iterative backbone remapping and local assembly method for bacterial genomes

    PubMed Central

    Tae, Hongseok; Settlage, Robert E.; Shallom, Shamira; Bavarva, Jasmin H.; Preston, Dale; Hawkins, Gregory N.; Adams, L. Garry; Garner, Harold R.

    2012-01-01

    Sequencing data analysis remains limiting and problematic, especially for low complexity repeat sequences and transposon elements due to inherent sequencing errors and short sequence read lengths. We have developed a program, ReviSeq, which uses a hybrid method comprised of iterative remapping and local assembly upon a bacterial sequence backbone. Application of this method to six Brucella suis field isolates compared to the newly revised Brucella suis 1330 reference genome identified on average 13, 15, 19 and 9 more variants per sample than STAMPY/SAMtools, BWA/SAMtools, iCORN and BWA/PINDEL pipelines, and excluded on average 4, 2, 3 and 19 variants per sample, respectively. In total, using this iterative approach, we identified on average 87 variants including SNVs, short INDELs and long INDELs per strain when compared to the reference. Our program outperforms other methods especially for long INDEL calling. The program is available at http://reviseq.sourceforge.net. PMID:22967795

  2. Predicting effects of structural stress in a genome-reduced model bacterial metabolism

    PubMed Central

    Güell, Oriol; Sagués, Francesc; Serrano, M. Ángeles

    2012-01-01

    Mycoplasma pneumoniae is a human pathogen recently proposed as a genome-reduced model for bacterial systems biology. Here, we study the response of its metabolic network to different forms of structural stress, including removal of individual and pairs of reactions and knockout of genes and clusters of co-expressed genes. Our results reveal a network architecture as robust as that of other model bacteria regarding multiple failures, although less robust against individual reaction inactivation. Interestingly, metabolite motifs associated to reactions can predict the propagation of inactivation cascades and damage amplification effects arising in double knockouts. We also detect a significant correlation between gene essentiality and damages produced by single gene knockouts, and find that genes controlling high-damage reactions tend to be expressed independently of each other, a functional switch mechanism that, simultaneously, acts as a genetic firewall to protect metabolism. Prediction of failure propagation is crucial for metabolic engineering or disease treatment. PMID:22934134

  3. Predicting effects of structural stress in a genome-reduced model bacterial metabolism.

    PubMed

    Güell, Oriol; Sagués, Francesc; Serrano, M Ángeles

    2012-01-01

    Mycoplasma pneumoniae is a human pathogen recently proposed as a genome-reduced model for bacterial systems biology. Here, we study the response of its metabolic network to different forms of structural stress, including removal of individual and pairs of reactions and knockout of genes and clusters of co-expressed genes. Our results reveal a network architecture as robust as that of other model bacteria regarding multiple failures, although less robust against individual reaction inactivation. Interestingly, metabolite motifs associated to reactions can predict the propagation of inactivation cascades and damage amplification effects arising in double knockouts. We also detect a significant correlation between gene essentiality and damages produced by single gene knockouts, and find that genes controlling high-damage reactions tend to be expressed independently of each other, a functional switch mechanism that, simultaneously, acts as a genetic firewall to protect metabolism. Prediction of failure propagation is crucial for metabolic engineering or disease treatment.

  4. Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network

    PubMed Central

    Martín-Jiménez, Cynthia A.; Salazar-Barreto, Diego; Barreto, George E.; González, Janneth

    2017-01-01

    Astrocytes are the most abundant cells of the central nervous system; they have a predominant role in maintaining brain metabolism. In this sense, abnormal metabolic states have been found in different neuropathological diseases. Determination of metabolic states of astrocytes is difficult to model using current experimental approaches given the high number of reactions and metabolites present. Thus, genome-scale metabolic networks derived from transcriptomic data can be used as a framework to elucidate how astrocytes modulate human brain metabolic states during normal conditions and in neurodegenerative diseases. We performed a Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network with the purpose of elucidating a significant portion of the metabolic map of the astrocyte. This is the first global high-quality, manually curated metabolic reconstruction network of a human astrocyte. It includes 5,007 metabolites and 5,659 reactions distributed among 8 cell compartments, (extracellular, cytoplasm, mitochondria, endoplasmic reticle, Golgi apparatus, lysosome, peroxisome and nucleus). Using the reconstructed network, the metabolic capabilities of human astrocytes were calculated and compared both in normal and ischemic conditions. We identified reactions activated in these two states, which can be useful for understanding the astrocytic pathways that are affected during brain disease. Additionally, we also showed that the obtained flux distributions in the model, are in accordance with literature-based findings. Up to date, this is the most complete representation of the human astrocyte in terms of inclusion of genes, proteins, reactions and metabolic pathways, being a useful guide for in-silico analysis of several metabolic behaviors of the astrocyte during normal and pathologic states. PMID:28243200

  5. Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network.

    PubMed

    Martín-Jiménez, Cynthia A; Salazar-Barreto, Diego; Barreto, George E; González, Janneth

    2017-01-01

    Astrocytes are the most abundant cells of the central nervous system; they have a predominant role in maintaining brain metabolism. In this sense, abnormal metabolic states have been found in different neuropathological diseases. Determination of metabolic states of astrocytes is difficult to model using current experimental approaches given the high number of reactions and metabolites present. Thus, genome-scale metabolic networks derived from transcriptomic data can be used as a framework to elucidate how astrocytes modulate human brain metabolic states during normal conditions and in neurodegenerative diseases. We performed a Genome-Scale Reconstruction of the Human Astrocyte Metabolic Network with the purpose of elucidating a significant portion of the metabolic map of the astrocyte. This is the first global high-quality, manually curated metabolic reconstruction network of a human astrocyte. It includes 5,007 metabolites and 5,659 reactions distributed among 8 cell compartments, (extracellular, cytoplasm, mitochondria, endoplasmic reticle, Golgi apparatus, lysosome, peroxisome and nucleus). Using the reconstructed network, the metabolic capabilities of human astrocytes were calculated and compared both in normal and ischemic conditions. We identified reactions activated in these two states, which can be useful for understanding the astrocytic pathways that are affected during brain disease. Additionally, we also showed that the obtained flux distributions in the model, are in accordance with literature-based findings. Up to date, this is the most complete representation of the human astrocyte in terms of inclusion of genes, proteins, reactions and metabolic pathways, being a useful guide for in-silico analysis of several metabolic behaviors of the astrocyte during normal and pathologic states.

  6. Distinguishing bacterial pathogens of potato using a genome-wide microarray approach.

    PubMed

    Aittamaa, M; Somervuo, P; Pirhonen, M; Mattinen, L; Nissinen, R; Auvinen, P; Valkonen, J P T

    2008-09-01

    A set of 9676 probes was designed for the most harmful bacterial pathogens of potato and tested in a microarray format. Gene-specific probes could be designed for all genes of Pectobacterium atrosepticum, c. 50% of the genes of Streptomyces scabies and c. 30% of the genes of Clavibacter michiganensis ssp. sepedonicus utilizing the whole-genome sequence information available. For Streptomyces turgidiscabies, 226 probes were designed according to the sequences of a pathogenicity island containing important virulence genes. In addition, probes were designed for the virulence-associated nip (necrosis-inducing protein) genes of P. atrosepticum, P. carotovorum and Dickeya dadantii and for the intergenic spacer (IGS) sequences of the 16S-23S rRNA gene region. Ralstonia solanacearum was not included in the study, because it is a quarantine organism and is not presently found in Finland, but a few probes were also designed for this species. The probes contained on average 40 target-specific nucleotides and were synthesized on the array in situ, organized as eight sub-arrays with an identical set of probes which could be used for hybridization with different samples. All bacteria were readily distinguished using a single channel system for signal detection. Nearly all of the c. 1000 probes designed for C. michiganensis ssp. sepedonicus, c. 50% and 40% of the c. 4000 probes designed for the genes of S. scabies and P. atrosepticum, respectively, and over 100 probes for S. turgidiscabies showed significant signals only with the respective species. P. atrosepticum, P. carotovorum and Dickeya strains were all detected with 110 common probes. By contrast, the strains of these species were found to differ in their signal profiles. Probes targeting the IGS region and nip genes could be used to place strains of Dickeya to two groups, which correlated with differences in virulence. Taken together, the approach of using a custom-designed, genome-wide microarray provided a robust means

  7. Engineering the largest RNA virus genome as an infectious bacterial artificial chromosome

    PubMed Central

    Almazán, Fernando; González, José M.; Pénzes, Zoltan; Izeta, Ander; Calvo, Enrique; Plana-Durán, Juan; Enjuanes, Luis

    2000-01-01

    The construction of cDNA clones encoding large-size RNA molecules of biological interest, like coronavirus genomes, which are among the largest mature RNA molecules known to biology, has been hampered by the instability of those cDNAs in bacteria. Herein, we show that the application of two strategies, cloning of the cDNAs into a bacterial artificial chromosome and nuclear expression of RNAs that are typically produced within the cytoplasm, is useful for the engineering of large RNA molecules. A cDNA encoding an infectious coronavirus RNA genome has been cloned as a bacterial artificial chromosome. The rescued coronavirus conserved all of the genetic markers introduced throughout the sequence and showed a standard mRNA pattern and the antigenic characteristics expected for the synthetic virus. The cDNA was transcribed within the nucleus, and the RNA translocated to the cytoplasm. Interestingly, the recovered virus had essentially the same sequence as the original one, and no splicing was observed. The cDNA was derived from an attenuated isolate that replicates exclusively in the respiratory tract of swine. During the engineering of the infectious cDNA, the spike gene of the virus was replaced by the spike gene of an enteric isolate. The synthetic virus replicated abundantly in the enteric tract and was fully virulent, demonstrating that the tropism and virulence of the recovered coronavirus can be modified. This demonstration opens up the possibility of employing this infectious cDNA as a vector for vaccine development in human, porcine, canine, and feline species susceptible to group 1 coronaviruses. PMID:10805807

  8. Spatial scales of bacterial community diversity at cold seeps (Eastern Mediterranean Sea)

    PubMed Central

    Pop Ristova, Petra; Wenzhöfer, Frank; Ramette, Alban; Felden, Janine; Boetius, Antje

    2015-01-01

    Cold seeps are highly productive, fragmented marine ecosystems that form at the seafloor around hydrocarbon emission pathways. The products of microbial utilization of methane and other hydrocarbons fuel rich chemosynthetic communities at these sites, with much higher respiration rates compared with the surrounding deep-sea floor. Yet little is known as to the richness, composition and spatial scaling of bacterial communities of cold seeps compared with non-seep communities. Here we assessed the bacterial diversity across nine different cold seeps in the Eastern Mediterranean deep-sea and surrounding seafloor areas. Community similarity analyses were carried out based on automated ribosomal intergenic spacer analysis (ARISA) fingerprinting and high-throughput 454 tag sequencing and were combined with in situ and ex situ geochemical analyses across spatial scales of a few tens of meters to hundreds of kilometers. Seep communities were dominated by Deltaproteobacteria, Epsilonproteobacteria and Gammaproteobacteria and shared, on average, 36% of bacterial types (ARISA OTUs (operational taxonomic units)) with communities from nearby non-seep deep-sea sediments. Bacterial communities of seeps were significantly different from those of non-seep sediments. Within cold seep regions on spatial scales of only tens to hundreds of meters, the bacterial communities differed considerably, sharing <50% of types at the ARISA OTU level. Their variations reflected differences in porewater sulfide concentrations from anaerobic degradation of hydrocarbons. This study shows that cold seep ecosystems contribute substantially to the microbial diversity of the deep-sea. PMID:25500510

  9. Spatial scales of bacterial community diversity at cold seeps (Eastern Mediterranean Sea).

    PubMed

    Pop Ristova, Petra; Wenzhöfer, Frank; Ramette, Alban; Felden, Janine; Boetius, Antje

    2015-06-01

    Cold seeps are highly productive, fragmented marine ecosystems that form at the seafloor around hydrocarbon emission pathways. The products of microbial utilization of methane and other hydrocarbons fuel rich chemosynthetic communities at these sites, with much higher respiration rates compared with the surrounding deep-sea floor. Yet little is known as to the richness, composition and spatial scaling of bacterial communities of cold seeps compared with non-seep communities. Here we assessed the bacterial diversity across nine different cold seeps in the Eastern Mediterranean deep-sea and surrounding seafloor areas. Community similarity analyses were carried out based on automated ribosomal intergenic spacer analysis (ARISA) fingerprinting and high-throughput 454 tag sequencing and were combined with in situ and ex situ geochemical analyses across spatial scales of a few tens of meters to hundreds of kilometers. Seep communities were dominated by Deltaproteobacteria, Epsilonproteobacteria and Gammaproteobacteria and shared, on average, 36% of bacterial types (ARISA OTUs (operational taxonomic units)) with communities from nearby non-seep deep-sea sediments. Bacterial communities of seeps were significantly different from those of non-seep sediments. Within cold seep regions on spatial scales of only tens to hundreds of meters, the bacterial communities differed considerably, sharing <50% of types at the ARISA OTU level. Their variations reflected differences in porewater sulfide concentrations from anaerobic degradation of hydrocarbons. This study shows that cold seep ecosystems contribute substantially to the microbial diversity of the deep-sea.

  10. Large-Scale Coherence in Concentrated Bacterial Suspensions

    NASA Astrophysics Data System (ADS)

    Cisneros, Luis; Dombrowski, Christopher; Goldstein, Raymond; Kessler, John

    2004-03-01

    When the normally randomly swimming bacteria Bacillus subtilis become sufficiently concentrated, there emerges a transition to new dynamics. This phase is characterized by spontaneous local ordering, collective velocities substantially greater than the swimming speed of individual organisms, and very long range fluid dynamical phenomena. Among them are evanescent swirls, reminscent of the vortex streets that normally occur at high Reynolds numbers. We shall present analysis of these coherent structures using particle imaging velocimetry (PIV), summarized by correlation functions of various types. We shall also present a key physical mechanism underlying these phenomena, and a mathematical model that can account for many observed features. Beyond the remarkable new physics underpinning these patterns, we demonstrate its significance for microbial ecology in relation to mixing at the molecular scale, and possibly to quorum sensing and formation of biofilms.

  11. Expression of lysozymes from Erwinia amylovora phages and Erwinia genomes and inhibition by a bacterial protein.

    PubMed

    Müller, Ina; Gernold, Marina; Schneider, Bernd; Geider, Klaus

    2012-01-01

    Genes coding for lysozyme-inhibiting proteins (Ivy) were cloned from the chromosomes of the plant pathogens Erwinia amylovora and Erwinia pyrifoliae. The product interfered not only with activity of hen egg white lysozyme, but also with an enzyme from E. amylovora phage ΦEa1h. We have expressed lysozyme genes from the genomes of three Erwinia species in Escherichia coli. The lysozymes expressed from genes of the E. amylovora phages ΦEa104 and ΦEa116, Erwinia chromosomes and Arabidopsis thaliana were not affected by Ivy. The enzyme from bacteriophage ΦEa1h was fused at the N- or C-terminus to other peptides. Compared to the intact lysozyme, a His-tag reduced its lytic activity about 10-fold and larger fusion proteins abolished activity completely. Specific protease cleavage restored lysozyme activity of a GST-fusion. The bacteriophage-encoded lysozymes were more active than the enzymes from bacterial chromosomes. Viral lyz genes were inserted into a broad-host range vector, and transfer to E. amylovora inhibited cell growth. Inserted in the yeast Pichia pastoris, the ΦEa1h-lysozyme was secreted and also inhibited by Ivy. Here we describe expression of unrelated cloned 'silent' lyz genes from Erwinia chromosomes and a novel interference of bacterial Ivy proteins with a viral lysozyme.

  12. Limitations to estimating bacterial cross-speciestransmission using genetic and genomic markers: inferencesfrom simulation modeling

    USGS Publications Warehouse

    Julio Andre, Benavides; Cross, Paul C.; Luikart, Gordon; Scott, Creel

    2014-01-01

    Cross-species transmission (CST) of bacterial pathogens has major implications for human health, livestock, and wildlife management because it determines whether control actions in one species may have subsequent effects on other potential host species. The study of bacterial transmission has benefitted from methods measuring two types of genetic variation: variable number of tandem repeats (VNTRs) and single nucleotide polymorphisms (SNPs). However, it is unclear whether these data can distinguish between different epidemiological scenarios. We used a simulation model with two host species and known transmission rates (within and between species) to evaluate the utility of these markers for inferring CST. We found that CST estimates are biased for a wide range of parameters when based on VNTRs and a most parsimonious reconstructed phylogeny. However, estimations of CST rates lower than 5% can be achieved with relatively low bias using as low as 250 SNPs. CST estimates are sensitive to several parameters, including the number of mutations accumulated since introduction, stochasticity, the genetic difference of strains introduced, and the sampling effort. Our results suggest that, even with whole-genome sequences, unbiased estimates of CST will be difficult when sampling is limited, mutation rates are low, or for pathogens that were recently introduced.

  13. Selection for Unequal Densities of Sigma70 Promoter-like Signalsin Different Regions of Large Bacterial Genomes

    SciTech Connect

    Huerta, Araceli M.; Francino, M. Pilar; Morett, Enrique; Collado-Vides, Julio

    2006-03-01

    The evolutionary processes operating in the DNA regions that participate in the regulation of gene expression are poorly understood. In Escherichia coli, we have established a sequence pattern that distinguishes regulatory from nonregulatory regions. The density of promoter-like sequences, that are recognizable by RNA polymerase and may function as potential promoters, is high within regulatory regions, in contrast to coding regions and regions located between convergently-transcribed genes. Moreover, functional promoter sites identified experimentally are often found in the subregions of highest density of promoter-like signals, even when individual sites with higher binding affinity for RNA polymerase exist elsewhere within the regulatory region. In order to investigate the generality of this pattern, we have used position weight matrices describing the -35 and -10 promoter boxes of E. coli to search for these motifs in 43 additional genomes belonging to most established bacterial phyla, after specific calibration of the matrices according to the base composition of the noncoding regions of each genome. We have found that all bacterial species analyzed contain similar promoter-like motifs, and that, in most cases, these motifs follow the same genomic distribution observed in E. coli. Differential densities between regulatory and nonregulatory regions are detectable in most bacterial genomes, with the exception of those that have experienced evolutionary extreme genome reduction. Thus, the phylogenetic distribution of this pattern mirrors that of genes and other genomic features that require weak selection to be effective in order to persist. On this basis, we suggest that the loss of differential densities in the reduced genomes of host-restricted pathogens and symbionts is the outcome of a process of genome degradation resulting from the decreased efficiency of purifying selection in highly structured small populations. This implies that the differential

  14. A peptide identification-free, genome sequence-independent shotgun proteomics workflow for strain-level bacterial differentiation.

    PubMed

    Shao, Wenguang; Zhang, Min; Lam, Henry; Lau, Stanley C K

    2015-09-23

    Shotgun proteomics is an emerging tool for bacterial identification and differentiation. However, the identification of the mass spectra of peptides to genome-derived peptide sequences remains a key issue that limits the use of shotgun proteomics to bacteria with genome sequences available. In this proof-of-concept study, we report a novel bacterial fingerprinting method that enjoys the resolving power and accuracy of mass spectrometry without the burden of peptide identification (i.e. genome sequence-independent). This method uses a similarity-clustering algorithm to search for mass spectra that are derived from the same peptide and merge them into a unique consensus spectrum as the basis to generate proteomic fingerprints of bacterial isolates. In comparison to a traditional peptide identification-based shotgun proteomics workflow and a PCR-based DNA fingerprinting method targeting the repetitive extragenic palindromes elements in bacterial genomes, the novel method generated fingerprints that were richer in information and more discriminative in differentiating E. coli isolates by their animal sources. The novel method is readily deployable to any cultivable bacteria, and may be used for several fields of study such as environmental microbiology, applied microbiology, and clinical microbiology.

  15. Weighted ssGBLUP improves genomic selection accuracy for bacterial cold water disease resistance in a rainbow trout population

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective of this study was to compare methods for genomic evaluation in a Rainbow Trout (Oncorhynchus mykiss) population for survival when challenged by Flavobacterium psychrophilum, the causative agent of bacterial cold water disease (BCWD). The used methods were: 1)regular ssGBLUP that assume...

  16. A peptide identification-free, genome sequence-independent shotgun proteomics workflow for strain-level bacterial differentiation

    PubMed Central

    Shao, Wenguang; Zhang, Min; Lam, Henry; Lau, Stanley C. K.

    2015-01-01

    Shotgun proteomics is an emerging tool for bacterial identification and differentiation. However, the identification of the mass spectra of peptides to genome-derived peptide sequences remains a key issue that limits the use of shotgun proteomics to bacteria with genome sequences available. In this proof-of-concept study, we report a novel bacterial fingerprinting method that enjoys the resolving power and accuracy of mass spectrometry without the burden of peptide identification (i.e. genome sequence-independent). This method uses a similarity-clustering algorithm to search for mass spectra that are derived from the same peptide and merge them into a unique consensus spectrum as the basis to generate proteomic fingerprints of bacterial isolates. In comparison to a traditional peptide identification-based shotgun proteomics workflow and a PCR-based DNA fingerprinting method targeting the repetitive extragenic palindromes elements in bacterial genomes, the novel method generated fingerprints that were richer in information and more discriminative in differentiating E. coli isolates by their animal sources. The novel method is readily deployable to any cultivable bacteria, and may be used for several fields of study such as environmental microbiology, applied microbiology, and clinical microbiology. PMID:26395646

  17. Complete Genome Sequence of Gluconacetobacter hansenii Strain NQ5 (ATCC 53582), an Efficient Producer of Bacterial Cellulose

    PubMed Central

    Pfeffer, Sarah; Mehta, Kalpa

    2016-01-01

    This study reports the release of the complete nucleotide sequence of Gluconacetobacter hansenii strain NQ5 (ATCC 53582). This strain was isolated by R. Malcolm Brown, Jr. in a sugar mill in North Queensland, Australia, and is an efficient producer of bacterial cellulose. The elucidation of the genome will contribute to the study of the molecular mechanisms necessary for cellulose biosynthesis. PMID:27516505

  18. Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay

    PubMed Central

    Marbouty, Martial; Baudry, Lyam; Cournac, Axel; Koszul, Romain

    2017-01-01

    The biochemical activities of microbial communities, or microbiomes, are essential parts of environmental and animal ecosystems. The dynamics, balance, and effects of these communities are strongly influenced by phages present in the population. Being able to characterize bacterium-phage relationships is therefore essential to investigate these ecosystems to the full extent of their complexity. However, this task is currently limited by (i) the ability to characterize complete bacterial and viral genomes from a complex mix of species and (ii) the difficulty to assign phage sequences to their bacterial hosts. We show that both limitations can be circumvented using meta3C, an experimental and computational approach that exploits the physical contacts between DNA molecules to infer their proximity. In a single experiment, dozens of bacterial and phage genomes present in a complex mouse gut microbiota were assembled and scaffolded de novo. The phage genomes were then assigned to their putative bacterial hosts according to the physical contacts between the different DNA molecules, opening new perspectives for a comprehensive picture of the genomic structure of the gut flora. Therefore, this work holds far-reaching implications for human health studies aiming to bridge the virome to the microbiome. PMID:28232956

  19. Identifying all moiety conservation laws in genome-scale metabolic networks.

    PubMed

    De Martino, Andrea; De Martino, Daniele; Mulet, Roberto; Pagnani, Andrea

    2014-01-01

    The stoichiometry of a metabolic network gives rise to a set of conservation laws for the aggregate level of specific pools of metabolites, which, on one hand, pose dynamical constraints that cross-link the variations of metabolite concentrations and, on the other, provide key insight into a cell's metabolic production capabilities. When the conserved quantity identifies with a chemical moiety, extracting all such conservation laws from the stoichiometry amounts to finding all non-negative integer solutions of a linear system, a programming problem known to be NP-hard. We present an efficient strategy to compute the complete set of integer conservation laws of a genome-scale stoichiometric matrix, also providing a certificate for correctness and maximality of the solution. Our method is deployed for the analysis of moiety conservation relationships in two large-scale reconstructions of the metabolism of the bacterium E. coli, in six tissue-specific human metabolic networks, and, finally, in the human reactome as a whole, revealing that bacterial metabolism could be evolutionarily designed to cover broader production spectra than human metabolism. Convergence to the full set of moiety conservation laws in each case is achieved in extremely reduced computing times. In addition, we uncover a scaling relation that links the size of the independent pool basis to the number of metabolites, for which we present an analytical explanation.

  20. Accomplishments in genome-scale in silico modeling for industrial and medical biotechnology

    PubMed Central

    Milne, Caroline B.; Kim, Pan-Jun; Eddy, James A.; Price, Nathan D.

    2011-01-01

    Driven by advancements in high-throughput biological technologies and the growing number of sequenced genomes, the construction of in silico models at the genome scale has provided powerful tools to investigate a vast array of biological systems and applications. Here, we review comprehensively the uses of such models in industrial and medical biotechnology, including biofuel generation, food production, and drug development. While the use of in silico models is still in its early stages for delivering to industry, significant initial successes have been achieved. For the cases presented here, genome-scale models predict engineering strategies to enhance properties of interest in an organism or to inhibit harmful mechanisms of pathogens or in disease. Going forward, genome-scale in silico models promise to extend their application and analysis scope to become a transformative tool in biotechnology. As such, genome-scale models can provide a basis for rational genome-scale engineering and synthetic biology. PMID:19946878

  1. SigmoID: a user-friendly tool for improving bacterial genome annotation through analysis of transcription control signals

    PubMed Central

    Damienikan, Aliaksandr U.

    2016-01-01

    The majority of bacterial genome annotations are currently automated and based on a ‘gene by gene’ approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium atrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. atrosepticum chromosome. Reviewing the annotation in cases where it didn’t fit with regulatory information allowed us to correct product and gene names for over 300 loci. PMID:27257541

  2. SigmoID: a user-friendly tool for improving bacterial genome annotation through analysis of transcription control signals.

    PubMed

    Nikolaichik, Yevgeny; Damienikan, Aliaksandr U

    2016-01-01

    The majority of bacterial genome annotations are currently automated and based on a 'gene by gene' approach. Regulatory signals and operon structures are rarely taken into account which often results in incomplete and even incorrect gene function assignments. Here we present SigmoID, a cross-platform (OS X, Linux and Windows) open-source application aiming at simplifying the identification of transcription regulatory sites (promoters, transcription factor binding sites and terminators) in bacterial genomes and providing assistance in correcting annotations in accordance with regulatory information. SigmoID combines a user-friendly graphical interface to well known command line tools with a genome browser for visualising regulatory elements in genomic context. Integrated access to online databases with regulatory information (RegPrecise and RegulonDB) and web-based search engines speeds up genome analysis and simplifies correction of genome annotation. We demonstrate some features of SigmoID by constructing a series of regulatory protein binding site profiles for two groups of bacteria: Soft Rot Enterobacteriaceae (Pectobacterium and Dickeya spp.) and Pseudomonas spp. Furthermore, we inferred over 900 transcription factor binding sites and alternative sigma factor promoters in the annotated genome of Pectobacterium atrosepticum. These regulatory signals control putative transcription units covering about 40% of the P. atrosepticum chromosome. Reviewing the annotation in cases where it didn't fit with regulatory information allowed us to correct product and gene names for over 300 loci.

  3. Complete Genome Sequence and Immunoproteomic Analyses of the Bacterial Fish Pathogen Streptococcus parauberis▿†

    PubMed Central

    Nho, Seong Won; Hikima, Jun-ichi; Cha, In Seok; Park, Seong Bin; Jang, Ho Bin; del Castillo, Carmelo S.; Kondo, Hidehiro; Hirono, Ikuo; Aoki, Takashi; Jung, Tae Sung

    2011-01-01

    Although Streptococcus parauberis is known as a bacterial pathogen associated with bovine udder mastitis, it has recently become one of the major causative agents of olive flounder (Paralichthys olivaceus) streptococcosis in northeast Asia, causing massive mortality resulting in severe economic losses. S. parauberis contains two serotypes, and it is likely that capsular polysaccharide antigens serve to differentiate the serotypes. In the present study, the complete genome sequence of S. parauberis (serotype I) was determined using the GS-FLX system to investigate its phylogeny, virulence factors, and antigenic proteins. S. parauberis possesses a single chromosome of 2,143,887 bp containing 1,868 predicted coding sequences (CDSs), with an average GC content of 35.6%. Whole-genome dot plot analysis and phylogenetic analysis of a 60-kDa chaperonin-encoding gene and the glyceraldehyde-3-phosphate dehydrogenase (GAPDH)-encoding gene showed that the strain was evolutionarily closely related to Streptococcus uberis. S. parauberis antigenic proteins were analyzed using an immunoproteomic technique. Twenty-one antigenic protein spots were identified in S. parauberis, by reaction with an antiserum obtained from S. parauberis-challenged olive flounder. This work provides the foundation needed to understand more clearly the relationship between pathogen and host and develops new approaches toward prophylactic and therapeutic strategies to deal with streptococcosis in fish. The work also provides a better understanding of the physiology and evolution of a significant representative of the Streptococcaceae. PMID:21531805

  4. Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses

    PubMed Central

    Yun, Sang-Im; Song, Byung-Hak; Kim, Jin-Kyoung; Lee, Young-Min

    2015-01-01

    Reverse genetics, an approach to rescue infectious virus entirely from a cloned cDNA, has revolutionized the field of positive-strand RNA viruses, whose genomes have the same polarity as cellular mRNA. The cDNA-based reverse genetics system is a seminal method that enables direct manipulation of the viral genomic RNA, thereby generating recombinant viruses for molecular and genetic studies of both viral RNA elements and gene products in viral replication and pathogenesis. It also provides a valuable platform that allows the development of genetically defined vaccines and viral vectors for the delivery of foreign genes. For many positive-strand RNA viruses such as Japanese encephalitis virus (JEV), however, the cloned cDNAs are unstable, posing a major obstacle to the construction and propagation of the functional cDNA. Here, the present report describes the strategic considerations in creating and amplifying a genetically stable full-length infectious JEV cDNA as a bacterial artificial chromosome (BAC) using the following general experimental procedures: viral RNA isolation, cDNA synthesis, cDNA subcloning and modification, assembly of a full-length cDNA, cDNA linearization, in vitro RNA synthesis, and virus recovery. This protocol provides a general methodology applicable to cloning full-length cDNA for a range of positive-strand RNA viruses, particularly those with a genome of >10 kb in length, into a BAC vector, from which infectious RNAs can be transcribed in vitro with a bacteriophage RNA polymerase. PMID:26780115

  5. LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Task 1.4.2 Report

    SciTech Connect

    Slezak, T; Borucki, M; Lam, M; Lenhoff, R; Vitalis, E

    2010-01-26

    Good progress has been made on both bacterial and viral sequencing by the TMTI centers. While access to appropriate samples is a limiting factor to throughput, excellent progress has been made with respect to getting agreements in place with key sources of relevant materials. Sharing of sequenced genomes funded by TMTI has been extremely limited to date. The April 2010 exercise should force a resolution to this, but additional managerial pressures may be needed to ensure that rapid sharing of TMTI-funded sequencing occurs, regardless of collaborator constraints concerning ultimate publication(s). Policies to permit TMTI-internal rapid sharing of sequenced genomes should be written into all TMTI agreements with collaborators now being negotiated. TMTI needs to establish a Web-based system for tracking samples destined for sequencing. This includes metadata on sample origins and contributor, information on sample shipment/receipt, prioritization by TMTI, assignment to one or more sequencing centers (including possible TMTI-sponsored sequencing at a contributor site), and status history of the sample sequencing effort. While this system could be a component of the AFRL system, it is not part of any current development effort. Policy and standardized procedures are needed to ensure appropriate verification of all TMTI samples prior to the investment in sequencing. PCR, arrays, and classical biochemical tests are examples of potential verification methods. Verification is needed to detect miss-labeled, degraded, mixed or contaminated samples. Regular QC exercises are needed to ensure that the TMTI-funded centers are meeting all standards for producing quality genomic sequence data.

  6. Bacterial Artificial Chromosomes: A Functional Genomics Tool for the Study of Positive-strand RNA Viruses.

    PubMed

    Yun, Sang-Im; Song, Byung-Hak; Kim, Jin-Kyoung; Lee, Young-Min

    2015-12-29

    Reverse genetics, an approach to rescue infectious virus entirely from a cloned cDNA, has revolutionized the field of positive-strand RNA viruses, whose genomes have the same polarity as cellular mRNA. The cDNA-based reverse genetics system is a seminal method that enables direct manipulation of the viral genomic RNA, thereby generating recombinant viruses for molecular and genetic studies of both viral RNA elements and gene products in viral replication and pathogenesis. It also provides a valuable platform that allows the development of genetically defined vaccines and viral vectors for the delivery of foreign genes. For many positive-strand RNA viruses such as Japanese encephalitis virus (JEV), however, the cloned cDNAs are unstable, posing a major obstacle to the construction and propagation of the functional cDNA. Here, the present report describes the strategic considerations in creating and amplifying a genetically stable full-length infectious JEV cDNA as a bacterial artificial chromosome (BAC) using the following general experimental procedures: viral RNA isolation, cDNA synthesis, cDNA subcloning and modification, assembly of a full-length cDNA, cDNA linearization, in vitro RNA synthesis, and virus recovery. This protocol provides a general methodology applicable to cloning full-length cDNA for a range of positive-strand RNA viruses, particularly those with a genome of >10 kb in length, into a BAC vector, from which infectious RNAs can be transcribed in vitro with a bacteriophage RNA polymerase.

  7. A sensitive, support-vector-machine method for the detection of horizontal gene transfers in viral, archaeal and bacterial genomes.

    PubMed

    Tsirigos, Aristotelis; Rigoutsos, Isidore

    2005-01-01

    In earlier work, we introduced and discussed a generalized computational framework for identifying horizontal transfers. This framework relied on a gene's nucleotide composition, obviated the need for knowledge of codon boundaries and database searches, and was shown to perform very well across a wide range of archaeal and bacterial genomes when compared with previously published approaches, such as Codon Adaptation Index and C + G content. Nonetheless, two considerations remained outstanding: we wanted to further increase the sensitivity of detecting horizontal transfers and also to be able to apply the method to increasingly smaller genomes. In the discussion that follows, we present such a method, Wn-SVM, and show that it exhibits a very significant improvement in sensitivity compared with earlier approaches. Wn-SVM uses a one-class support-vector machine and can learn using rather small training sets. This property makes Wn-SVM particularly suitable for studying small-size genomes, similar to those of viruses, as well as the typically larger archaeal and bacterial genomes. We show experimentally that the new method results in a superior performance across a wide range of organisms and that it improves even upon our own earlier method by an average of 10% across all examined genomes. As a small-genome case study, we analyze the genome of the human cytomegalovirus and demonstrate that Wn-SVM correctly identifies regions that are known to be conserved and prototypical of all beta-herpesvirinae, regions that are known to have been acquired horizontally from the human host and, finally, regions that had not up to now been suspected to be horizontally transferred. Atypical region predictions for many eukaryotic viruses, including the alpha-, beta- and gamma-herpesvirinae, and 123 archaeal and bacterial genomes, have been made available online at http://cbcsrv.watson.ibm.com/HGT_SVM/.

  8. SnpFilt: A pipeline for reference-free assembly-based identification of SNPs in bacterial genomes.

    PubMed

    Chan, Carmen H S; Octavia, Sophie; Sintchenko, Vitali; Lan, Ruiting

    2016-12-01

    De novo assembly of bacterial genomes from next-generation sequencing (NGS) data allows a reference-free discovery of single nucleotide polymorphisms (SNP). However, substantial rates of errors in genomes assembled by this approach remain a major barrier for the reference-free analysis of genome variations in medically important bacteria. The aim of this report was to improve the quality of SNP identification in bacterial genomes without closely related references. We developed a bioinformatics pipeline (SnpFilt) that constructs an assembly using SPAdes and then removes unreliable regions based on the quality and coverage of re-aligned reads at neighbouring regions. The performance of the pipeline was compared against reference-based SNP calling for Illumina HiSeq, MiSeq and NextSeq reads from a range of bacterial pathogens including Salmonella, which is one of the most common causes of food-borne disease. The SnpFilt pipeline removed all false SNP in all test NGS datasets consisting of paired-end Illumina reads. We also showed that for reliable and complete SNP calls, at least 40-fold coverage is required. Analysis of bacterial isolates associated with epidemiologically confirmed outbreaks using the SnpFilt pipeline produced results consistent with previously published findings. The SnpFilt pipeline improves the quality of de-novo assembly and precision of SNP calling in bacterial genomes by removal of regions of the assembly that may potentially contain assembly errors. SnpFilt is available from https://github.com/LanLab/SnpFilt.

  9. Continuing Evolution of Burkholderia mallei Through Genome Reduction and Large-Scale Rearrangements

    DTIC Science & Technology

    2010-01-22

    in Materials and Methods. b NRPS, nonribosomal peptide synthase ; PKS, polyketide synthase ; RND, resistance nodulation-division like pump. Losada et al...genomics, genome erosion, bacterial virulence. ª The Author(s) 2010. Published by Oxford University Press on behalf of the Society for Molecular Biology...creativecommons.org/licenses/by-nc/ 2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original

  10. Characterizing acetogenic metabolism using a genome-scale metabolic reconstruction of Clostridium ljungdahlii

    SciTech Connect

    Nagarajan, H; Sahin, M; Nogales, J; Latif, H; Lovley, DR; Ebrahim, A; Zengler, K

    2013-11-25

    Background: The metabolic capabilities of acetogens to ferment a wide range of sugars, to grow autotrophically on H-2/CO2, and more importantly on synthesis gas (H-2/CO/CO2) make them very attractive candidates as production hosts for biofuels and biocommodities. Acetogenic metabolism is considered one of the earliest modes of bacterial metabolism. A thorough understanding of various factors governing the metabolism, in particular energy conservation mechanisms, is critical for metabolic engineering of acetogens for targeted production of desired chemicals. Results: Here, we present the genome-scale metabolic network of Clostridium ljungdahlii, the first such model for an acetogen. This genome-scale model (iHN637) consisting of 637 genes, 785 reactions, and 698 metabolites captures all the major central metabolic and biosynthetic pathways, in particular pathways involved in carbon fixation and energy conservation. A combination of metabolic modeling, with physiological and transcriptomic data provided insights into autotrophic metabolism as well as aided the characterization of a nitrate reduction pathway in C. ljungdahlii. Analysis of the iHN637 metabolic model revealed that flavin based electron bifurcation played a key role in energy conservation during autotrophic growth and helped identify genes for some of the critical steps in this mechanism. Conclusions: iHN637 represents a predictive model that recapitulates experimental data, and provides valuable insights into the metabolic response of C. ljungdahlii to genetic perturbations under various growth conditions. Thus, the model will be instrumental in guiding metabolic engineering of C. ljungdahlii for the industrial production of biocommodities and biofuels.

  11. Characterizing acetogenic metabolism using a genome-scale metabolic reconstruction of Clostridium ljungdahlii

    PubMed Central

    2013-01-01

    Background The metabolic capabilities of acetogens to ferment a wide range of sugars, to grow autotrophically on H2/CO2, and more importantly on synthesis gas (H2/CO/CO2) make them very attractive candidates as production hosts for biofuels and biocommodities. Acetogenic metabolism is considered one of the earliest modes of bacterial metabolism. A thorough understanding of various factors governing the metabolism, in particular energy conservation mechanisms, is critical for metabolic engineering of acetogens for targeted production of desired chemicals. Results Here, we present the genome-scale metabolic network of Clostridium ljungdahlii, the first such model for an acetogen. This genome-scale model (iHN637) consisting of 637 genes, 785 reactions, and 698 metabolites captures all the major central metabolic and biosynthetic pathways, in particular pathways involved in carbon fixation and energy conservation. A combination of metabolic modeling, with physiological and transcriptomic data provided insights into autotrophic metabolism as well as aided the characterization of a nitrate reduction pathway in C. ljungdahlii. Analysis of the iHN637 metabolic model revealed that flavin based electron bifurcation played a key role in energy conservation during autotrophic growth and helped identify genes for some of the critical steps in this mechanism. Conclusions iHN637 represents a predictive model that recapitulates experimental data, and provides valuable insights into the metabolic response of C. ljungdahlii to genetic perturbations under various growth conditions. Thus, the model will be instrumental in guiding metabolic engineering of C. ljungdahlii for the industrial production of biocommodities and biofuels. PMID:24274140

  12. Ultrastructural and molecular characterization of a bacterial symbiosis in the ecologically important scale insect family Coelostomidiidae.

    PubMed

    Dhami, Manpreet K; Turner, Adrian P; Deines, Peter; Beggs, Jacqueline R; Taylor, Michael W

    2012-09-01

    Scale insects are important ecologically and as agricultural pests. The majority of scale insect taxa feed exclusively on plant phloem sap, which is carbon rich but deficient in essential amino acids. This suggests that, as seen in the related aphids and psyllids, scale insect nutrition might also depend upon bacterial symbionts, yet very little is known about scale insect-bacteria symbioses. We report here the first identification and molecular characterization of symbiotic bacteria associated with the New Zealand giant scale Coelostomidia wairoensis, using fluorescence in situ hybridization (FISH), transmission electron microscopy (TEM) and 16S rRNA gene-based analysis. Dissection and FISH confirmed the location of the bacteria in large, paired, multilobate organs in the abdominal region of the insect. TEM indicated that the dominant pleomorphic bacteria were confined to bacteriocytes in the sheath-enclosed bacteriome. Phylogenetic analysis revealed the presence of three distinct bacterial types, the bacteriome-associated B-symbiont (Bacteroidetes), an Erwinia-related symbiont (Gammaproteobacteria) and Wolbachia sp. (Alphaproteobacteria). This study extends the current knowledge of scale insect symbionts and is the first microbiological investigation of the ecologically important coelostomidiid scales.

  13. Strategies used for genetically modifying bacterial genome: ite-directed mutagenesis, gene inactivation, and gene over-expression*

    PubMed Central

    Xu, Jian-zhong; Zhang, Wei-guo

    2016-01-01

    With the availability of the whole genome sequence of Escherichia coli or Corynebacterium glutamicum, strategies for directed DNA manipulation have developed rapidly. DNA manipulation plays an important role in understanding the function of genes and in constructing novel engineering bacteria according to requirement. DNA manipulation involves modifying the autologous genes and expressing the heterogenous genes. Two alternative approaches, using electroporation linear DNA or recombinant suicide plasmid, allow a wide variety of DNA manipulation. However, the over-expression of the desired gene is generally executed via plasmid-mediation. The current review summarizes the common strategies used for genetically modifying E. coli and C. glutamicum genomes, and discusses the technical problem of multi-layered DNA manipulation. Strategies for gene over-expression via integrating into genome are proposed. This review is intended to be an accessible introduction to DNA manipulation within the bacterial genome for novices and a source of the latest experimental information for experienced investigators. PMID:26834010

  14. Segmental duplications within the Glycine max genome revealed by fluorescence in situ hybridization of bacterial artificial chromosomes.

    PubMed

    Pagel, Janice; Walling, Jason G; Young, Nevin D; Shoemaker, Randy C; Jackson, Scott A

    2004-08-01

    Soybean (Glycine max L. Merr.) is presumed to be an ancient polyploid based on chromosome number and multiple RFLP fragments in genetic mapping. Direct cytogenetic observation of duplicated regions within the soybean genome has not heretofore been reported. Employing fluorescence in situ hybridization (FISH) of genetically anchored bacterial artificial chromosomes (BACs) in soybean, we were able to observe that the distal ends of molecular linkage group E had duplicated regions on linkage groups A2 and B2. Further, using fiber-FISH, it was possible to measure the molecular size and organization of one of the duplicated regions. As FISH did not require repetitive DNA for blocking fluorescence signals, we assume that the 200-kb genome region is relatively low in repetitive sequences. This observation, along with the observation that the BACs are located in distal euchromatin regions, has implications for genome structure/evolution and the approach used to sequence the soybean genome.

  15. Comparative genomics of Mortierella elongata and its bacterial endosymbiont Mycoavidus cysteinexigens: Comparative genomics of Mortierella elongata

    SciTech Connect

    Uehling, J.; Gryganskyi, A.; Hameed, K.; Tschaplinski, T.; Misztal, P. K.; Wu, S.; Desirò, A.; Vande Pol, N.; Du, Z.; Zienkiewicz, A.; Zienkiewicz, K.; Morin, E.; Tisserant, E.; Splivallo, R.; Hainaut, M.; Kuo, A.; Yan, J.; Lipzen, A.; Nolan, M.; LaButti, K.; Barry, K.; Goldstein, A. H.; Labbé, J.; Schadt, C.; Tuskan, G.; Grigoriev, I.; Martin, F.; Vilgalys, R.; Bonito, G.

    2017-01-01

    Endosymbiosis of bacteria by eukaryotes is a defining feature of cellular evolution. In addition to well-known bacterial origins for mitochondria and chloroplasts, multiple origins of bacterial endosymbiosis are known within the cells of diverse animals, plants and fungi. Early-diverging lineages of terrestrial fungi harbor endosymbiotic bacteria belonging to the Burkholderiaceae. Furthermore, we sequenced the metagenome of the soil-inhabiting fungus Mortierella elongata and assembled the complete circular chromosome of its endosymbiont, Mycoavidus cysteinexigens, which we place within a lineage of endofungal symbionts that are sister clade to Burkholderia. The genome of M. elongata strain AG77 features a core set of primary metabolic pathways for degradation of simple carbohydrates and lipid biosynthesis, while the M. cysteinexigens (AG77) genome is reduced in size and function. Experiments using antibiotics to cure the endobacterium from the host demonstrate that the fungal host metabolism is highly modulated by presence/ absence of M. cysteinexigens. In independent comparative phylogenomic analyses of fungal and bacterial genomes we find that they are consistent with an ancient origin for M. elongata M. cysteinexigens symbiosis, most likely over 350 million years ago and concomitant with the terrestrialization of Earth and diversification of land fungi and plants.

  16. Genome-scale genetic engineering in Escherichia coli.

    PubMed

    Jeong, Jaehwan; Cho, Namjin; Jung, Daehee; Bang, Duhee

    2013-11-01

    Genome engineering has been developed to create useful strains for biological studies and industrial uses. However, a continuous challenge remained in the field: technical limitations in high-throughput screening and precise manipulation of strains. Today, technical improvements have made genome engineering more rapid and efficient. This review introduces recent advances in genome engineering technologies applied to Escherichia coli as well as multiplex automated genome engineering (MAGE), a recent technique proposed as a powerful toolkit due to its straightforward process, rapid experimental procedures, and highly efficient properties.

  17. Temporal scaling of bacterial taxa is influenced by both stochastic and deterministic ecological factors.

    PubMed

    van der Gast, Christopher J; Ager, Duane; Lilley, Andrew K

    2008-06-01

    Microorganisms operate at a range of spatial and temporal scales acting as key drivers of ecosystem properties. Therefore, many key questions in microbial ecology require the consideration of both spatial and temporal scales. Spatial scaling, in particular the species-area relationship (SAR), has a long history in ecology and has recently been addressed in microbial ecology. However, the temporal analogue of the SAR, the species-time relationship, has received far less attention even in the science of general ecology. Here we focus upon the role of temporal scaling in microbial ecological patterns by coupling molecular characterization of bacterial communities in discrete island (bioreactor) systems with a macroecological approach. Our findings showed that the temporal scaling exponent (slope), and therefore taxa turnover of the bacterial taxa-time relationship decreased as selective pressure (industrial wastewater concentration) increased. Also, as the concentration of industrial wastewater increased across the bioreactors, we observed a gradual switch from stochastic community assembly to more deterministic (niche)-based considerations. The identification of broad-scale statistical patterns is particularly relevant to microbial ecology, as it is frequently difficult to identify individual species or their functions. In this study, we identify wide-reaching statistical patterns of diversity and show that they are shaped by the prevalent underlying ecological factors.

  18. Genome sequence and plasmid transformation of the model high-yield bacterial cellulose producer Gluconacetobacter hansenii ATCC 53582

    PubMed Central

    Florea, Michael; Reeve, Benjamin; Abbott, James; Freemont, Paul S.; Ellis, Tom

    2016-01-01

    Bacterial cellulose is a strong, highly pure form of cellulose that is used in a range of applications in industry, consumer goods and medicine. Gluconacetobacter hansenii ATCC 53582 is one of the highest reported bacterial cellulose producing strains and has been used as a model organism in numerous studies of bacterial cellulose production and studies aiming to increased cellulose productivity. Here we present a high-quality draft genome sequence for G. hansenii ATCC 53582 and find that in addition to the previously described cellulose synthase operon, ATCC 53582 contains two additional cellulose synthase operons and several previously undescribed genes associated with cellulose production. In parallel, we also develop optimized protocols and identify plasmid backbones suitable for transformation of ATCC 53582, albeit with low efficiencies. Together, these results provide important information for further studies into cellulose synthesis and for future studies aiming to genetically engineer G. hansenii ATCC 53582 for increased cellulose productivity. PMID:27010592

  19. Genome sequence and plasmid transformation of the model high-yield bacterial cellulose producer Gluconacetobacter hansenii ATCC 53582

    NASA Astrophysics Data System (ADS)

    Florea, Michael; Reeve, Benjamin; Abbott, James; Freemont, Paul S.; Ellis, Tom

    2016-03-01

    Bacterial cellulose is a strong, highly pure form of cellulose that is used in a range of applications in industry, consumer goods and medicine. Gluconacetobacter hansenii ATCC 53582 is one of the highest reported bacterial cellulose producing strains and has been used as a model organism in numerous studies of bacterial cellulose production and studies aiming to increased cellulose productivity. Here we present a high-quality draft genome sequence for G. hansenii ATCC 53582 and find that in addition to the previously described cellulose synthase operon, ATCC 53582 contains two additional cellulose synthase operons and several previously undescribed genes associated with cellulose production. In parallel, we also develop optimized protocols and identify plasmid backbones suitable for transformation of ATCC 53582, albeit with low efficiencies. Together, these results provide important information for further studies into cellulose synthesis and for future studies aiming to genetically engineer G. hansenii ATCC 53582 for increased cellulose productivity.

  20. Genome sequence and plasmid transformation of the model high-yield bacterial cellulose producer Gluconacetobacter hansenii ATCC 53582.

    PubMed

    Florea, Michael; Reeve, Benjamin; Abbott, James; Freemont, Paul S; Ellis, Tom

    2016-03-24

    Bacterial cellulose is a strong, highly pure form of cellulose that is used in a range of applications in industry, consumer goods and medicine. Gluconacetobacter hansenii ATCC 53582 is one of the highest reported bacterial cellulose producing strains and has been used as a model organism in numerous studies of bacterial cellulose production and studies aiming to increased cellulose productivity. Here we present a high-quality draft genome sequence for G. hansenii ATCC 53582 and find that in addition to the previously described cellulose synthase operon, ATCC 53582 contains two additional cellulose synthase operons and several previously undescribed genes associated with cellulose production. In parallel, we also develop optimized protocols and identify plasmid backbones suitable for transformation of ATCC 53582, albeit with low efficiencies. Together, these results provide important information for further studies into cellulose synthesis and for future studies aiming to genetically engineer G. hansenii ATCC 53582 for increased cellulose productivity.

  1. FN-Identify: Novel Restriction Enzymes-Based Method for Bacterial Identification in Absence of Genome Sequencing.

    PubMed

    Awad, Mohamed; Ouda, Osama; El-Refy, Ali; El-Feky, Fawzy A; Mosa, Kareem A; Helmy, Mohamed

    2015-01-01

    Sequencing and restriction analysis of genes like 16S rRNA and HSP60 are intensively used for molecular identification in the microbial communities. With aid of the rapid progress in bioinformatics, genome sequencing became the method of choice for bacterial identification. However, the genome sequencing technology is still out of reach in the developing countries. In this paper, we propose FN-Identify, a sequencing-free method for bacterial identification. FN-Identify exploits the gene sequences data available in GenBank and other databases and the two algorithms that we developed, CreateScheme and GeneIdentify, to create a restriction enzyme-based identification scheme. FN-Identify was tested using three different and diverse bacterial populations (members of Lactobacillus, Pseudomonas, and Mycobacterium groups) in an in silico analysis using restriction enzymes and sequences of 16S rRNA gene. The analysis of the restriction maps of the members of three groups using the fragment numbers information only or along with fragments sizes successfully identified all of the members of the three groups using a minimum of four and maximum of eight restriction enzymes. Our results demonstrate the utility and accuracy of FN-Identify method and its two algorithms as an alternative method that uses the standard microbiology laboratories techniques when the genome sequencing is not available.

  2. Genome-wide association analysis identifies resistance loci for bacterial blight in a diverse collection of indica rice germplasm.

    PubMed

    Zhang, Fan; Wu, Zhi-Chao; Wang, Ming-Ming; Zhang, Fan; Dingkuhn, Michael; Xu, Jian-Long; Zhou, Yong-Li; Li, Zhi-Kang

    2017-01-01

    Bacterial blight, which is caused by Xanthomonas oryzae pv. oryzae (Xoo), is one of the most devastating rice diseases worldwide. The development and use of disease-resistant cultivars have been the most effective strategy to control bacterial blight. Identifying the genes mediating bacterial blight resistance is a prerequisite for breeding cultivars with broad-spectrum and durable resistance. We herein describe a genome-wide association study involving 172 diverse Oryza sativa ssp. indica accessions to identify loci influencing the resistance to representative strains of six Xoo races. Twelve resistance loci containing 121 significantly associated signals were identified using 317,894 single nucleotide polymorphisms, which explained 13.3-59.9% of the variability in lesion length caused by Xoo races P1, P6, and P9a. Two hotspot regions (L11 and L12) were located within or nearby two cloned R genes (xa25 and Xa26) and one fine-mapped R gene (Xa4). Our results confirmed the relatively high resolution of genome-wide association studies. Moreover, we detected novel significant associations on chromosomes 2, 3, and 6-10. Haplotype analyses of xa25, the Xa26 paralog (MRKc; LOC_Os11g47290), and a Xa4 candidate gene (LOC_11g46870) revealed differences in bacterial blight resistance among indica subgroups. These differences were responsible for the observed variations in lesion lengths resulting from infections by Xoo races P1 and P9a. Our findings may be relevant for future studies involving bacterial blight resistance gene cloning, and provide insights into the genetic basis for bacterial blight resistance in indica rice, which may be useful for knowledge-based crop improvement.

  3. Genome-wide association analysis identifies resistance loci for bacterial blight in a diverse collection of indica rice germplasm

    PubMed Central

    Wang, Ming-Ming; Zhang, Fan; Dingkuhn, Michael; Xu, Jian-Long; Zhou, Yong-Li; Li, Zhi-Kang

    2017-01-01

    Bacterial blight, which is caused by Xanthomonas oryzae pv. oryzae (Xoo), is one of the most devastating rice diseases worldwide. The development and use of disease-resistant cultivars have been the most effective strategy to control bacterial blight. Identifying the genes mediating bacterial blight resistance is a prerequisite for breeding cultivars with broad-spectrum and durable resistance. We herein describe a genome-wide association study involving 172 diverse Oryza sativa ssp. indica accessions to identify loci influencing the resistance to representative strains of six Xoo races. Twelve resistance loci containing 121 significantly associated signals were identified using 317,894 single nucleotide polymorphisms, which explained 13.3–59.9% of the variability in lesion length caused by Xoo races P1, P6, and P9a. Two hotspot regions (L11 and L12) were located within or nearby two cloned R genes (xa25 and Xa26) and one fine-mapped R gene (Xa4). Our results confirmed the relatively high resolution of genome-wide association studies. Moreover, we detected novel significant associations on chromosomes 2, 3, and 6–10. Haplotype analyses of xa25, the Xa26 paralog (MRKc; LOC_Os11g47290), and a Xa4 candidate gene (LOC_11g46870) revealed differences in bacterial blight resistance among indica subgroups. These differences were responsible for the observed variations in lesion lengths resulting from infections by Xoo races P1 and P9a. Our findings may be relevant for future studies involving bacterial blight resistance gene cloning, and provide insights into the genetic basis for bacterial blight resistance in indica rice, which may be useful for knowledge-based crop improvement. PMID:28355306

  4. The 19 Genomes of Drosophila: A BAC Library Resource for Genus-Wide and Genome-Scale Comparative Evolutionary Research

    PubMed Central

    Song, Xiang; Goicoechea, Jose Luis; Ammiraju, Jetty S. S.; Luo, Meizhong; He, Ruifeng; Lin, Jinke; Lee, So-Jeong; Sisneros, Nicholas; Watts, Tom; Kudrna, David A.; Golser, Wolfgang; Ashley, Elizabeth; Collura, Kristi; Braidotti, Michele; Yu, Yeisoo; Matzkin, Luciano M.; McAllister, Bryant F.; Markow, Therese Ann; Wing, Rod A.

    2011-01-01

    The genus Drosophila has been the subject of intense comparative phylogenomics characterization to provide insights into genome evolution under diverse biological and ecological contexts and to functionally annotate the Drosophila melanogaster genome, a model system for animal and insect genetics. Recent sequencing of 11 additional Drosophila species from various divergence points of the genus is a first step in this direction. However, to fully reap the benefits of this resource, the Drosophila community is faced with two critical needs: i.e., the expansion of genomic resources from a much broader range of phylogenetic diversity and the development of additional resources to aid in finishing the existing draft genomes. To address these needs, we report the first synthesis of a comprehensive set of bacterial artificial chromosome (BAC) resources for 19 Drosophila species from all three subgenera. Ten libraries were derived from the exact source used to generate 10 of the 12 draft genomes, while the rest were generated from a strategically selected set of species on the basis of salient ecological and life history features and their phylogenetic positions. The majority of the new species have at least one sequenced reference genome for immediate comparative benefit. This 19-BAC library set was rigorously characterized and shown to have large insert sizes (125–168 kb), low nonrecombinant clone content (0.3–5.3%), and deep coverage (9.1–42.9×). Further, we demonstrated the utility of this BAC resource for generating physical maps of targeted loci, refining draft sequence assemblies and identifying potential genomic rearrangements across the phylogeny. PMID:21321134

  5. The 19 genomes of Drosophila: a BAC library resource for genus-wide and genome-scale comparative evolutionary research.

    PubMed

    Song, Xiang; Goicoechea, Jose Luis; Ammiraju, Jetty S S; Luo, Meizhong; He, Ruifeng; Lin, Jinke; Lee, So-Jeong; Sisneros, Nicholas; Watts, Tom; Kudrna, David A; Golser, Wolfgang; Ashley, Elizabeth; Collura, Kristi; Braidotti, Michele; Yu, Yeisoo; Matzkin, Luciano M; McAllister, Bryant F; Markow, Therese Ann; Wing, Rod A

    2011-04-01

    The genus Drosophila has been the subject of intense comparative phylogenomics characterization to provide insights into genome evolution under diverse biological and ecological contexts and to functionally annotate the Drosophila melanogaster genome, a model system for animal and insect genetics. Recent sequencing of 11 additional Drosophila species from various divergence points of the genus is a first step in this direction. However, to fully reap the benefits of this resource, the Drosophila community is faced with two critical needs: i.e., the expansion of genomic resources from a much broader range of phylogenetic diversity and the development of additional resources to aid in finishing the existing draft genomes. To address these needs, we report the first synthesis of a comprehensive set of bacterial artificial chromosome (BAC) resources for 19 Drosophila species from all three subgenera. Ten libraries were derived from the exact source used to generate 10 of the 12 draft genomes, while the rest were generated from a strategically selected set of species on the basis of salient ecological and life history features and their phylogenetic positions. The majority of the new species have at least one sequenced reference genome for immediate comparative benefit. This 19-BAC library set was rigorously characterized and shown to have large insert sizes (125-168 kb), low nonrecombinant clone content (0.3-5.3%), and deep coverage (9.1-42.9×). Further, we demonstrated the utility of this BAC resource for generating physical maps of targeted loci, refining draft sequence assemblies and identifying potential genomic rearrangements across the phylogeny.

  6. Comparative genome-scale modelling of Staphylococcus aureus strains identifies strain-specific metabolic capabilities linked to pathogenicity

    PubMed Central

    Bosi, Emanuele; Monk, Jonathan M.; Aziz, Ramy K.; Fondi, Marco; Nizet, Victor; Palsson, Bernhard Ø.

    2016-01-01

    Staphylococcus aureus is a preeminent bacterial pathogen capable of colonizing diverse ecological niches within its human host. We describe here the pangenome of S. aureus based on analysis of genome sequences from 64 strains of S. aureus spanning a range of ecological niches, host types, and antibiotic resistance profiles. Based on this set, S. aureus is expected to have an open pangenome composed of 7,411 genes and a core genome composed of 1,441 genes. Metabolism was highly conserved in this core genome; however, differences were identified in amino acid and nucleotide biosynthesis pathways between the strains. Genome-scale models (GEMs) of metabolism were constructed for the 64 strains of S. aureus. These GEMs enabled a systems approach to characterizing the core metabolic and panmetabolic capabilities of the S. aureus species. All models were predicted to be auxotrophic for the vitamins niacin (vitamin B3) and thiamin (vitamin B1), whereas strain-specific auxotrophies were predicted for riboflavin (vitamin B2), guanosine, leucine, methionine, and cysteine, among others. GEMs were used to systematically analyze growth capabilities in more than 300 different growth-supporting environments. The results identified metabolic capabilities linked to pathogenic traits and virulence acquisitions. Such traits can be used to differentiate strains responsible for mild vs. severe infections and preference for hosts (e.g., animals vs. humans). Genome-scale analysis of multiple strains of a species can thus be used to identify metabolic determinants of virulence and increase our understanding of why certain strains of this deadly pathogen have spread rapidly throughout the world. PMID:27286824

  7. Millennial-scale ocean acidification and late Quaternary decline of cryptic bacterial crusts in tropical reefs.

    PubMed

    Riding, R; Liang, L; Braga, J C

    2014-09-01

    Ocean acidification by atmospheric carbon dioxide has increased almost continuously since the last glacial maximum (LGM), 21,000 years ago. It is expected to impair tropical reef development, but effects on reefs at the present day and in the recent past have proved difficult to evaluate. We present evidence that acidification has already significantly reduced the formation of calcified bacterial crusts in tropical reefs. Unlike major reef builders such as coralline algae and corals that more closely control their calcification, bacterial calcification is very sensitive to ambient changes in carbonate chemistry. Bacterial crusts in reef cavities have declined in thickness over the past 14,000 years with largest reduction occurring 12,000-10,000 years ago. We interpret this as an early effect of deglacial ocean acidification on reef calcification and infer that similar crusts were likely to have been thicker when seawater carbonate saturation was increased during earlier glacial intervals, and thinner during interglacials. These changes in crust thickness could have substantially affected reef development over glacial cycles, as rigid crusts significantly strengthen framework and their reduction would have increased the susceptibility of reefs to biological and physical erosion. Bacterial crust decline reveals previously unrecognized millennial-scale acidification effects on tropical reefs. This directs attention to the role of crusts in reef formation and the ability of bioinduced calcification to reflect changes in seawater chemistry. It also provides a long-term context for assessing anticipated anthropogenic effects.

  8. High-throughput generation, optimization and analysis of genome-scale metabolic models.

    SciTech Connect

    Henry, C. S.; DeJongh, M.; Best, A. A.; Frybarger, P. M.; Linsay, B.; Stevens, R. L.

    2010-09-01

    Genome-scale metabolic models have proven to be valuable for predicting organism phenotypes from genotypes. Yet efforts to develop new models are failing to keep pace with genome sequencing. To address this problem, we introduce the Model SEED, a web-based resource for high-throughput generation, optimization and analysis of genome-scale metabolic models. The Model SEED integrates existing methods and introduces techniques to automate nearly every step of this process, taking {approx}48 h to reconstruct a metabolic model from an assembled genome sequence. We apply this resource to generate 130 genome-scale metabolic models representing a taxonomically diverse set of bacteria. Twenty-two of the models were validated against available gene essentiality and Biolog data, with the average model accuracy determined to be 66% before optimization and 87% after optimization.

  9. Nonviral Genome Editing Based on a Polymer-Derivatized CRISPR Nanocomplex for Targeting Bacterial Pathogens and Antibiotic Resistance.

    PubMed

    Kang, Yoo Kyung; Kwon, Kyu; Ryu, Jea Sung; Lee, Ha Neul; Park, Chankyu; Chung, Hyun Jung

    2017-02-27

    The overuse of antibiotics plays a major role in the emergence and spread of multidrug-resistant bacteria. A molecularly targeted, specific treatment method for bacterial pathogens can prevent this problem by reducing the selective pressure during microbial growth. Herein, we introduce a nonviral treatment strategy delivering genome editing material for targeting antibacterial resistance. We apply the CRISPR-Cas9 system, which has been recognized as an innovative tool for highly specific and efficient genome engineering in different organisms, as the delivery cargo. We utilize polymer-derivatized Cas9, by direct covalent modification of the protein with cationic polymer, for subsequent complexation with single-guide RNA targeting antibiotic resistance. We show that nanosized CRISPR complexes (= Cr-Nanocomplex) were successfully formed, while maintaining the functional activity of Cas9 endonuclease to induce double-strand DNA cleavage. We also demonstrate that the Cr-Nanocomplex designed to target mecA-the major gene involved in methicillin resistance-can be efficiently delivered into Methicillin-resistant Staphylococcus aureus (MRSA), and allow the editing of the bacterial genome with much higher efficiency compared to using native Cas9 complexes or conventional lipid-based formulations. The present study shows for the first time that a covalently modified CRISPR system allows nonviral, therapeutic genome editing, and can be potentially applied as a target specific antimicrobial.

  10. Genome-wide survey of codons under diversifying selection in a highly recombining bacterial species, Helicobacter pylori

    PubMed Central

    Yahara, Koji; Furuta, Yoshikazu; Morimoto, Shinpei; Kikutake, Chie; Komukai, Sho; Matelska, Dorota; Dunin-Horkawicz, Stanisław; Bujnicki, Janusz M.; Uchiyama, Ikuo; Kobayashi, Ichizo

    2016-01-01

    Selection has been a central issue in biology in eukaryotes as well as prokaryotes. Inference of selection in recombining bacterial species, compared with clonal ones, has been a challenge. It is not known how codons under diversifying selection are distributed along the chromosome or among functional categories or how frequently such codons are subject to mutual homologous recombination. Here, we explored these questions by analysing genes present in >90% among 29 genomes of Helicobacter pylori, one of the bacterial species with the highest mutation and recombination rates. By a method for recombining sequences, we identified codons under diversifying selection (dN/dS > 1), which were widely distributed and accounted for ∼0.2% of all the codons of the genome. The codons were enriched in genes of host interaction/cell surface and genome maintenance (DNA replication, recombination, repair, and restriction modification system). The encoded amino acid residues were sometimes found adjacent to critical catalytic/binding residues in protein structures. Furthermore, by estimating the intensity of homologous recombination at a single nucleotide level, we found that these codons appear to be more frequently subject to recombination. We expect that the present study provides a new approach to population genomics of selection in recombining prokaryotes. PMID:26961370

  11. Co-evolution of genomic islands and their bacterial hosts revealed through phylogenetic analyses of 17 groups of homologous genomic islands.

    PubMed

    Guo, F-B; Wei, W; Wang, X L; Lin, H; Ding, H; Huang, J; Rao, N

    2012-10-15

    Horizontal gene transfer is an important mechanism for the evolution of microbial genomes, and many horizontal gene transfer events are facilitated by genomic islands (GIs). Until now, few reports have provided evidence for the co-evolution of horizontally transferred genes and their hosts. We obtained 17 groups of homologous GIs, all of which appear in 8 or more bacterial strains of the same species or genus. Using phylogenetic analyses, we found that the topological structure of a distance tree based on the proteins of each group of homologous GIs was consistent with that based on the complete proteomes of the hosts. This result clearly indicates that GIs and their bacterial hosts have co-evolved. In addition to presenting and providing evidence for a novel concept, i.e., the co-evolution of GIs and their bacterial hosts, we also describe a new and interesting detail for the phylogenetic analysis of horizontally transferred genes: consistent phylogenetic trees can be obtained by focusing on homologous GIs despite the commonly accepted theory that the phylogenies of horizontally transferred sequences and host organisms should be inconsistent.

  12. Large-scale computational and statistical analyses of high transcription potentialities in 32 prokaryotic genomes

    PubMed Central

    Sinoquet, Christine; Demey, Sylvain; Braun, Frédérique

    2008-01-01

    This article compares 32 bacterial genomes with respect to their high transcription potentialities. The σ70 promoter has been widely studied for Escherichia coli model and a consensus is known. Since transcriptional regulations are known to compensate for promoter weakness (i.e. when the promoter similarity with regard to the consensus is rather low), predicting functional promoters is a hard task. Instead, the research work presented here comes within the scope of investigating potentially high ORF expression, in relation with three criteria: (i) high similarity to the σ70 consensus (namely, the consensus variant appropriate for each genome), (ii) transcription strength reinforcement through a supplementary binding site—the upstream promoter (UP) element—and (iii) enhancement through an optimal Shine-Dalgarno (SD) sequence. We show that in the AT-rich Firmicutes’ genomes, frequencies of potentially strong σ70-like promoters are exceptionally high. Besides, though they contain a low number of strong promoters (SPs), some genomes may show a high proportion of promoters harbouring an UP element. Putative SPs of lesser quality are more frequently associated with an UP element than putative strong promoters of better quality. A meaningful difference is statistically ascertained when comparing bacterial genomes with similarly AT-rich genomes generated at random; the difference is the highest for Firmicutes. Comparing some Firmicutes genomes with similarly AT-rich Proteobacteria genomes, we confirm the Firmicutes specificity. We show that this specificity is neither explained by AT-bias nor genome size bias; neither does it originate in the abundance of optimal SD sequences, a typical and significant feature of Firmicutes more thoroughly analysed in our study. PMID:18440978

  13. Comparative Genomics Analysis and Phenotypic Characterization of Shewanella putrefaciens W3-18-1: Anaerobic Respiration, Bacterial Microcompartments, and Lateral Flagella

    SciTech Connect

    Qiu, D.; Tu, Q.; He, Zhili; Zhou, Jizhong

    2010-05-17

    Respiratory versatility and psychrophily are the hallmarks of Shewanella. The ability to utilize a wide range of electron acceptors for respiration is due to the large number of c-type cytochrome genes present in the genome of Shewanella strains. More recently the dissimilatory metal reduction of Shewanella species has been extensively and intensively studied for potential applications in the bioremediation of radioactive wastes of groundwater and subsurface environments. Multiple Shewanella genome sequences are now available in the public databases (Fredrickson et al., 2008). Most of the sequenced Shewanella strains were isolated from marine environments and this genus was believed to be of marine origin (Hau and Gralnick, 2007). However, the well-characterized model strain, S. oneidensis MR-1, was isolated from the freshwater lake sediment of Lake Oneida, New York (Myers and Nealson, 1988) and similar bacteria have also been isolated from other freshwater environments (Venkateswaran et al., 1999). Here we comparatively analyzed the genome sequence and physiological characteristics of S. putrefaciens W3-18-1 and S. oneidensis MR-1, isolated from the marine and freshwater lake sediments, respectively. The anaerobic respirations, carbon source utilization, and cell motility have been experimentally investigated. Large scale horizontal gene transfers have been revealed and the genetic divergence between these two strains was considered to be critical to the bacterial adaptation to specific habitats, freshwater or marine sediments.

  14. Efficient PAHs biodegradation by a bacterial consortium at flask and bioreactor scale.

    PubMed

    Moscoso, F; Teijiz, I; Deive, F J; Sanromán, M A

    2012-09-01

    In this work, the biodegradation of three polycyclic aromatic hydrocarbons (PAHs) such as Phenanthrene (PHE), Pyrene (PYR) and Benzo[a]anthracene (BaA) has been investigated. A bacterial consortium consisting of two strains was used for the first time based on preliminary promising biodegradation data. They were tentatively identified as Staphylococcus warneri and Bacillus pumilus. Degradation values higher than 85% were obtained for each single PAH when operating at flask scale, whereas minimum levels of 90% of PAHs removal were obtained after just 3 days of cultivation at bioreactor scale. The operation in cometabolic conditions led to maximum levels about 75% and 100% at flask and bioreactor scale, respectively. All the experimental data were analyzed in the light of logistic and Luedeking and Piret type models, with the purpose to better characterize the biodegradation process by S. warneri and B. pumilus. Finally, the metabolic pathway followed to degrade each PAH was ascertained.

  15. Decoding Synteny Blocks and Large-Scale Duplications in Mammalian and Plant Genomes

    NASA Astrophysics Data System (ADS)

    Peng, Qian; Alekseyev, Max A.; Tesler, Glenn; Pevzner, Pavel A.

    The existing synteny block reconstruction algorithms use anchors (e.g., orthologous genes) shared over all genomes to construct the synteny blocks for multiple genomes. This approach, while efficient for a few genomes, cannot be scaled to address the need to construct synteny blocks in many mammalian genomes that are currently being sequenced. The problem is that the number of anchors shared among all genomes quickly decreases with the increase in the number of genomes. Another problem is that many genomes (plant genomes in particular) had extensive duplications, which makes decoding of genomic architecture and rearrangement analysis in plants difficult. The existing synteny block generation algorithms in plants do not address the issue of generating non-overlapping synteny blocks suitable for analyzing rearrangements and evolution history of duplications. We present a new algorithm based on the A-Bruijn graph framework that overcomes these difficulties and provides a unified approach to synteny block reconstruction for multiple genomes, and for genomes with large duplications.

  16. Large-scale genomic analysis of ovarian carcinomas.

    PubMed

    Gorringe, Kylie L; Campbell, Ian G

    2009-04-01

    Epithelial ovarian cancers are typified by frequent genomic aberrations that have been difficult to unravel. Recently, high-resolution array technologies have provided the first glimpse of the remarkable complexity of these aberrations with some ovarian cancers containing hundreds of copy number breakpoints, micro-deletions and amplifications. Many of these alterations contain cancer-related genes suggesting that the majority is disease-associated and not just the product of random genomic instability. Future developments such as next-generation sequencing and integrated analysis of data from multiple array platforms on large numbers of samples are poised to revolutionize our understanding of this complex disease.

  17. Draft Genome Sequence of Criibacterium bergeronii gen. nov., sp. nov., Strain CCRI-22567T, Isolated from a Vaginal Sample from a Woman with Bacterial Vaginosis

    PubMed Central

    Maheux, Andrée F.; Bérubé, Ève; Boudreau, Dominique K.; Raymond, Frédéric; Corbeil, Jacques; Roy, Paul H.

    2016-01-01

    Criibacterium bergeronii gen. nov., sp. nov., CCRI-22567 is the type strain of the new genus Criibacterium. The strain was isolated from a woman with bacterial vaginosis. The genome assembly comprised 2,384,460 bp, with 34.4% G+C content. This is the first genome announcement of a strain belonging to the genus Criibacterium. PMID:27587833

  18. Draft Genome Sequence of Criibacterium bergeronii gen. nov., sp. nov., Strain CCRI-22567T, Isolated from a Vaginal Sample from a Woman with Bacterial Vaginosis.

    PubMed

    Maheux, Andrée F; Bérubé, Ève; Boudreau, Dominique K; Raymond, Frédéric; Corbeil, Jacques; Roy, Paul H; Boissinot, Maurice; Omar, Rabeea F

    2016-09-01

    Criibacterium bergeronii gen. nov., sp. nov., CCRI-22567 is the type strain of the new genus Criibacterium The strain was isolated from a woman with bacterial vaginosis. The genome assembly comprised 2,384,460 bp, with 34.4% G+C content. This is the first genome announcement of a strain belonging to the genus Criibacterium.

  19. Genomic Encyclopedia of Bacterial and Archaeal Type Strains, Phase III: the genomes of soil and plant-associated and newly described type strains.

    PubMed

    Whitman, William B; Woyke, Tanja; Klenk, Hans-Peter; Zhou, Yuguang; Lilburn, Timothy G; Beck, Brian J; De Vos, Paul; Vandamme, Peter; Eisen, Jonathan A; Garrity, George; Hugenholtz, Philip; Kyrpides, Nikos C

    2015-01-01

    The Genomic Encyclopedia of Bacteria and Archaea (GEBA) project was launched by the JGI in 2007 as a pilot project to sequence about 250 bacterial and archaeal genomes of elevated phylogenetic diversity. Herein, we propose to extend this approach to type strains of prokaryotes associated with soil or plants and their close relatives as well as type strains from newly described species. Understanding the microbiology of soil and plants is critical to many DOE mission areas, such as biofuel production from biomass, biogeochemistry, and carbon cycling. We are also targeting type strains of novel species while they are being described. Since 2006, about 630 new species have been described per year, many of which are closely aligned to DOE areas of interest in soil, agriculture, degradation of pollutants, biofuel production, biogeochemical transformation, and biodiversity.

  20. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    SciTech Connect

    Tartakovsky, Guzel D.; Tartakovsky, Alexandre M.; Scheibe, Timothy D.; Fang, Yilin; Mahadevan, Radhakrishnan; Lovley, Derek R.

    2013-09-07

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparisonto prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model under

  1. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    NASA Astrophysics Data System (ADS)

    Scheibe, T. D.; Tartakovsky, G.; Tartakovsky, A. M.; Fang, Y.; Mahadevan, R.; Lovley, D. R.

    2012-12-01

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparison to prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model

  2. Pore-scale simulation of microbial growth using a genome-scale metabolic model: Implications for Darcy-scale reactive transport

    NASA Astrophysics Data System (ADS)

    Tartakovsky, G. D.; Tartakovsky, A. M.; Scheibe, T. D.; Fang, Y.; Mahadevan, R.; Lovley, D. R.

    2013-09-01

    Recent advances in microbiology have enabled the quantitative simulation of microbial metabolism and growth based on genome-scale characterization of metabolic pathways and fluxes. We have incorporated a genome-scale metabolic model of the iron-reducing bacteria Geobacter sulfurreducens into a pore-scale simulation of microbial growth based on coupling of iron reduction to oxidation of a soluble electron donor (acetate). In our model, fluid flow and solute transport is governed by a combination of the Navier-Stokes and advection-diffusion-reaction equations. Microbial growth occurs only on the surface of soil grains where solid-phase mineral iron oxides are available. Mass fluxes of chemical species associated with microbial growth are described by the genome-scale microbial model, implemented using a constraint-based metabolic model, and provide the Robin-type boundary condition for the advection-diffusion equation at soil grain surfaces. Conventional models of microbially-mediated subsurface reactions use a lumped reaction model that does not consider individual microbial reaction pathways, and describe reactions rates using empirically-derived rate formulations such as the Monod-type kinetics. We have used our pore-scale model to explore the relationship between genome-scale metabolic models and Monod-type formulations, and to assess the manifestation of pore-scale variability (microenvironments) in terms of apparent Darcy-scale microbial reaction rates. The genome-scale model predicted lower biomass yield, and different stoichiometry for iron consumption, in comparison to prior Monod formulations based on energetics considerations. We were able to fit an equivalent Monod model, by modifying the reaction stoichiometry and biomass yield coefficient, that could effectively match results of the genome-scale simulation of microbial behaviors under excess nutrient conditions, but predictions of the fitted Monod model deviated from those of the genome-scale model

  3. Genome-scale functional profiling of the mammalian AP-1 signaling pathway

    PubMed Central

    Chanda, Sumit K.; White, Suhaila; Orth, Anthony P.; Reisdorph, Richard; Miraglia, Loren; Thomas, Russell S.; DeJesus, Paul; Mason, Daniel E.; Huang, Qihong; Vega, Raquel; Yu, De-Hua; Nelson, Christian G.; Smith, Brendan M.; Terry, Robert; Linford, Alicia S.; Yu, Yang; Chirn, Gung-wei; Song, Chuanzheng; Labow, Mark A.; Cohen, Dalia; King, Frederick J.; Peters, Eric C.; Schultz, Peter G.; Vogt, Peter K.; Hogenesch, John B.; Caldwell, Jeremy S.

    2003-01-01

    Large-scale functional genomics approaches are fundamental to the characterization of mammalian transcriptomes annotated by genome sequencing projects. Although current high-throughput strategies systematically survey either transcriptional or biochemical networks, analogous genome-scale investigations that analyze gene function in mammalian cells have yet to be fully realized. Through transient overexpression analysis, we describe the parallel interrogation of ≈20,000 sequence annotated genes in cancer-related signaling pathways. For experimental validation of these genome data, we apply an integrative strategy to characterize previously unreported effectors of activator protein-1 (AP-1) mediated growth and mitogenic response pathways. These studies identify the ADP-ribosylation factor GTPase-activating protein Centaurin α1 and a Tudor domain-containing hypothetical protein as putative AP-1 regulatory oncogenes. These results provide insight into the composition of the AP-1 signaling machinery and validate this approach as a tractable platform for genome-wide functional analysis. PMID:14514886

  4. SearchDOGS Bacteria, Software That Provides Automated Identification of Potentially Missed Genes in Annotated Bacterial Genomes

    PubMed Central

    ÓhÉigeartaigh, Seán S.; Armisén, David; Byrne, Kevin P.

    2014-01-01

    We report the development of SearchDOGS Bacteria, software to automatically detect missing genes in annotated bacterial genomes by combining BLAST searches with comparative genomics. Having successfully applied the approach to yeast genomes, we redeveloped SearchDOGS to function as a standalone, downloadable package, requiring only a set of GenBank annotation files as input. The software automatically generates a homology structure using reciprocal BLAST and a synteny-based method; this is followed by a scan of the entire genome of each species for unannotated genes. Results are provided in a HTML interface, providing coordinates, BLAST results, syntenic location, omega values (Ka/Ks, where Ks is the number of synonymous substitutions per synonymous site and Ka is the number of nonsynonymous substitutions per nonsynonymous site) for protein conservation estimates, and other information for each candidate gene. Using SearchDOGS Bacteria, we identified 155 gene candidates in the Shigella boydii sb227 genome, including 56 candidates of length < 60 codons. SearchDOGS Bacteria has two major advantages over currently available annotation software. First, it outperforms current methods in terms of sensitivity and is highly effective at identifying small or highly diverged genes. Second, as a freely downloadable package, it can be used with unpublished or confidential data. PMID:24659774

  5. Deciphering Cyanide-Degrading Potential of Bacterial Community Associated with the Coking Wastewater Treatment Plant with a Novel Draft Genome.

    PubMed

    Wang, Zhiping; Liu, Lili; Guo, Feng; Zhang, Tong

    2015-10-01

    Biotreatment processes fed with coking wastewater often encounter insufficient removal of pollutants, such as ammonia, phenols, and polycyclic aromatic hydrocarbons (PAHs), especially for cyanides. However, only a limited number of bacterial species in pure cultures have been confirmed to metabolize cyanides, which hinders the improvement of these processes. In this study, a microbial community of activated sludge enriched in a coking wastewater treatment plant was analyzed using 454 pyrosequencing and Illumina sequencing to characterize the potential cyanide-degrading bacteria. According to the classification of these pyro-tags, targeting V3/V4 regions of 16S rRNA gene, half of them were assigned to the family Xanthomonadaceae, implying that Xanthomonadaceae bacteria are well-adapted to coking wastewater. A nearly complete draft genome of the dominant bacterium was reconstructed from metagenome of this community to explore cyanide metabolism based on analysis of the genome. The assembled 16S rRNA gene from this draft genome showed that this bacterium was a novel species of Thermomonas within Xanthomonadaceae, which was further verified by comparative genomics. The annotation using KEGG and Pfam identified genes related to cyanide metabolism, including genes responsible for the iron-harvesting system, cyanide-insensitive terminal oxidase, cyanide hydrolase/nitrilase, and thiosulfate:cyanide transferase. Phylogenetic analysis showed that these genes had homologs in previously identified genomes of bacteria within Xanthomonadaceae and even presented similar gene cassettes, thus implying an inherent cyanide-decomposing potential. The findings of this study expand our knowledge about the bacterial degradation of cyanide compounds and will be helpful in the remediation of cyanides contamination.

  6. An empirical strategy for characterizing bacterial proteomes across species in the absence of genomic sequences

    SciTech Connect

    Turse, Joshua E.; Marshall, Matthew J.; Fredrickson, Jim K.; Lipton, Mary S.; Callister, Stephen J.

    2010-11-12

    Current methods in proteomics are dependent on the availability of sequenced genomes to identify proteins. However, genomic sequences are not always available for bacteria or microbial communities, even with high throughput sequencing technology becoming more readily available. Nevertheless, the homology that exists between related bacteria makes possible the extraction of meaningful biological information from an organism’s, or community’s proteome using the genomic sequence of a near neighbor. Here, a cross-organism search strategy was used to look at the amount of proteomics information obtainable with relative genetic distance from a near neighbor organism and to identify proteins in the proteome of minimally characterized environmental isolates. We conclude that closely related organisms with sequenced genomes, can be used to characterize proteomes of organisms with unsequenced genomes. In general, a cross-organism search strategy demonstrates the first step to use of sequences genomes to evaluate the proteomes of environmental bacteria and microbial communities that have no sequenced genome

  7. A genome-wide association study identifies a horizontally transferred bacterial surface adhesin gene associated with antimicrobial resistant strains

    PubMed Central

    Suzuki, Masato; Shibayama, Keigo; Yahara, Koji

    2016-01-01

    Carbapenems are a class of last-resort antibiotics; thus, the increase in bacterial carbapenem-resistance is a serious public health threat. Acinetobacter baumannii is one of the microorganisms that can acquire carbapenem-resistance; it causes severe nosocomial infection, and is notoriously difficult to control in hospitals. Recently, a machine-learning approach was first used to analyze the genome sequences of hundreds of susceptible and resistant A. baumannii strains, including those carrying commonly acquired resistant mechanisms, to build a classifier that can predict strain resistance. A complementary approach is to explore novel genetic elements that could be associated with the antimicrobial resistance of strains, independent of known mechanisms. Therefore, we carefully selected A. baumannii strains, spanning various genotypes, from public genome databases, and conducted the first genome-wide association study (GWAS) of carbapenem resistance. We employed a recently developed method, capable of identifying any kind of genetic variation and accounting for bacterial population structure, and evaluated its effectiveness. Our study identified a surface adhesin gene that had been horizontally transferred to an ancestral branch of A. baumannii, as well as a specific region of that gene that appeared to accumulate multiple individual variations across the different branches of carbapenem-resistant A. baumannii strains. PMID:27892531

  8. Accomplishments in genome-scale in silico modeling for industrial and medical biotechnology.

    PubMed

    Milne, Caroline B; Kim, Pan-Jun; Eddy, James A; Price, Nathan D

    2009-12-01

    Driven by advancements in high-throughput biological technologies and the growing number of sequenced genomes, the construction of in silico models at the genome scale has provided powerful tools to investigate a vast array of biological systems and applications. Here, we review comprehensively the uses of such models in industrial and medical biotechnology, including biofuel generation, food production, and drug development. While the use of in silico models is still in its early stages for delivering to industry, significant initial successes have been achieved. For the cases presented here, genome-scale models predict engineering strategies to enhance properties of interest in an organism or to inhibit harmful mechanisms of pathogens. Going forward, genome-scale in silico models promise to extend their application and analysis scope to become a trans-formative tool in biotechnology.

  9. Genomic Survey of Pathogenicity Determinants and VNTR Markers in the Cassava Bacterial Pathogen Xanthomonas axonopodis pv. Manihotis Strain CIO151

    PubMed Central

    Arrieta-Ortiz, Mario L.; Rodríguez-R, Luis M.; Pérez-Quintero, Álvaro L.; Poulin, Lucie; Díaz, Ana C.; Arias Rojas, Nathalia; Trujillo, Cesar; Restrepo Benavides, Mariana; Bart, Rebecca; Boch, Jens; Boureau, Tristan; Darrasse, Armelle; David, Perrine; Dugé de Bernonville, Thomas; Fontanilla, Paula; Gagnevin, Lionel; Guérin, Fabien; Jacques, Marie-Agnès; Lauber, Emmanuelle; Lefeuvre, Pierre; Medina, Cesar; Medina, Edgar; Montenegro, Nathaly; Muñoz Bodnar, Alejandra; Noël, Laurent D.; Ortiz Quiñones, Juan F.; Osorio, Daniela; Pardo, Carolina; Patil, Prabhu B.; Poussier, Stéphane; Pruvost, Olivier; Robène-Soustrade, Isabelle; Ryan, Robert P.; Tabima, Javier; Urrego Morales, Oscar G.; Vernière, Christian; Carrere, Sébastien; Verdier, Valérie; Szurek, Boris; Restrepo, Silvia; López, Camilo

    2013-01-01

    Xanthomonas axonopodis pv. manihotis (Xam) is the causal agent of bacterial blight of cassava, which is among the main components of human diet in Africa and South America. Current information about the molecular pathogenicity factors involved in the infection process of this organism is limited. Previous studies in other bacteria in this genus suggest that advanced draft genome sequences are valuable resources for molecular studies on their interaction with plants and could provide valuable tools for diagnostics and detection. Here we have generated the first manually annotated high-quality draft genome sequence of Xam strain CIO151. Its genomic structure is similar to that of other xanthomonads, especially Xanthomonas euvesicatoria and Xanthomonas citri pv. citri species. Several putative pathogenicity factors were identified, including type III effectors, cell wall-degrading enzymes and clusters encoding protein secretion systems. Specific characteristics in this genome include changes in the xanthomonadin cluster that could explain the lack of typical yellow color in all strains of this pathovar and the presence of 50 regions in the genome with atypical nucleotide composition. The genome sequence was used to predict and evaluate 22 variable number of tandem repeat (VNTR) loci that were subsequently demonstrated as polymorphic in representative Xam strains. Our results demonstrate that Xanthomonas axonopodis pv. manihotis strain CIO151 possesses ten clusters of pathogenicity factors conserved within the genus Xanthomonas. We report 126 genes that are potentially unique to Xam, as well as potential horizontal transfer events in the history of the genome. The relation of these regions with virulence and pathogenicity could explain several aspects of the biology of this pathogen, including its ability to colonize both vascular and non-vascular tissues of cassava plants. A set of 16 robust, polymorphic VNTR loci will be useful to develop a multi-locus VNTR analysis

  10. Genomic survey of pathogenicity determinants and VNTR markers in the cassava bacterial pathogen Xanthomonas axonopodis pv. Manihotis strain CIO151.

    PubMed

    Arrieta-Ortiz, Mario L; Rodríguez-R, Luis M; Pérez-Quintero, Álvaro L; Poulin, Lucie; Díaz, Ana C; Arias Rojas, Nathalia; Trujillo, Cesar; Restrepo Benavides, Mariana; Bart, Rebecca; Boch, Jens; Boureau, Tristan; Darrasse, Armelle; David, Perrine; Dugé de Bernonville, Thomas; Fontanilla, Paula; Gagnevin, Lionel; Guérin, Fabien; Jacques, Marie-Agnès; Lauber, Emmanuelle; Lefeuvre, Pierre; Medina, Cesar; Medina, Edgar; Montenegro, Nathaly; Muñoz Bodnar, Alejandra; Noël, Laurent D; Ortiz Quiñones, Juan F; Osorio, Daniela; Pardo, Carolina; Patil, Prabhu B; Poussier, Stéphane; Pruvost, Olivier; Robène-Soustrade, Isabelle; Ryan, Robert P; Tabima, Javier; Urrego Morales, Oscar G; Vernière, Christian; Carrere, Sébastien; Verdier, Valérie; Szurek, Boris; Restrepo, Silvia; López, Camilo; Koebnik, Ralf; Bernal, Adriana

    2013-01-01

    Xanthomonas axonopodis pv. manihotis (Xam) is the causal agent of bacterial blight of cassava, which is among the main components of human diet in Africa and South America. Current information about the molecular pathogenicity factors involved in the infection process of this organism is limited. Previous studies in other bacteria in this genus suggest that advanced draft genome sequences are valuable resources for molecular studies on their interaction with plants and could provide valuable tools for diagnostics and detection. Here we have generated the first manually annotated high-quality draft genome sequence of Xam strain CIO151. Its genomic structure is similar to that of other xanthomonads, especially Xanthomonas euvesicatoria and Xanthomonas citri pv. citri species. Several putative pathogenicity factors were identified, including type III effectors, cell wall-degrading enzymes and clusters encoding protein secretion systems. Specific characteristics in this genome include changes in the xanthomonadin cluster that could explain the lack of typical yellow color in all strains of this pathovar and the presence of 50 regions in the genome with atypical nucleotide composition. The genome sequence was used to predict and evaluate 22 variable number of tandem repeat (VNTR) loci that were subsequently demonstrated as polymorphic in representative Xam strains. Our results demonstrate that Xanthomonas axonopodis pv. manihotis strain CIO151 possesses ten clusters of pathogenicity factors conserved within the genus Xanthomonas. We report 126 genes that are potentially unique to Xam, as well as potential horizontal transfer events in the history of the genome. The relation of these regions with virulence and pathogenicity could explain several aspects of the biology of this pathogen, including its ability to colonize both vascular and non-vascular tissues of cassava plants. A set of 16 robust, polymorphic VNTR loci will be useful to develop a multi-locus VNTR analysis

  11. LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Tier 1 Report

    SciTech Connect

    Slezak, T; Borucki, M; Lenhoff, R; Vitalis, E

    2009-09-29

    The Lawrence Livermore National Lab Bioinformatics group has recently taken on a role in DTRA's Transformation Medical Technologies Initiative (TMTI). The high-level goal of TMTI is to accelerate the development of broad-spectrum countermeasures. To achieve those goals, TMTI has a near term need to obtain more sequence information across a large range of pathogens, near neighbors, and across a broad geographical and host range. Our role in this project is to research available sequence data for the organisms of interest and identify critical microbial sequence and knowledge gaps that need to be filled to meet TMTI objectives. This effort includes: (1) assessing current genomic sequence for each agent including phylogenetic and geographical diversity, host range, date of isolation range, virulence, sequence availability of key near neighbors, and other characteristics; (2) identifying Subject Matter Experts (SME's) and potential holders of isolate collections, contacting appropriate SME's with known expertise and isolate collections to obtain information on isolate availability and specific recommendations; (3) identifying sequence as well as knowledge gaps (eg virulence, host range, and antibiotic resistance determinants); (4) providing specific recommendations as to the most valuable strains to be placed on the DTRA sequencing queue. We acknowledge that criteria for prioritization of isolates for sequencing falls into two categories aligning with priority queues 1 and 2 as described in the summary. (Priority queue 0 relates to DTRA operational isolates whose availability is not predictable in advance.) 1. Selection of isolates that appear to have likelihood to provide information on virulence and antibiotic resistance. This will include sequence of known virulent strains. Particularly valuable would be virulent strains that have genetically similar yet avirulent, or non human transmissible, counterparts that can be used for comparison to help identify key

  12. The channel catfish genome sequence provides insights into the evolution of scale formation in teleosts

    PubMed Central

    Liu, Zhanjiang; Liu, Shikai; Yao, Jun; Bao, Lisui; Zhang, Jiaren; Li, Yun; Jiang, Chen; Sun, Luyang; Wang, Ruijia; Zhang, Yu; Zhou, Tao; Zeng, Qifan; Fu, Qiang; Gao, Sen; Li, Ning; Koren, Sergey; Jiang, Yanliang; Zimin, Aleksey; Xu, Peng; Phillippy, Adam M.; Geng, Xin; Song, Lin; Sun, Fanyue; Li, Chao; Wang, Xiaozhu; Chen, Ailu; Jin, Yulin; Yuan, Zihao; Yang, Yujia; Tan, Suxu; Peatman, Eric; Lu, Jianguo; Qin, Zhenkui; Dunham, Rex; Li, Zhaoxia; Sonstegard, Tad; Feng, Jianbin; Danzmann, Roy G.; Schroeder, Steven; Scheffler, Brian; Duke, Mary V.; Ballard, Linda; Kucuktas, Huseyin; Kaltenboeck, Ludmilla; Liu, Haixia; Armbruster, Jonathan; Xie, Yangjie; Kirby, Mona L.; Tian, Yi; Flanagan, Mary Elizabeth; Mu, Weijie; Waldbieser, Geoffrey C.

    2016-01-01

    Catfish represent 12% of teleost or 6.3% of all vertebrate species, and are of enormous economic value. Here we report a high-quality reference genome sequence of channel catfish (Ictalurus punctatus), the major aquaculture species in the US. The reference genome sequence was validated by genetic mapping of 54,000 SNPs, and annotated with 26,661 predicted protein-coding genes. Through comparative analysis of genomes and transcriptomes of scaled and scaleless fish and scale regeneration experiments, we address the genomic basis for the most striking physical characteristic of catfish, the evolutionary loss of scales and provide evidence that lack of secretory calcium-binding phosphoproteins accounts for the evolutionary loss of scales in catfish. The channel catfish reference genome sequence, along with two additional genome sequences and transcriptomes of scaled catfishes, provide crucial resources for evolutionary and biological studies. This work also demonstrates the power of comparative subtraction of candidate genes for traits of structural significance. PMID:27249958

  13. Genome-Scale Metabolic Modeling in the Simulation of Field-Scale Uranium Bioremediation

    NASA Astrophysics Data System (ADS)

    Yabusaki, S.; Wilkins, M.; Fang, Y.; Williams, K. H.; Waichler, S.; Long, P. E.

    2015-12-01

    Coupled variably saturated flow and biogeochemical reactive transport modeling is used to improve understanding of the processes, properties, and conditions controlling uranium bio-immobilization in a field experiment where uranium-contaminated groundwater was amended with acetate and bicarbonate. The acetate stimulates indigenous microorganisms that catalyze metal reduction, including the conversion of aqueous U(VI) to solid-phase U(IV), which effectively removes uranium from solution. The initiation of the bicarbonate amendment prior to biostimulation was designed to promote U(VI) desorption that would increase the aqueous U(VI) available for bioreduction. The three-dimensional simulations were able to largely reproduce the timing and magnitude of the physical, chemical and biological responses to the acetate and bicarbonate amendment in the context of changing water table elevation and gradient. A time series of groundwater proteomic samples exhibited correlations between the most abundant Geobacter metallireducens proteins and the genome-scale metabolic model-predicted fluxes of intra-cellular reactions associated with each of those proteins. The desorption of U(VI) induced by the bicarbonate amendment led to initially higher rates of bioreduction compared to locations with minimal bicarbonate exposure. After bicarbonate amendment ceased, bioreduction continued at these locations whereas U(VI) sorption was the dominant removal mechanism at the bicarbonate-impacted sites.

  14. Cycling Transcriptional Networks Optimize Energy Utilization on a Genome Scale.

    PubMed

    Wang, Guang-Zhong; Hickey, Stephanie L; Shi, Lei; Huang, Hung-Chung; Nakashe, Prachi; Koike, Nobuya; Tu, Benjamin P; Takahashi, Joseph S; Konopka, Genevieve

    2015-12-01

    Genes expressing circadian RNA rhythms are enriched for metabolic pathways, but the adaptive significance of cyclic gene expression remains unclear. We estimated the genome-wide synthetic and degradative cost of transcription and translation in three organisms and found that the cost of cycling genes is strikingly higher compared to non-cycling genes. Cycling genes are expressed at high levels and constitute the most costly proteins to synthesize in the genome. We demonstrate that metabolic cycling is accelerated in yeast grown under higher nutrient flux and the number of cycling genes increases ∼40%, which are achieved by increasing the amplitude and not the mean level of gene expression. These results suggest that rhythmic gene expression optimizes the metabolic cost of global gene expression and that highly expressed genes have been selected to be downregulated in a cyclic manner for energy conservation.

  15. Scaling up genome annotation using MAKER and work queue.

    PubMed

    Thrasher, Andrew; Musgrave, Zachary; Kachmarck, Brian; Thain, Douglas; Emrich, Scott

    2014-01-01

    Next generation sequencing technologies have enabled sequencing many genomes. Because of the overall increasing demand and the inherent parallelism available in many required analyses, these bioinformatics applications should ideally run on clusters, clouds and/or grids. We present a modified annotation framework that achieves a speed-up of 45x using 50 workers using a Caenorhabditis japonica test case. We also evaluate these modifications within the Amazon EC2 cloud framework. The underlying genome annotation (MAKER) is parallelised as an MPI application. Our framework enables it to now run without MPI while utilising a wide variety of distributed computing resources. This parallel framework also allows easy explicit data transfer, which helps overcome a major limitation of bioinformatics tools that often rely on shared file systems. Combined, our proposed framework can be used, even during early stages of development, to easily run sequence analysis tools on clusters, grids and clouds.

  16. High-throughput genomic sequencing of cassava bacterial blight strains identifies conserved effectors to target for durable resistance.

    PubMed

    Bart, Rebecca; Cohn, Megan; Kassen, Andrew; McCallum, Emily J; Shybut, Mikel; Petriello, Annalise; Krasileva, Ksenia; Dahlbeck, Douglas; Medina, Cesar; Alicai, Titus; Kumar, Lava; Moreira, Leandro M; Rodrigues Neto, Júlio; Verdier, Valerie; Santana, María Angélica; Kositcharoenkul, Nuttima; Vanderschuren, Hervé; Gruissem, Wilhelm; Bernal, Adriana; Staskawicz, Brian J

    2012-07-10

    Cassava bacterial blight (CBB), incited by Xanthomonas axonopodis pv. manihotis (Xam), is the most important bacterial disease of cassava, a staple food source for millions of people in developing countries. Here we present a widely applicable strategy for elucidating the virulence components of a pathogen population. We report Illumina-based draft genomes for 65 Xam strains and deduce the phylogenetic relatedness of Xam across the areas where cassava is grown. Using an extensive database of effector proteins from animal and plant pathogens, we identify the effector repertoire for each sequenced strain and use a comparative sequence analysis to deduce the least polymorphic of the conserved effectors. These highly conserved effectors have been maintained over 11 countries, three continents, and 70 y of evolution and as such represent ideal targets for developing resistance strategies.

  17. Direct-to-consumer genomics on the scales of autonomy.

    PubMed

    Vayena, Effy

    2015-04-01

    Direct-to-consumer (DTC) genetic services have generated enormous controversy from their first emergence. A dramatic recent manifestation of this is the Food and Drug Administration's (FDA) cease and desist order against 23andMe, the leading provider in the market. Critics have argued for the restrictive regulation of such services, and even their prohibition, on the grounds of the harm they pose to consumers. Their advocates, by contrast, defend them as a means of enhancing the autonomy of those same consumers. Autonomy emerges as a key battle-field in this debate, because many of the 'harm' arguments can be interpreted as identifying threats to autonomy. This paper assesses whether DTC genomic services are a threat to, or instead, an enhancement of, personal autonomy. It deploys Joseph Raz's account of personal autonomy, with its emphasis on choice from a range of valuable options. It then seeks to counter claims that DTC genomics threatens autonomy because it involves manipulation in contravention of consumers' independence or because it does not generate valuable options which can be meaningfully engaged with by consumers. It is stressed that the value of the options generated by DTC genomics should not be judged exclusively from the perspective of medical actionability, but should take into consideration plural utilities. Finally, the paper ends by broaching policy recommendations, suggesting that there is a strong autonomy-based argument for permitting DTC genomic services, and that the key question is the nature of the regulatory conditions under which they should be permitted. The discussion of autonomy in this paper helps illuminate some of these conditions.

  18. Mapping copy number variation by population-scale genome sequencing.

    PubMed

    Mills, Ryan E; Walter, Klaudia; Stewart, Chip; Handsaker, Robert E; Chen, Ken; Alkan, Can; Abyzov, Alexej; Yoon, Seungtai Chris; Ye, Kai; Cheetham, R Keira; Chinwalla, Asif; Conrad, Donald F; Fu, Yutao; Grubert, Fabian; Hajirasouliha, Iman; Hormozdiari, Fereydoun; Iakoucheva, Lilia M; Iqbal, Zamin; Kang, Shuli; Kidd, Jeffrey M; Konkel, Miriam K; Korn, Joshua; Khurana, Ekta; Kural, Deniz; Lam, Hugo Y K; Leng, Jing; Li, Ruiqiang; Li, Yingrui; Lin, Chang-Yun; Luo, Ruibang; Mu, Xinmeng Jasmine; Nemesh, James; Peckham, Heather E; Rausch, Tobias; Scally, Aylwyn; Shi, Xinghua; Stromberg, Michael P; Stütz, Adrian M; Urban, Alexander Eckehart; Walker, Jerilyn A; Wu, Jiantao; Zhang, Yujun; Zhang, Zhengdong D; Batzer, Mark A; Ding, Li; Marth, Gabor T; McVean, Gil; Sebat, Jonathan; Snyder, Michael; Wang, Jun; Ye, Kenny; Eichler, Evan E; Gerstein, Mark B; Hurles, Matthew E; Lee, Charles; McCarroll, Steven A; Korbel, Jan O

    2011-02-03

    Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.

  19. Symmetry and scale orient Min protein patterns in shaped bacterial sculptures

    PubMed Central

    Wu, Fabai; van Schie, Bas G.C.; Keymer, Juan E.; Dekker, Cees

    2016-01-01

    The boundary of a cell defines the shape and scale for its subcellular organisation. However, the effects of the cell’s spatial boundaries as well as the geometry sensing and scale adaptation of intracellular molecular networks remain largely unexplored. Here, we show that living bacterial cells can be ‘sculpted’ into defined shapes, such as squares and rectangles, which are used to explore the spatial adaptation of Min proteins that oscillate pole-to-pole in rod-shape Escherichia coli to assist cell division. In a wide geometric parameter space, ranging from 2x1x1 to 11x6x1 μm3, Min proteins exhibit versatile oscillation patterns, sustaining rotational, longitudinal, diagonal, stripe, and even transversal modes. These patterns are found to directly capture the symmetry and scale of the cell boundary, and the Min concentration gradients scale in adaptation to the cell size within a characteristic length range of 3–6 μm. Numerical simulations reveal that local microscopic Turing kinetics of Min proteins can yield global symmetry selection, gradient scaling, and an adaptive range, when and only when facilitated by the three-dimensional confinement of cell boundary. These findings cannot be explained by previous geometry-sensing models based on the longest distance, membrane area or curvature, and reveal that spatial boundaries can facilitate simple molecular interactions to result in far more versatile functions than previously understood. PMID:26098227

  20. Symmetry and scale orient Min protein patterns in shaped bacterial sculptures

    NASA Astrophysics Data System (ADS)

    Wu, Fabai; van Schie, Bas G. C.; Keymer, Juan E.; Dekker, Cees

    2015-08-01

    The boundary of a cell defines the shape and scale of its subcellular organization. However, the effects of the cell's spatial boundaries as well as the geometry sensing and scale adaptation of intracellular molecular networks remain largely unexplored. Here, we show that living bacterial cells can be ‘sculpted’ into defined shapes, such as squares and rectangles, which are used to explore the spatial adaptation of Min proteins that oscillate pole-to-pole in rod-shaped Escherichia coli to assist cell division. In a wide geometric parameter space, ranging from 2 × 1 × 1 to 11 × 6 × 1 μm3, Min proteins exhibit versatile oscillation patterns, sustaining rotational, longitudinal, diagonal, stripe and even transversal modes. These patterns are found to directly capture the symmetry and scale of the cell boundary, and the Min concentration gradients scale with the cell size within a characteristic length range of 3-6 μm. Numerical simulations reveal that local microscopic Turing kinetics of Min proteins can yield global symmetry selection, gradient scaling and an adaptive range, when and only when facilitated by the three-dimensional confinement of the cell boundary. These findings cannot be explained by previous geometry-sensing models based on the longest distance, membrane area or curvature, and reveal that spatial boundaries can facilitate simple molecular interactions to result in far more versatile functions than previously understood.

  1. Dynamics of bacterial communities before and after distribution in a full-scale drinking water network.

    PubMed

    El-Chakhtoura, Joline; Prest, Emmanuelle; Saikaly, Pascal; van Loosdrecht, Mark; Hammes, Frederik; Vrouwenvelder, Hans

    2015-05-01

    Understanding the biological stability of drinking water distribution systems is imperative in the framework of process control and risk management. The objective of this research was to examine the dynamics of the bacterial community during drinking water distribution at high temporal resolution. Water samples (156 in total) were collected over short time-scales (minutes/hours/days) from the outlet of a treatment plant and a location in its corresponding distribution network. The drinking water is treated by biofiltration and disinfectant residuals are absent during distribution. The community was analyzed by 16S rRNA gene pyrosequencing and flow cytometry as well as conventional, culture-based methods. Despite a random dramatic event (detected with pyrosequencing and flow cytometry but not with plate counts), the bacterial community profile at the two locations did not vary significantly over time. A diverse core microbiome was shared between the two locations (58-65% of the taxa and 86-91% of the sequences) and found to be dependent on the treatment strategy. The bacterial community structure changed during distribution, with greater richness detected in the network and phyla such as Acidobacteria and Gemmatimonadetes becoming abundant. The rare taxa displayed the highest dynamicity, causing the major change during water distribution. This change did not have hygienic implications and is contingent on the sensitivity of the applied methods. The concept of biological stability therefore needs to be revised. Biostability is generally desired in drinking water guidelines but may be difficult to achieve in large-scale complex distribution systems that are inherently dynamic.

  2. TIGER: Toolbox for integrating genome-scale metabolic models, expression data, and transcriptional regulatory networks

    PubMed Central

    2011-01-01

    Background Several methods have been developed for analyzing genome-scale models of metabolism and transcriptional regulation. Many of these methods, such as Flux Balance Analysis, use constrained optimization to predict relationships between metabolic flux and the genes that encode and regulate enzyme activity. Recently, mixed integer programming has been used to encode these gene-protein-reaction (GPR) relationships into a single optimization problem, but these techniques are often of limited generality and lack a tool for automating the conversion of rules to a coupled regulatory/metabolic model. Results We present TIGER, a Toolbox for Integrating Genome-scale Metabolism, Expression, and Regulation. TIGER converts a series of generalized, Boolean or multilevel rules into a set of mixed integer inequalities. The package also includes implementations of existing algorithms to integrate high-throughput expression data with genome-scale models of metabolism and transcriptional regulation. We demonstrate how TIGER automates the coupling of a genome-scale metabolic model with GPR logic and models of transcriptional regulation, thereby serving as a platform for algorithm development and large-scale metabolic analysis. Additionally, we demonstrate how TIGER's algorithms can be used to identify inconsistencies and improve existing models of transcriptional regulation with examples from the reconstructed transcriptional regulatory network of Saccharomyces cerevisiae. Conclusion The TIGER package provides a consistent platform for algorithm development and extending existing genome-scale metabolic models with regulatory networks and high-throughput data. PMID:21943338

  3. On the road to synthetic life: the minimal cell and genome-scale engineering.

    PubMed

    Juhas, Mario

    2016-01-01

    Synthetic biology employs rational engineering principles to build biological systems from the libraries of standard, well characterized biological parts. Biological systems designed and built by synthetic biologists fulfill a plethora of useful purposes, ranging from better healthcare and energy production to biomanufacturing. Recent advancements in the synthesis, assembly and "booting-up" of synthetic genomes and in low and high-throughput genome engineering have paved the way for engineering on the genome-wide scale. One of the key goals of genome engineering is the construction of minimal genomes consisting solely of essential genes (genes indispensable for survival of living organisms). Besides serving as a toolbox to understand the universal principles of life, the cell encoded by minimal genome could be used to build a stringently controlled "cell factory" with a desired phenotype. This review provides an update on recent advances in the genome-scale engineering with particular emphasis on the engineering of minimal genomes. Furthermore, it presents an ongoing discussion to the scientific community for better suitability of minimal or robust cells for industrial applications.

  4. The Genomic Sequence of the Oral Pathobiont Strain NI1060 Reveals Unique Strategies for Bacterial Competition and Pathogenicity

    PubMed Central

    Jiao, Yizu; Hasegawa, Mizuho; Moon, Henry; Núñez, Gabriel; Inohara, Naohiro; Raes, Jeroen

    2016-01-01

    Strain NI1060 is an oral bacterium responsible for periodontitis in a murine ligature-induced disease model. To better understand its pathogenicity, we have determined the complete sequence of its 2,553,982 bp genome. Although closely related to Pasteurella pneumotropica, a pneumonia-associated rodent commensal based on its 16S rRNA, the NI1060 genomic content suggests that they are different species thriving on different energy sources via alternative metabolic pathways. Genomic and phylogenetic analyses showed that strain NI1060 is distinct from the genera currently described in the family Pasteurellaceae, and is likely to represent a novel species. In addition, we found putative virulence genes involved in lipooligosaccharide synthesis, adhesins and bacteriotoxic proteins. These genes are potentially important for host adaption and for the induction of dysbiosis through bacterial competition and pathogenicity. Importantly, strain NI1060 strongly stimulates Nod1, an innate immune receptor, but is defective in two peptidoglycan recycling genes due to a frameshift mutation. The in-depth analysis of its genome thus provides critical insights for the development of NI1060 as a prime model system for infectious disease. PMID:27409077

  5. GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing.

    PubMed

    Wang, Xuewen; Wang, Le

    2016-01-01

    Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar.

  6. Effect of sludge age on the bacterial diversity of bench scale sequencing batch reactors.

    PubMed

    Akarsubasi, Alper Tunga; Eyice, Ozge; Miskin, Ian; Head, Ian M; Curtis, Thomas P

    2009-04-15

    Sludge age or mean cell residence time (MCRT) plays a crucial role in design and operation of wastewater treatment plants. The change in performance, for example micropollutant removal, associated with changes in MCRT is often attributed to changes in microbial diversity. We operated four identical laboratory-scale sequencing batch reactors (two test and two control) in parallel for 212 days. Sludge age was decreased gradually (from 10.4to 2.6 days) in experimental reactors whereas it was kept constant (10.4 days) in control reactors. The reactor performance and biomass changed in a manner consistent with our understanding of the effect of sludge age on a reactors performance: the effluent quality and biomass declined with decreasing MCRT. The composition of the bacterial and ammonia-oxidizing bacterial communities in four reactors was analyzed using denaturing gradient gel electrophoresis (DGGE), and similarities in band patterns were measured using the Dice coefficient. The overall similarity between the communities in reactors run at different sludge ages was indistinguishable from the similarity in communities in reactors run at identical sludge ages. This was true for both the general bacterial communities and putative AOB communities. The number of detectable bands in DGGE profiles was also unaffected by sludge age (p approximately 0.5 in both cases). Initially, the detectable diversity of activated sludge communities in all four reactors clustered with time, regardless of their designation or sludge age; however, these clusters were only weakly supported by bootstrap analysis. However, after 135 days, a sludge age specific clustering was observed in the bacterial community but not the putative ammonia-oxidizing bacterial community. The mean self-similarity of each reactor decreased, variance increased, and the number of detectable bands in DGGE profiles decreased over time in all reactors. The changes observed with time are consistent with ecological drift

  7. RNA-guided CRISPR-Cas technologies for genome-scale investigation of disease processes.

    PubMed

    Humphrey, Sean E; Kasinski, Andrea L

    2015-04-02

    From its discovery as an adaptive bacterial and archaea immune system, the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system has quickly been developed into a powerful and groundbreaking programmable nuclease technology for the global and precise editing of the genome in cells. This system allows for comprehensive unbiased functional studies and is already advancing the field by revealing genes that have previously unknown roles in disease processes. In this review, we examine and compare recently developed CRISPR-Cas platforms for global genome editing and examine the advancements these platforms have made in guide RNA design, guide RNA/Cas9 interaction, on-target specificity, and target sequence selection. We also explore some of the exciting therapeutic potentials of the CRISPR-Cas technology as well as some of the innovative new uses of this technology beyond genome editing.

  8. FDA Bioinformatics Tool for Microbial Genomics Research on Molecular Characterization of Bacterial Foodborne Pathogens Using Microarrays

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Advances in microbial genomics and bioinformatics are offering greater insights into the emergence and spread of foodborne pathogens in outbreak scenarios. The Food and Drug Administration (FDA) has developed the genomics tool ArrayTrackTM, which provides extensive functionalities to man...

  9. Genome Sequences of 12 Bacterial Isolates Obtained from the Urine of Pregnant Women

    PubMed Central

    Weimer, Cory M.; Deitzler, Grace E.; Robinson, Lloyd S.; Park, SoEun; Hallsworth-Pepin, Kymberlie; Wollam, Aye; Mitreva, Makedonka

    2016-01-01

    The presence of bacteria in urine can pose significant risks during pregnancy. However, there are few reference genome strains for many common urinary bacteria. We isolated 12 urinary strains of Streptococcus, Staphylococcus, Citrobacter, Gardnerella, and Lactobacillus. These strains and their genomes are now available to the research community. PMID:27688327

  10. Attenuated Virulence and Genomic Reductive Evolution in the Entomopathogenic Bacterial Symbiont Species, Xenorhabdus poinarii

    PubMed Central

    Ogier, Jean-Claude; Pagès, Sylvie; Bisch, Gaëlle; Chiapello, Hélène; Médigue, Claudine; Rouy, Zoé; Teyssier, Corinne; Vincent, Stéphanie; Tailliez, Patrick; Givaudan, Alain; Gaudriault, Sophie

    2014-01-01

    Bacteria of the genus Xenorhabdus are symbionts of soil entomopathogenic nematodes of the genus Steinernema. This symbiotic association constitutes an insecticidal complex active against a wide range of insect pests. Unlike other Xenorhabdus species, Xenorhabdus poinarii is avirulent when injected into insects in the absence of its nematode host. We sequenced the genome of the X. poinarii strain G6 and the closely related but virulent X. doucetiae strain FRM16. G6 had a smaller genome (500–700 kb smaller) than virulent Xenorhabdus strains and lacked genes encoding potential virulence factors (hemolysins, type 5 secretion systems, enzymes involved in the synthesis of secondary metabolites, and toxin–antitoxin systems). The genomes of all the X. poinarii strains analyzed here had a similar small size. We did not observe the accumulation of pseudogenes, insertion sequences or decrease in coding density usually seen as a sign of genomic erosion driven by genetic drift in host-adapted bacteria. Instead, genome reduction of X. poinarii seems to have been mediated by the excision of genomic blocks from the flexible genome, as reported for the genomes of attenuated free pathogenic bacteria and some facultative mutualistic bacteria growing exclusively within hosts. This evolutionary pathway probably reflects the adaptation of X. poinarii to specific host. PMID:24904010

  11. Draft Genome Sequence of Nocardia jinanensis, an Opportunistic Bacterial Pathogen That Causes Cellulitis.

    PubMed

    Chakrabortti, Alolika; Li, Jinming; Liang, Zhao-Xun

    2016-07-21

    The draft genome sequence of Nocardia jinanensis, an opportunistic pathogen that can cause skin infections, reveals genes that may contribute to the lifestyle and pathogenicity of N. jinanensis The genome also reveals the biosynthetic capacity of N. jinanensis in producing mycolic acids, siderophores, and other polyketide and nonribosomal peptide-derived secondary metabolites.

  12. Whole-Genome Sequencing of Invasion-Resistant Cells Identifies Laminin α2 as a Host Factor for Bacterial Invasion

    PubMed Central

    van Wijk, Xander M.; Döhrmann, Simon; Hallström, Björn M.; Li, Shangzhong; Voldborg, Bjørn G.; Meng, Brandon X.; McKee, Karen K.; van Kuppevelt, Toin H.; Yurchenco, Peter D.; Palsson, Bernhard O.; Lewis, Nathan E.; Nizet, Victor

    2017-01-01

    ABSTRACT To understand the role of glycosaminoglycans in bacterial cellular invasion, xylosyltransferase-deficient mutants of Chinese hamster ovary (CHO) cells were created using clustered regularly interspaced short palindromic repeat (CRISPR) and CRISPR-associated gene 9 (CRISPR-cas9) gene targeting. When these mutants were compared to the pgsA745 cell line, a CHO xylosyltransferase mutant generated previously using chemical mutagenesis, an unexpected result was obtained. Bacterial invasion of pgsA745 cells by group B Streptococcus (GBS), group A Streptococcus, and Staphylococcus aureus was markedly reduced compared to the invasion of wild-type cells, but newly generated CRISPR-cas9 mutants were only resistant to GBS. Invasion of pgsA745 cells was not restored by transfection with xylosyltransferase, suggesting that an additional mutation conferring panresistance to multiple bacteria was present in pgsA745 cells. Whole-genome sequencing and transcriptome sequencing (RNA-Seq) uncovered a deletion in the gene encoding the laminin subunit α2 (Lama2) that eliminated much of domain L4a. Silencing of the long Lama2 isoform in wild-type cells strongly reduced bacterial invasion, whereas transfection with human LAMA2 cDNA significantly enhanced invasion in pgsA745 cells. The addition of exogenous laminin-α2β1γ1/laminin-α2β2γ1 strongly increased bacterial invasion in CHO cells, as well as in human alveolar basal epithelial and human brain microvascular endothelial cells. Thus, the L4a domain in laminin α2 is important for cellular invasion by a number of bacterial pathogens. PMID:28074024

  13. Estimating phylogenetic trees from genome-scale data.

    PubMed

    Liu, Liang; Xi, Zhenxiang; Wu, Shaoyuan; Davis, Charles C; Edwards, Scott V

    2015-12-01

    The heterogeneity of signals in the genomes of diverse organisms poses challenges for traditional phylogenetic analysis. Phylogenetic methods known as "species tree" methods have been proposed to directly address one important source of gene tree heterogeneity, namely the incomplete lineage sorting that occurs when evolving lineages radiate rapidly, resulting in a diversity of gene trees from a single underlying species tree. Here we review theory and empirical examples that help clarify conflicts between species tree and concatenation methods, and misconceptions in the literature about the performance of species tree methods. Considering concatenation as a special case of the multispecies coalescent model helps explain differences in the behavior of the two methods on phylogenomic data sets. Recent work suggests that species tree methods are more robust than concatenation approaches to some of the classic challenges of phylogenetic analysis, including rapidly evolving sites in DNA sequences and long-branch attraction. We show that approaches, such as binning, designed to augment the signal in species tree analyses can distort the distribution of gene trees and are inconsistent. Computationally efficient species tree methods incorporating biological realism are a key to phylogenetic analysis of whole-genome data.

  14. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

    PubMed

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.

  15. A genomic approach to bacterial taxonomy: an examination and proposed reclassification of species within the genus Neisseria

    PubMed Central

    Jolley, Keith A.; Earle, Sarah G.; Corton, Craig; Bentley, Stephen D.; Parkhill, Julian; Maiden, Martin C. J.

    2012-01-01

    In common with other bacterial taxa, members of the genus Neisseria are classified using a range of phenotypic and biochemical approaches, which are not entirely satisfactory in assigning isolates to species groups. Recently, there has been increasing interest in using nucleotide sequences for bacterial typing and taxonomy, but to date, no broadly accepted alternative to conventional methods is available. Here, the taxonomic relationships of 55 representative members of the genus Neisseria have been analysed using whole-genome sequence data. As genetic material belonging to the accessory genome is widely shared among different taxa but not present in all isolates, this analysis indexed nucleotide sequence variation within sets of genes, specifically protein-coding genes that were present and directly comparable in all isolates. Variation in these genes identified seven species groups, which were robust to the choice of genes and phylogenetic clustering methods used. The groupings were largely, but not completely, congruent with current species designations, with some minor changes in nomenclature and the reassignment of a few isolates necessary. In particular, these data showed that isolates classified as Neisseria polysaccharea are polyphyletic and probably include more than one taxonomically distinct organism. The seven groups could be reliably and rapidly generated with sequence variation within the 53 ribosomal protein subunit (rps) genes, further demonstrating that ribosomal multilocus sequence typing (rMLST) is a practicable and powerful means of characterizing bacteria at all levels, from domain to strain. PMID:22422752

  16. Viability of phenanthrene biodegradation by an isolated bacterial consortium: optimization and scale-up.

    PubMed

    Moscoso, F; Ferreira, L; Deive, F J; Morán, P; Sanromán, M A

    2013-02-01

    In the present work, biodegradation of phenanthrene by a bacterial consortium (LB2), isolated from lab-polluted soils has been investigated. The 16S rRNA gene-based molecular analysis revealed that the bacterial consortium LB2 consisted of two strains showing a very high homology with Staphylococcus warneri and Bacillus pumilus. The optimization of phenanthrene degradation by the consortium LB2, using a central composite face-centered design was carried out taking into account three important parameters such as temperature, pH, and phenanthrene concentration. Near complete phenanthrene degradation was reached by consortium LB2 at the optimal conditions (pH of 7.5 and 37.5 °C) in less than 48 h. Moreover, the efficiency of phenanthrene biodegradation was assessed by using logistic and Luedeking and Piret-type models. Finally, the process was implemented at bench-scale bioreactor and the main degradation routes were identified based on GC-MS data.

  17. Geospatial Resolution of Human and Bacterial Diversity with City-Scale Metagenomics.

    PubMed

    Afshinnekoo, Ebrahim; Meydan, Cem; Chowdhury, Shanin; Jaroudi, Dyala; Boyer, Collin; Bernstein, Nick; Maritz, Julia M; Reeves, Darryl; Gandara, Jorge; Chhangawala, Sagar; Ahsanuddin, Sofia; Simmons, Amber; Nessel, Timothy; Sundaresh, Bharathi; Pereira, Elizabeth; Jorgensen, Ellen; Kolokotronis, Sergios-Orestis; Kirchberger, Nell; Garcia, Isaac; Gandara, David; Dhanraj, Sean; Nawrin, Tanzina; Saletore, Yogesh; Alexander, Noah; Vijay, Priyanka; Hénaff, Elizabeth M; Zumbo, Paul; Walsh, Michael; O'Mullan, Gregory D; Tighe, Scott; Dudley, Joel T; Dunaif, Anya; Ennis, Sean; O'Halloran, Eoghan; Magalhaes, Tiago R; Boone, Braden; Jones, Angela L; Muth, Theodore R; Paolantonio, Katie Schneider; Alter, Elizabeth; Schadt, Eric E; Garbarino, Jeanne; Prill, Robert J; Carlton, Jane M; Levy, Shawn; Mason, Christopher E

    2015-07-29

    The panoply of microorganisms and other species present in our environment influence human health and disease, especially in cities, but have not been profiled with metagenomics at a city-wide scale. We sequenced DNA from surfaces across the entire New York City (NYC) subway system, the Gowanus Canal, and public parks. Nearly half of the DNA (48%) does not match any known organism; identified organisms spanned 1,688 bacterial, viral, archaeal, and eukaryotic taxa, which were enriched for harmless genera associated with skin (e.g., Acinetobacter). Predicted ancestry of human DNA left on subway surfaces can recapitulate U.S. Census demographic data, and bacterial signatures can reveal a station's history, such as marine-associated bacteria in a hurricane-flooded station. Some evidence of pathogens was found (Bacillus anthracis), but a lack of reported cases in NYC suggests that the pathogens represent a normal, urban microbiome. This baseline metagenomic map of NYC could help long-term disease surveillance, bioterrorism threat mitigation, and health management in the built environment of cities.

  18. Strain Dependent Genetic Networks for Antibiotic-Sensitivity in a Bacterial Pathogen with a Large Pan-Genome

    PubMed Central

    van Opijnen, Tim; Bento, José

    2016-01-01

    The interaction between an antibiotic and bacterium is not merely restricted to the drug and its direct target, rather antibiotic induced stress seems to resonate through the bacterium, creating selective pressures that drive the emergence of adaptive mutations not only in the direct target, but in genes involved in many different fundamental processes as well. Surprisingly, it has been shown that adaptive mutations do not necessarily have the same effect in all species, indicating that the genetic background influences how phenotypes are manifested. However, to what extent the genetic background affects the manner in which a bacterium experiences antibiotic stress, and how this stress is processed is unclear. Here we employ the genome-wide tool Tn-Seq to construct daptomycin-sensitivity profiles for two strains of the bacterial pathogen Streptococcus pneumoniae. Remarkably, over half of the genes that are important for dealing with antibiotic-induced stress in one strain are dispensable in another. By confirming over 100 genotype-phenotype relationships, probing potassium-loss, employing genetic interaction mapping as well as temporal gene-expression experiments we reveal genome-wide conditionally important/essential genes, we discover roles for genes with unknown function, and uncover parts of the antibiotic’s mode-of-action. Moreover, by mapping the underlying genomic network for two query genes we encounter little conservation in network connectivity between strains as well as profound differences in regulatory relationships. Our approach uniquely enables genome-wide fitness comparisons across strains, facilitating the discovery that antibiotic responses are complex events that can vary widely between strains, which suggests that in some cases the emergence of resistance could be strain specific and at least for species with a large pan-genome less predictable. PMID:27607357

  19. Strain Dependent Genetic Networks for Antibiotic-Sensitivity in a Bacterial Pathogen with a Large Pan-Genome.

    PubMed

    van Opijnen, Tim; Dedrick, Sandra; Bento, José

    2016-09-01

    The interaction between an antibiotic and bacterium is not merely restricted to the drug and its direct target, rather antibiotic induced stress seems to resonate through the bacterium, creating selective pressures that drive the emergence of adaptive mutations not only in the direct target, but in genes involved in many different fundamental processes as well. Surprisingly, it has been shown that adaptive mutations do not necessarily have the same effect in all species, indicating that the genetic background influences how phenotypes are manifested. However, to what extent the genetic background affects the manner in which a bacterium experiences antibiotic stress, and how this stress is processed is unclear. Here we employ the genome-wide tool Tn-Seq to construct daptomycin-sensitivity profiles for two strains of the bacterial pathogen Streptococcus pneumoniae. Remarkably, over half of the genes that are important for dealing with antibiotic-induced stress in one strain are dispensable in another. By confirming over 100 genotype-phenotype relationships, probing potassium-loss, employing genetic interaction mapping as well as temporal gene-expression experiments we reveal genome-wide conditionally important/essential genes, we discover roles for genes with unknown function, and uncover parts of the antibiotic's mode-of-action. Moreover, by mapping the underlying genomic network for two query genes we encounter little conservation in network connectivity between strains as well as profound differences in regulatory relationships. Our approach uniquely enables genome-wide fitness comparisons across strains, facilitating the discovery that antibiotic responses are complex events that can vary widely between strains, which suggests that in some cases the emergence of resistance could be strain specific and at least for species with a large pan-genome less predictable.

  20. Genomic conflict in scale insects: the causes and consequences of bizarre genetic systems.

    PubMed

    Ross, Laura; Pen, Ido; Shuker, David M

    2010-11-01

    It is now clear that mechanisms of sex determination are extraordinarily labile, with considerable variation across all taxonomic levels. This variation is often expressed through differences in the genetic system (XX-XY, XX-XO, haplodiploidy, and so on). Why there is so much variation in such a seemingly fundamental process has attracted much attention, with recent ideas concentrating on the possible role of genomic conflicts of interest. Here we consider the role of inter- and intra-genomic conflicts in one large insect taxon: the scale insects. Scale insects exhibit a dizzying array of genetic systems, and their biology promotes conflicts of interest over transmission and sex ratio between male- and female-expressed genes, parental- and offspring-expressed genes (both examples of intra-genomic conflict) and between scale insects and their endosymbionts (inter-genomic conflict). We first review the wide range of genetic systems found in scale insects and the possible evolutionary transitions between them. We then outline the theoretical opportunities for genomic conflicts in this group and how these might influence sex determination and sex ratio. We then consider the evidence for these conflicts in the evolution of sex determination in scale insects. Importantly, the evolution of novel genetic systems in scale insects has itself helped create new conflicts of interest, for instance over sex ratio. As a result, a major obstacle to our understanding of the role of conflict in the evolution of sex-determination and genetic systems will be the difficulty in identifying the direction of causal relationships. We conclude by outlining possible experimental and comparative approaches to test more effectively how important genomic conflicts have been.

  1. Draft Genome Sequence of the Murine Bacterial Isolate Lactobacillus murinus EF-1

    PubMed Central

    Fritz, Eric

    2017-01-01

    ABSTRACT   Screening for lysogenic lactobacilli in rat fecal samples has identified Lactobacillus murinus EF-1. Whole-genome sequencing revealed a 2.30-Mb draft genome with 39.6% G+C content and 2,196 open reading frames. PHAST analysis identified three intact prophages of 26.1 kb, 25.4 kb, and 49.6 kb in size. PMID:28336601

  2. Genome Sequence of Acidovorax citrulli Group 1 Strain pslb65 Causing Bacterial Fruit Blotch of Melons.

    PubMed

    Wang, Tielin; Sun, Baixin; Yang, Yuwen; Zhao, Tingchang

    2015-04-23

    Acidovorax citrulli is typed into two groups, mainly based on the host. We determined the draft genome of A. citrulli group 1 strain pslb65. The strain was isolated from melon collected from Xinjiang province, China. The A. citrulli pslb65 genome contains 4,903,443 bp and has a G+C content of 68.8 mol%.

  3. Genome Sequence of Acidovorax citrulli Group 1 Strain pslb65 Causing Bacterial Fruit Blotch of Melons

    PubMed Central

    Wang, Tielin; Sun, Baixin; Yang, Yuwen

    2015-01-01

    Acidovorax citrulli is typed into two groups, mainly based on the host. We determined the draft genome of A. citrulli group 1 strain pslb65. The strain was isolated from melon collected from Xinjiang province, China. The A. citrulli pslb65 genome contains 4,903,443 bp and has a G+C content of 68.8 mol%. PMID:25908136

  4. Draft Genome Sequence of Erwinia tracheiphila, an Economically Important Bacterial Pathogen of Cucurbits.

    PubMed

    Shapiro, Lori R; Scully, Erin D; Roberts, Dana; Straub, Timothy J; Geib, Scott M; Park, Jihye; Stephenson, Andrew G; Salaau Rojas, Erika; Liu, Quin; Beattie, Gwyn; Gleason, Mark; De Moraes, Consuelo M; Mescher, Mark C; Fleischer, Shelby G; Kolter, Roberto; Pierce, Naomi; Zhaxybayeva, Olga

    2015-06-04

    Erwinia tracheiphila is one of the most economically important pathogens of cucumbers, melons, squashes, pumpkins, and gourds in the northeastern and midwestern United States, yet its molecular pathology remains uninvestigated. Here, we report the first draft genome sequence of an E. tracheiphila strain isolated from an infected wild gourd (Cucurbita pepo subsp. texana) plant. The genome assembly consists of 7 contigs and includes a putative plasmid and at least 20 phage and prophage elements.

  5. Draft Genome Sequence of Erwinia tracheiphila, an Economically Important Bacterial Pathogen of Cucurbits

    PubMed Central

    Scully, Erin D.; Roberts, Dana; Straub, Timothy J.; Geib, Scott M.; Park, Jihye; Stephenson, Andrew G.; Salaau Rojas, Erika; Liu, Quin; Beattie, Gwyn; Gleason, Mark; De Moraes, Consuelo M.; Mescher, Mark C.; Fleischer, Shelby G.; Kolter, Roberto; Pierce, Naomi; Zhaxybayeva, Olga

    2015-01-01

    Erwinia tracheiphila is one of the most economically important pathogens of cucumbers, melons, squashes, pumpkins, and gourds in the northeastern and midwestern United States, yet its molecular pathology remains uninvestigated. Here, we report the first draft genome sequence of an E. tracheiphila strain isolated from an infected wild gourd (Cucurbita pepo subsp. texana) plant. The genome assembly consists of 7 contigs and includes a putative plasmid and at least 20 phage and prophage elements. PMID:26044415

  6. A simple model for DNA bridging proteins and bacterial or human genomes: bridging-induced attraction and genome compaction

    NASA Astrophysics Data System (ADS)

    Johnson, J.; Brackley, C. A.; Cook, P. R.; Marenduzzo, D.

    2015-02-01

    We present computer simulations of the phase behaviour of an ensemble of proteins interacting with a polymer, mimicking non-specific binding to a piece of bacterial DNA or eukaryotic chromatin. The proteins can simultaneously bind to the polymer in two or more places to create protein bridges. Despite the lack of any explicit interaction between the proteins or between DNA segments, our simulations confirm previous results showing that when the protein-polymer interaction is sufficiently strong, the proteins come together to form clusters. Furthermore, a sufficiently large concentration of bridging proteins leads to the compaction of the swollen polymer into a globular phase. Here we characterise both the formation of protein clusters and the polymer collapse as a function of protein concentration, protein-polymer affinity and fibre flexibility.

  7. A simple model for DNA bridging proteins and bacterial or human genomes: bridging-induced attraction and genome compaction.

    PubMed

    Johnson, J; Brackley, C A; Cook, P R; Marenduzzo, D

    2015-02-18

    We present computer simulations of the phase behaviour of an ensemble of proteins interacting with a polymer, mimicking non-specific binding to a piece of bacterial DNA or eukaryotic chromatin. The proteins can simultaneously bind to the polymer in two or more places to create protein bridges. Despite the lack of any explicit interaction between the proteins or between DNA segments, our simulations confirm previous results showing that when the protein-polymer interaction is sufficiently strong, the proteins come together to form clusters. Furthermore, a sufficiently large concentration of bridging proteins leads to the compaction of the swollen polymer into a globular phase. Here we characterise both the formation of protein clusters and the polymer collapse as a function of protein concentration, protein-polymer affinity and fibre flexibility.

  8. Genomic epidemiology and global diversity of the emerging bacterial pathogen Elizabethkingia anophelis.

    PubMed

    Breurec, Sebastien; Criscuolo, Alexis; Diancourt, Laure; Rendueles, Olaya; Vandenbogaert, Mathias; Passet, Virginie; Caro, Valérie; Rocha, Eduardo P C; Touchon, Marie; Brisse, Sylvain

    2016-07-27

    Elizabethkingia anophelis is an emerging pathogen involved in human infections and outbreaks in distinct world regions. We investigated the phylogenetic relationships and pathogenesis-associated genomic features of two neonatal meningitis isolates isolated 5 years apart from one hospital in Central African Republic and compared them with Elizabethkingia from other regions and sources. Average nucleotide identity firmly confirmed that E. anophelis, E. meningoseptica and E. miricola represent demarcated genomic species. A core genome multilocus sequence typing scheme, broadly applicable to Elizabethkingia species, was developed and made publicly available (http://bigsdb.pasteur.fr/elizabethkingia). Phylogenetic analysis revealed distinct E. anophelis sublineages and demonstrated high genetic relatedness between the African isolates, compatible with persistence of the strain in the hospital environment. CRISPR spacer variation between the African isolates was mirrored by the presence of a large mobile genetic element. The pan-genome of E. anophelis comprised 6,880 gene families, underlining genomic heterogeneity of this species. African isolates carried unique resistance genes acquired by horizontal transfer. We demonstrated the presence of extensive variation of the capsular polysaccharide synthesis gene cluster in E. anophelis. Our results demonstrate the dynamic evolution of this emerging pathogen and the power of genomic approaches for Elizabethkingia identification, population biology and epidemiology.

  9. Genomic epidemiology and global diversity of the emerging bacterial pathogen Elizabethkingia anophelis

    PubMed Central

    Breurec, Sebastien; Criscuolo, Alexis; Diancourt, Laure; Rendueles, Olaya; Vandenbogaert, Mathias; Passet, Virginie; Caro, Valérie; Rocha, Eduardo P. C.; Touchon, Marie; Brisse, Sylvain

    2016-01-01

    Elizabethkingia anophelis is an emerging pathogen involved in human infections and outbreaks in distinct world regions. We investigated the phylogenetic relationships and pathogenesis-associated genomic features of two neonatal meningitis isolates isolated 5 years apart from one hospital in Central African Republic and compared them with Elizabethkingia from other regions and sources. Average nucleotide identity firmly confirmed that E. anophelis, E. meningoseptica and E. miricola represent demarcated genomic species. A core genome multilocus sequence typing scheme, broadly applicable to Elizabethkingia species, was developed and made publicly available (http://bigsdb.pasteur.fr/elizabethkingia). Phylogenetic analysis revealed distinct E. anophelis sublineages and demonstrated high genetic relatedness between the African isolates, compatible with persistence of the strain in the hospital environment. CRISPR spacer variation between the African isolates was mirrored by the presence of a large mobile genetic element. The pan-genome of E. anophelis comprised 6,880 gene families, underlining genomic heterogeneity of this species. African isolates carried unique resistance genes acquired by horizontal transfer. We demonstrated the presence of extensive variation of the capsular polysaccharide synthesis gene cluster in E. anophelis. Our results demonstrate the dynamic evolution of this emerging pathogen and the power of genomic approaches for Elizabethkingia identification, population biology and epidemiology. PMID:27461509

  10. Long-Term Bacterial Dynamics in a Full-Scale Drinking Water Distribution System

    PubMed Central

    Prest, E. I.; Weissbrodt, D. G.; Hammes, F.; van Loosdrecht, M. C. M.; Vrouwenvelder, J. S.

    2016-01-01

    Large seasonal variations in microbial drinking water quality can occur in distribution networks, but are often not taken into account when evaluating results from short-term water sampling campaigns. Temporal dynamics in bacterial community characteristics were investigated during a two-year drinking water monitoring campaign in a full-scale distribution system operating without detectable disinfectant residual. A total of 368 water samples were collected on a biweekly basis at the water treatment plant (WTP) effluent and at one fixed location in the drinking water distribution network (NET). The samples were analysed for heterotrophic plate counts (HPC), Aeromonas plate counts, adenosine-tri-phosphate (ATP) concentrations, and flow cytometric (FCM) total and intact cell counts (TCC, ICC), water temperature, pH, conductivity, total organic carbon (TOC) and assimilable organic carbon (AOC). Multivariate analysis of the large dataset was performed to explore correlative trends between microbial and environmental parameters. The WTP effluent displayed considerable seasonal variations in TCC (from 90 × 103 cells mL-1 in winter time up to 455 × 103 cells mL-1 in summer time) and in bacterial ATP concentrations (<1–3.6 ng L-1), which were congruent with water temperature variations. These fluctuations were not detected with HPC and Aeromonas counts. The water in the network was predominantly influenced by the characteristics of the WTP effluent. The increase in ICC between the WTP effluent and the network sampling location was small (34 × 103 cells mL-1 on average) compared to seasonal fluctuations in ICC in the WTP effluent. Interestingly, the extent of bacterial growth in the NET was inversely correlated to AOC concentrations in the WTP effluent (Pearson’s correlation factor r = -0.35), and positively correlated with water temperature (r = 0.49). Collecting a large dataset at high frequency over a two year period enabled the characterization of previously

  11. Long-Term Bacterial Dynamics in a Full-Scale Drinking Water Distribution System.

    PubMed

    Prest, E I; Weissbrodt, D G; Hammes, F; van Loosdrecht, M C M; Vrouwenvelder, J S

    2016-01-01

    Large seasonal variations in microbial drinking water quality can occur in distribution networks, but are often not taken into account when evaluating results from short-term water sampling campaigns. Temporal dynamics in bacterial community characteristics were investigated during a two-year drinking water monitoring campaign in a full-scale distribution system operating without detectable disinfectant residual. A total of 368 water samples were collected on a biweekly basis at the water treatment plant (WTP) effluent and at one fixed location in the drinking water distribution network (NET). The samples were analysed for heterotrophic plate counts (HPC), Aeromonas plate counts, adenosine-tri-phosphate (ATP) concentrations, and flow cytometric (FCM) total and intact cell counts (TCC, ICC), water temperature, pH, conductivity, total organic carbon (TOC) and assimilable organic carbon (AOC). Multivariate analysis of the large dataset was performed to explore correlative trends between microbial and environmental parameters. The WTP effluent displayed considerable seasonal variations in TCC (from 90 × 103 cells mL-1 in winter time up to 455 × 103 cells mL-1 in summer time) and in bacterial ATP concentrations (<1-3.6 ng L-1), which were congruent with water temperature variations. These fluctuations were not detected with HPC and Aeromonas counts. The water in the network was predominantly influenced by the characteristics of the WTP effluent. The increase in ICC between the WTP effluent and the network sampling location was small (34 × 103 cells mL-1 on average) compared to seasonal fluctuations in ICC in the WTP effluent. Interestingly, the extent of bacterial growth in the NET was inversely correlated to AOC concentrations in the WTP effluent (Pearson's correlation factor r = -0.35), and positively correlated with water temperature (r = 0.49). Collecting a large dataset at high frequency over a two year period enabled the characterization of previously

  12. Rapid Prototyping of Microbial Cell Factories via Genome-scale Engineering

    PubMed Central

    Si, Tong; Xiao, Han; Zhao, Huimin

    2014-01-01

    Advances in reading, writing and editing genetic materials have greatly expanded our ability to reprogram biological systems at the resolution of a single nucleotide and on the scale of a whole genome. Such capacity has greatly accelerated the cycles of design, build and test to engineer microbes for efficient synthesis of fuels, chemicals and drugs. In this review, we summarize the emerging technologies that have been applied, or are potentially useful for genome-scale engineering in microbial systems. We will focus on the development of high-throughput methodologies, which may accelerate the prototyping of microbial cell factories. PMID:25450192

  13. Rapid prototyping of microbial cell factories via genome-scale engineering.

    PubMed

    Si, Tong; Xiao, Han; Zhao, Huimin

    2015-11-15

    Advances in reading, writing and editing genetic materials have greatly expanded our ability to reprogram biological systems at the resolution of a single nucleotide and on the scale of a whole genome. Such capacity has greatly accelerated the cycles of design, build and test to engineer microbes for efficient synthesis of fuels, chemicals and drugs. In this review, we summarize the emerging technologies that have been applied, or are potentially useful for genome-scale engineering in microbial systems. We will focus on the development of high-throughput methodologies, which may accelerate the prototyping of microbial cell factories.

  14. The influence of nano-scale surface roughness on bacterial adhesion to ultrafine-grained titanium.

    PubMed

    Truong, Vi K; Lapovok, Rimma; Estrin, Yuri S; Rundell, Stuart; Wang, James Y; Fluke, Christopher J; Crawford, Russell J; Ivanova, Elena P

    2010-05-01

    We discuss the effect of extreme grain refinement in the bulk of commercial purity titanium (CP, Grade-2) on bacterial attachment to the mechano-chemically polished surfaces of the material. The ultrafine crystallinity of the bulk was achieved by severe plastic deformation by means of equal channel angular pressing (ECAP). The chemical composition, wettability, surface topography and roughness of titanium surfaces were characterized using X-ray photoelectron spectroscopy (XPS) and water contact angle (WCA) measurements, as well as atomic force microscopy (AFM) with 3D interactive visualization of the titanium surface morphology. It was found that physico-chemical surface characteristics of the as-received and the ECAP-modified CP titanium did not differ in any significant way, while the surface roughness at the nano-scale did. Optical profilometry performed on large scanning areas of approximately 225 mum x 300 mum showed that there was no significant difference between the roughness parameters R(a) and R(q) for surfaces in the two conditions, the overall level of roughness being lower for the ECAP-processed one. By contrast, topographic profile analysis at the nano-scale by AFM did reveal a difference in these parameters. This difference was sensitive to the size of the scanned surface area. A further two surface roughness parameters, skewness (R(skw)) and kurtosis (R(kur)), were also used to describe the morphology of titanium surfaces. It was found that the bacterial strains used in this study as adsorbates, viz. Staphylococcus aureus CIP 65.8 and Pseudomonas aeruginosa ATCC 9025, showed preference for surfaces of ECAP-processed titanium. S. aureus cells were found to have a greater propensity for attachment to surfaces of ECAP-modified titanium, while the attachment of P. aeruginosa, while also showing some preference for the ECAP-processed material, was less sensitive to the ECAP processing.

  15. Bacterial features in the genome of Methanococcus jannaschii in terms of gene composition and biased base composition in ORFs and their surrounding regions.

    PubMed

    Watanabe, H; Gojobori, T; Miura, K; Watanabea, H

    1997-12-31

    As a result of genome projects, the complete nucleotide sequence of the entire genome of an archaeon, Methanococcus jannaschii, was recently determined as well as other complete sequences of bacterial and eucaryal genomes. When all the 1680 predicted protein-coding [corrected] genes of M. jannaschii were classified on the basis of sequence similarity, it was found that this archaeon had a chimeric set of 1016 bacterial-type, 471 eucaryal-type and 193 species- or archaebacteria-specific genes. However, most of the genes predicted to be involved in translation and transcription pathways including RNA genes were of the eucaryal-type with only a few exceptions such as 16S ribosomal RNA and some translation factor-like genes. This appeared curious since previous studies indicated that methanogens have bacterial features in gene organization and expression. To understand the apparent inconsistency between physiological observations and the result of the classification of genes for transcription and translation, we examined the structural relatedness of the genome of M. jannaschii to those of other species. In practice, we compared base compositional patterns in ORFs and their surrounding regions. This made it possible to reveal the relationships among the translation- and transcription-related structures in genomes. In this study, we conducted a statistical test called 'G-test' to evaluate the base biases around the boundaries of ORFs. We then found that M. jannaschii possesses more bacterial features in base biases than eucaryal ones, e.g. strong G biases at the positions corresponding to the Shine-Dalgarno site. This indicates that the few exceptional bacterial genes for translation, such as 16S ribosomal RNA and translation factor-like genes, play crucial roles in the translation pathway in M. jannaschii. The possibility that the genome structure in the last common ancestor of all present species was bacterial is discussed.

  16. Amoebal Endosymbiont Neochlamydia Genome Sequence Illuminates the Bacterial Role in the Defense of the Host Amoebae against Legionella pneumophila

    PubMed Central

    Ishida, Kasumi; Sekizuka, Tsuyoshi; Hayashida, Kyoko; Matsuo, Junji; Takeuchi, Fumihiko; Kuroda, Makoto; Nakamura, Shinji; Yamazaki, Tomohiro; Yoshida, Mitsutaka; Takahashi, Kaori; Nagai, Hiroki; Sugimoto, Chihiro; Yamaguchi, Hiroyuki

    2014-01-01

    Previous work has shown that the obligate intracellular amoebal endosymbiont Neochlamydia S13, an environmental chlamydia strain, has an amoebal infection rate of 100%, but does not cause amoebal lysis and lacks transferability to other host amoebae. The underlying mechanism for these observations remains unknown. In this study, we found that the host amoeba could completely evade Legionella infection. The draft genome sequence of Neochlamydia S13 revealed several defects in essential metabolic pathways, as well as unique molecules with leucine-rich repeats (LRRs) and ankyrin domains, responsible for protein-protein interaction. Neochlamydia S13 lacked an intact tricarboxylic acid cycle and had an incomplete respiratory chain. ADP/ATP translocases, ATP-binding cassette transporters, and secretion systems (types II and III) were well conserved, but no type IV secretion system was found. The number of outer membrane proteins (OmcB, PomS, 76-kDa protein, and OmpW) was limited. Interestingly, genes predicting unique proteins with LRRs (30 genes) or ankyrin domains (one gene) were identified. Furthermore, 33 transposases were found, possibly explaining the drastic genome modification. Taken together, the genomic features of Neochlamydia S13 explain the intimate interaction with the host amoeba to compensate for bacterial metabolic defects, and illuminate the role of the endosymbiont in the defense of the host amoebae against Legionella infection. PMID:24747986

  17. Analysis of five complete genome sequences for members of the class Peribacteria in the recently recognized Peregrinibacteria bacterial phylum.

    PubMed

    Anantharaman, Karthik; Brown, Christopher T; Burstein, David; Castelle, Cindy J; Probst, Alexander J; Thomas, Brian C; Williams, Kenneth H; Banfield, Jillian F

    2016-01-01

    Five closely related populations of bacteria from the Candidate Phylum (CP) Peregrinibacteria, part of the bacterial Candidate Phyla Radiation (CPR), were sampled from filtered groundwater obtained from an aquifer adjacent to the Colorado River near the town of Rifle, CO, USA. Here, we present the first complete genome sequences for organisms from this phylum. These bacteria have small genomes and, unlike most organisms from other lineages in the CPR, have the capacity for nucleotide synthesis. They invest significantly in biosynthesis of cell wall and cell envelope components, including peptidoglycan, isoprenoids via the mevalonate pathway, and a variety of amino sugars including perosamine and rhamnose. The genomes encode an intriguing set of large extracellular proteins, some of which are very cysteine-rich and may function in attachment, possibly to other cells. Strain variation in these proteins is an important source of genotypic variety. Overall, the cell envelope features, combined with the lack of biosynthesis capacities for many required cofactors, fatty acids, and most amino acids point to a symbiotic lifestyle. Phylogenetic analyses indicate that these bacteria likely represent a new class within the Peregrinibacteria phylum, although they ultimately may be recognized as members of a separate phylum. We propose the provisional taxonomic assignment as 'Candidatus Peribacter riflensis', Genus Peribacter, Family Peribacteraceae, Order Peribacterales, Class Peribacteria in the phylum Peregrinibacteria.

  18. Survey of chimeric IStron elements in bacterial genomes: multiple molecular symbioses between group I intron ribozymes and DNA transposons

    PubMed Central

    Tourasse, Nicolas J.; Stabell, Fredrik B.; Kolstø, Anne-Brit

    2014-01-01

    IStrons are chimeric genetic elements composed of a group I intron associated with an insertion sequence (IS). The group I intron is a catalytic RNA providing the IStron with self-splicing ability, which renders IStron insertions harmless to the host genome. The IS element is a DNA transposon conferring mobility, and thus allowing the IStron to spread in genomes. IStrons are therefore a striking example of a molecular symbiosis between unrelated genetic elements endowed with different functions. In this study, we have conducted the first comprehensive survey of IStrons in sequenced genomes that provides insights into the distribution, diversity, origin and evolution of IStrons. We show that IStrons have a restricted phylogenetic distribution limited to two bacterial phyla, the Firmicutes and the Fusobacteria. Nevertheless, diverse IStrons representing two major groups targeting different insertion site motifs were identified. This taken with the finding that while the intron components of all IStrons belong to the same structural class, they are fused to different IS families, indicates that multiple intron–IS symbioses have occurred during evolution. In addition, introns and IS elements related to those that were at the origin of IStrons were also identified. PMID:25324310

  19. Comparative genomics analysis of the companion mechanisms of Bacillus thuringiensis Bc601 and Bacillus endophyticus Hbe603 in bacterial consortium

    PubMed Central

    Jia, Nan; Ding, Ming-Zhu; Gao, Feng; Yuan, Ying-Jin

    2016-01-01

    Bacillus thuringiensis and Bacillus endophyticus both act as the companion bacteria, which cooperate with Ketogulonigenium vulgare in vitamin C two-step fermentation. Two Bacillus species have different morphologies, swarming motility and 2-keto-L-gulonic acid productivities when they co-culture with K. vulgare. Here, we report the complete genome sequencing of B. thuringiensis Bc601 and eight plasmids of B. endophyticus Hbe603, and carry out the comparative genomics analysis. Consequently, B. thuringiensis Bc601, with greater ability of response to the external environment, has been found more two-component system, sporulation coat and peptidoglycan biosynthesis related proteins than B. endophyticus Hbe603, and B. endophyticus Hbe603, with greater ability of nutrients biosynthesis, has been found more alpha-galactosidase, propanoate, glutathione and inositol phosphate metabolism, and amino acid degradation related proteins than B. thuringiensis Bc601. Different ability of swarming motility, response to the external environment and nutrients biosynthesis may reflect different companion mechanisms of two Bacillus species. Comparative genomic analysis of B. endophyticus and B. thuringiensis enables us to further understand the cooperative mechanism with K. vulgare, and facilitate the optimization of bacterial consortium. PMID:27353048

  20. Exploiting Bacterial Whole-Genome Sequencing Data for Evaluation of Diagnostic Assays: Campylobacter Species Identification as a Case Study.

    PubMed

    Jansen van Rensburg, Melissa J; Swift, Craig; Cody, Alison J; Jenkins, Claire; Maiden, Martin C J

    2016-12-01

    The application of whole-genome sequencing (WGS) to problems in clinical microbiology has had a major impact on the field. Clinical laboratories are now using WGS for pathogen identification, antimicrobial susceptibility testing, and epidemiological typing. WGS data also represent a valuable resource for the development and evaluation of molecular diagnostic assays, which continue to play an important role in clinical microbiology. To demonstrate this application of WGS, this study used publicly available genomic data to evaluate a duplex real-time PCR (RT-PCR) assay that targets mapA and ceuE for the detection of Campylobacter jejuni and Campylobacter coli, leading global causes of bacterial gastroenteritis. In silico analyses of mapA and ceuE primer and probe sequences from 1,713 genetically diverse C. jejuni and C. coli genomes, supported by RT-PCR testing, indicated that the assay was robust, with 1,707 (99.7%) isolates correctly identified. The high specificity of the mapA-ceuE assay was the result of interspecies diversity and intraspecies conservation of the target genes in C. jejuni and C. coli Rare instances of a lack of specificity among C. coli isolates were due to introgression in mapA or sequence diversity in ceuE The results of this study illustrate how WGS can be exploited to evaluate molecular diagnostic assays by using publicly available data, online databases, and open-source software.

  1. Analysis of five complete genome sequences for members of the class Peribacteria in the recently recognized Peregrinibacteria bacterial phylum

    SciTech Connect

    Anantharaman, Karthik; Brown, Christopher T.; Burstein, David; Castelle, Cindy J.; Probst, Alexander J.; Thomas, Brian C.; Williams, Kenneth H.; Banfield, Jillian F.

    2016-01-28

    Five closely related populations of bacteria from the Candidate Phylum (CP) Peregrinibacteria, part of the bacterial Candidate Phyla Radiation (CPR), were sampled from filtered groundwater obtained from an aquifer adjacent to the Colorado River near the town of Rifle, CO, USA. Here, we present the first complete genome sequences for organisms from this phylum. These bacteria have small genomes and, unlike most organisms from other lineages in the CPR, have the capacity for nucleotide synthesis. They invest significantly in biosynthesis of cell wall and cell envelope components, including peptidoglycan, isoprenoids via the mevalonate pathway, and a variety of amino sugars including perosamine and rhamnose. The genomes encode an intriguing set of large extracellular proteins, some of which are very cysteine-rich and may function in attachment, possibly to other cells. Strain variation in these proteins is an important source of genotypic variety. Overall, the cell envelope features, combined with the lack of biosynthesis capacities for many required cofactors, fatty acids, and most amino acids point to a symbiotic lifestyle. Furthermore, phylogenetic analyses indicate that these bacteria likely represent a new class within the Peregrinibacteria phylum, although they ultimately may be recognized as members of a separate phylum. In conclusion, we propose the provisional taxonomic assignment as ‘Candidatus Peribacter riflensis’, Genus Peribacter, Family Peribacteraceae, Order Peribacterales, Class Peribacteria in the phylum Peregrinibacteria.

  2. Analysis of five complete genome sequences for members of the class Peribacteria in the recently recognized Peregrinibacteria bacterial phylum

    DOE PAGES

    Anantharaman, Karthik; Brown, Christopher T.; Burstein, David; ...

    2016-01-28

    Five closely related populations of bacteria from the Candidate Phylum (CP) Peregrinibacteria, part of the bacterial Candidate Phyla Radiation (CPR), were sampled from filtered groundwater obtained from an aquifer adjacent to the Colorado River near the town of Rifle, CO, USA. Here, we present the first complete genome sequences for organisms from this phylum. These bacteria have small genomes and, unlike most organisms from other lineages in the CPR, have the capacity for nucleotide synthesis. They invest significantly in biosynthesis of cell wall and cell envelope components, including peptidoglycan, isoprenoids via the mevalonate pathway, and a variety of amino sugarsmore » including perosamine and rhamnose. The genomes encode an intriguing set of large extracellular proteins, some of which are very cysteine-rich and may function in attachment, possibly to other cells. Strain variation in these proteins is an important source of genotypic variety. Overall, the cell envelope features, combined with the lack of biosynthesis capacities for many required cofactors, fatty acids, and most amino acids point to a symbiotic lifestyle. Furthermore, phylogenetic analyses indicate that these bacteria likely represent a new class within the Peregrinibacteria phylum, although they ultimately may be recognized as members of a separate phylum. In conclusion, we propose the provisional taxonomic assignment as ‘Candidatus Peribacter riflensis’, Genus Peribacter, Family Peribacteraceae, Order Peribacterales, Class Peribacteria in the phylum Peregrinibacteria.« less

  3. Exploiting Bacterial Whole-Genome Sequencing Data for Evaluation of Diagnostic Assays: Campylobacter Species Identification as a Case Study

    PubMed Central

    Jansen van Rensburg, Melissa J.; Swift, Craig; Cody, Alison J.; Jenkins, Claire

    2016-01-01

    The application of whole-genome sequencing (WGS) to problems in clinical microbiology has had a major impact on the field. Clinical laboratories are now using WGS for pathogen identification, antimicrobial susceptibility testing, and epidemiological typing. WGS data also represent a valuable resource for the development and evaluation of molecular diagnostic assays, which continue to play an important role in clinical microbiology. To demonstrate this application of WGS, this study used publicly available genomic data to evaluate a duplex real-time PCR (RT-PCR) assay that targets mapA and ceuE for the detection of Campylobacter jejuni and Campylobacter coli, leading global causes of bacterial gastroenteritis. In silico analyses of mapA and ceuE primer and probe sequences from 1,713 genetically diverse C. jejuni and C. coli genomes, supported by RT-PCR testing, indicated that the assay was robust, with 1,707 (99.7%) isolates correctly identified. The high specificity of the mapA-ceuE assay was the result of interspecies diversity and intraspecies conservation of the target genes in C. jejuni and C. coli. Rare instances of a lack of specificity among C. coli isolates were due to introgression in mapA or sequence diversity in ceuE. The results of this study illustrate how WGS can be exploited to evaluate molecular diagnostic assays by using publicly available data, online databases, and open-source software. PMID:27733632

  4. Detection of mixed infection from bacterial whole genome sequence data allows assessment of its role in Clostridium difficile transmission.

    PubMed

    Eyre, David W; Cule, Madeleine L; Griffiths, David; Crook, Derrick W; Peto, Tim E A; Walker, A Sarah; Wilson, Daniel J

    2013-01-01

    Bacterial whole genome sequencing offers the prospect of rapid and high precision investigation of infectious disease outbreaks. Close genetic relationships between microorganisms isolated from different infected cases suggest transmission is a strong possibility, whereas transmission between cases with genetically distinct bacterial isolates can be excluded. However, undetected mixed infections-infection with ≥2 unrelated strains of the same species where only one is sequenced-potentially impairs exclusion of transmission with certainty, and may therefore limit the utility of this technique. We investigated the problem by developing a computationally efficient method for detecting mixed infection without the need for resource-intensive independent sequencing of multiple bacterial colonies. Given the relatively low density of single nucleotide polymorphisms within bacterial sequence data, direct reconstruction of mixed infection haplotypes from current short-read sequence data is not consistently possible. We therefore use a two-step maximum likelihood-based approach, assuming each sample contains up to two infecting strains. We jointly estimate the proportion of the infection arising from the dominant and minor strains, and the sequence divergence between these strains. In cases where mixed infection is confirmed, the dominant and minor haplotypes are then matched to a database of previously sequenced local isolates. We demonstrate the performance of our algorithm with in silico and in vitro mixed infection experiments, and apply it to transmission of an important healthcare-associated pathogen, Clostridium difficile. Using hospital ward movement data in a previously described stochastic transmission model, 15 pairs of cases enriched for likely transmission events associated with mixed infection were selected. Our method identified four previously undetected mixed infections, and a previously undetected transmission event, but no direct transmission between the

  5. Ensembl Genomes 2016: more genomes, more complexity

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J.; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K.; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D.; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello–Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin L.; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  6. Ensembl Genomes 2016: more genomes, more complexity.

    PubMed

    Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

    2016-01-04

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.

  7. Bacterial community structure and variation in a full-scale seawater desalination plant for drinking water production.

    PubMed

    Belila, A; El-Chakhtoura, J; Otaibi, N; Muyzer, G; Gonzalez-Gil, G; Saikaly, P E; van Loosdrecht, M C M; Vrouwenvelder, J S

    2016-05-01

    Microbial processes inevitably play a role in membrane-based desalination plants, mainly recognized as membrane biofouling. We assessed the bacterial community structure and diversity during different treatment steps in a full-scale seawater desalination plant producing 40,000 m(3)/d of drinking water. Water samples were taken over the full treatment train consisting of chlorination, spruce media and cartridge filters, de-chlorination, first and second pass reverse osmosis (RO) membranes and final chlorine dosage for drinking water distribution. The water samples were analyzed for water quality parameters (total bacterial cell number, total organic carbon, conductivity, pH, etc.) and microbial community composition by 16S rRNA gene pyrosequencing. The planktonic microbial community was dominated by Proteobacteria (48.6%) followed by Bacteroidetes (15%), Firmicutes (9.3%) and Cyanobacteria (4.9%). During the pretreatment step, the spruce media filter did not impact the bacterial community composition dominated by Proteobacteria. In contrast, the RO and final chlorination treatment steps reduced the Proteobacterial relative abundance in the produced water where Firmicutes constituted the most dominant bacterial group. Shannon and Chao1 diversity indices showed that bacterial species richness and diversity decreased during the seawater desalination process. The two-stage RO filtration strongly reduced the water conductivity (>99%), TOC concentration (98.5%) and total bacterial cell number (>99%), albeit some bacterial DNA was found in the water after RO filtration. About 0.25% of the total bacterial operational taxonomic units (OTUs) were present in all stages of the desalination plant: the seawater, the RO permeates and the chlorinated drinking water, suggesting that these bacterial strains can survive in different environments such as high/low salt concentration and with/without residual disinfectant. These bacterial strains were not caused by contamination during

  8. Feasibility of Large-Scale Genomic Testing to Facilitate Enrollment Onto Genomically Matched Clinical Trials

    PubMed Central

    Meric-Bernstam, Funda; Brusco, Lauren; Shaw, Kenna; Horombe, Chacha; Kopetz, Scott; Davies, Michael A.; Routbort, Mark; Piha-Paul, Sarina A.; Janku, Filip; Ueno, Naoto; Hong, David; De Groot, John; Ravi, Vinod; Li, Yisheng; Luthra, Raja; Patel, Keyur; Broaddus, Russell; Mendelsohn, John; Mills, Gordon B.

    2015-01-01

    Purpose We report the experience with 2,000 consecutive patients with advanced cancer who underwent testing on a genomic testing protocol, including the frequency of actionable alterations across tumor types, subsequent enrollment onto clinical trials, and the challenges for trial enrollment. Patients and Methods Standardized hotspot mutation analysis was performed in 2,000 patients, using either an 11-gene (251 patients) or a 46- or 50-gene (1,749 patients) multiplex platform. Thirty-five genes were considered potentially actionable based on their potential to be targeted with approved or investigational therapies. Results Seven hundred eighty-nine patients (39%) had at least one mutation in potentially actionable genes. Eighty-three patients (11%) with potentially actionable mutations went on genotype-matched trials targeting these alterations. Of 230 patients with PIK3CA/AKT1/PTEN/BRAF mutations that returned for therapy, 116 (50%) received a genotype-matched drug. Forty patients (17%) were treated on a genotype-selected trial requiring a mutation for eligibility, 16 (7%) were treated on a genotype-relevant trial targeting a genomic alteration without biomarker selection, and 40 (17%) received a genotype-relevant drug off trial. Challenges to trial accrual included patient preference of noninvestigational treatment or local treatment, poor performance status or other reasons for trial ineligibility, lack of trials/slots, and insurance denial. Conclusion Broad implementation of multiplex hotspot testing is feasible; however, only a small portion of patients with actionable alterations were actually enrolled onto genotype-matched trials. Increased awareness of therapeutic implications and access to novel therapeutics are needed to optimally leverage results from broad-based genomic testing. PMID:26014291

  9. From DNA to FBA: How to Build Your Own Genome-Scale Metabolic Model

    PubMed Central

    Cuevas, Daniel A.; Edirisinghe, Janaka; Henry, Chris S.; Overbeek, Ross; O’Connell, Taylor G.; Edwards, Robert A.

    2016-01-01

    Microbiological studies are increasingly relying on in silico methods to perform exploration and rapid analysis of genomic data, and functional genomics studies are supplemented by the new perspectives that genome-scale metabolic models offer. A mathematical model consisting of a microbe’s entire metabolic map can be rapidly determined from whole-genome sequencing and annotating the genomic material encoded in its DNA. Flux-balance analysis (FBA), a linear programming technique that uses metabolic models to predict the phenotypic responses imposed by environmental elements and factors, is the leading method to simulate and manipulate cellular growth in silico. However, the process of creating an accurate model to use in FBA consists of a series of steps involving a multitude of connections between bioinformatics databases, enzyme resources, and metabolic pathways. We present the methodology and procedure to obtain a metabolic model using PyFBA, an extensible Python-based open-source software package aimed to provide a platform where functional annotations are used to build metabolic models (http://linsalrob.github.io/PyFBA). Backed by the Model SEED biochemistry database, PyFBA contains methods to reconstruct a microbe’s metabolic map, run FBA upon different media conditions, and gap-fill its metabolism. The extensibility of PyFBA facilitates novel techniques in creating accurate genome-scale metabolic models. PMID:27379044

  10. Population genomic footprints of fine-scale differentiation between habitats in Mediterranean blue tits.

    PubMed

    Szulkin, M; Gagnaire, P-A; Bierne, N; Charmantier, A

    2016-01-01

    Linking population genetic variation to the spatial heterogeneity of the environment is of fundamental interest to evolutionary biology and ecology, in particular when phenotypic differences between populations are observed at biologically small spatial scales. Here, we applied restriction-site associated DNA sequencing (RAD-Seq) to test whether phenotypically differentiated populations of wild blue tits (Cyanistes caeruleus) breeding in a highly heterogeneous environment exhibit genetic structure related to habitat type. Using 12 106 SNPs in 197 individuals from deciduous and evergreen oak woodlands, we applied complementary population genomic analyses, which revealed that genetic variation is influenced by both geographical distance and habitat type. A fine-scale genetic differentiation supported by genome- and transcriptome-wide analyses was found within Corsica, between two adjacent habitats where blue tits exhibit marked differences in breeding time while nesting < 6 km apart. Using redundancy analysis (RDA), we show that genomic variation remains associated with habitat type when controlling for spatial and temporal effects. Finally, our results suggest that the observed patterns of genomic differentiation were not driven by a small proportion of highly differentiated loci, but rather emerged through a process such as habitat choice, which reduces gene flow between habitats across the entire genome. The pattern of genomic isolation-by-environment closely matches differentiation observed at the phenotypic level, thereby offering significant potential for future inference of phenotype-genotype associations in a heterogeneous environment.

  11. A genome-wide screen of bacterial mutants that enhance dauer formation in C. elegans

    PubMed Central

    Khanna, Amit; Kumar, Jitendra; Vargas, Misha A.; Barrett, LaKisha; Katewa, Subhash; Li, Patrick; McCloskey, Tom; Sharma, Amit; Naudé, Nicole; Nelson, Christopher; Brem, Rachel; Killilea, David W.; Mooney, Sean D.; Gill, Matthew; Kapahi, Pankaj

    2016-01-01

    Molecular pathways involved in dauer formation, an alternate larval stage that allows Caenorhabditis elegans to survive adverse environmental conditions during development, also modulate longevity and metabolism. The decision to proceed with reproductive development or undergo diapause depends on food abundance, population density, and temperature. In recent years, the chemical identities of pheromone signals that modulate dauer entry have been characterized. However, signals derived from bacteria, the major source of nutrients for C. elegans, remain poorly characterized. To systematically identify bacterial components that influence dauer formation and aging in C. elegans, we utilized the individual gene deletion mutants in E. coli (K12). We identified 56 diverse E. coli deletion mutants that enhance dauer formation in an insulin-like receptor mutant (daf-2) background. We describe the mechanism of action of a bacterial mutant cyaA, that is defective in the production of cyclic AMP, which extends lifespan and enhances dauer formation through the modulation of TGF-β (daf-7) signaling in C. elegans. Our results demonstrate the importance of bacterial components in influencing developmental decisions and lifespan in C. elegans. Furthermore, we demonstrate that C. elegans is a useful model to study bacterial-host interactions. PMID:27958277

  12. Impact of donor–recipient phylogenetic distance on bacterial genome transplantation

    PubMed Central

    Labroussaa, Fabien; Lebaudy, Anne; Baby, Vincent; Gourgues, Géraldine; Matteau, Dominick; Vashee, Sanjay; Sirand-Pugnet, Pascal; Rodrigue, Sébastien; Lartigue, Carole

    2016-01-01

    Genome transplantation (GT) allows the installation of purified chromosomes into recipient cells, causing the resulting organisms to adopt the genotype and the phenotype conferred by the donor cells. This key process remains a bottleneck in synthetic biology, especially for genome engineering strategies of intractable and economically important microbial species. So far, this process has only been reported using two closely related bacteria, Mycoplasma mycoides subsp. capri (Mmc) and Mycoplasma capricolum subsp. capricolum (Mcap), and the main factors driving the compatibility between a donor genome and a recipient cell are poorly understood. Here, we investigated the impact of the evolutionary distance between donor and recipient species on the efficiency of GT. Using Mcap as the recipient cell, we successfully transplanted the genome of six bacteria belonging to the Spiroplasma phylogenetic group but including species of two distinct genera. Our results demonstrate that GT efficiency is inversely correlated with the phylogenetic distance between donor and recipient bacteria but also suggest that other species-specific barriers to GT exist. This work constitutes an important step toward understanding the cellular factors governing the GT process in order to better define and eventually extend the existing genome compatibility limit. PMID:27488189

  13. Comparative Genomics between Two Xenorhabdus bovienii Strains Highlights Differential Evolutionary Scenarios within an Entomopathogenic Bacterial Species

    PubMed Central

    Bisch, Gaëlle; Ogier, Jean-Claude; Médigue, Claudine; Rouy, Zoé; Vincent, Stéphanie; Tailliez, Patrick; Givaudan, Alain; Gaudriault, Sophie

    2016-01-01

    Bacteria of the genus Xenorhabdus are symbionts of soil entomopathogenic nematodes of the genus Steinernema. This symbiotic association constitutes an insecticidal complex active against a wide range of insect pests. Within Xenorhabdus bovienii species, the X. bovienii CS03 strain (Xb CS03) is nonvirulent when directly injected into lepidopteran insects, and displays a low virulence when associated with its Steinernema symbiont. The genome of Xb CS03 was sequenced and compared with the genome of a virulent strain, X. bovienii SS-2004 (Xb SS-2004). The genome size and content widely differed between the two strains. Indeed, Xb CS03 had a large genome containing several specific loci involved in the inhibition of competitors, including a few NRPS-PKS loci (nonribosomal peptide synthetases and polyketide synthases) producing antimicrobial molecules. Consistently, Xb CS03 had a greater antimicrobial activity than Xb SS-2004. The Xb CS03 strain contained more pseudogenes than Xb SS-2004. Decay of genes involved in the host invasion and exploitation (toxins, invasins, or extracellular enzymes) was particularly important in Xb CS03. This may provide an explanation for the nonvirulence of the strain when injected into an insect host. We suggest that Xb CS03 and Xb SS-2004 followed divergent evolutionary scenarios to cope with their peculiar life cycle. The fitness strategy of Xb CS03 would involve competitor inhibition, whereas Xb SS-2004 would quickly and efficiently kill the insect host. Hence, Xenorhabdus strains would have widely divergent host exploitation strategies, which impact their genome structure. PMID:26769959

  14. Adaptation in Toxic Environments: Arsenic Genomic Islands in the Bacterial Genus Thiomonas

    PubMed Central

    Freel, Kelle C.; Krueger, Martin C.; Farasin, Julien; Brochier-Armanet, Céline; Barbe, Valérie; Andrès, Jeremy; Cholley, Pierre-Etienne; Dillies, Marie-Agnès; Jagla, Bernd; Koechler, Sandrine; Leva, Yann; Magdelenat, Ghislaine; Plewniak, Frédéric; Proux, Caroline; Coppée, Jean-Yves; Bertin, Philippe N.; Heipieper, Hermann J.; Arsène-Ploetze, Florence

    2015-01-01

    Acid mine drainage (AMD) is a highly toxic environment for most living organisms due to the presence of many lethal elements including arsenic (As). Thiomonas (Tm.) bacteria are found ubiquitously in AMD and can withstand these extreme conditions, in part because they are able to oxidize arsenite. In order to further improve our knowledge concerning the adaptive capacities of these bacteria, we sequenced and assembled the genome of six isolates derived from the Carnoulès AMD, and compared them to the genomes of Tm. arsenitoxydans 3As (isolated from the same site) and Tm. intermedia K12 (isolated from a sewage pipe). A detailed analysis of the Tm. sp. CB2 genome revealed various rearrangements had occurred in comparison to what was observed in 3As and K12 and over 20 genomic islands (GEIs) were found in each of these three genomes. We performed a detailed comparison of the two arsenic-related islands found in CB2, carrying the genes required for arsenite oxidation and As resistance, with those found in K12, 3As, and five other Thiomonas strains also isolated from Carnoulès (CB1, CB3, CB6, ACO3 and ACO7). Our results suggest that these arsenic-related islands have evolved differentially in these closely related Thiomonas strains, leading to divergent capacities to survive in As rich environments. PMID:26422469

  15. BiGG Models: A platform for integrating, standardizing and sharing genome-scale models

    PubMed Central

    King, Zachary A.; Lu, Justin; Dräger, Andreas; Miller, Philip; Federowicz, Stephen; Lerman, Joshua A.; Ebrahim, Ali; Palsson, Bernhard O.; Lewis, Nathan E.

    2016-01-01

    Genome-scale metabolic models are mathematically-structured knowledge bases that can be used to predict metabolic pathway usage and growth phenotypes. Furthermore, they can generate and test hypotheses when integrated with experimental data. To maximize the value of these models, centralized repositories of high-quality models must be established, models must adhere to established standards and model components must be linked to relevant databases. Tools for model visualization further enhance their utility. To meet these needs, we present BiGG Models (http://bigg.ucsd.edu), a completely redesigned Biochemical, Genetic and Genomic knowledge base. BiGG Models contains more than 75 high-quality, manually-curated genome-scale metabolic models. On the website, users can browse, search and visualize models. BiGG Models connects genome-scale models to genome annotations and external databases. Reaction and metabolite identifiers have been standardized across models to conform to community standards and enable rapid comparison across models. Furthermore, BiGG Models provides a comprehensive application programming interface for accessing BiGG Models with modeling and analysis tools. As a resource for highly curated, standardized and accessible models of metabolism, BiGG Models will facilitate diverse systems biology studies and support knowledge-based analysis of diverse experimental data. PMID:26476456

  16. BiGG Models: A platform for integrating, standardizing and sharing genome-scale models

    SciTech Connect

    King, Zachary A.; Lu, Justin; Drager, Andreas; Miller, Philip; Federowicz, Stephen; Lerman, Joshua A.; Ebrahim, Ali; Palsson, Bernhard O.; Lewis, Nathan E.

    2015-10-17

    In this study, genome-scale metabolic models are mathematically structured knowledge bases that can be used to predict metabolic pathway usage and growth phenotypes. Furthermore, they can generate and test hypotheses when integrated with experimental data. To maximize the value of these models, centralized repositories of high-quality models must be established, models must adhere to established standards and model components must be linked to relevant databases. Tools for model visualization further enhance their utility. To meet these needs, we present BiGG Models (http://bigg.ucsd.edu), a completely redesigned Biochemical, Genetic and Genomic knowledge base. BiGG Models contains more than 75 high-quality, manually-curated genome-scale metabolic models. On the website, users can browse, search and visualize models. BiGG Models connects genome-scale models to genome annotations and external databases. Reaction and metabolite identifiers have been standardized across models to conform to community standards and enable rapid comparison across models. Furthermore, BiGG Models provides a comprehensive application programming interface for accessing BiGG Models with modeling and analysis tools. As a resource for highly curated, standardized and accessible models of metabolism, BiGG Models will facilitate diverse systems biology studies and support knowledge-based analysis of diverse experimental data.

  17. Improved bacteriophage genome data is necessary for integrating viral and bacterial ecology.

    PubMed

    Bibby, Kyle

    2014-02-01

    The recent rise in "omics"-enabled approaches has lead to improved understanding in many areas of microbial ecology. However, despite the importance that viruses play in a broad microbial ecology context, viral ecology remains largely not integrated into high-throughput microbial ecology studies. A fundamental hindrance to the integration of viral ecology into omics-enabled microbial ecology studies is the lack of suitable reference bacteriophage genomes in reference databases-currently, only 0.001% of bacteriophage diversity is represented in genome sequence databases. This commentary serves to highlight this issue and to promote bacteriophage genome sequencing as a valuable scientific undertaking to both better understand bacteriophage diversity and move towards a more holistic view of microbial ecology.

  18. Evolution, genomics and epidemiology of Pseudomonas syringae: Challenges in Bacterial Molecular Plant Pathology.

    PubMed

    Baltrus, David A; McCann, Honour C; Guttman, David S

    2017-01-01

    A remarkable shift in our understanding of plant-pathogenic bacteria is underway. Until recently, nearly all research on phytopathogenic bacteria was focused on a small number of model strains, which provided a deep, but narrow, perspective on plant-microbe interactions. Advances in genome sequencing technologies have changed this by enabling the incorporation of much greater diversity into comparative and functional research. We are now moving beyond a typological understanding of a select collection of strains to a more generalized appreciation of the breadth and scope of plant-microbe interactions. The study of natural populations and evolution has particularly benefited from the expansion of genomic data. We are beginning to have a much deeper understanding of the natural genetic diversity, niche breadth, ecological constraints and defining characteristics of phytopathogenic species. Given this expanding genomic and ecological knowledge, we believe the time is ripe to evaluate what we know about the evolutionary dynamics of plant pathogens.

  19. Genomic signatures of obligate host dependence in the luminous bacterial symbiont of a vertebrate.

    PubMed

    Hendry, Tory A; de Wet, Jeffrey R; Dunlap, Paul V

    2014-08-01

    The majority of bacteria engaged in bioluminescent symbiosis are environmentally acquired and facultatively symbiotic. A few enigmatic bioluminescent symbionts have not been successfully cultured, which has led to speculation that they may be obligately dependent on their hosts. Here, we report the draft genome of the uncultured luminous symbiont of an anomalopid flashlight fish, 'Candidatus Photodesmus katoptron'. The genome of the anomalopid symbiont is reduced by 80% compared with close relatives and lacks almost all genes necessary for amino acid synthesis and for metabolism of energy sources other than glucose, supporting obligate dependence on the host for growth. 'Candidatus Photodesmus katoptron' is the first described obligate mutualistic symbiont of a vertebrate. Unlike most other obligate mutualists, the anomalopid symbiont genome has retained complete pathways for chemotaxis and motility as well as most genes involved in cell wall production, consistent with the hypothesis that these bacteria may be transmitted environmentally during an extra-host phase.

  20. Differential Genome Evolution Between Companion Symbionts in an Insect-Bacterial Symbiosis

    PubMed Central

    McCutcheon, John P.; MacDonald, Bradon R.; Romanovicz, Dwight; Moran, Nancy A.

    2014-01-01

    ABSTRACT Obligate symbioses with bacteria allow insects to feed on otherwise unsuitable diets. Some symbionts have extremely reduced genomes and have lost many genes considered to be essential in other bacteria. To understand how symbiont genome degeneration proceeds, we compared the genomes of symbionts in two leafhopper species, Homalodisca vitripennis (glassy-winged sharpshooter [GWSS]) and Graphocephala atropunctata (blue-green sharpshooter [BGSS]) (Hemiptera: Cicadellidae). Each host species is associated with the anciently acquired “Candidatus Sulcia muelleri” (Bacteroidetes) and the more recently acquired “Candidatus Baumannia cicadellinicola” (Gammaproteobacteria). BGSS “Ca. Baumannia” retains 89 genes that are absent from GWSS “Ca. Baumannia”; these underlie central cellular functions, including cell envelope biogenesis, cellular replication, and stress response. In contrast, “Ca. Sulcia” strains differ by only a few genes. Although GWSS “Ca. Baumannia” cells are spherical or pleomorphic (a convergent trait of obligate symbionts), electron microscopy reveals that BGSS “Ca. Baumannia” maintains a rod shape, possibly due to its retention of genes involved in cell envelope biogenesis and integrity. Phylogenomic results suggest that “Ca. Baumannia” is derived from the clade consisting of Sodalis and relatives, a group that has evolved symbiotic associations with numerous insect hosts. Finally, the rates of synonymous and nonsynonymous substitutions are higher in “Ca. Baumannia” than in “Ca. Sulcia,” which may be due to a lower mutation rate in the latter. Taken together, our results suggest that the two “Ca. Baumannia” genomes represent different stages of genome reduction in which many essential functions are being lost and likely compensated by hosts. “Ca. Sulcia” exhibits much greater genome stability and slower sequence evolution, although the mechanisms underlying these differences are poorly understood

  1. A FISH approach for mapping the human genome using Bacterial Artificial Chromosomes (BACs)

    SciTech Connect

    Hubert, R.S.; Chen, X.N.; Mitchell, S.

    1994-09-01

    As the Human Genome Project progresses, large insert cloning vectors such as BACs, P1, and P1 Artificial Chromosomes (PACs) will be required to complement the YAC mapping efforts. The value of the BAC vector for physical mapping lies in the stability of the inserts, the lack of chimerism, the length of inserts (up to 300 kb), the ability to obtain large amounts of pure clone DNA and the ease of BAC manipulation. These features helped us design two approaches for generating physical mapping reagents for human genetic studies. The first approach is a whole genome strategy in which randomly selected BACs are mapped, using FISH, to specific chromosomal bands. To date, 700 BACs have been mapped to single chromosome bands at a resolution of 2-5 Mb in addition to BACs mapped to 14 different centromeres. These BACs represent more than 90 Mb of the genome and include >70% of all human chromosome bands at the 350-band level. These data revealed that >97% of the BACs were non-chimeric and have a genomic distribution covering most gaps in the existing YAC map with excellent coverage of gene-rich regions. In the second approach, we used YACs to identify BACs on chromosome 21. A 1.5 Mb contig between D21S339 and D21S220 nears completion within the Down syndrome congenital heart disease (DS-CHD) region. Seventeen BACs ranging in size from 80 kb to 240 kb were ordered using 14 STSs with FISH confirmation. We have also used 40 YACs spanning 21q to identify, on average, >1 BAC/Mb to provide molecular cytogenetic reagents and anchor points for further mapping. The contig generated on chromosome 21 will be helpful in isolating the genes for DS-CHD. The physical mapping reagents generated using the whole genome approach will provide cytogenetic markers and mapped genomic fragments that will facilitate positional cloning efforts and the identification of genes within most chromosomal bands.

  2. The influences of the recycle process on the bacterial community in a pilot scale microalgae raceway pond.

    PubMed

    Erkelens, Mason; Ball, Andrew S; Lewis, David M

    2014-04-01

    The use of recycled media has been shown to be a necessary step within the lifecycle of microalgal biofuels for economic sustainability and reducing the water footprint. However the impact of the harvesting of microalgae on the bacterial load of the recycled water has yet to be investigated. Within this study PCR-DGGE and real-time PCR was used to evaluate the bacterial community dynamics within the recycled water following harvest and concentration steps for a pilot scale open pond system (120,000L), which was developed for the production of green crude oil from Tetraselmis sp. in hyper saline water. Two stages were used in the harvesting; Stage 1 electroflocculation, and Stage 2 centrifugation. Electroflocculation was shown to have little effect on the bacterial cell concentration. In contrast bacterial diversity and cell concentration within the centrifugation step was greatly reduced.

  3. Generation and Evaluation of a Genome-Scale Metabolic Network Model of Synechococcus elongatus PCC7942.

    PubMed

    Triana, Julián; Montagud, Arnau; Siurana, Maria; Fuente, David; Urchueguía, Arantxa; Gamermann, Daniel; Torres, Javier; Tena, Jose; de Córdoba, Pedro Fernández; Urchueguía, Javier F

    2014-08-20

    The reconstruction of genome-scale metabolic models and their applications represent a great advantage of systems biology. Through their use as metabolic flux simulation models, production of industrially-interesting metabolites can be predicted. Due to the growing number of studies of metabolic models driven by the increasing genomic sequencing projects, it is important to conceptualize steps of reconstruction and analysis. We have focused our work in the cyanobacterium Synechococcus elongatus PCC7942, for which several analyses and insights are unveiled. A comprehensive approach has been used, which can be of interest to lead the process of manual curation and genome-scale metabolic analysis. The final model, iSyf715 includes 851 reactions and 838 metabolites. A biomass equation, which encompasses elementary building blocks to allow cell growth, is also included. The applicability of the model is finally demonstrated by simulating autotrophic growth conditions of Synechococcus elongatus PCC7942.

  4. Generation and Evaluation of a Genome-Scale Metabolic Network Model of Synechococcus elongatus PCC7942

    PubMed Central

    Triana, Julián; Montagud†, Arnau; Siurana, Maria; Fuente, David; Urchueguía, Arantxa; Gamermann, Daniel; Torres, Javier; Tena, Jose; de Córdoba, Pedro Fernández; Urchueguía, Javier F.

    2014-01-01

    The reconstruction of genome-scale metabolic models and their applications represent a great advantage of systems biology. Through their use as metabolic flux simulation models, production of industrially-interesting metabolites can be predicted. Due to the growing number of studies of metabolic models driven by the increasing genomic sequencing projects, it is important to conceptualize steps of reconstruction and analysis. We have focused our work in the cyanobacterium Synechococcus elongatus PCC7942, for which several analyses and insights are unveiled. A comprehensive approach has been used, which can be of interest to lead the process of manual curation and genome-scale metabolic analysis. The final model, iSyf715 includes 851 reactions and 838 metabolites. A biomass equation, which encompasses elementary building blocks to allow cell growth, is also included. The applicability of the model is finally demonstrated by simulating autotrophic growth conditions of Synechococcus elongatus PCC7942. PMID:25141288

  5. Implications of Genome-Based Discrimination between Clostridium botulinum Group I and Clostridium sporogenes Strains for Bacterial Taxonomy

    PubMed Central

    Weigand, Michael R.; Pena-Gonzalez, Angela; Shirey, Timothy B.; Broeker, Robin G.; Ishaq, Maliha K.; Konstantinidis, Konstantinos T.

    2015-01-01

    Taxonomic classification of Clostridium botulinum is based on the production of botulinum neurotoxin (BoNT), while closely related, nontoxic organisms are classified as Clostridium sporogenes. However, this taxonomic organization does not accurately mirror phylogenetic relationships between these species. A phylogenetic reconstruction using 2,016 orthologous genes shared among strains of C. botulinum group I and C. sporogenes clearly separated these two species into discrete clades which showed ∼93% average nucleotide identity (ANI) between them. Clustering of strains based on the presence of variable orthologs revealed 143 C. sporogenes clade-specific genetic signatures, a subset of which were further evaluated for their ability to correctly classify a panel of presumptive C. sporogenes strains by PCR. Genome sequencing of several C. sporogenes strains lacking these signatures confirmed that they clustered with C. botulinum strains in a core genome phylogenetic tree. Our analysis also identified C. botulinum strains that contained C. sporogenes clade-specific signatures and phylogenetically clustered with C. sporogenes strains. The genome sequences of two bont/B2-containing strains belonging to the C. sporogenes clade contained regions with similarity to a bont-bearing plasmid (pCLD), while two different strains belonging to the C. botulinum clade carried bont/B2 on the chromosome. These results indicate that bont/B2 was likely acquired by C. sporogenes strains through horizontal gene transfer. The genome-based classification of these species used to identify candidate genes for the development of rapid assays for molecular identification may be applicable to additional bacterial species that are challenging with respect to their classification. PMID:26048939

  6. Implications of Genome-Based Discrimination between Clostridium botulinum Group I and Clostridium sporogenes Strains for Bacterial Taxonomy.

    PubMed

    Weigand, Michael R; Pena-Gonzalez, Angela; Shirey, Timothy B; Broeker, Robin G; Ishaq, Maliha K; Konstantinidis, Konstantinos T; Raphael, Brian H

    2015-08-15

    Taxonomic classification of Clostridium botulinum is based on the production of botulinum neurotoxin (BoNT), while closely related, nontoxic organisms are classified as Clostridium sporogenes. However, this taxonomic organization does not accurately mirror phylogenetic relationships between these species. A phylogenetic reconstruction using 2,016 orthologous genes shared among strains of C. botulinum group I and C. sporogenes clearly separated these two species into discrete clades which showed ∼93% average nucleotide identity (ANI) between them. Clustering of strains based on the presence of variable orthologs revealed 143 C. sporogenes clade-specific genetic signatures, a subset of which were further evaluated for their ability to correctly classify a panel of presumptive C. sporogenes strains by PCR. Genome sequencing of several C. sporogenes strains lacking these signatures confirmed that they clustered with C. botulinum strains in a core genome phylogenetic tree. Our analysis also identified C. botulinum strains that contained C. sporogenes clade-specific signatures and phylogenetically clustered with C. sporogenes strains. The genome sequences of two bont/B2-containing strains belonging to the C. sporogenes clade contained regions with similarity to a bont-bearing plasmid (pCLD), while two different strains belonging to the C. botulinum clade carried bont/B2 on the chromosome. These results indicate that bont/B2 was likely acquired by C. sporogenes strains through horizontal gene transfer. The genome-based classification of these species used to identify candidate genes for the development of rapid assays for molecular identification may be applicable to additional bacterial species that are challenging with respect to their classification.

  7. All duplicates are not equal: the difference between small-scale and genome duplication

    PubMed Central

    Hakes, Luke; Pinney, John W; Lovell, Simon C; Oliver, Stephen G; Robertson, David L

    2007-01-01

    Background Genes in populations are in constant flux, being gained through duplication and occasionally retained or, more frequently, lost from the genome. In this study we compare pairs of identifiable gene duplicates generated by small-scale (predominantly single-gene) duplications with those created by a large-scale gene duplication event (whole-genome duplication) in the yeast Saccharomyces cerevisiae. Results We find a number of quantifiable differences between these data sets. Whole-genome duplicates tend to exhibit less profound phenotypic effects when deleted, are functionally less divergent, and are associated with a different set of functions than their small-scale duplicate counterparts. At first sight, either of these latter two features could provide a plausible mechanism by which the difference in dispensability might arise. However, we uncover no evidence suggesting that this is the case. We find that the difference in dispensability observed between the two duplicate types is limited to gene products found within protein complexes, and probably results from differences in the relative strength of the evolutionary pressures present following each type of duplication event. Conclusion Genes, and the proteins they specify, originating from small-scale and whole-genome duplication events differ in quantifiable ways. We infer that this is not due to their association with different functional categories; rather, it is a direct result of biases in gene retention. PMID:17916239

  8. Characterization of bacterial community dynamics in a full-scale drinking water treatment plant.

    PubMed

    Li, Cuiping; Ling, Fangqiong; Zhang, Minglu; Liu, Wen-Tso; Li, Yuxian; Liu, Wenjun

    2017-01-01

    Understanding the spatial and temporal dynamics of microbial communities in drinking water systems is vital to securing the microbial safety of drinking water. The objective of this study was to comprehensively characterize the dynamics of microbial biomass and bacterial communities at each step of a full-scale drinking water treatment plant in Beijing, China. Both bulk water and biofilm samples on granular activated carbon (GAC) were collected over 9months. The proportion of cultivable cells decreased during the treatment processes, and this proportion was higher in warm season than cool season, suggesting that treatment processes and water temperature probably had considerable impact on the R2A cultivability of total bacteria. 16s rRNA gene based 454 pyrosequencing analysis of the bacterial community revealed that Proteobacteria predominated in all samples. The GAC biofilm harbored a distinct population with a much higher relative abundance of Acidobacteria than water samples. Principle coordinate analysis and one-way analysis of similarity indicated that the dynamics of the microbial communities in bulk water and biofilm samples were better explained by the treatment processes rather than by sampling time, and distinctive changes of the microbial communities in water occurred after GAC filtration. Furthermore, 20 distinct OTUs contributing most to the dissimilarity among samples of different sampling locations and 6 persistent OTUs present in the entire treatment process flow were identified. Overall, our findings demonstrate the significant effects that treatment processes have on the microbial biomass and community fluctuation and provide implications for further targeted investigation on particular bacteria populations.

  9. Rapid editing and evolution of bacterial genomes using libraries of synthetic DNA.

    PubMed

    Gallagher, Ryan R; Li, Zhe; Lewis, Aaron O; Isaacs, Farren J

    2014-10-01

    Multiplex automated genome engineering (MAGE) is a powerful technology for in vivo genome editing that uses synthetic single-stranded DNA (ssDNA) to introduce targeted modifications directly into the Escherichia coli chromosome. MAGE is a cyclical process that involves transformation of ssDNA (by electroporation) followed by outgrowth, during which bacteriophage homologous recombination proteins mediate annealing of ssDNAs to their genomic targets. By iteratively introducing libraries of mutagenic ssDNAs targeting multiple sites, MAGE can generate combinatorial genetic diversity in a cell population. Alternatively, MAGE can introduce precise mutant alleles at many loci for genome-wide editing or for recoding projects that are not possible with other methods. In recent technological advances, MAGE has been improved by strain modifications and selection techniques that enhance allelic replacement. This protocol describes the manual execution of MAGE wherein each cycle takes ≈ 2.5 h, which, if carried out by two people, allows ≈ 10 continuous cycles of MAGE-based mutagenesis per day.

  10. Germinal transmission of site-specific excised genomic DNA by the bacterial ParA resolvase

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome engineering is an essential tool in research and product development. Behind some of the recent advances in plant gene transfer is the development of site-specific recombination systems that enable the precise manipulation of DNA, e.g. the deletion, integration or translocation of DNA. DNA ...

  11. Finished Genome Sequences of Xanthomonas fragariae, the Cause of Bacterial Angular Leaf Spot of Strawberry

    PubMed Central

    Henry, Peter M.

    2016-01-01

    Xanthomonas fragariae is a foliar pathogen of strawberry that is of significant concern to nursery production of strawberry transplants and field production of strawberry fruit. Long-read sequencing was employed to generate finished genomes for two isolates (each with one chromosome and two plasmids) from symptomatic plants in northern California. PMID:27834715

  12. Draft genome sequence of Erwinia tracheiphila, an economically important bacterial pathogen of cucurbits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Erwinia tracheiphila is one of the most economically important pathogen of cucumbers, melons, squashes, pumpkins, and gourds, in the Northeastern and Midwestern United States, yet the molecular pathology remains uninvestigated. Here we report the first draft genome sequence of an E. tracheiphila str...

  13. Comparative genomics of bacterial and plant folate synthesis and salvage: predictions and validations

    PubMed Central

    de Crécy-Lagard, Valérie; El Yacoubi, Basma; de la Garza, Rocío Díaz; Noiriel, Alexandre; Hanson, Andrew D

    2007-01-01

    Background Folate synthesis and salvage pathways are relatively well known from classical biochemistry and genetics but they have not been subjected to comparative genomic analysis. The availability of genome sequences from hundreds of diverse bacteria, and from Arabidopsis thaliana, enabled such an analysis using the SEED database and its tools. This study reports the results of the analysis and integrates them with new and existing experimental data. Results Based on sequence similarity and the clustering, fusion, and phylogenetic distribution of genes, several functional predictions emerged from this analysis. For bacteria, these included the existence of novel GTP cyclohydrolase I and folylpolyglutamate synthase gene families, and of a trifunctional p-aminobenzoate synthesis gene. For plants and bacteria, the predictions comprised the identities of a 'missing' folate synthesis gene (folQ) and of a folate transporter, and the absence from plants of a folate salvage enzyme. Genetic and biochemical tests bore out these predictions. Conclusion For bacteria, these results demonstrate that much can be learnt from comparative genomics, even for well-explored primary metabolic pathways. For plants, the findings particularly illustrate the potential for rapid functional assignment of unknown genes that have prokaryotic homologs, by analyzing which genes are associated with the latter. More generally, our data indicate how combined genomic analysis of both plants and prokaryotes can be more powerful than isolated examination of either group alone. PMID:17645794

  14. Genome Evolution and Nitrogen Fixation in Bacterial Ectosymbionts of a Protist Inhabiting Wood-Feeding Cockroaches

    PubMed Central

    Carpenter, Kevin J.; Weber, Peter K.; Nalepa, Christine A.; Perlman, Steve J.; Keeling, Patrick J.

    2016-01-01

    ABSTRACT By combining genomics and isotope imaging analysis using high-resolution secondary ion mass spectrometry (NanoSIMS), we examined the function and evolution of Bacteroidales ectosymbionts of the protist Barbulanympha from the hindguts of the wood-eating cockroach Cryptocercus punctulatus. In particular, we investigated the structure of ectosymbiont genomes, which, in contrast to those of endosymbionts, has been little studied to date, and tested the hypothesis that these ectosymbionts fix nitrogen. Unlike with most obligate endosymbionts, genome reduction has not played a major role in the evolution of the Barbulanympha ectosymbionts. Instead, interaction with the external environment has remained important for this symbiont as genes for synthesis of transporters, outer membrane proteins, lipopolysaccharides, and lipoproteins have been retained. The ectosymbiont genome carried two complete operons for nitrogen fixation, a urea transporter, and a urease, indicating the availability of nitrogen as a driving force behind the symbiosis. NanoSIMS analysis of C. punctulatus hindgut symbionts exposed in vivo to 15N2 supports the hypothesis that Barbulanympha ectosymbionts are capable of nitrogen fixation. This genomic and in vivo functional investigation of protist ectosymbionts highlights the diversity of evolutionary forces and trajectories that shape symbiotic interactions. IMPORTANCE The ecological and evolutionary importance of symbioses is increasingly clear, but the overall diversity of symbiotic interactions remains poorly explored. In this study, we investigated the evolution and nitrogen fixation capabilities of ectosymbionts attached to the protist Barbulanympha from the hindgut of the wood-eating cockroach Cryptocercus punctulatus. In addressing genome evolution of protist ectosymbionts, our data suggest that the ecological pressures influencing the evolution of extracellular symbionts clearly differ from intracellular symbionts and organelles. Using

  15. Genome evolution and nitrogen fixation in bacterial ectosymbionts of a protist inhabiting wood-feeding cockroaches

    DOE PAGES

    Tai, Vera; Carpenter, Kevin J.; Weber, Peter K.; ...

    2016-05-27

    By combining genomics and isotope imaging analysis using high-resolution secondary ion mass spectrometry (NanoSIMS), we examined the function and evolution of Bacteroidales ectosymbionts of the protistBarbulanymphafrom the hindguts of the wood-eating cockroachCryptocercus punctulatus. In particular, we investigated the structure of ectosymbiont genomes, which, in contrast to those of endosymbionts, has been little studied to date, and tested the hypothesis that these ectosymbionts fix nitrogen. Unlike with most obligate endosymbionts, genome reduction has not played a major role in the evolution of the Barbulanympha ectosymbionts. Instead, interaction with the external environment has remained important for this symbiont as genes for synthesismore » of transporters, outer membrane proteins, lipopolysaccharides, and lipoproteins have been retained. The ectosymbiont genome carried two complete operons for nitrogen fixation, a urea transporter, and a urease, indicating the availability of nitrogen as a driving force behind the symbiosis. NanoSIMS analysis ofC. punctulatushindgut symbionts exposedin vivoto15N2 supports the hypothesis thatBarbulanymphaectosymbionts are capable of nitrogen fixation. This genomic andin vivofunctional investigation of protist ectosymbionts highlights the diversity of evolutionary forces and trajectories that shape symbiotic interactions. The ecological and evolutionary importance of symbioses is increasingly clear, but the overall diversity of symbiotic interactions remains poorly explored. Here in this study, we investigated the evolution and nitrogen fixation capabilities of ectosymbionts attached to the protist Barbulanympha from the hindgut of the wood-eating cockroach Cryptocercus punctulatus. In addressing genome evolution of protist ectosymbionts, our data suggest that the ecological pressures influencing the evolution of extracellular symbionts clearly differ from intracellular symbionts and organelles. Using NanoSIMS analysis

  16. Genome evolution and nitrogen fixation in bacterial ectosymbionts of a protist inhabiting wood-feeding cockroaches

    SciTech Connect

    Tai, Vera; Carpenter, Kevin J.; Weber, Peter K.; Nalepa, Christine A.; Perlman, Steve J.; Keeling, Patrick J.

    2016-05-27

    By combining genomics and isotope imaging analysis using high-resolution secondary ion mass spectrometry (NanoSIMS), we examined the function and evolution of Bacteroidales ectosymbionts of the protistBarbulanymphafrom the hindguts of the wood-eating cockroachCryptocercus punctulatus. In particular, we investigated the structure of ectosymbiont genomes, which, in contrast to those of endosymbionts, has been little studied to date, and tested the hypothesis that these ectosymbionts fix nitrogen. Unlike with most obligate endosymbionts, genome reduction has not played a major role in the evolution of the Barbulanympha ectosymbionts. Instead, interaction with the external environment has remained important for this symbiont as genes for synthesis of transporters, outer membrane proteins, lipopolysaccharides, and lipoproteins have been retained. The ectosymbiont genome carried two complete operons for nitrogen fixation, a urea transporter, and a urease, indicating the availability of nitrogen as a driving force behind the symbiosis. NanoSIMS analysis ofC. punctulatushindgut symbionts exposedin vivoto15N2 supports the hypothesis thatBarbulanymphaectosymbionts are capable of nitrogen fixation. This genomic andin vivofunctional investigation of protist ectosymbionts highlights the diversity of evolutionary forces and trajectories that shape symbiotic interactions. The ecological and evolutionary importance of symbioses is increasingly clear, but the overall diversity of symbiotic interactions remains poorly explored. Here in this study, we investigated the

  17. GMATA: An Integrated Software Package for Genome-Scale SSR Mining, Marker Development and Viewing

    PubMed Central

    Wang, Xuewen; Wang, Le

    2016-01-01

    Simple sequence repeats (SSRs), also referred to as microsatellites, are highly variable tandem DNAs that are widely used as genetic markers. The increasing availability of whole-genome and transcript sequences provides information resources for SSR marker development. However, efficient software is required to efficiently identify and display SSR information along with other gene features at a genome scale. We developed novel software package Genome-wide Microsatellite Analyzing Tool Package (GMATA) integrating SSR mining, statistical analysis and plotting, marker design, polymorphism screening and marker transferability, and enabled simultaneously display SSR markers with other genome features. GMATA applies novel strategies for SSR analysis and primer design in large genomes, which allows GMATA to perform faster calculation and provides more accurate results than existing tools. Our package is also capable of processing DNA sequences of any size on a standard computer. GMATA is user friendly, only requires mouse clicks or types inputs on the command line, and is executable in multiple computing platforms. We demonstrated the application of GMATA in plants genomes and reveal a novel distribution pattern of SSRs in 15 grass genomes. The most abundant motifs are dimer GA/TC, the A/T monomer and the GCG/CGC trimer, rather than the rich G/C content in DNA sequence. We also revealed that SSR count is a linear to the chromosome length in fully assembled grass genomes. GMATA represents a powerful application tool that facilitates genomic sequence analyses. GAMTA is freely available at http://sourceforge.net/projects/gmata/?source=navbar. PMID:27679641

  18. Genome Segregation and Packaging Machinery in Acanthamoeba polyphaga Mimivirus Is Reminiscent of Bacterial Apparatus

    PubMed Central

    Chelikani, Venkata; Ranjan, Tushar; Zade, Amrutraj; Shukla, Avi

    2014-01-01

    ABSTRACT Genome packaging is a critical step in the virion assembly process. The putative ATP-driven genome packaging motor of Acanthamoeba polyphaga mimivirus (APMV) and other nucleocytoplasmic large DNA viruses (NCLDVs) is a distant ortholog of prokaryotic chromosome segregation motors, such as FtsK and HerA, rather than other viral packaging motors, such as large terminase. Intriguingly, APMV also encodes other components, i.e., three putative serine recombinases and a putative type II topoisomerase, all of which are essential for chromosome segregation in prokaryotes. Based on our analyses of these components and taking the limited available literature into account, here we propose for the first time a model for genome segregation and packaging in APMV that can possibly be extended to NCLDV subfamilies, except perhaps Poxviridae and Ascoviridae. This model might represent a unique variation of the prokaryotic system acquired and contrived by the large DNA viruses of eukaryotes. It is also consistent with previous observations that unicellular eukaryotes, such as amoebae, are melting pots for the advent of chimeric organisms with novel mechanisms. IMPORTANCE Extremely large viruses with DNA genomes infect a wide range of eukaryotes, from human beings to amoebae and from crocodiles to algae. These large DNA viruses, unlike their much smaller cousins, have the capability of making most of the protein components required for their multiplication. Once they infect the cell, these viruses set up viral replication centers, known as viral factories, to carry out their multiplication with very little help from the host. Our sequence analyses show that there is remarkable similarity between prokaryotes (bacteria and archaea) and large DNA viruses, such as mimivirus, vaccinia virus, and pandoravirus, in the way that they process their newly synthesized genetic material to make sure that only one copy of the complete genome is generated and is meticulously placed inside

  19. CRISPR-Cas9: from a bacterial immune system to genome-edited human cells in clinical trials.

    PubMed

    Kick, Leonhard; Kirchner, Marion; Schneider, Sabine

    2017-03-13

    The adaptive bacterial immune system CRISPR-Cas is revolutionising all fields of life science and has opened up new frontiers towards personalised medicine. Since the elucidation of the molecular mechanism of Cas9 from Streptococcus pyogenes in 2012 and its development as a genomic engineering tool, genetic modifications in more than 40 species have been carried out, over 290 patents have been filed worldwide and the first clinical trials using CRISPR-Cas-modified T-cells have recently been started in China and in the US. In this review we summarise current design developments, novel Cas systems and their antagonists, present and potential future applications as well as the ongoing debate on ethical issues, which has arisen through the CRISPR-Cas technology.

  20. A Year of Infection in the Intensive Care Unit: Prospective Whole Genome Sequencing of Bacterial Clinical Isolates Reveals Cryptic Transmissions and Novel Microbiota.

    PubMed

    Roach, David J; Burton, Joshua N; Lee, Choli; Stackhouse, Bethany; Butler-Wu, Susan M; Cookson, Brad T; Shendure, Jay; Salipante, Stephen J

    2015-07-01

    Bacterial whole genome sequencing holds promise as a disruptive technology in clinical microbiology, but it has not yet been applied systematically or comprehensively within a clinical context. Here, over the course of one year, we performed prospective collection and whole genome sequencing of nearly all bacterial isolates obtained from a tertiary care hospital's intensive care units (ICUs). This unbiased collection of 1,229 bacterial genomes from 391 patients enables detailed exploration of several features of clinical pathogens. A sizable fraction of isolates identified as clinically relevant corresponded to previously undescribed species: 12% of isolates assigned a species-level classification by conventional methods actually qualified as distinct, novel genomospecies on the basis of genomic similarity. Pan-genome analysis of the most frequently encountered pathogens in the collection revealed substantial variation in pan-genome size (1,420 to 20,432 genes) and the rate of gene discovery (1 to 152 genes per isolate sequenced). Surprisingly, although potential nosocomial transmission of actively surveilled pathogens was rare, 8.7% of isolates belonged to genomically related clonal lineages that were present among multiple patients, usually with overlapping hospital admissions, and were associated with clinically significant infection in 62% of patients from which they were recovered. Multi-patient clonal lineages were particularly evident in the neonatal care unit, where seven separate Staphylococcus epidermidis clonal lineages were identified, including one lineage associated with bacteremia in 5/9 neonates. Our study highlights key differences in the information made available by conventional microbiological practices versus whole genome sequencing, and motivates the further integration of microbial genome sequencing into routine clinical care.

  1. CyanoGEBA: A Better Understanding of Cynobacterial Diversity through Large-scale Genomics (JGI Seventh Annual User Meeting 2012: Genomics of Energy and Environment)

    ScienceCinema

    Shih, Patrick [Kerfeld Lab, UC Berkeley and JGI

    2016-07-12

    Patrick Shih, representing both the University of California, Berkeley and JGI, gives a talk titled "CyanoGEBA: A Better Understanding of Cynobacterial Diversity through Large-scale Genomics" at the JGI 7th Annual Users Meeting: Genomics of Energy & Environment Meeting on March 22, 2012 in Walnut Creek, California.

  2. Large-scale analysis of the yeast genome by transposon tagging and gene disruption.

    PubMed

    Ross-Macdonald, P; Coelho, P S; Roemer, T; Agarwal, S; Kumar, A; Jansen, R; Cheung, K H; Sheehan, A; Symoniatis, D; Umansky, L; Heidtman, M; Nelson, F K; Iwasaki, H; Hager, K; Gerstein, M; Miller, P; Roeder, G S; Snyder, M

    1999-11-25

    Economical methods by which gene function may be analysed on a genomic scale are relatively scarce. To fill this need, we have developed a transposon-tagging strategy for the genome-wide analysis of disruption phenotypes, gene expression and protein localization, and have applied this method to the large-scale analysis of gene function in the budding yeast Saccharomyces cerevisiae. Here we present the largest collection of defined yeast mutants ever generated within a single genetic background--a collection of over 11,000 strains, each carrying a transposon inserted within a region of the genome expressed during vegetative growth and/or sporulation. These insertions affect nearly 2,000 annotated genes, representing about one-third of the 6,200 predicted genes in the yeast genome. We have used this collection to determine disruption phenotypes for nearly 8,000 strains using 20 different growth conditions; the resulting data sets were clustered to identify groups of functionally related genes. We have also identified over 300 previously non-annotated open reading frames and analysed by indirect immunofluorescence over 1,300 transposon-tagged proteins. In total, our study encompasses over 260,000 data points, constituting the largest functional analysis of the yeast genome ever undertaken.

  3. BiGG Models: A platform for integrating, standardizing and sharing genome-scale models

    DOE PAGES

    King, Zachary A.; Lu, Justin; Drager, Andreas; ...

    2015-10-17

    In this study, genome-scale metabolic models are mathematically structured knowledge bases that can be used to predict metabolic pathway usage and growth phenotypes. Furthermore, they can generate and test hypotheses when integrated with experimental data. To maximize the value of these models, centralized repositories of high-quality models must be established, models must adhere to established standards and model components must be linked to relevant databases. Tools for model visualization further enhance their utility. To meet these needs, we present BiGG Models (http://bigg.ucsd.edu), a completely redesigned Biochemical, Genetic and Genomic knowledge base. BiGG Models contains more than 75 high-quality, manually-curated genome-scalemore » metabolic models. On the website, users can browse, search and visualize models. BiGG Models connects genome-scale models to genome annotations and external databases. Reaction and metabolite identifiers have been standardized across models to conform to community standards and enable rapid comparison across models. Furthermore, BiGG Models provides a comprehensive application programming interface for accessing BiGG Models with modeling and analysis tools. As a resource for highly curated, standardized and accessible models of metabolism, BiGG Models will facilitate diverse systems biology studies and support knowledge-based analysis of diverse experimental data.« less

  4. Biomarker-based classification of bacterial and fungal whole-blood infections in a genome-wide expression study

    PubMed Central

    Dix, Andreas; Hünniger, Kerstin; Weber, Michael; Guthke, Reinhard; Kurzai, Oliver; Linde, Jörg

    2015-01-01

    Sepsis is a clinical syndrome that can be caused by bacteria or fungi. Early knowledge on the nature of the causative agent is a prerequisite for targeted anti-microbial therapy. Besides currently used detection methods like blood culture and PCR-based assays, the analysis of the transcriptional response of the host to infecting organisms holds great promise. In this study, we aim to examine the transcriptional footprint of infections caused by the bacterial pathogens Staphylococcus aureus and Escherichia coli and the fungal pathogens Candida albicans and Aspergillus fumigatus in a human whole-blood model. Moreover, we use the expression information to build a random forest classifier to classify if a sample contains a bacterial, fungal, or mock-infection. After normalizing the transcription intensities using stably expressed reference genes, we filtered the gene set for biomarkers of bacterial or fungal blood infections. This selection is based on differential expression and an additional gene relevance measure. In this way, we identified 38 biomarker genes, including IL6, SOCS3, and IRG1 which were already associated to sepsis by other studies. Using these genes, we trained the classifier and assessed its performance. It yielded a 96% accuracy (sensitivities >93%, specificities >97%) for a 10-fold stratified cross-validation and a 92% accuracy (sensitivities and specificities >83%) for an additional test dataset comprising Cryptococcus neoformans infections. Furthermore, the classifier is robust to Gaussian noise, indicating correct class predictions on datasets of new species. In conclusion, this genome-wide approach demonstrates an effective feature selection process in combination with the construction of a well-performing classification model. Further analyses of genes with pathogen-dependent expression patterns can provide insights into the systemic host responses, which may lead to new anti-microbial therapeutic advances. PMID:25814982

  5. Biomarker-based classification of bacterial and fungal whole-blood infections in a genome-wide expression study.

    PubMed

    Dix, Andreas; Hünniger, Kerstin; Weber, Michael; Guthke, Reinhard; Kurzai, Oliver; Linde, Jörg

    2015-01-01

    Sepsis is a clinical syndrome that can be caused by bacteria or fungi. Early knowledge on the nature of the causative agent is a prerequisite for targeted anti-microbial therapy. Besides currently used detection methods like blood culture and PCR-based assays, the analysis of the transcriptional response of the host to infecting organisms holds great promise. In this study, we aim to examine the transcriptional footprint of infections caused by the bacterial pathogens Staphylococcus aureus and Escherichia coli and the fungal pathogens Candida albicans and Aspergillus fumigatus in a human whole-blood model. Moreover, we use the expression information to build a random forest classifier to classify if a sample contains a bacterial, fungal, or mock-infection. After normalizing the transcription intensities using stably expressed reference genes, we filtered the gene set for biomarkers of bacterial or fungal blood infections. This selection is based on differential expression and an additional gene relevance measure. In this way, we identified 38 biomarker genes, including IL6, SOCS3, and IRG1 which were already associated to sepsis by other studies. Using these genes, we trained the classifier and assessed its performance. It yielded a 96% accuracy (sensitivities >93%, specificities >97%) for a 10-fold stratified cross-validation and a 92% accuracy (sensitivities and specificities >83%) for an additional test dataset comprising Cryptococcus neoformans infections. Furthermore, the classifier is robust to Gaussian noise, indicating correct class predictions on datasets of new species. In conclusion, this genome-wide approach demonstrates an effective feature selection process in combination with the construction of a well-performing classification model. Further analyses of genes with pathogen-dependent expression patterns can provide insights into the systemic host responses, which may lead to new anti-microbial therapeutic advances.

  6. Genome-scale approaches to the epigenetics of common human disease.

    PubMed

    Feinberg, Andrew P

    2010-01-01

    Traditionally, the pathology of human disease has been focused on microscopic examination of affected tissues, chemical and biochemical analysis of biopsy samples, other available samples of convenience, such as blood, and noninvasive or invasive imaging of varying complexity, in order to classify disease and illuminate its mechanistic basis. The molecular age has complemented this armamentarium with gene expression arrays and selective analysis of individual genes. However, we are entering a new era of epigenomic profiling, i.e., genome-scale analysis of cell-heritable nonsequence genetic change, such as DNA methylation. The epigenome offers access to stable measurements of cellular state and to biobanked material for large-scale epidemiological studies. Some of these genome-scale technologies are beginning to be applied to create the new field of epigenetic epidemiology.

  7. Genome-Scale Analysis of Cell-Specific Regulatory Codes Using Nuclear Enzymes.

    PubMed

    Baek, Songjoon; Sung, Myong-Hee

    2016-01-01

    High-throughput sequencing technologies have made it possible for biologists to generate genome-wide profiles of chromatin features at the nucleotide resolution. Enzymes such as nucleases or transposes have been instrumental as a chromatin-probing agent due to their ability to target accessible chromatin for cleavage or insertion. On the scale of a few hundred base pairs, preferential action of the nuclear enzymes on accessible chromatin allows mapping of cell state-specific accessibility in vivo. Such accessible regions contain functionally important regulatory sites, including promoters and enhancers, which undergo active remodeling for cells adapting in a dynamic environment. DNase-seq and the more recent ATAC-seq are two assays that are gaining popularity. Deep sequencing of DNA libraries from these assays, termed genomic footprinting, has been proposed to enable the comprehensive construction of protein occupancy profiles over the genome at the nucleotide level. Recent studies have discovered limitations of genomic footprinting which reduce the scope of detectable proteins. In addition, the identification of putative factors that bind to the observed footprints remains challenging. Despite these caveats, the methodology still presents significant advantages over alternative techniques such as ChIP-seq or FAIRE-seq. Here we describe computational approaches and tools for analysis of chromatin accessibility and genomic footprinting. Proper experimental design and assay-specific data analysis ensure the detection sensitivity and maximize retrievable information. The enzyme-based chromatin profiling approaches represent a powerful and evolving methodology which facilitates our understanding of how the genome is regulated.

  8. [Bacterial infections as seen from the eukaryotic genome: DNA double strand breaks, inflammation and cancer].

    PubMed

    Lemercier, Claudie

    2014-01-01

    An increasing number of studies report that infection by pathogenic bacteria alters the host genome, producing highly hazardous DNA double strand breaks for the eukaryotic cell. Even when DNA repair occurs, it often leaves "scars" on chromosomes that might generate genomic instability at the next cell division. Chronic intestinal inflammation promotes the expansion of genotoxic bacteria in the intestinal microbiote which in turn triggers tumor formation and colon carcinomas. Bacteria act at the level of the host DNA repair machinery. They also highjack the host cell cycle to allow themselves time for replication in an appropriate reservoir. However, except in the case of bacteria carrying the CDT nuclease, the molecular mechanisms responsible for DNA lesions are not well understood, even if reactive oxygen species released during infection make good candidates.

  9. The complete genome sequence of Chromobacterium violaceum reveals remarkable and exploitable bacterial adaptability

    PubMed Central

    2003-01-01

    Chromobacterium violaceum is one of millions of species of free-living microorganisms that populate the soil and water in the extant areas of tropical biodiversity around the world. Its complete genome sequence reveals (i) extensive alternative pathways for energy generation, (ii) ≈500 ORFs for transport-related proteins, (iii) complex and extensive systems for stress adaptation and motility, and (iv) widespread utilization of quorum sensing for control of inducible systems, all of which underpin the versatility and adaptability of the organism. The genome also contains extensive but incomplete arrays of ORFs coding for proteins associated with mammalian pathogenicity, possibly involved in the occasional but often fatal cases of human C. violaceum infection. There is, in addition, a series of previously unknown but important enzymes and secondary metabolites including paraquat-inducible proteins, drug and heavy-metal-resistance proteins, multiple chitinases, and proteins for the detoxification of xenobiotics that may have biotechnological applications. PMID:14500782

  10. Draft Genome Sequence of Two Strains of Xanthomonas arboricola Isolated from Prunus persica Which Are Dissimilar to Strains That Cause Bacterial Spot Disease on Prunus spp.

    PubMed Central

    Garita-Cambronero, Jerson; Palacio-Bielsa, Ana; López, María M.

    2016-01-01

    The draft genome sequences of two strains of Xanthomonas arboricola, isolated from asymptomatic peach trees in Spain, are reported here. These strains are avirulent and do not belong to the same phylogroup as X. arboricola pv. pruni, a causal agent of bacterial spot disease of stone fruits and almonds. PMID:27609931

  11. Estimation of long terminal repeat element content in the Helicoverpa zea genome from high-throughput sequencing of bacterial artificial chromosome pools

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The lepidopteran pest insect, Helicoverpa zea, feeds on cultivated corn and cotton crops in North America where control remains challenging due to evolution of resistance to chemical and transgenic insecticidal toxins, yet few genomic resources are available for this species. A bacterial artificial...

  12. Complete Genome Sequence of Lactobacillus jensenii Strain SNUV360, a Probiotic for Treatment of Bacterial Vaginosis Isolated from the Vagina of a Healthy Korean Woman

    PubMed Central

    Lee, Sunghee; You, Hyun Ju; Kwon, Bomi

    2017-01-01

    ABSTRACT Lactobacillus jensenii SNUV360 is a potential probiotic strain that shows antimicrobial activity for the treatment of bacterial vaginosis. Here, we present the complete genomic sequence of L. jensenii SNUV360, isolated from a vaginal sample from a healthy Korean woman. Analysis of the sequence may provide insight into its functional activity. PMID:28280032

  13. Genomic selection models double the accuracy of predicted breeding values for bacterial cold water disease resistance compared to a traditional pedigree-based model in rainbow trout aquaculture

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Previously we have shown that bacterial cold water disease (BCWD) resistance in rainbow trout can be improved using traditional family-based selection, but progress has been limited to exploiting only between-family genetic variation. Genomic selection (GS) is a new alternative enabling exploitation...

  14. Genome-enabled selection doubles the accuracy of predicted breeding values for bacterial cold water disease resistance compared to traditional family-based selection in rainbow trout aquaculture

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We have shown previously that bacterial cold water disease (BCWD) resistance in rainbow trout can be improved using traditional family-based selection, but progress has been limited to exploiting only between-family genetic variation. Genomic selection (GS) is a new alternative enabling exploitation...

  15. Genome-scale CRISPR-Cas9 knockout screening in human cells.

    PubMed

    Shalem, Ophir; Sanjana, Neville E; Hartenian, Ella; Shi, Xi; Scott, David A; Mikkelsen, Tarjei S; Heckl, Dirk; Ebert, Benjamin L; Root, David E; Doench, John G; Zhang, Feng

    2014-01-03

    The simplicity of programming the CRISPR (clustered regularly interspaced short palindromic repeats)-associated nuclease Cas9 to modify specific genomic loci suggests a new way to interrogate gene function on a genome-wide scale. We show that lentiviral delivery of a genome-scale CRISPR-Cas9 knockout (GeCKO) library targeting 18,080 genes with 64,751 unique guide sequences enables both negative and positive selection screening in human cells. First, we used the GeCKO library to identify genes essential for cell viability in cancer and pluripotent stem cells. Next, in a melanoma model, we screened for genes whose loss is involved in resistance to vemurafenib, a therapeutic RAF inhibitor. Our highest-ranking candidates include previously validated genes NF1 and MED12, as well as novel hits NF2, CUL3, TADA2B, and TADA1. We observe a high level of consistency between independent guide RNAs targeting the same gene and a high rate of hit confirmation, demonstrating the promise of genome-scale screening with Cas9.

  16. Stress-induced Condensation of Bacterial Genomes Results in Re-pairing of Sister Chromosomes

    PubMed Central

    Shechter, Nelia; Zaltzman, Liron; Weiner, Allon; Brumfeld, Vlad; Shimoni, Eyal; Fridmann-Sirkis, Yael; Minsky, Abraham

    2013-01-01

    Genome condensation is increasingly recognized as a generic stress response in bacteria. To better understand the physiological implications of this response, we used fluorescent markers to locate specific sites on Escherichia coli chromosomes following exposure to cytotoxic stress. We find that stress-induced condensation proceeds through a nonrandom, zipper-like convergence of sister chromosomes, which is proposed to rely on the recently demonstrated intrinsic ability of identical double-stranded DNA molecules to specifically identify each other. We further show that this convergence culminates in spatial proximity of homologous sites throughout chromosome arms. We suggest that the resulting apposition of homologous sites can explain how repair of double strand DNA breaks might occur in a mechanism that is independent of the widely accepted yet physiologically improbable genome-wide search for homologous templates. We claim that by inducing genome condensation and orderly convergence of sister chromosomes, diverse stress conditions prime bacteria to effectively cope with severe DNA lesions such as double strand DNA breaks. PMID:23884460

  17. Expansion of Cultured Bacterial Diversity by Large-Scale Dilution-to-Extinction Culturing from a Single Seawater Sample.

    PubMed

    Yang, Seung-Jo; Kang, Ilnam; Cho, Jang-Cheon

    2016-01-01

    High-throughput cultivation (HTC) based on a dilution-to-extinction method has been applied broadly to the cultivation of marine bacterial groups, which has often led to the repeated isolation of abundant lineages such as SAR11 and oligotrophic marine gammaproteobacteria (OMG). In this study, to expand the phylogenetic diversity of HTC isolates, we performed a large-scale HTC with a single surface seawater sample collected from the East Sea, the Western Pacific Ocean. Phylogenetic analyses of the 16S rRNA genes from 847 putative pure cultures demonstrated that some isolates were affiliated with not-yet-cultured clades, including the OPB35 and Puniceicoccaceae marine group of Verrucomicrobia and PS1 of Alphaproteobacteria. In addition, numerous strains were obtained from abundant clades, such as SAR11, marine Roseobacter clade, OMG (e.g., SAR92 and OM60), OM43, and SAR116, thereby increasing the size of available culture resources for representative marine bacterial groups. Comparison between the composition of HTC isolates and the bacterial community structure of the seawater sample used for HTC showed that diverse marine bacterial groups exhibited various growth capabilities under our HTC conditions. The growth response of many bacterial groups, however, was clearly different from that observed with conventional plating methods, as exemplified by numerous isolates of the SAR11 clade and Verrucomicrobia. This study showed that a large number of novel bacterial strains could be obtained by an extensive HTC from even a small number of samples.

  18. Extensive Capsule Locus Variation and Large-Scale Genomic Recombination within the Klebsiella pneumoniae Clonal Group 258.

    PubMed

    Wyres, Kelly L; Gorrie, Claire; Edwards, David J; Wertheim, Heiman F L; Hsu, Li Yang; Van Kinh, Nguyen; Zadoks, Ruth; Baker, Stephen; Holt, Kathryn E

    2015-04-10

    Klebsiella pneumoniae clonal group (CG) 258, comprising sequence types (STs) 258, 11, and closely related variants, is associated with dissemination of the K. pneumoniae carbapenemase (KPC). Hospital outbreaks of KPC CG258 infections have been observed globally and are very difficult to treat. As a consequence, there is renewed interest in alternative infection control measures such as vaccines and phage or depolymerase treatments targeting the K. pneumoniae polysaccharide capsule. To date, 78 immunologically distinct capsule variants have been described in K. pneumoniae. Previous investigations of ST258 and a small number of closely related strains suggested that capsular variation was limited within this clone; only two distinct ST258 capsule polysaccharide synthesis (cps) loci have been identified, both acquired through large-scale recombination events (>50 kb). In contrast to previous studies, we report a comparative genomic analysis of the broader K. pneumoniae CG258 (n = 39). We identified 11 different cps loci within CG258, indicating that capsular switching is actually common within the complex. We observed several insertion sequences (IS) within the cps loci, and show further intraclone diversification of two cps loci through IS activity. Our data also indicate that several large-scale recombination events have shaped the genomes of CG258, and that definition of the complex should be broadened to include ST395 (also reported to harbor KPC). As only the second report of extensive intraclonal cps variation among Gram-negative bacterial species, our findings alter our understanding of the evolution of these organisms and have key implications for the design of control measures targeting K. pneumoniae capsules.

  19. antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences.

    PubMed

    Medema, Marnix H; Blin, Kai; Cimermancic, Peter; de Jager, Victor; Zakrzewski, Piotr; Fischbach, Michael A; Weber, Tilmann; Takano, Eriko; Breitling, Rainer

    2011-07-01

    Bacterial and fungal secondary metabolism is a rich source of novel bioactive compounds with potential pharmaceutical applications as antibiotics, anti-tumor drugs or cholesterol-lowering drugs. To find new drug candidates, microbiologists are increasingly relying on sequencing genomes of a wide variety of microbes. However, rapidly and reliably pinpointing all the potential gene clusters for secondary metabolites in dozens of newly sequenced genomes has been extremely challenging, due to their biochemical heterogeneity, the presence of unknown enzymes and the dispersed nature of the necessary specialized bioinformatics tools and resources. Here, we present antiSMASH (antibiotics & Secondary Metabolite Analysis Shell), the first comprehensive pipeline capable of identifying biosynthetic loci covering the whole range of known secondary metabolite compound classes (polyketides, non-ribosomal peptides, terpenes, aminoglycosides, aminocoumarins, indolocarbazoles, lantibiotics, bacteriocins, nucleosides, beta-lactams, butyrolactones, siderophores, melanins and others). It aligns the identified regions at the gene cluster level to their nearest relatives from a database containing all other known gene clusters, and integrates or cross-links all previously available secondary-metabolite specific gene analysis methods in one interactive view. antiSMASH is available at http://antismash.secondarymetabolites.org.

  20. Niche differentiation of bacterial communities at a millimeter scale in Shark Bay microbial mats

    PubMed Central

    Wong, Hon Lun; Smith, Daniela-Lee; Visscher, Pieter T.; Burns, Brendan P.

    2015-01-01

    Modern microbial mats can provide key insights into early Earth ecosystems, and Shark Bay, Australia, holds one of the best examples of these systems. Identifying the spatial distribution of microorganisms with mat depth facilitates a greater understanding of specific niches and potentially novel microbial interactions. High throughput sequencing coupled with elemental analyses and biogeochemical measurements of two distinct mat types (smooth and pustular) at a millimeter scale were undertaken in the present study. A total of 8,263,982 16S rRNA gene sequences were obtained, which were affiliated to 58 bacterial and candidate phyla. The surface of both mats were dominated by Cyanobacteria, accompanied with known or putative members of Alphaproteobacteria and Bacteroidetes. The deeper anoxic layers of smooth mats were dominated by Chloroflexi, while Alphaproteobacteria dominated the lower layers of pustular mats. In situ microelectrode measurements revealed smooth mats have a steeper profile of O2 and H2S concentrations, as well as higher oxygen production, consumption, and sulfate reduction rates. Specific elements (Mo, Mg, Mn, Fe, V, P) could be correlated with specific mat types and putative phylogenetic groups. Models are proposed for these systems suggesting putative surface anoxic niches, differential nitrogen fixing niches, and those coupled with methane metabolism. PMID:26499760

  1. Bacterial community structure of a lab-scale anammox membrane bioreactor.

    PubMed

    Gonzalez-Martinez, Alejandro; Osorio, F; Rodriguez-Sanchez, Alejandro; Martinez-Toledo, Maria Victoria; Gonzalez-Lopez, Jesus; Lotti, Tommaso; van Loosdrecht, M C M

    2015-01-01

    Autotrophic nitrogen removal technologies have proliferated through the last decade. Among these, a promising one is the membrane bioreactor (MBR) Anammox, which can achieve very high solids retention time and therefore sets a proper environment for the cultivation of anammox bacteria. In this sense, the MBR Anammox is an efficient technology for the treatment of effluents with low organic carbon and high ammonium concentrations once it has been treated under partial nitrification systems. A lab-scale MBR Anammox bioreactor has been built at the Technological University of Delft, The Netherlands and has been proven for efficient nitrogen removal and efficient cultivation of anammox bacteria. In this study, next-generation sequencing techniques have been used for the investigation of the bacterial communities of this MBR Anammox for the first time ever. A strong domination of Candidatus Brocadia bacterium and also the presence of a myriad of other microorganisms that have adapted to this environment were detected, suggesting that the MBR Anammox bioreactor might have a more complex microbial ecosystem that it has been thought. Among these, nitrate-reducing heterotrophs and primary producers, among others, were identified. Definition of the ecological roles of the OTUs identified through metagenomic analysis was discussed.

  2. Disinfection of bacterial biofilms in pilot-scale cooling tower systems.

    PubMed

    Liu, Yang; Zhang, Wei; Sileika, Tadas; Warta, Richard; Cianciotto, Nicholas P; Packman, Aaron I

    2011-04-01

    The impact of continuous chlorination and periodic glutaraldehyde treatment on planktonic and biofilm microbial communities was evaluated in pilot-scale cooling towers operated continuously for 3 months. The system was operated at a flow rate of 10,080 l day(-1). Experiments were performed with a well-defined microbial consortium containing three heterotrophic bacteria: Pseudomonas aeruginosa, Klebsiella pneumoniae and Flavobacterium sp. The persistence of each species was monitored in the recirculating cooling water loop and in biofilms on steel and PVC coupons in the cooling tower basin. The observed bacterial colonization in cooling towers did not follow trends in growth rates observed under batch conditions and, instead, reflected differences in the ability of each organism to remain attached and form biofilms under the high-through flow conditions in cooling towers. Flavobacterium was the dominant organism in the community, while P. aeruginosa and K. pneumoniae did not attach well to either PVC or steel coupons in cooling towers and were not able to persist in biofilms. As a result, the much greater ability of Flavobacterium to adhere to surfaces protected it from disinfection, whereas P. aeruginosa and K. pneumoniae were subject to rapid disinfection in the planktonic state.

  3. Disinfection of bacterial biofilms in pilot-scale cooling tower systems

    PubMed Central

    Liu, Yang; Zhang, Wei; Sileika, Tadas; Warta, Richard; Cianciotto, Nicholas P.; Packman, Aaron I.

    2015-01-01

    The impact of continuous chlorination and periodic glutaraldehyde treatment on planktonic and biofilm microbial communities was evaluated in pilot-scale cooling towers operated continuously for 3 months. The system was operated at a flow rate of 10,080 l day−1. Experiments were performed with a well-defined microbial consortium containing three heterotrophic bacteria: Pseudomonas aeruginosa, Klebsiella pneumoniae and Flavobacterium sp. The persistence of each species was monitored in the recirculating cooling water loop and in biofilms on steel and PVC coupons in the cooling tower basin. The observed bacterial colonization in cooling towers did not follow trends in growth rates observed under batch conditions and, instead, reflected differences in the ability of each organism to remain attached and form biofilms under the high-through flow conditions in cooling towers. Flavobacterium was the dominant organism in the community, while P. aeruginosa and K. pneumoniae did not attach well to either PVC or steel coupons in cooling towers and were not able to persist in biofilms. As a result, the much greater ability of Flavobacterium to adhere to surfaces protected it from disinfection, whereas P. aeruginosa and K. pneumoniae were subject to rapid disinfection in the planktonic state. PMID:21547755

  4. Niche differentiation of bacterial communities at a millimeter scale in Shark Bay microbial mats

    NASA Astrophysics Data System (ADS)

    Wong, Hon Lun; Smith, Daniela-Lee; Visscher, Pieter T.; Burns, Brendan P.

    2015-10-01

    Modern microbial mats can provide key insights into early Earth ecosystems, and Shark Bay, Australia, holds one of the best examples of these systems. Identifying the spatial distribution of microorganisms with mat depth facilitates a greater understanding of specific niches and potentially novel microbial interactions. High throughput sequencing coupled with elemental analyses and biogeochemical measurements of two distinct mat types (smooth and pustular) at a millimeter scale were undertaken in the present study. A total of 8,263,982 16S rRNA gene sequences were obtained, which were affiliated to 58 bacterial and candidate phyla. The surface of both mats were dominated by Cyanobacteria, accompanied with known or putative members of Alphaproteobacteria and Bacteroidetes. The deeper anoxic layers of smooth mats were dominated by Chloroflexi, while Alphaproteobacteria dominated the lower layers of pustular mats. In situ microelectrode measurements revealed smooth mats have a steeper profile of O2 and H2S concentrations, as well as higher oxygen production, consumption, and sulfate reduction rates. Specific elements (Mo, Mg, Mn, Fe, V, P) could be correlated with specific mat types and putative phylogenetic groups. Models are proposed for these systems suggesting putative surface anoxic niches, differential nitrogen fixing niches, and those coupled with methane metabolism.

  5. Weak synchronization and large-scale collective oscillation in dense bacterial suspensions.

    PubMed

    Chen, Chong; Liu, Song; Shi, Xia-Qing; Chaté, Hugues; Wu, Yilin

    2017-02-09

    Collective oscillatory behaviour is ubiquitous in nature, having a vital role in many biological processes from embryogenesis and organ development to pace-making in neuron networks. Elucidating the mechanisms that give rise to synchronization is essential to the understanding of biological self-organization. Collective oscillations in biological multicellular systems often arise from long-range coupling mediated by diffusive chemicals, by electrochemical mechanisms, or by biomechanical interaction between cells and their physical environment. In these examples, the phase of some oscillatory intracellular degree of freedom is synchronized. Here, in contrast, we report the discovery of a weak synchronization mechanism that does not require long-range coupling or inherent oscillation of individual cells. We find that millions of motile cells in dense bacterial suspensions can self-organize into highly robust collective oscillatory motion, while individual cells move in an erratic manner, without obvious periodic motion but with frequent, abrupt and random directional changes. So erratic are individual trajectories that uncovering the collective oscillations of our micrometre-sized cells requires individual velocities to be averaged over tens or hundreds of micrometres. On such large scales, the oscillations appear to be in phase and the mean position of cells typically describes a regular elliptic trajectory. We found that the phase of the oscillations is organized into a centimetre-scale travelling wave. We present a model of noisy self-propelled particles with strictly local interactions that accounts faithfully for our observations, suggesting that self-organized collective oscillatory motion results from spontaneous chiral and rotational symmetry breaking. These findings reveal a previously unseen type of long-range order in active matter systems (those in which energy is spent locally to produce non-random motion). This mechanism of collective oscillation may

  6. Weak synchronization and large-scale collective oscillation in dense bacterial suspensions

    NASA Astrophysics Data System (ADS)

    Chen, Chong; Liu, Song; Shi, Xia-Qing; Chaté, Hugues; Wu, Yilin

    2017-01-01

    Collective oscillatory behaviour is ubiquitous in nature, having a vital role in many biological processes from embryogenesis and organ development to pace-making in neuron networks. Elucidating the mechanisms that give rise to synchronization is essential to the understanding of biological self-organization. Collective oscillations in biological multicellular systems often arise from long-range coupling mediated by diffusive chemicals, by electrochemical mechanisms, or by biomechanical interaction between cells and their physical environment. In these examples, the phase of some oscillatory intracellular degree of freedom is synchronized. Here, in contrast, we report the discovery of a weak synchronization mechanism that does not require long-range coupling or inherent oscillation of individual cells. We find that millions of motile cells in dense bacterial suspensions can self-organize into highly robust collective oscillatory motion, while individual cells move in an erratic manner, without obvious periodic motion but with frequent, abrupt and random directional changes. So erratic are individual trajectories that uncovering the collective oscillations of our micrometre-sized cells requires individual velocities to be averaged over tens or hundreds of micrometres. On such large scales, the oscillations appear to be in phase and the mean position of cells typically describes a regular elliptic trajectory. We found that the phase of the oscillations is organized into a centimetre-scale travelling wave. We present a model of noisy self-propelled particles with strictly local interactions that accounts faithfully for our observations, suggesting that self-organized collective oscillatory motion results from spontaneous chiral and rotational symmetry breaking. These findings reveal a previously unseen type of long-range order in active matter systems (those in which energy is spent locally to produce non-random motion). This mechanism of collective oscillation may

  7. Combined Analysis of Variation in Core, Accessory and Regulatory Genome Regions Provides a Super-Resolution View into the Evolution of Bacterial Populations

    PubMed Central

    McNally, Alan; Oren, Yaara; Kelly, Darren; Sreecharan, Tristan; Vehkala, Minna; Välimäki, Niko; Prentice, Michael B.; Ashour, Amgad; Avram, Oren; Pupko, Tal; Literak, Ivan; Guenther, Sebastian; Schaufler, Katharina; Wieler, Lothar H.; Zhiyong, Zong; Sheppard, Samuel K.; Corander, Jukka

    2016-01-01

    The use of whole-genome phylogenetic analysis has revolutionized our understanding of the evolution and spread of many important bacterial pathogens due to the high resolution view it provides. However, the majority of such analyses do not consider the potential role of accessory genes when inferring evolutionary trajectories. Moreover, the recently discovered importance of the switching of gene regulatory elements suggests that an exhaustive analysis, combining information from core and accessory genes with regulatory elements could provide unparalleled detail of the evolution of a bacterial population. Here we demonstrate this principle by applying it to a worldwide multi-host sample of the important pathogenic E. coli lineage ST131. Our approach reveals the existence of multiple circulating subtypes of the major drug–resistant clade of ST131 and provides the first ever population level evidence of core genome substitutions in gene regulatory regions associated with the acquisition and maintenance of different accessory genome elements. PMID:27618184

  8. Distinct signatures of diversifying selection revealed by genome analysis of respiratory tract and invasive bacterial populations.

    PubMed

    Shea, Patrick R; Beres, Stephen B; Flores, Anthony R; Ewbank, Amy L; Gonzalez-Lugo, Javier H; Martagon-Rosado, Alexandro J; Martinez-Gutierrez, Juan C; Rehman, Hina A; Serrano-Gonzalez, Monica; Fittipaldi, Nahuel; Ayers, Stephen D; Webb, Paul; Willey, Barbara M; Low, Donald E; Musser, James M

    2011-03-22

    Many pathogens colonize different anatomical sites, but the selective pressures contributing to survival in the diverse niches are poorly understood. Group A Streptococcus (GAS) is a human-adapted bacterium that causes a range of infections. Much effort has been expended to dissect the molecular basis of invasive (sterile-site) infections, but little is known about the genomes of strains causing pharyngitis (streptococcal "sore throat"). Additionally, there is essentially nothing known about the genetic relationships between populations of invasive and pharyngitis strains. In particular, it is unclear if invasive strains represent a distinct genetic subpopulation of strains that cause pharyngitis. We compared the genomes of 86 serotype M3 GAS pharyngitis strains with those of 215 invasive M3 strains from the same geographical location. The pharyngitis and invasive groups were highly related to each other and had virtually identical phylogenetic structures, indicating they belong to the same genetic pool. Despite the overall high degree of genetic similarity, we discovered that strains from different host environments (i.e., throat, normally sterile sites) have distinct patterns of diversifying selection at the nucleotide level. In particular, the pattern of polymorphisms in the hyaluronic acid capsule synthesis operon was especially different between the two strain populations. This finding was mirrored by data obtained from full-genome analysis of strains sequentially cultured from nonhuman primates. Our results answer the long-standing question of the genetic relationship between GAS pharyngitis and invasive strains. The data provide previously undescribed information about the evolutionary history of pathogenic microbes that cause disease in different anatomical sites.

  9. Genome-Scale Identification of Legionella pneumophila Effectors Using a Machine Learning Approach

    PubMed Central

    Burstein, David; Zusman, Tal; Degtyar, Elena; Viner, Ram; Segal, Gil; Pupko, Tal

    2009-01-01

    A large number of highly pathogenic bacteria utilize secretion systems to translocate effector proteins into host cells. Using these effectors, the bacteria subvert host cell processes during infection. Legionella pneumophila translocates effectors via the Icm/Dot type-IV secretion system and to date, approximately 100 effectors have been identified by various experimental and computational techniques. Effector identification is a critical first step towards the understanding of the pathogenesis system in L. pneumophila as well as in other bacterial pathogens. Here, we formulate the task of effector identification as a classification problem: each L. pneumophila open reading frame (ORF) was classified as either effector or not. We computationally defined a set of features that best distinguish effectors from non-effectors. These features cover a wide range of characteristics including taxonomical dispersion, regulatory data, genomic organization, similarity to eukaryotic proteomes and more. Machine learning algorithms utilizing these features were then applied to classify all the ORFs within the L. pneumophila genome. Using this approach we were able to predict and experimentally validate 40 new effectors, reaching a success rate of above 90%. Increasing the number of validated effectors to around 140, we were able to gain novel insights into their characteristics. Effectors were found to have low G+C content, supporting the hypothesis that a large number of effectors originate via horizontal gene transfer, probably from their protozoan host. In addition, effectors were found to cluster in specific genomic regions. Finally, we were able to provide a novel description of the C-terminal translocation signal required for effector translocation by the Icm/Dot secretion system. To conclude, we have discovered 40 novel L. pneumophila effectors, predicted over a hundred additional highly probable effectors, and shown the applicability of machine learning algorithms for

  10. Comparative genome-scale metabolic modeling of actinomycetes: the topology of essential core metabolism.

    PubMed

    Alam, Mohammad Tauqeer; Medema, Marnix H; Takano, Eriko; Breitling, Rainer

    2011-07-21

    Actinomycetes are highly important bacteria. On one hand, some of them cause severe human and plant diseases, on the other hand, many species are known for their ability to produce antibiotics. Here we report the results of a comparative analysis of genome-scale metabolic models of 37 species of actinomycetes. Based on in silico knockouts we generated topological and genomic maps for each organism. Combining the collection of genome-wide models, we constructed a global enzyme association network to identify both a conserved "core network" and an "essential core network" of the entire group. As has been reported for low-degree metabolites in several organisms, low-degree enzymes (in linear pathways) turn out to be generally more essential than high-degree enzymes (in metabolic hubs).

  11. Organelle genome complexity scales positively with organism size in volvocine green algae.

    PubMed

    Smith, David Roy; Hamaji, Takashi; Olson, Bradley J S C; Durand, Pierre M; Ferris, Patrick; Michod, Richard E; Featherston, Jonathan; Nozaki, Hisayoshi; Keeling, Patrick J

    2013-04-01

    It has been argued that for certain lineages, noncoding DNA expansion is a consequence of the increased random genetic drift associated with long-term escalations in organism size. But a lack of data has prevented the investigation of this hypothesis in most plastid-bearing protists. Here, using newly sequenced mitochondrial and plastid genomes, we explore the relationship between organelle DNA noncoding content and organism size within volvocine green algae. By looking at unicellular, colonial, and differentiated multicellular algae, we show that organelle DNA complexity scales positively with species size and cell number across the volvocine lineage. Moreover, silent-site genetic diversity data suggest that the volvocine species with the largest cell numbers and most bloated organelle genomes have the smallest effective population sizes. Together, these findings support the view that nonadaptive processes, like random genetic drift, promote the expansion of noncoding regions in organelle genomes.

  12. T346Hunter: a novel web-based tool for the prediction of type III, type IV and type VI secretion systems in bacterial genomes.

    PubMed

    Martínez-García, Pedro Manuel; Ramos, Cayo; Rodríguez-Palenzuela, Pablo

    2015-01-01

    T346Hunter (Type Three, Four and Six secretion system Hunter) is a web-based tool for the identification and localisation of type III, type IV and type VI secretion systems (T3SS, T4SS and T6SS, respectively) clusters in bacterial genomes. Non-flagellar T3SS (NF-T3SS) and T6SS are complex molecular machines that deliver effector proteins from bacterial cells into the environment or into other eukaryotic or prokaryotic cells, with significant implications for pathogenesis of the strains encoding them. Meanwhile, T4SS is a more functionally diverse system, which is involved in not only effector translocation but also conjugation and DNA uptake/release. Development of control strategies against bacterial-mediated diseases requires genomic identification of the virulence arsenal of pathogenic bacteria, with T3SS, T4SS and T6SS being major determinants in this regard. Therefore, computational methods for systematic identification of these specialised machines are of particular interest. With the aim of facilitating this task, T346Hunter provides a user-friendly web-based tool for the prediction of T3SS, T4SS and T6SS clusters in newly sequenced bacterial genomes. After inspection of the available scientific literature, we constructed a database of hidden Markov model (HMM) protein profiles and sequences representing the various components of T3SS, T4SS and T6SS. T346Hunter performs searches of such a database against user-supplied bacterial sequences and localises enriched regions in any of these three types of secretion systems. Moreover, through the T346Hunter server, users can visualise the predicted clusters obtained for approximately 1700 bacterial chromosomes and plasmids. T346Hunter offers great help to researchers in advancing their understanding of the biological mechanisms in which these sophisticated molecular machines are involved. T346Hunter is freely available at http://bacterial-virulence-factors.cbgp.upm.es/T346Hunter.

  13. iAK692: A genome-scale metabolic model of Spirulina platensis C1

    PubMed Central

    2012-01-01

    Background Spirulina (Arthrospira) platensis is a well-known filamentous cyanobacterium used in the production of many industrial products, including high value compounds, healthy food supplements, animal feeds, pharmaceuticals and cosmetics, for example. It has been increasingly studied around the world for scientific purposes, especially for its genome, biology, physiology, and also for the analysis of its small-scale metabolic network. However, the overall description of the metabolic and biotechnological capabilities of S. platensis requires the development of a whole cellular metabolism model. Recently, the S. platensis C1 (Arthrospira sp. PCC9438) genome sequence has become available, allowing systems-level studies of this commercial cyanobacterium. Results In this work, we present the genome-scale metabolic network analysis of S. platensis C1, iAK692, its topological properties, and its metabolic capabilities and functions. The network was reconstructed from the S. platensis C1 annotated genomic sequence using Pathway Tools software to generate a preliminary network. Then, manual curation was performed based on a collective knowledge base and a combination of genomic, biochemical, and physiological information. The genome-scale metabolic model consists of 692 genes, 837 metabolites, and 875 reactions. We validated iAK692 by conducting fermentation experiments and simulating the model under autotrophic, heterotrophic, and mixotrophic growth conditions using COBRA toolbox. The model predictions under these growth conditions were consistent with the experimental results. The iAK692 model was further used to predict the unique active reactions and essential genes for each growth condition. Additionally, the metabolic states of iAK692 during autotrophic and mixotrophic growths were described by phenotypic phase plane (PhPP) analysis. Conclusions This study proposes the first genome-scale model of S. platensis C1, iAK692, which is a predictive metabolic platform

  14. Large-Scale Mutagenesis of the Yeast Genome Using a Tn7-Derived Multipurpose Transposon

    PubMed Central

    Kumar, Anuj; Seringhaus, Michael; Biery, Matthew C.; Sarnovsky, Robert J.; Umansky, Lara; Piccirillo, Stacy; Heidtman, Matthew; Cheung, Kei-Hoi; Dobry, Craig J.; Gerstein, Mark B.; Craig, Nancy L.; Snyder, Michael

    2004-01-01

    We present here an unbiased and extremely versatile insertional library of yeast genomic DNA generated by in vitro mutagenesis with a multipurpose element derived from the bacterial transposon Tn7. This mini-Tn7 element has been engineered such that a single insertion can be used to generate a lacZ fusion, gene disruption, and epitope-tagged gene product. Using this transposon, we generated a plasmid-based library of ∼300,000 mutant alleles; by high-throughput screening in yeast, we identified and sequenced 9032 insertions affecting 2613 genes (45% of the genome). From analysis of 7176 insertions, we found little bias in Tn7 target-site selection in vitro. In contrast, we also sequenced 10,174 Tn3 insertions and found a markedly stronger preference for an AT-rich 5-base pair target sequence. We further screened 1327 insertion alleles in yeast for hypersensitivity to the chemotherapeutic cisplatin. Fifty-one genes were identified, including four functionally uncharacterized genes and 25 genes involved in DNA repair, replication, transcription, and chromatin structure. In total, the collection reported here constitutes the largest plasmid-based set of sequenced yeast mutant alleles to date and, as such, should be singularly useful for gene and genome-wide functional analysis. PMID:15466296

  15. Large-scale mutagenesis of the yeast genome using a Tn7-derived multipurpose transposon.

    PubMed

    Kumar, Anuj; Seringhaus, Michael; Biery, Matthew C; Sarnovsky, Robert J; Umansky, Lara; Piccirillo, Stacy; Heidtman, Matthew; Cheung, Kei-Hoi; Dobry, Craig J; Gerstein, Mark B; Craig, Nancy L; Snyder, Michael

    2004-10-01

    We present here an unbiased and extremely versatile insertional library of yeast genomic DNA generated by in vitro mutagenesis with a multipurpose element derived from the bacterial transposon Tn7. This mini-Tn7 element has been engineered such that a single insertion can be used to generate a lacZ fusion, gene disruption, and epitope-tagged gene product. Using this transposon, we generated a plasmid-based library of approximately 300,000 mutant alleles; by high-throughput screening in yeast, we identified and sequenced 9032 insertions affecting 2613 genes (45% of the genome). From analysis of 7176 insertions, we found little bias in Tn7 target-site selection in vitro. In contrast, we also sequenced 10,174 Tn3 insertions and found a markedly stronger preference for an AT-rich 5-base pair target sequence. We further screened 1327 insertion alleles in yeast for hypersensitivity to the chemotherapeutic cisplatin. Fifty-one genes were identified, including four functionally uncharacterized genes and 25 genes involved in DNA repair, replication, transcription, and chromatin structure. In total, the collection reported here constitutes the largest plasmid-based set of sequenced yeast mutant alleles to date and, as such, should be singularly useful for gene and genome-wide functional analysis.

  16. Informed Consent in Genome-Scale Research: What Do Prospective Participants Think?

    PubMed Central

    Trinidad, Susan Brown; Fullerton, Stephanie M.; Bares, Julie M.; Jarvik, Gail P.; Larson, Eric B.; Burke, Wylie

    2012-01-01

    Background To promote effective genome-scale research, genomic and clinical data for large population samples must be collected, stored, and shared. Methods We conducted focus groups with 45 members of a Seattle-based integrated healthcare delivery system to learn about their views and expectations for informed consent in genome-scale studies. Results Participants viewed information about study purpose, aims, and how and by whom study data could be used to be at least as important as information about risks and possible harms. They generally supported a tiered consent approach for specific issues, including research purpose, data sharing, and access to individual research results. Participants expressed a continuum of opinions with respect to the acceptability of broad consent, ranging from completely acceptable to completely unacceptable. Older participants were more likely to view the consent process in relational – rather than contractual – terms, compared with younger participants. The majority of participants endorsed seeking study subjects’ permission regarding material changes in study purpose and data sharing. Conclusions Although this study sample was limited in terms of racial and socioeconomic diversity, our results suggest a strong positive interest in genomic research on the part of at least some prospective participants and indicate a need for increased public engagement, as well as strategies for ongoing communication with study participants. PMID:23493836

  17. Investigating host-pathogen behavior and their interaction using genome-scale metabolic network models.

    PubMed

    Sadhukhan, Priyanka P; Raghunathan, Anu

    2014-01-01

    Genome Scale Metabolic Modeling methods represent one way to compute whole cell function starting from the genome sequence of an organism and contribute towards understanding and predicting the genotype-phenotype relationship. About 80 models spanning all the kingdoms of life from archaea to eukaryotes have been built till date and used to interrogate cell phenotype under varying conditions. These models have been used to not only understand the flux distribution in evolutionary conserved pathways like glycolysis and the Krebs cycle but also in applications ranging from value added product formation in Escherichia coli to predicting inborn errors of Homo sapiens metabolism. This chapter describes a protocol that delineates the process of genome scale metabolic modeling for analysing host-pathogen behavior and interaction using flux balance analysis (FBA). The steps discussed in the process include (1) reconstruction of a metabolic network from the genome sequence, (2) its representation in a precise mathematical framework, (3) its translation to a model, and (4) the analysis using linear algebra and optimization. The methods for biological interpretations of computed cell phenotypes in the context of individual host and pathogen models and their integration are also discussed.

  18. Genome-Wide Motif Statistics are Shaped by DNA Binding Proteins over Evolutionary Time Scales

    NASA Astrophysics Data System (ADS)

    Qian, Long; Kussell, Edo

    2016-10-01

    The composition of a genome with respect to all possible short DNA motifs impacts the ability of DNA binding proteins to locate and bind their target sites. Since nonfunctional DNA binding can be detrimental to cellular functions and ultimately to organismal fitness, organisms could benefit from reducing the number of nonfunctional DNA binding sites genome wide. Using in vitro measurements of binding affinities for a large collection of DNA binding proteins, in multiple species, we detect a significant global avoidance of weak binding sites in genomes. We demonstrate that the underlying evolutionary process leaves a distinct genomic hallmark in that similar words have correlated frequencies, a signal that we detect in all species across domains of life. We consider the possibility that natural selection against weak binding sites contributes to this process, and using an evolutionary model we show that the strength of selection needed to maintain global word compositions is on the order of point mutation rates. Likewise, we show that evolutionary mechanisms based on interference of protein-DNA binding with replication and mutational repair processes could yield similar results and operate with similar rates. On the basis of these modeling and bioinformatic results, we conclude that genome-wide word compositions have been molded by DNA binding proteins acting through tiny evolutionary steps over time scales spanning millions of generations.

  19. Modeling of scale-dependent bacterial growth by chemical kinetics approach.

    PubMed

    Martínez, Haydee; Sánchez, Joaquín; Cruz, José-Manuel; Ayala, Guadalupe; Rivera, Marco; Buhse, Thomas

    2014-01-01

    We applied the so-called chemical kinetics approach to complex bacterial growth patterns that were dependent on the liquid-surface-area-to-volume ratio (SA/V) of the bacterial cultures. The kinetic modeling was based on current experimental knowledge in terms of autocatalytic bacterial growth, its inhibition by the metabolite CO2, and the relief of inhibition through the physical escape of the inhibitor. The model quantitatively reproduces kinetic data of SA/V-dependent bacterial growth and can discriminate between differences in the growth dynamics of enteropathogenic E. coli, E. coli JM83, and Salmonella typhimurium on one hand and Vibrio cholerae on the other hand. Furthermore, the data fitting procedures allowed predictions about the velocities of the involved key processes and the potential behavior in an open-flow bacterial chemostat, revealing an oscillatory approach to the stationary states.

  20. Data-driven hypothesis weighting increases detection power in genome-scale multiple testing.

    PubMed

    Ignatiadis, Nikolaos; Klaus, Bernd; Zaugg, Judith B; Huber, Wolfgang

    2016-07-01

    Hypothesis weighting improves the power of large-scale multiple testing. We describe independent hypothesis weighting (IHW), a method that assigns weights using covariates independent of the P-values under the null hypothesis but informative of each test's power or prior probability of the null hypothesis (http://www.bioconductor.org/packages/IHW). IHW increases power while controlling the false discovery rate and is a practical approach to discovering associations in genomics, high-throughput biology and other large data sets.

  1. Quantitative methods for genome-scale analysis of in situ hybridization and correlation with microarray data

    PubMed Central

    Lee, Chang-Kyu; Sunkin, Susan M; Kuan, Chihchau; Thompson, Carol L; Pathak, Sayan; Ng, Lydia; Lau, Chris; Fischer, Shanna; Mortrud, Marty; Slaughterbeck, Cliff; Jones, Allan; Lein, Ed; Hawrylycz, Michael

    2008-01-01

    With the emergence of genome-wide colorimetric in situ hybridization (ISH) data sets such as the Allen Brain Atlas, it is important to understand the relationship between this gene expression modality and those derived from more quantitative based technologies. This study introduces a novel method for standardized relative quantification of colorimetric ISH signal that enables a large-scale cross-platform expression level comparison of ISH with two publicly available microarray brain data sources. PMID:18234097

  2. Genome-scale dynamic modeling of the competition between Rhodoferax and Geobacter in anoxic subsurface environments.

    PubMed

    Zhuang, Kai; Izallalen, Mounir; Mouser, Paula; Richter, Hanno; Risso, Carla; Mahadevan, Radhakrishnan; Lovley, Derek R

    2011-02-01

    The advent of rapid complete genome sequencing, and the potential to capture this information in genome-scale metabolic models, provide the possibility of comprehensively modeling microbial community interactions. For example, Rhodoferax and Geobacter species are acetate-oxidizing Fe(III)-reducers that compete in anoxic subsurface environments and this competition may have an influence on the in situ bioremediation of uranium-contaminated groundwater. Therefore, genome-scale models of Geobacter sulfurreducens and Rhodoferax ferrireducens were used to evaluate how Geobacter and Rhodoferax species might compete under diverse conditions found in a uranium-contaminated aquifer in Rifle, CO. The model predicted that at the low rates of acetate flux expected under natural conditions at the site, Rhodoferax will outcompete Geobacter as long as sufficient ammonium is available. The model also predicted that when high concentrations of acetate are added during in situ bioremediation, Geobacter species would predominate, consistent with field-scale observations. This can be attributed to the higher expected growth yields of Rhodoferax and the ability of Geobacter to fix nitrogen. The modeling predicted relative proportions of Geobacter and Rhodoferax in geochemically distinct zones of the Rifle site that were comparable to those that were previously documented with molecular techniques. The model also predicted that under nitrogen fixation, higher carbon and electron fluxes would be diverted toward respiration rather than biomass formation in Geobacter, providing a potential explanation for enhanced in situ U(VI) reduction in low-ammonium zones. These results show that genome-scale modeling can be a useful tool for predicting microbial interactions in subsurface environments and shows promise for designing bioremediation strategies.

  3. Enumeration of smallest intervention strategies in genome-scale metabolic networks.

    PubMed

    von Kamp, Axel; Klamt, Steffen

    2014-01-01

    One ultimate goal of metabolic network modeling is the rational redesign of biochemical networks to optimize the production of certain compounds by cellular systems. Although several constraint-based optimization techniques have been developed for this purpose, methods for systematic enumeration of intervention strategies in genome-scale metabolic networks are still lacking. In principle, Minimal Cut Sets (MCSs; inclusion-minimal combinations of reaction or gene deletions that lead to the fulfilment of a given intervention goal) provide an exhaustive enumeration approach. However, their disadvantage is the combinatorial explosion in larger networks and the requirement to compute first the elementary modes (EMs) which itself is impractical in genome-scale networks. We present MCSEnumerator, a new method for effective enumeration of the smallest MCSs (with fewest interventions) in genome-scale metabolic network models. For this we combine two approaches, namely (i) the mapping of MCSs to EMs in a dual network, and (ii) a modified algorithm by which shortest EMs can be effectively determined in large networks. In this way, we can identify the smallest MCSs by calculating the shortest EMs in the dual network. Realistic application examples demonstrate that our algorithm is able to list thousands of the most efficient intervention strategies in genome-scale networks for various intervention problems. For instance, for the first time we could enumerate all synthetic lethals in E.coli with combinations of up to 5 rea