Science.gov

Sample records for co-evolving genomic groups

  1. The spread of opinion on co-evolving networks

    NASA Astrophysics Data System (ADS)

    Singh, Pramesh; Sreenivasan, S.; Korniss, G.; Szymanski, B. K.

    2011-03-01

    We discuss a model of opinion formation in co-evolving networks. In realistic scenarios, the network constantly changes structure favoring connections between similar individuals (homophily). Here we allow the opinions to co-evolve with the reorganization of links in the network. This dynamical nature of the network impedes the spreading of opinions. We study how this resistance to the spread can be overcome and consensus can be achieved by randomly distributing a few committed agents (-nodes that are not influenceable in their opinions). In this model adjacent nodes influence each other if they are similar on at least Q attributes, where Q is the influence threshold. Nodes will rewire their existing links if they are not similar enough. We demonstrate through simulations that in the absence of committed agents, time to reach consensus in opinions diverges exponentially with system size N. However, as committed agents are added, beyond a small value of committed fraction, the consensus time becomes a slowly varying function of N. (Ref- F. Vazquez et al. - Phys. Rev. E76, 046120 -2007) Supported in part by ARL and ONR.

  2. Genomic Adaptation of the Lactobacillus casei Group

    PubMed Central

    Nakano, Akiyo; Takahata, Muneaki; Murakami, Masaru; Takaki, Takashi; Nishiyama, Hidetoshi; Igimi, Shizunobu; Hattori, Masahira; Morita, Hidetoshi

    2013-01-01

    Lactobacillus casei, L. paracasei, and L. rhamnosus form a closely related taxonomic group (Lactobacillus casei group) within the facultatively heterofermentative lactobacilli. Here, we report the complete genome sequences of L. paracasei JCM 8130 and L. casei ATCC 393, and the draft genome sequence of L. paracasei COM0101, all of which were isolated from daily products. Furthermore, we re-annotated the genome of L. rhamnosus ATCC 53103 (also known as L. rhamnosus GG), which we have previously reported. We confirmed that ATCC 393 is distinct from other strains previously described as L. paracasei. The core genome of 10 completely sequenced strains of the L. casei group comprised 1,682 protein-coding genes. Although extensive genome-wide synteny was found among the L. casei group, the genomes of ATCC 53103, JCM 8130, and ATCC 393 contained genomic islands compared with L. paracasei ATCC 334. Several genomic islands, including carbohydrate utilization gene clusters, were found at the same loci in the chromosomes of the L. casei group. The spaCBA pilus gene cluster, which was first identified in GG, was also found in other strains of the L. casei group, but several L. paracasei strains including COM0101 contained truncated spaC gene. ATCC 53103 encoded a higher number of proteins involved in carbohydrate utilization compared with intestinal lactobacilli, and extracellular adhesion proteins, several of which are absent in other strains of the L. casei group. In addition to previously fully sequenced L. rhamnosus and L. paracasei strains, the complete genome sequences of L. casei will provide valuable insights into the evolution of the L. casei group. PMID:24116025

  3. Comparative Genomics of the Campylobacter lari Group

    PubMed Central

    Miller, William G.; Yee, Emma; Chapman, Mary H.; Smith, Timothy P.L.; Bono, James L.; Huynh, Steven; Parker, Craig T.; Vandamme, Peter; Luong, Khai; Korlach, Jonas

    2014-01-01

    The Campylobacter lari group is a phylogenetic clade within the epsilon subdivision of the Proteobacteria and is part of the thermotolerant Campylobacter spp., a division within the genus that includes the human pathogen Campylobacter jejuni. The C. lari group is currently composed of five species (C. lari, Campylobacter insulaenigrae, Campylobacter volucris, Campylobacter subantarcticus, and Campylobacter peloridis), as well as a group of strains termed the urease-positive thermophilic Campylobacter (UPTC) and other C. lari-like strains. Here we present the complete genome sequences of 11 C. lari group strains, including the five C. lari group species, four UPTC strains, and a lari-like strain isolated in this study. The genome of C. lari subsp. lari strain RM2100 was described previously. Analysis of the C. lari group genomes indicates that this group is highly related at the genome level. Furthermore, these genomes are strongly syntenic with minor rearrangements occurring only in 4 of the 12 genomes studied. The C. lari group can be bifurcated, based on the flagella and flagellar modification genes. Genomic analysis of the UPTC strains indicated that these organisms are variable but highly similar, closely related to but distinct from C. lari. Additionally, the C. lari group contains multiple genes encoding hemagglutination domain proteins, which are either contingency genes or linked to conserved contingency genes. Many of the features identified in strain RM2100, such as major deficiencies in amino acid biosynthesis and energy metabolism, are conserved across all 12 genomes, suggesting that these common features may play a role in the association of the C. lari group with coastal environments and watersheds. PMID:25381664

  4. Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data

    PubMed Central

    Zhang, Chao; Zheng, Yu; Ma, Xiuli; Han, Jiawei

    2015-01-01

    Recent years have witnessed the wide proliferation of geo-sensory applications wherein a bundle of sensors are deployed at different locations to cooperatively monitor the target condition. Given massive geo-sensory data, we study the problem of mining spatial co-evolving patterns (SCPs), i.e., groups of sensors that are spatially correlated and co-evolve frequently in their readings. SCP mining is of great importance to various real-world applications, yet it is challenging because (1) the truly interesting evolutions are often flooded by numerous trivial fluctuations in the geo-sensory time series; and (2) the pattern search space is extremely large due to the spatiotemporal combinatorial nature of SCP. In this paper, we propose a two-stage method called Assembler. In the first stage, Assembler filters trivial fluctuations using wavelet transform and detects frequent evolutions for individual sensors via a segment-and-group approach. In the second stage, Assembler generates SCPs by assembling the frequent evolutions of individual sensors. Leveraging the spatial constraint, it conceptually organizes all the SCPs into a novel structure called the SCP search tree, which facilitates the effective pruning of the search space to generate SCPs efficiently. Our experiments on both real and synthetic data sets show that Assembler is effective, efficient, and scalable. PMID:26705506

  5. Genomes, neurotoxins and biology of Clostridium botulinum Group I and Group II

    PubMed Central

    Carter, Andrew T.; Peck, Michael W.

    2015-01-01

    Recent developments in whole genome sequencing have made a substantial contribution to understanding the genomes, neurotoxins and biology of Clostridium botulinum Group I (proteolytic C. botulinum) and C. botulinum Group II (non-proteolytic C. botulinum). Two different approaches are used to study genomics in these bacteria; comparative whole genome microarrays and direct comparison of complete genome DNA sequences. The properties of the different types of neurotoxin formed, and different neurotoxin gene clusters found in C. botulinum Groups I and II are explored. Specific examples of botulinum neurotoxin genes are chosen for an in-depth discussion of neurotoxin gene evolution. The most recent cases of foodborne botulism are summarised. PMID:25445012

  6. OrthoMCL: identification of ortholog groups for eukaryotic genomes.

    PubMed

    Li, Li; Stoeckert, Christian J; Roos, David S

    2003-09-01

    The identification of orthologous groups is useful for genome annotation, studies on gene/protein evolution, comparative genomics, and the identification of taxonomically restricted sequences. Methods successfully exploited for prokaryotic genome analysis have proved difficult to apply to eukaryotes, however, as larger genomes may contain multiple paralogous genes, and sequence information is often incomplete. OrthoMCL provides a scalable method for constructing orthologous groups across multiple eukaryotic taxa, using a Markov Cluster algorithm to group (putative) orthologs and paralogs. This method performs similarly to the INPARANOID algorithm when applied to two genomes, but can be extended to cluster orthologs from multiple species. OrthoMCL clusters are coherent with groups identified by EGO, but improved recognition of "recent" paralogs permits overlapping EGO groups representing the same gene to be merged. Comparison with previously assigned EC annotations suggests a high degree of reliability, implying utility for automated eukaryotic genome annotation. OrthoMCL has been applied to the proteome data set from seven publicly available genomes (human, fly, worm, yeast, Arabidopsis, the malaria parasite Plasmodium falciparum, and Escherichia coli). A Web interface allows queries based on individual genes or user-defined phylogenetic patterns (http://www.cbil.upenn.edu/gene-family). Analysis of clusters incorporating P. falciparum genes identifies numerous enzymes that were incompletely annotated in first-pass annotation of the parasite genome.

  7. Genomic characterization of Italian Clostridium botulinum group I strains.

    PubMed

    Giordani, Francesco; Fillo, Silvia; Anselmo, Anna; Palozzi, Anna Maria; Fortunato, Antonella; Gentile, Bernardina; Azarnia Tehran, Domenico; Ciammaruconi, Andrea; Spagnolo, Ferdinando; Pittiglio, Valentina; Anniballi, Fabrizio; Auricchio, Bruna; De Medici, Dario; Lista, Florigio

    2015-12-01

    Clostridium botulinum is a gram-positive bacterium capable of producing the botulinum neurotoxin, a powerful poison that causes botulism, a severe neuroparalytic disease. Its genome has been sequenced entirely and its gene content has been analyzed. To date, 19 full genomes and 64 draft genomes are available. The geographical origin of these genomes is predominantly from the US. In the present study, 10 Italian genomes of C. botulinum group I were analyzed and compared with previously sequenced group I genomes, in order to genetically characterize the Italian population of C. botulinum group I and to investigate the phylogenetic relationships among different lineages. Using the suites of software ClonalFrame and ClonalOrigin to perform genomic analysis, we demonstrated that Italian C. botulinum group I population is phylogenetically heterogeneous encompassing different and distant lineages including overseas strains, too. Moreover, a high recombination rate was demonstrated in the evolution of C. botulinum group I species. Finally, genome sequencing of the strain 357 led us to identify a novel botulinum neurotoxin subtype, F8. PMID:26341861

  8. Comparative genomics of the Campylobacter lari group

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Campylobacter lari group is a phylogenetic clade within the epsilon subdivision of the Proteobacteria and is part of the thermotolerant campylobacters, a division within the genus that includes the human pathogen Campylobacter jejuni. The lari group is currently composed of five validly-named sp...

  9. Cooperative behavior and phase transitions in co-evolving stag hunt game

    NASA Astrophysics Data System (ADS)

    Zhang, W.; Li, Y. S.; Xu, C.; Hui, P. M.

    2016-02-01

    Cooperative behavior and different phases in a co-evolving network dynamics based on the stag hunt game is studied. The dynamical processes are parameterized by a payoff r that tends to promote non-cooperative behavior and a probability q for a rewiring attempt that could isolate the non-cooperators. The interplay between the parameters leads to different phases. Detailed simulations and a mean field theory are employed to reveal the properties of different phases. For small r, the cooperators are the majority and form a connected cluster while the non-cooperators increase with q but remain isolated over the whole range of q, and it is a static phase. For sufficiently large r, cooperators disappear in an intermediate range qL ≤ q ≤qU and a dynamical all-non-cooperators phase results. For q >qU, a static phase results again. A mean field theory based on how the link densities change in time by the co-evolving dynamics is constructed. The theory gives a phase diagram in the q- r parameter space that is qualitatively in agreement with simulation results. The sources of discrepancies between theory and simulations are discussed.

  10. Indel Group in Genomes (IGG) Molecular Genetic Markers1[OPEN

    PubMed Central

    Burkart-Waco, Diana; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger

    2016-01-01

    Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831

  11. Indel Group in Genomes (IGG) Molecular Genetic Markers.

    PubMed

    Toal, Ted W; Burkart-Waco, Diana; Howell, Tyson; Ron, Mily; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger; Brady, Siobhan M

    2016-09-01

    Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831

  12. 77 FR 75425 - Interagency Working Group on Plant Genomics (IWGPG): The National Plant Genome Initiative-What's...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-12-20

    ... Interagency Working Group on Plant Genomics (IWGPG): The National Plant Genome Initiative--What's Next? AGENCY... standardization. Summary: The National Plant Genome Initiative (NPGI) was established in 1998 as a coordinated national research program by the Interagency Working Group on Plant Genomes (IWGPG) under the Committee...

  13. A SEARCH FOR CO-EVOLVING ION AND NEUTRAL GAS SPECIES IN PRESTELLAR MOLECULAR CLOUD CORES

    SciTech Connect

    Tassis, Konstantinos; Hezareh, Talayeh; Willacy, Karen

    2012-11-20

    A comparison between the widths of ion and neutral molecule spectral lines has been recently used to estimate the strength of the magnetic field in turbulent star-forming regions. However, the ion (HCO{sup +}) and neutral (HCN) species used in such studies may not be necessarily co-evolving at every scale and density, and thus, may not trace the same regions. Here, we use coupled chemical/dynamical models of evolving prestellar molecular cloud cores including non-equilibrium chemistry, with and without magnetic fields, to study the spatial distribution of HCO{sup +} and HCN, which have been used in observations of spectral line width differences to date. In addition, we seek new ion-neutral pairs that are good candidates for such observations, because they have similar evolution and are approximately co-spatial in our models. We identify three such good candidate pairs: HCO{sup +}/NO, HCO{sup +}/CO, and NO{sup +}/NO.

  14. Molecular Epidemiology and Genomics of Group A Streptococcus

    PubMed Central

    Bessen, Debra E.; McShan, W. Michael; Nguyen, Scott V.; Shetty, Amol; Agrawal, Sonia; Tettelin, Hervé

    2014-01-01

    Streptococcus pyogenes (group A streptococcus; GAS) is a strict human pathogen with a very high prevalence worldwide. This review highlights the genetic organization of the species and the important ecological considerations that impact its evolution. Recent advances are presented on the topics of molecular epidemiology, population biology, molecular basis for genetic change, genome structure and genetic flux, phylogenomics and closely related streptococcal species, and the long- and short-term evolution of GAS. The application of whole genome sequence data to addressing key biological questions is discussed. PMID:25460818

  15. Comparative genome analysis of Bacillus cereus group genomes withBacillus subtilis

    SciTech Connect

    Anderson, Iain; Sorokin, Alexei; Kapatral, Vinayak; Reznik, Gary; Bhattacharya, Anamitra; Mikhailova, Natalia; Burd, Henry; Joukov, Victor; Kaznadzey, Denis; Walunas, Theresa; D'Souza, Mark; Larsen, Niels; Pusch,Gordon; Liolios, Konstantinos; Grechkin, Yuri; Lapidus, Alla; Goltsman,Eugene; Chu, Lien; Fonstein, Michael; Ehrlich, S. Dusko; Overbeek, Ross; Kyrpides, Nikos; Ivanova, Natalia

    2005-09-14

    Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1,381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.

  16. Analytic approach to co-evolving dynamics in complex networks: dissatisfied adaptive snowdrift game

    NASA Astrophysics Data System (ADS)

    Gräser, Oliver; Xu, Chen; Hui, P. M.

    2011-08-01

    We investigate the formulation of mean-field (MF) approaches for co-evolving dynamic model systems, focusing on the accuracy and validity of different schemes in closing MF equations. Within the context of a recently introduced co-evolutionary snowdrift game in which rational adaptive actions are driven by dissatisfaction in the payoff, we introduce a method to test the validity of closure schemes and analyse the shortcomings of previous schemes. A previous scheme suitable for adaptive epidemic models is shown to be invalid for the model studied here. A binomial-style closure scheme that significantly improves upon the previous schemes is introduced. Fixed-point analysis of the MF equations not only explains the numerical observed transition between a connected state with suppressed cooperation and a highly cooperative disconnected state, but also reveals a previously undetected connected state that exhibits the unusual behaviour of decreasing cooperation as the temptation for uncooperative action drops. We proposed a procedure for selecting proper initial conditions to realize the unusual state in numerical simulations. The effects of the mean number of connections that an agent carries are also studied.

  17. Comparative Genomics of the Staphylococcus intermedius Group of Animal Pathogens

    PubMed Central

    Ben Zakour, Nouri L.; Beatson, Scott A.; van den Broek, Adri H. M.; Thoday, Keith L.; Fitzgerald, J. Ross

    2012-01-01

    The Staphylococcus intermedius group consists of three closely related coagulase-positive bacterial species including S. intermedius, Staphylococcus pseudintermedius, and Staphylococcus delphini. S. pseudintermedius is a major skin pathogen of dogs, which occasionally causes severe zoonotic infections of humans. S. delphini has been isolated from an array of different animals including horses, mink, and pigeons, whereas S. intermedius has been isolated only from pigeons to date. Here we provide a detailed analysis of the S. pseudintermedius whole genome sequence in comparison to high quality draft S. intermedius and S. delphini genomes, and to other sequenced staphylococcal species. The core genome of the SIG was highly conserved with average nucleotide identity (ANI) between the three species of 93.61%, which is very close to the threshold of species delineation (95% ANI), highlighting the close-relatedness of the SIG species. However, considerable variation was identified in the content of mobile genetic elements, cell wall-associated proteins, and iron and sugar transporters, reflecting the distinct ecological niches inhabited. Of note, S. pseudintermedius ED99 contained a clustered regularly interspaced short palindromic repeat locus of the Nmeni subtype and S. intermedius contained both Nmeni and Mtube subtypes. In contrast to S. intermedius and S. delphini and most other staphylococci examined to date, S. pseudintermedius contained at least nine predicted reverse transcriptase Group II introns. Furthermore, S. pseudintermedius ED99 encoded several transposons which were largely responsible for its multi-resistant phenotype. Overall, the study highlights extensive differences in accessory genome content between closely related staphylococcal species inhabiting distinct host niches, providing new avenues for research into pathogenesis and bacterial host-adaptation. PMID:22919635

  18. Autonomous Agent-Based Systems and Their Applications in Fluid Dynamics, Particle Separation, and Co-evolving Networks

    NASA Astrophysics Data System (ADS)

    Graeser, Oliver

    This thesis comprises three parts, reporting research results in Fluid Dynamics (Part I), Particle Separation (Part II) and Co-evolving Networks (Part III). Part I deals with the simulation of fluid dynamics using the lattice-Boltzmann method. Microfluidic devices often feature two-dimensional, repetitive arrays. Flows through such devices are pressure-driven and confined by solid walls. We have defined new adaptive generalised periodic boundary conditions to represent the effects of outer solid walls, and are thus able to exploit the periodicity of the array by simulating the flow through one unit cell in lieu of the entire device. The so-calculated fully developed flow describes the flow through the entire array accurately, but with computational requirements that are reduced according to the dimensions of the array. Part II discusses the problem of separating macromolecules like proteins or DNA coils. The reliable separation of such molecules is a crucial task in molecular biology. The use of Brownian ratchets as mechanisms for the separation of such particles has been proposed and discussed during the last decade. Pressure-driven flows have so far been dismissed as possible driving forces for Brownian ratchets, as they do not generate ratchet asymmetry. We propose a microfluidic design that uses pressure-driven flows to create asymmetry and hence allows particle separation. The dependence of the asymmetry on various factors of the microfluidic geometry is discussed. We further exemplify the feasibility of our approach using Brownian dynamics simulations of particles of different sizes in such a device. The results show that ratchet-based particle separation using flows as the driving force is possible. Simulation results and ratchet theory predictions are in excellent agreement. Part III deals with the co-evolution of networks and dynamic models. A group of agents occupies the nodes of a network, which defines the relationship between these agents. The

  19. Analysis of co-evolving genes in campylobacter jejuni and C. coli

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: The population structure of Campylobacter has been frequently studied by MLST, for which fragments of housekeeping genes are compared. We wished to determine if the used MLST genes are representative of the complete genome. Methods: A set of 1029 core gene families (CGF) was identifie...

  20. Complete Genome Sequence of Rotavirus Group C Isolated in South Korea

    PubMed Central

    Kim, Dong-Uk; Park, Kwi-Sung; Kim, Jae-Kyung

    2015-01-01

    Rotavirus group C is the major etiological agent associated with acute gastroenteritis in all human age groups. Here, we report the complete genome sequence of human group C rotavirus (GpC-RV) isolated in South Korea. PMID:26404599

  1. Principles of Genome Evolution in the Drosophila melanogaster Species Group

    PubMed Central

    Ranz, José M; Maurin, Damien; Chan, Yuk S; von Grotthuss, Marcin; Hillier, LaDeana W; Roote, John; Ashburner, Michael; Bergman, Casey M

    2007-01-01

    That closely related species often differ by chromosomal inversions was discovered by Sturtevant and Plunkett in 1926. Our knowledge of how these inversions originate is still very limited, although a prevailing view is that they are facilitated by ectopic recombination events between inverted repetitive sequences. The availability of genome sequences of related species now allows us to study in detail the mechanisms that generate interspecific inversions. We have analyzed the breakpoint regions of the 29 inversions that differentiate the chromosomes of Drosophila melanogaster and two closely related species, D. simulans and D. yakuba, and reconstructed the molecular events that underlie their origin. Experimental and computational analysis revealed that the breakpoint regions of 59% of the inversions (17/29) are associated with inverted duplications of genes or other nonrepetitive sequences. In only two cases do we find evidence for inverted repetitive sequences in inversion breakpoints. We propose that the presence of inverted duplications associated with inversion breakpoint regions is the result of staggered breaks, either isochromatid or chromatid, and that this, rather than ectopic exchange between inverted repetitive sequences, is the prevalent mechanism for the generation of inversions in the melanogaster species group. Outgroup analysis also revealed evidence for widespread breakpoint recycling. Lastly, we have found that expression domains in D. melanogaster may be disrupted in D. yakuba, bringing into question their potential adaptive significance. PMID:17550304

  2. Complete Genomic Sequence for an Avian Group G Rotavirus from South Africa

    PubMed Central

    Stucker, Karla M.; Stockwell, Timothy B.; Nyaga, Martin M.; Halpin, Rebecca A.; Fedorova, Nadia; Akopov, Asmik; Ngoveni, Harry; Peenze, Ina; Seheri, Mapaseka L.; Mphahlele, M. Jeffrey

    2015-01-01

    We report the first complete sequence for an avian group G rotavirus (RVG) genome from Africa, which is the third publically available RVG genome. These RVG genomes are highly diverse, especially in their VP4, VP7, NSP4, and NSP3 segments, indicating that RVG diversity is comparable to that of rotavirus A. PMID:25767240

  3. Complete genomic sequence for an avian group G rotavirus from South Africa.

    PubMed

    Stucker, Karla M; Stockwell, Timothy B; Nyaga, Martin M; Halpin, Rebecca A; Fedorova, Nadia; Akopov, Asmik; Ngoveni, Harry; Peenze, Ina; Seheri, Mapaseka L; Mphahlele, M Jeffrey; Wentworth, David E

    2015-01-01

    We report the first complete sequence for an avian group G rotavirus (RVG) genome from Africa, which is the third publically available RVG genome. These RVG genomes are highly diverse, especially in their VP4, VP7, NSP4, and NSP3 segments, indicating that RVG diversity is comparable to that of rotavirus A. PMID:25767240

  4. Modeling growth and dissemination of lymphoma in a co-evolving lymph node: a diffuse-domain approach

    NASA Astrophysics Data System (ADS)

    Chuang, Yao-Li; Cristini, Vittorio; Chen, Ying; Li, Xiangrong; Frieboes, Hermann; Lowengrub, John

    2013-03-01

    While partial differential equation models of tumor growth have successfully described various spatiotemporal phenomena observed for in-vitro tumor spheroid experiments, one challenge towards taking these models to further study in-vivo tumors is that instead of relatively static tissue culture with regular boundary conditions, in-vivo tumors are often confined in organ tissues that co-evolve with the tumor growth. Here we adopt a recently developed diffuse-domain method to account for the co-evolving domain boundaries, adapting our previous in-vitro tumor model for the development of lymphoma encapsulated in a lymph node, which may swell or shrink due to proliferation and dissemination of lymphoma cells and treatment by chemotherapy. We use the model to study the induced spatial heterogeneity, which may arise as an emerging phenomenon in experimental observations and model analysis. Spatial heterogeneity is believed to lead to tumor infiltration patterns and reduce the efficacy of chemotherapy, leaving residuals that cause cancer relapse after the treatment. Understanding the spatiotemporal evolution of in-vivo tumors can be an essential step towards more effective strategies of curing cancer. Supported by NIH-PSOC grant 1U54CA143907-01.

  5. Complete Genome Sequences of Four Novel Escherichia coli Bacteriophages Belonging to New Phage Groups

    PubMed Central

    Kot, Witold

    2015-01-01

    Here, we describe the sequencing and genome annotations of a set of four Escherichia coli bacteriophages (phages) belonging to newly discovered groups previously consisting of only a single phage and thus expand our knowledge of these phage groups. PMID:26184932

  6. PhyloGene server for identification and visualization of co-evolving proteins using normalized phylogenetic profiles

    PubMed Central

    Sadreyev, Ilyas R.; Ji, Fei; Cohen, Emiliano; Ruvkun, Gary; Tabach, Yuval

    2015-01-01

    Proteins that function in the same pathways, protein complexes or the same environmental conditions can show similar patterns of sequence conservation across phylogenetic clades. In species that no longer require a specific protein complex or pathway, these proteins, as a group, tend to be lost or diverge. Analysis of the similarity in patterns of sequence conservation across a large set of eukaryotes can predict functional associations between different proteins, identify new pathway members and reveal the function of previously uncharacterized proteins. We used normalized phylogenetic profiling to predict protein function and identify new pathway members and disease genes. The phylogenetic profiles of tens of thousands conserved proteins in the human, mouse, Caenorhabditis elegans and Drosophila genomes can be queried on our new web server, PhyloGene. PhyloGene provides intuitive and user-friendly platform to query the patterns of conservation across 86 animal, fungal, plant and protist genomes. A protein query can be submitted either by selecting the name from whole-genome protein sets of the intensively studied species or by entering a protein sequence. The graphic output shows the profile of sequence conservation for the query and the most similar phylogenetic profiles for the proteins in the genome of choice. The user can also download this output in numerical form. PMID:25958392

  7. PhyloGene server for identification and visualization of co-evolving proteins using normalized phylogenetic profiles.

    PubMed

    Sadreyev, Ilyas R; Ji, Fei; Cohen, Emiliano; Ruvkun, Gary; Tabach, Yuval

    2015-07-01

    Proteins that function in the same pathways, protein complexes or the same environmental conditions can show similar patterns of sequence conservation across phylogenetic clades. In species that no longer require a specific protein complex or pathway, these proteins, as a group, tend to be lost or diverge. Analysis of the similarity in patterns of sequence conservation across a large set of eukaryotes can predict functional associations between different proteins, identify new pathway members and reveal the function of previously uncharacterized proteins. We used normalized phylogenetic profiling to predict protein function and identify new pathway members and disease genes. The phylogenetic profiles of tens of thousands conserved proteins in the human, mouse, Caenorhabditis elegans and Drosophila genomes can be queried on our new web server, PhyloGene. PhyloGene provides intuitive and user-friendly platform to query the patterns of conservation across 86 animal, fungal, plant and protist genomes. A protein query can be submitted either by selecting the name from whole-genome protein sets of the intensively studied species or by entering a protein sequence. The graphic output shows the profile of sequence conservation for the query and the most similar phylogenetic profiles for the proteins in the genome of choice. The user can also download this output in numerical form.

  8. Genomic Sequence or Signature Tags (GSTs) from the Genome Group at Brookhaven National Laboratory (BNL)

    DOE Data Explorer

    Dunn, John J.; McCorkle, Sean R.; Praissman, Laura A.; Hind, Geoffrey; Van der Lelie, Daniel; Bahou, Wadie F.; Gnatenko, Dmitri V.; Krause, Maureen K.

    Genomic Signature Tags (GSTs) are the products of a method we have developed for identifying and quantitatively analyzing genomic DNAs. The DNA is initially fragmented with a type II restriction enzyme. An oligonucleotide adaptor containing a recognition site for MmeI, a type IIS restriction enzyme, is then used to release 21-bp tags from fixed positions in the DNA relative to the sites recognized by the fragmenting enzyme. These tags are PCR-amplified, purified, concatenated and then cloned and sequenced. The tag sequences and abundances are used to create a high resolution GST sequence profile of the genomic DNA. [Quoted from Genomic Signature Tags (GSTs): A System for Profiling Genomic DNA, Dunn, John J.; McCorkle, Sean R.; Praissman, Laura A.; Hind, Geoffrey; Van der Lelie, Daniel; Bahou, Wadie F.; Gnatenko, Dmitri V.; Krause, Maureen K., Revised 9/13/2002

  9. Comparative Genomics of the Bacterial Genus Streptococcus Illuminates Evolutionary Implications of Species Groups

    PubMed Central

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei; Klenk, Hans-Peter; Li, Wen-Jun

    2014-01-01

    Members of the genus Streptococcus within the phylum Firmicutes are among the most diverse and significant zoonotic pathogens. This genus has gone through considerable taxonomic revision due to increasing improvements of chemotaxonomic approaches, DNA hybridization and 16S rRNA gene sequencing. It is proposed to place the majority of streptococci into “species groups”. However, the evolutionary implications of species groups are not clear presently. We use comparative genomic approaches to yield a better understanding of the evolution of Streptococcus through genome dynamics, population structure, phylogenies and virulence factor distribution of species groups. Genome dynamics analyses indicate that the pan-genome size increases with the addition of newly sequenced strains, while the core genome size decreases with sequential addition at the genus level and species group level. Population structure analysis reveals two distinct lineages, one including Pyogenic, Bovis, Mutans and Salivarius groups, and the other including Mitis, Anginosus and Unknown groups. Phylogenetic dendrograms show that species within the same species group cluster together, and infer two main clades in accordance with population structure analysis. Distribution of streptococcal virulence factors has no obvious patterns among the species groups; however, the evolution of some common virulence factors is congruous with the evolution of species groups, according to phylogenetic inference. We suggest that the proposed streptococcal species groups are reasonable from the viewpoints of comparative genomics; evolution of the genus is congruent with the individual evolutionary trajectories of different species groups. PMID:24977706

  10. Sulfur-oxidizing symbionts have not co-evolved with their hydrothermal vent tube worm hosts: an RFLP analysis.

    PubMed

    Laue, B E; Nelson, D C

    1997-09-01

    A fine-scale phylogenetic comparison was made among the symbionts of different genera of hydrothermal vent tube worms. These included Riftia pachyptila and Tevnia jerichonona, which inhabit sites along the east Pacific Rise, and Ridgeia piscesae from the Juan de Fuca Ridge. An analysis of restriction fragment length polymorphism (RFLP) was employed using three symbiont-specific gene probes: eubacterial 16S rRNA, RuBPC/O Form II, and ATP sulfurylase (recently cloned from the Riftia symbiont). Results indicated that all of the symbionts from the three different hosts were conspecific and the Riftia and Tevnia symbionts were indistinguishable over and 1800-km range. Significantly, this indicates that the symbionts have not co-evolved with their respective hosts, which are known to belong to separate families. This study strongly supports the conclusion that the symbionts are acquired de novo by each generation of juvenile tube worms from a common source in the surrounding sea water. PMID:9284558

  11. Comparative genomics and functional analysis of the 936 group of lactococcal Siphoviridae phages

    PubMed Central

    Murphy, James; Bottacini, Francesca; Mahony, Jennifer; Kelleher, Philip; Neve, Horst; Zomer, Aldert; Nauta, Arjen; van Sinderen, Douwe

    2016-01-01

    Genome sequencing and comparative analysis of bacteriophage collections has greatly enhanced our understanding regarding their prevalence, phage-host interactions as well as the overall biodiversity of their genomes. This knowledge is very relevant to phages infecting Lactococcus lactis, since they constitute a significant risk factor for dairy fermentations. Of the eighty four lactococcal phage genomes currently available, fifty five belong to the so-called 936 group, the most prevalent of the ten currently recognized lactococcal phage groups. Here, we report the genetic characteristics of a new collection of 936 group phages. By combining these genomes to those sequenced previously we determined the core and variable elements of the 936 genome. Genomic variation occurs across the 936 phage genome, such as genetic elements that (i) lead to a +1 translational frameshift resulting in the formation of additional structures on the phage tail, (ii) specify a double neck passage structure, and (iii) encode packaging module-associated methylases. Hierarchical clustering of the gene complement of the 936 group phages and nucleotide alignments allowed grouping of the ninety 936 group phages into distinct clusters, which in general appear to correspond with their geographical origin. PMID:26892066

  12. Population genomics and research ethics with socially identifable groups.

    PubMed

    McGregor, Joan L

    2007-01-01

    In this paper, the author questions whether the research ethics guidelines and procedures are robust enough to protect groups when conducting genetics research with socially identifiable populations, particularly with Native American groups. The author argues for a change in the federal guidelines in substance and procedures of conducting genetic research with socially identifiable groups.

  13. Application of Whole-Genome Sequencing to an Unusual Outbreak of Invasive Group A Streptococcal Disease

    PubMed Central

    Galloway-Peña, Jessica; Clement, Meredith E.; Sharma Kuinkel, Batu K.; Ruffin, Felicia; Flores, Anthony R.; Levinson, Howard; Shelburne, Samuel A.; Moore, Zack; Fowler, Vance G.

    2016-01-01

    Whole-genome analysis was applied to investigate atypical point-source transmission of 2 invasive group A streptococcal (GAS) infections. Isolates were serotype M4, ST39, and genetically indistinguishable. Comparison with MGAS10750 revealed nonsynonymous polymorphisms in ropB and increased speB transcription. This study demonstrates the usefulness of whole-genome analyses for GAS outbreaks. PMID:27006966

  14. A quantitative measure for organization of complex and co-evolving networks

    NASA Astrophysics Data System (ADS)

    Georgiev, Georgi

    2012-02-01

    To define evolution and self-organization in complex networks a quantitative measure for organization is necessary. Two systems should be numerically distinguishable by their degree of organization and their rate of self-organization. Here we apply as a measure for quantity of organization the inverse of the average sum of physical actions of all elements in a system per unit motion multiplied by the Planck's constant. The meaning of quantity of organization here is the number of quanta of action per one unit motion of an element. For example, a unit motion for electrons on a computer chip is the one necessary for one computation. This definition can be applied to the organization in any complex system. Systems self-organize to decrease the average action per element per unit motion in them. This is the attractor for a dynamical, nonlinear system evolving in time. Constraints increase this average action, so constraint minimization is a basic mechanism for action minimization. Increase of quantity of elements in the network, leads to faster constraint minimization through grouping, decrease of average action per element and motion and therefore faster self-organization and evolution.

  15. Genome Sequence of Acidovorax citrulli Group 1 Strain pslb65 Causing Bacterial Fruit Blotch of Melons.

    PubMed

    Wang, Tielin; Sun, Baixin; Yang, Yuwen; Zhao, Tingchang

    2015-04-23

    Acidovorax citrulli is typed into two groups, mainly based on the host. We determined the draft genome of A. citrulli group 1 strain pslb65. The strain was isolated from melon collected from Xinjiang province, China. The A. citrulli pslb65 genome contains 4,903,443 bp and has a G+C content of 68.8 mol%.

  16. Genome Sequence of Acidovorax citrulli Group 1 Strain pslb65 Causing Bacterial Fruit Blotch of Melons

    PubMed Central

    Wang, Tielin; Sun, Baixin; Yang, Yuwen

    2015-01-01

    Acidovorax citrulli is typed into two groups, mainly based on the host. We determined the draft genome of A. citrulli group 1 strain pslb65. The strain was isolated from melon collected from Xinjiang province, China. The A. citrulli pslb65 genome contains 4,903,443 bp and has a G+C content of 68.8 mol%. PMID:25908136

  17. Alignment-free genome tree inference by learning group-specific distance metrics.

    PubMed

    Patil, Kaustubh R; McHardy, Alice C

    2013-01-01

    Understanding the evolutionary relationships between organisms is vital for their in-depth study. Gene-based methods are often used to infer such relationships, which are not without drawbacks. One can now attempt to use genome-scale information, because of the ever increasing number of genomes available. This opportunity also presents a challenge in terms of computational efficiency. Two fundamentally different methods are often employed for sequence comparisons, namely alignment-based and alignment-free methods. Alignment-free methods rely on the genome signature concept and provide a computationally efficient way that is also applicable to nonhomologous sequences. The genome signature contains evolutionary signal as it is more similar for closely related organisms than for distantly related ones. We used genome-scale sequence information to infer taxonomic distances between organisms without additional information such as gene annotations. We propose a method to improve genome tree inference by learning specific distance metrics over the genome signature for groups of organisms with similar phylogenetic, genomic, or ecological properties. Specifically, our method learns a Mahalanobis metric for a set of genomes and a reference taxonomy to guide the learning process. By applying this method to more than a thousand prokaryotic genomes, we showed that, indeed, better distance metrics could be learned for most of the 18 groups of organisms tested here. Once a group-specific metric is available, it can be used to estimate the taxonomic distances for other sequenced organisms from the group. This study also presents a large scale comparison between 10 methods--9 alignment-free and 1 alignment-based.

  18. Phage Morphology Recapitulates Phylogeny: The Comparative Genomics of a New Group of Myoviruses

    PubMed Central

    Comeau, André M.; Tremblay, Denise; Moineau, Sylvain; Rattei, Thomas; Kushkina, Alla I.; Tovkach, Fedor I.; Krisch, Henry M.; Ackermann, Hans-Wolfgang

    2012-01-01

    Among dsDNA tailed bacteriophages (Caudovirales), members of the Myoviridae family have the most sophisticated virion design that includes a complex contractile tail structure. The Myoviridae generally have larger genomes than the other phage families. Relatively few “dwarf” myoviruses, those with a genome size of less than 50 kb such as those of the Mu group, have been analyzed in extenso. Here we report on the genome sequencing and morphological characterization of a new group of such phages that infect a diverse range of Proteobacteria, namely Aeromonas salmonicida phage 56, Vibrio cholerae phages 138 and CP-T1, Bdellovibrio phage φ1422, and Pectobacterium carotovorum phage ZF40. This group of dwarf myoviruses shares an identical virion morphology, characterized by usually short contractile tails, and have genome sizes of approximately 45 kb. Although their genome sequences are variable in their lysogeny, replication, and host adaption modules, presumably reflecting differing lifestyles and hosts, their structural and morphogenesis modules have been evolutionarily constrained by their virion morphology. Comparative genomic analysis reveals that these phages, along with related prophage genomes, form a new coherent group within the Myoviridae. The results presented in this communication support the hypothesis that the diversity of phages may be more structured than generally believed and that the innumerable phages in the biosphere all belong to discrete lineages or families. PMID:22792219

  19. Nitrogen limitation as a driver of genome size evolution in a group of karst plants.

    PubMed

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-06-25

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to environmental limitation in natural ecosystems. Using phylogenetic comparative methods, we test this hypothesis by determining whether leaf N and P availability affects genome sizes in 99 species of Primulina (Gesneriaceae), a group of soil specialists adapted to limestone karst environment in south China. We find that genome sizes in Primulina are strongly positively correlated with plant N content, but the correlation with plant P content is not significant when phylogeny history was taken into account. This study shows for the first time that N limitation might have been a plausible driver of genome size variation in a group of plants. We propose that competition for nitrogen nutrient between DNA synthesis and cellular functions is a possible mechanism for genome size evolution in Primulina under N-limitation.

  20. Nitrogen limitation as a driver of genome size evolution in a group of karst plants

    NASA Astrophysics Data System (ADS)

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-06-01

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to environmental limitation in natural ecosystems. Using phylogenetic comparative methods, we test this hypothesis by determining whether leaf N and P availability affects genome sizes in 99 species of Primulina (Gesneriaceae), a group of soil specialists adapted to limestone karst environment in south China. We find that genome sizes in Primulina are strongly positively correlated with plant N content, but the correlation with plant P content is not significant when phylogeny history was taken into account. This study shows for the first time that N limitation might have been a plausible driver of genome size variation in a group of plants. We propose that competition for nitrogen nutrient between DNA synthesis and cellular functions is a possible mechanism for genome size evolution in Primulina under N-limitation.

  1. Complete Genome Sequences of Nine Bacillus cereus Group Phages.

    PubMed

    Foltz, Samantha; Johnson, Allison A

    2016-01-01

    We report the sequences of nine novel Bacillus cereus group bacteriophages: DIGNKC, Juglone, Nemo, Nigalana, NotTheCreek, Phrodo, SageFayge, Vinny, and Zuko. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples using B. thuringiensis subsp. kurstaki as the host bacterium.

  2. Genome Sequence of Bacillus cereus Group Phage SalinJah.

    PubMed

    Erill, Ivan; Caruso, Steven M

    2016-01-01

    The double-stranded DNA (dsDNA) Myoviridae Bacillus cereus group bacteriophage SalinJah was isolated from soil collected in Gyeonggi-do, South Korea. SalinJah, a cluster C phage with a broad host range, suggests the need to create a new subcluster with SalinJah and Helga as founding members.

  3. Complete Genome Sequences of Nine Bacillus cereus Group Phages

    PubMed Central

    Foltz, Samantha

    2016-01-01

    We report the sequences of nine novel Bacillus cereus group bacteriophages: DIGNKC, Juglone, Nemo, Nigalana, NotTheCreek, Phrodo, SageFayge, Vinny, and Zuko. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples using B. thuringiensis subsp. kurstaki as the host bacterium. PMID:27417827

  4. Genome Sequence of Bacillus cereus Group Phage SalinJah

    PubMed Central

    2016-01-01

    The double-stranded DNA (dsDNA) Myoviridae Bacillus cereus group bacteriophage SalinJah was isolated from soil collected in Gyeonggi-do, South Korea. SalinJah, a cluster C phage with a broad host range, suggests the need to create a new subcluster with SalinJah and Helga as founding members. PMID:27688335

  5. Complete Genome Sequences of Nine Bacillus cereus Group Phages.

    PubMed

    Foltz, Samantha; Johnson, Allison A

    2016-01-01

    We report the sequences of nine novel Bacillus cereus group bacteriophages: DIGNKC, Juglone, Nemo, Nigalana, NotTheCreek, Phrodo, SageFayge, Vinny, and Zuko. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples using B. thuringiensis subsp. kurstaki as the host bacterium. PMID:27417827

  6. Genome Sequence of Bacillus cereus Group Phage SalinJah.

    PubMed

    Erill, Ivan; Caruso, Steven M

    2016-01-01

    The double-stranded DNA (dsDNA) Myoviridae Bacillus cereus group bacteriophage SalinJah was isolated from soil collected in Gyeonggi-do, South Korea. SalinJah, a cluster C phage with a broad host range, suggests the need to create a new subcluster with SalinJah and Helga as founding members. PMID:27688335

  7. Co-evolving CENP-A and CAL1 Domains Mediate Centromeric CENP-A Deposition across Drosophila Species.

    PubMed

    Rosin, Leah; Mellone, Barbara G

    2016-04-18

    Centromeres mediate the conserved process of chromosome segregation, yet centromeric DNA and the centromeric histone, CENP-A, are rapidly evolving. The rapid evolution of Drosophila CENP-A loop 1 (L1) is thought to modulate the DNA-binding preferences of CENP-A to counteract centromere drive, the preferential transmission of chromosomes with expanded centromeric satellites. Consistent with this model, CENP-A from Drosophila bipectinata (bip) cannot localize to Drosophila melanogaster (mel) centromeres. We show that this result is due to the inability of the mel CENP-A chaperone, CAL1, to deposit bip CENP-A into chromatin. Co-expression of bip CENP-A and bip CAL1 in mel cells restores centromeric localization, and similar findings apply to other Drosophila species. We identify two co-evolving regions, CENP-A L1 and the CAL1 N terminus, as critical for lineage-specific CENP-A incorporation. Collectively, our data show that the rapid evolution of L1 modulates CAL1-mediated CENP-A assembly, suggesting an alternative mechanism for the suppression of centromere drive.

  8. Co-evolving CENP-A and CAL1 Domains Mediate Centromeric CENP-A Deposition across Drosophila Species.

    PubMed

    Rosin, Leah; Mellone, Barbara G

    2016-04-18

    Centromeres mediate the conserved process of chromosome segregation, yet centromeric DNA and the centromeric histone, CENP-A, are rapidly evolving. The rapid evolution of Drosophila CENP-A loop 1 (L1) is thought to modulate the DNA-binding preferences of CENP-A to counteract centromere drive, the preferential transmission of chromosomes with expanded centromeric satellites. Consistent with this model, CENP-A from Drosophila bipectinata (bip) cannot localize to Drosophila melanogaster (mel) centromeres. We show that this result is due to the inability of the mel CENP-A chaperone, CAL1, to deposit bip CENP-A into chromatin. Co-expression of bip CENP-A and bip CAL1 in mel cells restores centromeric localization, and similar findings apply to other Drosophila species. We identify two co-evolving regions, CENP-A L1 and the CAL1 N terminus, as critical for lineage-specific CENP-A incorporation. Collectively, our data show that the rapid evolution of L1 modulates CAL1-mediated CENP-A assembly, suggesting an alternative mechanism for the suppression of centromere drive. PMID:27093083

  9. Characterization and comparative genomic analysis of bacteriophages infecting members of the Bacillus cereus group.

    PubMed

    Lee, Ju-Hoon; Shin, Hakdong; Ryu, Sangryeol

    2014-05-01

    The Bacillus cereus group phages infecting B. cereus, B. anthracis, and B. thuringiensis (Bt) have been studied at the molecular level and, recently, at the genomic level to control the pathogens B. cereus and B. anthracis and to prevent phage contamination of the natural insect pesticide Bt. A comparative phylogenetic analysis has revealed three different major phage groups with different morphologies (Myoviridae for group I, Siphoviridae for group II, and Tectiviridae for group III), genome size (group I > group II > group III), and lifestyle (virulent for group I and temperate for group II and III). A subsequent phage genome comparison using a dot plot analysis showed that phages in each group are highly homologous, substantiating the grouping of B. cereus phages. Endolysin is a host lysis protein that contains two conserved domains: a cell-wall-binding domain (CBD) and an enzymatic activity domain (EAD). In B. cereus sensu lato phage group I, four different endolysin groups have been detected, according to combinations of two types of CBD and four types of EAD. Group I phages have two copies of tail lysins and one copy of endolysin, but the functions of the tail lysins are still unknown. In the B. cereus sensu lato phage group II, the B. anthracis phages have been studied and applied for typing and rapid detection of pathogenic host strains. In the B. cereus sensu lato phage group III, the B. thuringiensis phages Bam35 and GIL01 have been studied to understand phage entry and lytic switch regulation mechanisms. In this review, we suggest that further study of the B. cereus group phages would be useful for various phage applications, such as biocontrol, typing, and rapid detection of the pathogens B. cereus and B. anthracis and for the prevention of phage contamination of the natural insect pesticide Bt.

  10. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    PubMed

    Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my.

  11. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    PubMed

    Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my. PMID:27138013

  12. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform

    PubMed Central

    Zheng, Wenning; Paterson, Ian C.; Mutha, Naresh V. R.; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A.; Jakubovics, Nicholas S.; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my. PMID:27138013

  13. Extending the Bacillus cereus group genomics to putative food-borne pathogens of different toxicity.

    PubMed

    Lapidus, Alla; Goltsman, Eugene; Auger, Sandrine; Galleron, Nathalie; Ségurens, Béatrice; Dossat, Carole; Land, Miriam L; Broussolle, Veronique; Brillard, Julien; Guinebretiere, Marie-Helene; Sanchis, Vincent; Nguen-The, Christophe; Lereclus, Didier; Richardson, Paul; Wincker, Patrick; Weissenbach, Jean; Ehrlich, S Dusko; Sorokin, Alexei

    2008-01-30

    The Bacillus cereus group represents sporulating soil bacteria containing pathogenic strains which may cause diarrheic or emetic food poisoning outbreaks. Multiple locus sequence typing revealed a presence in natural samples of these bacteria of about 30 clonal complexes. Application of genomic methods to this group was however biased due to the major interest for representatives closely related to Bacillus anthracis. Albeit the most important food-borne pathogens were not yet defined, existing data indicate that they are scattered all over the phylogenetic tree. The preliminary analysis of the sequences of three genomes discussed in this paper narrows down the gaps in our knowledge of the B. cereus group. The strain NVH391-98 is a rare but particularly severe food-borne pathogen. Sequencing revealed that the strain should be a representative of a novel bacterial species, for which the name Bacillus cytotoxis or Bacillus cytotoxicus is proposed. This strain has a reduced genome size compared to other B. cereus group strains. Genome analysis revealed absence of sigma B factor and the presence of genes encoding diarrheic Nhe toxin, not detected earlier. The strain B. cereus F837/76 represents a clonal complex close to that of B. anthracis. Including F837/76, three such B. cereus strains had been sequenced. Alignment of genomes suggests that B. anthracis is their common ancestor. Since such strains often emerge from clinical cases, they merit a special attention. The third strain, KBAB4, is a typical facultative psychrophile generally found in soil. Phylogenic studies show that in nature it is the most active group in terms of gene exchange. Genomic sequence revealed high presence of extra-chromosomal genetic material (about 530kb) that may account for this phenomenon. Genes coding Nhe-like toxin were found on a big plasmid in this strain. This may indicate a potential mechanism of toxicity spread from the psychrophile strain community. The results of this genomic

  14. Extending the Bacillus cereus group genomics to putative food-borne pathogens of different toxicity

    SciTech Connect

    Lapidus, Alla L.; Goltsman, Eugene; Auger, Sandrine; Galleron, Nathalie; Segurens, Beatrice; Simon, Jorg; Dossat, Carole; Broussolle, Veronique; Brillard, Julien; Guinebretiere, Marie-Helene; Sanchis, Vincent; Nguen-the, Christophe; Lereclus, Didier; Richardson, P M; Wincker, Patrick; Sorokin, Alexei

    2008-01-01

    The Bacillus cereus group represents sporulating soil bacteria containing pathogenic strains which may cause diarrheic or emetic food poisoning outbreaks. Multiple locus sequence typing revealed a presence in natural samples of these bacteria of about 30 clonal complexes. Application of genomic methods to this group was however biased due to the major interest for representatives closely related to Bacillus anthracis. Albeit the most important food-borne pathogens were not yet defined, existing data indicate that they are scattered all over the phylogenetic tree. The preliminary analysis of the sequences of three genomes discussed in this paper narrows down the gaps in our knowledge of the B. cereus group. The strain NVH391-98 is a rare but particularly severe food-borne pathogen. Sequencing revealed that the strain should be a representative of a novel bacterial species, for which the name Bacillus cytotoxis or Bacillus cytotoxicus is proposed. This strain has a reduced genome size compared to other B. cereus group strains. Genome analysis revealed absence of sigma B factor and the presence of genes encoding diarrheic Nhe toxin, not detected earlier. The strain B. cereus F837/76 represents a clonal complex close to that of B. anthracis. Including F837/76, three such B. cereus strains had been sequenced. Alignment of genomes suggests that B. anthracis is their common ancestor. Since such strains often emerge from clinical cases, they merit a special attention. The third strain, KBAB4, is a typical facultative psychrophile generally found in soil. Phylogenic studies show that in nature it is the most active group in terms of gene exchange. Genomic sequence revealed high presence of extra-chromosomal genetic material (about 530 kb) that may account for this phenomenon. Genes coding Nhe-like toxin were found on a big plasmid in this strain. This may indicate a potential mechanism of toxicity spread from the psychrophile strain community. The results of this genomic

  15. Whole genome sequencing as a tool for phylogenetic analysis of clinical strains of Mitis group streptococci.

    PubMed

    Rasmussen, L H; Dargis, R; Højholt, K; Christensen, J J; Skovgaard, O; Justesen, U S; Rosenvinge, F S; Moser, C; Lukjancenko, O; Rasmussen, S; Nielsen, X C

    2016-10-01

    Identification of Mitis group streptococci (MGS) to the species level is challenging for routine microbiology laboratories. Correct identification is crucial for the diagnosis of infective endocarditis, identification of treatment failure, and/or infection relapse. Eighty MGS from Danish patients with infective endocarditis were whole genome sequenced. We compared the phylogenetic analyses based on single genes (recA, sodA, gdh), multigene (MLSA), SNPs, and core-genome sequences. The six phylogenetic analyses generally showed a similar pattern of six monophyletic clusters, though a few differences were observed in single gene analyses. Species identification based on single gene analysis showed their limitations when more strains were included. In contrast, analyses incorporating more sequence data, like MLSA, SNPs and core-genome analyses, provided more distinct clustering. The core-genome tree showed the most distinct clustering. PMID:27325438

  16. Whole genome sequencing as a tool for phylogenetic analysis of clinical strains of Mitis group streptococci.

    PubMed

    Rasmussen, L H; Dargis, R; Højholt, K; Christensen, J J; Skovgaard, O; Justesen, U S; Rosenvinge, F S; Moser, C; Lukjancenko, O; Rasmussen, S; Nielsen, X C

    2016-10-01

    Identification of Mitis group streptococci (MGS) to the species level is challenging for routine microbiology laboratories. Correct identification is crucial for the diagnosis of infective endocarditis, identification of treatment failure, and/or infection relapse. Eighty MGS from Danish patients with infective endocarditis were whole genome sequenced. We compared the phylogenetic analyses based on single genes (recA, sodA, gdh), multigene (MLSA), SNPs, and core-genome sequences. The six phylogenetic analyses generally showed a similar pattern of six monophyletic clusters, though a few differences were observed in single gene analyses. Species identification based on single gene analysis showed their limitations when more strains were included. In contrast, analyses incorporating more sequence data, like MLSA, SNPs and core-genome analyses, provided more distinct clustering. The core-genome tree showed the most distinct clustering.

  17. Genome Sequence and Analysis of Buzura suppressaria Nucleopolyhedrovirus: A Group II Alphabaculovirus

    PubMed Central

    Zhu, Zheng; Yin, Feifei; Liu, Xiaoping; Hou, Dianhai; Wang, Jun; Zhang, Lei; Arif, Basil; Wang, Hualin; Deng, Fei; Hu, Zhihong

    2014-01-01

    The genome of Buzura suppressaria nucleopolyhedrovirus (BusuNPV) was sequenced by 454 pyrosequencing technology. The size of the genome is 120,420 bp with 36.8% G+C content. It contains 127 hypothetical open reading frames (ORFs) covering 90.7% of the genome and includes the 37 conserved baculovirus core genes, 84 genes found in other baculoviruses, and 6 unique ORFs. No typical baculoviral homologous repeats (hrs) were present but the genome contained a region of repeated sequences. Gene Parity Plots revealed a 28.8 kb region conserved among the alpha- and beta-baculoviruses. Overall comparisons of BusuNPV to other baculoviruses point to a distinct species in group II Alphabaculovirus. PMID:24475121

  18. Complete genome of the uncultured Termite Group 1 bacteria in a single host protist cell.

    PubMed

    Hongoh, Yuichi; Sharma, Vineet K; Prakash, Tulika; Noda, Satoko; Taylor, Todd D; Kudo, Toshiaki; Sakaki, Yoshiyuki; Toyoda, Atsushi; Hattori, Masahira; Ohkuma, Moriya

    2008-04-01

    Termites harbor a symbiotic gut microbial community that is responsible for their ability to thrive on recalcitrant plant matter. The community comprises diverse microorganisms, most of which are as yet uncultivable; the detailed symbiotic mechanism remains unclear. Here, we present the first complete genome sequence of a termite gut symbiont-an uncultured bacterium named Rs-D17 belonging to the candidate phylum Termite Group 1 (TG1). TG1 is a dominant group in termite guts, found as intracellular symbionts of various cellulolytic protists, without any physiological information. To acquire the complete genome sequence, we collected Rs-D17 cells from only a single host protist cell to minimize their genomic variation and performed isothermal whole-genome amplification. This strategy enabled us to reconstruct a circular chromosome (1,125,857 bp) encoding 761 putative protein-coding genes. The genome additionally contains 121 pseudogenes assigned to categories, such as cell wall biosynthesis, regulators, transporters, and defense mechanisms. Despite its apparent reductive evolution, the ability to synthesize 15 amino acids and various cofactors is retained, some of these genes having been duplicated. Considering that diverse termite-gut protists harbor TG1 bacteria, we suggest that this bacterial group plays a key role in the gut symbiotic system by stably supplying essential nitrogenous compounds deficient in lignocelluloses to their host protists and the termites. Our results provide a breakthrough to clarify the functions of and the interactions among the individual members of this multilayered symbiotic complex. PMID:18391199

  19. Complete genome of the uncultured Termite Group 1 bacteria in a single host protist cell.

    PubMed

    Hongoh, Yuichi; Sharma, Vineet K; Prakash, Tulika; Noda, Satoko; Taylor, Todd D; Kudo, Toshiaki; Sakaki, Yoshiyuki; Toyoda, Atsushi; Hattori, Masahira; Ohkuma, Moriya

    2008-04-01

    Termites harbor a symbiotic gut microbial community that is responsible for their ability to thrive on recalcitrant plant matter. The community comprises diverse microorganisms, most of which are as yet uncultivable; the detailed symbiotic mechanism remains unclear. Here, we present the first complete genome sequence of a termite gut symbiont-an uncultured bacterium named Rs-D17 belonging to the candidate phylum Termite Group 1 (TG1). TG1 is a dominant group in termite guts, found as intracellular symbionts of various cellulolytic protists, without any physiological information. To acquire the complete genome sequence, we collected Rs-D17 cells from only a single host protist cell to minimize their genomic variation and performed isothermal whole-genome amplification. This strategy enabled us to reconstruct a circular chromosome (1,125,857 bp) encoding 761 putative protein-coding genes. The genome additionally contains 121 pseudogenes assigned to categories, such as cell wall biosynthesis, regulators, transporters, and defense mechanisms. Despite its apparent reductive evolution, the ability to synthesize 15 amino acids and various cofactors is retained, some of these genes having been duplicated. Considering that diverse termite-gut protists harbor TG1 bacteria, we suggest that this bacterial group plays a key role in the gut symbiotic system by stably supplying essential nitrogenous compounds deficient in lignocelluloses to their host protists and the termites. Our results provide a breakthrough to clarify the functions of and the interactions among the individual members of this multilayered symbiotic complex.

  20. Group II introns break new boundaries: presence in a bilaterian's genome.

    PubMed

    Vallès, Yvonne; Halanych, Kenneth M; Boore, Jeffrey L

    2008-01-01

    Group II introns are ribozymes, removing themselves from their primary transcripts, as well as mobile genetic elements, transposing via an RNA intermediate, and are thought to be the ancestors of spliceosomal introns. Although common in bacteria and most eukaryotic organelles, they have never been reported in any bilaterian animal genome, organellar or nuclear. Here we report the first group II intron found in the mitochondrial genome of a bilaterian worm. This location is especially surprising, since animal mitochondrial genomes are generally distinct from those of plants, fungi, and protists by being small and compact, and so are viewed as being highly streamlined, perhaps as a result of strong selective pressures for fast replication while establishing germ plasm during early development. This intron is found in the mtDNA of an annelid worm, (an undescribed species of Nephtys), where the complete sequence revealed a 1819 bp group II intron inside the cox1 gene. We infer that this intron is the result of a recent horizontal gene transfer event from a viral or bacterial vector into the mitochondrial genome of Nephtys sp. Our findings hold implications for understanding mechanisms, constraints, and selective pressures that account for patterns of animal mitochondrial genome evolution.

  1. Complete genome sequence and phylogenetic position of the Bacillus cereus group phage JBP901.

    PubMed

    Asare, Paul Tetteh; Ryu, Sangryeol; Kim, Kwang-Pyo

    2015-09-01

    Bacteriophage JBP901, isolated from fermented food, is specific for Bacillus cereus group species and exhibits a broad host spectrum among a large number of B. cereus isolates. Genome sequence analysis revealed a linear 159,492-bp genome with overall G+C content of 39.7 mol%, and 201 ORFs. The presence of a putative methylase, the large number of tRNAs, and the large number of nucleotide-metabolism- and replication-related genes in JBP901 reflects its broad lytic capacity. Most of the ORFs showed a high degree of similarity to Bcp1, Bc431v3 and BCP78, and various comparative genomics analyses also consistently clustered JBP901 with orphan (unclassified) Bacillus phages in the subfamily Spounavirinae of the family Myoviridae, supporting the presence of a distinguishable group in the subfamily.

  2. Genomic definition of hypervirulent and multidrug-resistant Klebsiella pneumoniae clonal groups.

    PubMed

    Bialek-Davenet, Suzanne; Criscuolo, Alexis; Ailloud, Florent; Passet, Virginie; Jones, Louis; Delannoy-Vieillard, Anne-Sophie; Garin, Benoit; Le Hello, Simon; Arlet, Guillaume; Nicolas-Chanoine, Marie-Hélène; Decré, Dominique; Brisse, Sylvain

    2014-11-01

    Multidrug-resistant and highly virulent Klebsiella pneumoniae isolates are emerging, but the clonal groups (CGs) corresponding to these high-risk strains have remained imprecisely defined. We aimed to identify K. pneumoniae CGs on the basis of genome-wide sequence variation and to provide a simple bioinformatics tool to extract virulence and resistance gene data from genomic data. We sequenced 48 K. pneumoniae isolates, mostly of serotypes K1 and K2, and compared the genomes with 119 publicly available genomes. A total of 694 highly conserved genes were included in a core-genome multilocus sequence typing scheme, and cluster analysis of the data enabled precise definition of globally distributed hypervirulent and multidrug-resistant CGs. In addition, we created a freely accessible database, BIGSdb-Kp, to enable rapid extraction of medically and epidemiologically relevant information from genomic sequences of K. pneumoniae. Although drug-resistant and virulent K. pneumoniae populations were largely nonoverlapping, isolates with combined virulence and resistance features were detected.

  3. Genomic definition of hypervirulent and multidrug-resistant Klebsiella pneumoniae clonal groups.

    PubMed

    Bialek-Davenet, Suzanne; Criscuolo, Alexis; Ailloud, Florent; Passet, Virginie; Jones, Louis; Delannoy-Vieillard, Anne-Sophie; Garin, Benoit; Le Hello, Simon; Arlet, Guillaume; Nicolas-Chanoine, Marie-Hélène; Decré, Dominique; Brisse, Sylvain

    2014-11-01

    Multidrug-resistant and highly virulent Klebsiella pneumoniae isolates are emerging, but the clonal groups (CGs) corresponding to these high-risk strains have remained imprecisely defined. We aimed to identify K. pneumoniae CGs on the basis of genome-wide sequence variation and to provide a simple bioinformatics tool to extract virulence and resistance gene data from genomic data. We sequenced 48 K. pneumoniae isolates, mostly of serotypes K1 and K2, and compared the genomes with 119 publicly available genomes. A total of 694 highly conserved genes were included in a core-genome multilocus sequence typing scheme, and cluster analysis of the data enabled precise definition of globally distributed hypervirulent and multidrug-resistant CGs. In addition, we created a freely accessible database, BIGSdb-Kp, to enable rapid extraction of medically and epidemiologically relevant information from genomic sequences of K. pneumoniae. Although drug-resistant and virulent K. pneumoniae populations were largely nonoverlapping, isolates with combined virulence and resistance features were detected. PMID:25341126

  4. Generalized bacterial genome editing using mobile group II introns and Cre-lox

    PubMed Central

    Enyeart, Peter J; Chirieleison, Steven M; Dao, Mai N; Perutka, Jiri; Quandt, Erik M; Yao, Jun; Whitt, Jacob T; Keatinge-Clay, Adrian T; Lambowitz, Alan M; Ellington, Andrew D

    2013-01-01

    Efficient bacterial genetic engineering approaches with broad-host applicability are rare. We combine two systems, mobile group II introns (‘targetrons') and Cre/lox, which function efficiently in many different organisms, into a versatile platform we call GETR (Genome Editing via Targetrons and Recombinases). The introns deliver lox sites to specific genomic loci, enabling genomic manipulations. Efficiency is enhanced by adding flexibility to the RNA hairpins formed by the lox sites. We use the system for insertions, deletions, inversions, and one-step cut-and-paste operations. We demonstrate insertion of a 12-kb polyketide synthase operon into the lacZ gene of Escherichia coli, multiple simultaneous and sequential deletions of up to 120 kb in E. coli and Staphylococcus aureus, inversions of up to 1.2 Mb in E. coli and Bacillus subtilis, and one-step cut-and-pastes for translocating 120 kb of genomic sequence to a site 1.5 Mb away. We also demonstrate the simultaneous delivery of lox sites into multiple loci in the Shewanella oneidensis genome. No selectable markers need to be placed in the genome, and the efficiency of Cre-mediated manipulations typically approaches 100%. PMID:24002656

  5. Genomic Definition of Hypervirulent and Multidrug-Resistant Klebsiella pneumoniae Clonal Groups

    PubMed Central

    Bialek-Davenet, Suzanne; Criscuolo, Alexis; Ailloud, Florent; Passet, Virginie; Jones, Louis; Delannoy-Vieillard, Anne-Sophie; Garin, Benoit; Le Hello, Simon; Arlet, Guillaume; Nicolas-Chanoine, Marie-Hélène; Decré, Dominique

    2014-01-01

    Multidrug-resistant and highly virulent Klebsiella pneumoniae isolates are emerging, but the clonal groups (CGs) corresponding to these high-risk strains have remained imprecisely defined. We aimed to identify K. pneumoniae CGs on the basis of genome-wide sequence variation and to provide a simple bioinformatics tool to extract virulence and resistance gene data from genomic data. We sequenced 48 K. pneumoniae isolates, mostly of serotypes K1 and K2, and compared the genomes with 119 publicly available genomes. A total of 694 highly conserved genes were included in a core-genome multilocus sequence typing scheme, and cluster analysis of the data enabled precise definition of globally distributed hypervirulent and multidrug-resistant CGs. In addition, we created a freely accessible database, BIGSdb-Kp, to enable rapid extraction of medically and epidemiologically relevant information from genomic sequences of K. pneumoniae. Although drug-resistant and virulent K. pneumoniae populations were largely nonoverlapping, isolates with combined virulence and resistance features were detected. PMID:25341126

  6. Genome Editing via Mobile Group-II Introns and Cre/lox

    NASA Astrophysics Data System (ADS)

    Enyeart, P. E.; Perutka, J.; Dao, M.; Ellington, A. E.

    2010-04-01

    Mobile group-II introns and the Cre/lox systems are combined to allow large segments of DNA to be removed or transferred within/between bacterial genomes. Planned applications include metabolic optimization and development of novel dissimilatory metal-reducing bacteria.

  7. Complete genome sequence of the bioleaching bacterium Leptospirillum sp. group II strain CF-1.

    PubMed

    Ferrer, Alonso; Bunk, Boyke; Spröer, Cathrin; Biedendieck, Rebekka; Valdés, Natalia; Jahn, Martina; Jahn, Dieter; Orellana, Omar; Levicán, Gloria

    2016-03-20

    We describe the complete genome sequence of Leptospirillum sp. group II strain CF-1, an acidophilic bioleaching bacterium isolated from an acid mine drainage (AMD). This work provides data to gain insights about adaptive response of Leptospirillum spp. to the extreme conditions of bioleaching environments. PMID:26853478

  8. Complete genome sequence of the bioleaching bacterium Leptospirillum sp. group II strain CF-1.

    PubMed

    Ferrer, Alonso; Bunk, Boyke; Spröer, Cathrin; Biedendieck, Rebekka; Valdés, Natalia; Jahn, Martina; Jahn, Dieter; Orellana, Omar; Levicán, Gloria

    2016-03-20

    We describe the complete genome sequence of Leptospirillum sp. group II strain CF-1, an acidophilic bioleaching bacterium isolated from an acid mine drainage (AMD). This work provides data to gain insights about adaptive response of Leptospirillum spp. to the extreme conditions of bioleaching environments.

  9. Genome Sequencing and Analysis of Catopsilia pomona nucleopolyhedrovirus: A Distinct Species in Group I Alphabaculovirus

    PubMed Central

    Wang, Jun; Zhu, Zheng; Zhang, Lei; Hou, Dianhai; Wang, Manli; Arif, Basil; Kou, Zheng; Wang, Hualin; Deng, Fei; Hu, Zhihong

    2016-01-01

    The genome sequence of Catopsilia pomona nucleopolyhedrovirus (CapoNPV) was determined by the Roche 454 sequencing system. The genome consisted of 128,058 bp and had an overall G+C content of 40%. There were 130 hypothetical open reading frames (ORFs) potentially encoding proteins of more than 50 amino acids and covering 92% of the genome. Among all the hypothetical ORFs, 37 baculovirus core genes, 23 lepidopteran baculovirus conserved genes and 10 genes conserved in Group I alphabaculoviruses were identified. In addition, the genome included regions of 8 typical baculoviral homologous repeat sequences (hrs). Phylogenic analysis showed that CapoNPV was in a distinct branch of clade “a” in Group I alphabaculoviruses. Gene parity plot analysis and overall similarity of ORFs indicated that CapoNPV is more closely related to the Group I alphabaculoviruses than to other baculoviruses. Interesting, CapoNPV lacks the genes encoding the fibroblast growth factor (fgf) and ac30, which are conserved in most lepidopteran and Group I baculoviruses, respectively. Sequence analysis of the F-like protein of CapoNPV showed that some amino acids were inserted into the fusion peptide region and the pre-transmembrane region of the protein. All these unique features imply that CapoNPV represents a member of a new baculovirus species. PMID:27166956

  10. Genomic and physiological variability within Group II (non-proteolytic) Clostridium botulinum

    PubMed Central

    2013-01-01

    Background Clostridium botulinum is a group of four physiologically and phylogenetically distinct bacteria that produce botulinum neurotoxin. While studies have characterised variability between strains of Group I (proteolytic) C. botulinum, the genetic and physiological variability and relationships between strains within Group II (non-proteolytic) C. botulinum are not well understood. In this study the genome of Group II strain C. botulinum Eklund 17B (NRP) was sequenced and used to construct a whole genome DNA microarray. This was used in a comparative genomic indexing study to compare the relatedness of 43 strains of Group II C. botulinum (14 type B, 24 type E and 5 type F). These results were compared with characteristics determined from physiological tests. Results Whole genome indexing showed that strains of Group II C. botulinum isolated from a wide variety of environments over more than 75 years clustered together indicating the genetic background of Group II C. botulinum is stable. Further analysis showed that strains forming type B or type F toxin are closely related with only toxin cluster genes targets being unique to either type. Strains producing type E toxin formed a separate subset. Carbohydrate fermentation tests supported the observation that type B and F strains form a separate subset to type E strains. All the type F strains and most of type B strains produced acid from amylopectin, amylose and glycogen whereas type E strains did not. However, these two subsets did not differ strongly in minimum growth temperature or maximum NaCl concentration for growth. No relationship was found between tellurite resistance and toxin type despite all the tested type B and type F strains carrying tehB, while the sequence was absent or diverged in all type E strains. Conclusions Although Group II C. botulinum form a tight genetic group, genomic and physiological analysis indicates there are two distinct subsets within this group. All type B strains and type F

  11. Evolutionary Genomics of Genes Involved in Olfactory Behavior in the Drosophila melanogaster Species Group

    PubMed Central

    Lavagnino, Nicolás; Serra, François; Arbiza, Leonardo; Dopazo, Hernán; Hasson, Esteban

    2012-01-01

    Previous comparative genomic studies of genes involved in olfactory behavior in Drosophila focused only on particular gene families such as odorant receptor and/or odorant binding proteins. However, olfactory behavior has a complex genetic architecture that is orchestrated by many interacting genes. In this paper, we present a comparative genomic study of olfactory behavior in Drosophila including an extended set of genes known to affect olfactory behavior. We took advantage of the recent burst of whole genome sequences and the development of powerful statistical tools to analyze genomic data and test evolutionary and functional hypotheses of olfactory genes in the six species of the Drosophila melanogaster species group for which whole genome sequences are available. Our study reveals widespread purifying selection and limited incidence of positive selection on olfactory genes. We show that the pace of evolution of olfactory genes is mostly independent of the life cycle stage, and of the number of life cycle stages, in which they participate in olfaction. However, we detected a relationship between evolutionary rates and the position that the gene products occupy in the olfactory system, genes occupying central positions tend to be more constrained than peripheral genes. Finally, we demonstrate that specialization to one host does not seem to be associated with bursts of adaptive evolution in olfactory genes in D. sechellia and D. erecta, the two specialists species analyzed, but rather different lineages have idiosyncratic evolutionary histories in which both historical and ecological factors have been involved. PMID:22346339

  12. Intraspecies Genomic Groups in Enterococcus faecium and Their Correlation with Origin and Pathogenicity

    PubMed Central

    Vancanneyt, Marc; Lombardi, Angiolella; Andrighetto, Christian; Knijff, Edo; Torriani, Sandra; Björkroth, K. Johanna; Franz, Charles M. A. P.; Foulquié Moreno, María R.; Revets, Hilde; Vuyst, Luc De; Swings, Jean; Kersters, Karel; Dellaglio, Franco; Holzapfel, Wilhelm H.

    2002-01-01

    Seventy-eight Enterococcus faecium strains from various sources were characterized by random amplified polymorphic DNA (RAPD)-PCR, amplified fragment length polymorphism (AFLP), and pulsed-field gel electrophoresis (PFGE) analysis of SmaI restriction patterns. Two main genomic groups (I and II) were obtained in both RAPD-PCR and AFLP analyses. DNA-DNA hybridization values between representative strains of both groups demonstrated a mean DNA-DNA reassociation level of 71%. PFGE analysis revealed high genetic strain diversity within the two genomic groups. Only group I contained strains originating from human clinical samples or strains that were vancomycin-resistant or beta-hemolytic. No differentiating phenotypic features between groups I and II were found using the rapid ID 32 STREP system. The two groups could be further subdivided into, respectively, four and three subclusters in both RAPD-PCR and AFLP analyses, and a high correlation was seen between the subclusters generated by these two methods. Subclusters of group I were to some extent correlated with origin, pathogenicity, and bacteriocinogeny of the strains. Host specificity of E. faecium strains was not confirmed. PMID:11872491

  13. Genome size in Hieracium subgenus Hieracium (Asteraceae) is strongly correlated with major phylogenetic groups

    PubMed Central

    Chrtek, Jindřich; Zahradníček, Jaroslav; Krak, Karol; Fehrer, Judith

    2009-01-01

    Background and Aims Hieracium subgenus Hieracium is one of the taxonomically most intricate groups of vascular plants, due to polyploidy and a diversity of breeeding systems (sexuality vs. apomixis). The aim of the present study was to analyse nuclear genome size in a phylogenetic framework and to assess relationships between genome size and ploidy, breeding system and selected ecogeographic features. Methods Holoploid and monoploid genome sizes (C- and Cx-values) of 215 cultivated plants from 89 field populations of 42 so-called ‘basic’ Hieracium species were determined using propidium iodide flow cytometry. Chromosome counts were available for all analysed plants, and all plants were tested experimentally for their mode of reproduction (sexuality vs. apomixis). For constructing molecular phylogenetic trees, the external transcribed spacer region of nuclear ribosomal DNA was used. Key Results The mean 2C values differed up to 2·37-fold among different species (from 7·03 pg in diploid to 16·67 in tetraploid accessions). The 1Cx values varied 1·22-fold (between 3·51 and 4·34 pg). Variation in 1Cx values between conspecific (species in a broad sense) accessions ranged from 0·24% to 7·2%. Little variation (not exceeding the approximate measurement inaccurracy threshold of 3·5%) was found in 33 species, whereas variation higher than 3·5% was detected in seven species. Most of the latter may have a polytopic origin. Mean 1Cx values of the three cytotypes (2n, 3n and 4n) differed significantly (average of 3·93 pg in diploids, 3·82 pg in triploids and 3·78 pg in tetraploids) indicating downsizing of genomes in polyploids. The pattern of genome size variation correlated well with two major phylogenetic clades which were composed of species with western or eastern European origin. The monoploid genome size in the ‘western’ species was significantly lower than in the ‘eastern’ ones. Correlation of genome size with latitude, altitude and selected

  14. Genome sequence of Candidatus Nitrososphaera evergladensis from group I.1b enriched from Everglades soil reveals novel genomic features of the ammonia-oxidizing archaea.

    PubMed

    Zhalnina, Kateryna V; Dias, Raquel; Leonard, Michael T; Dorr de Quadros, Patricia; Camargo, Flavio A O; Drew, Jennifer C; Farmerie, William G; Daroub, Samira H; Triplett, Eric W

    2014-01-01

    The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group.

  15. Genome Sequence of Candidatus Nitrososphaera evergladensis from Group I.1b Enriched from Everglades Soil Reveals Novel Genomic Features of the Ammonia-Oxidizing Archaea

    PubMed Central

    Zhalnina, Kateryna V.; Dias, Raquel; Leonard, Michael T.; Dorr de Quadros, Patricia; Camargo, Flavio A. O.; Drew, Jennifer C.; Farmerie, William G.; Daroub, Samira H.; Triplett, Eric W.

    2014-01-01

    The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group. PMID:24999826

  16. A genomic insight into diversity among tribal and nontribal population groups of Manipur, India.

    PubMed

    Saraswathy, K N; Kiranmala, Naorem; Murry, Benrithung; Sinha, Ekata; Saksena, Deepti; Kaur, Harpreet; Sachdeva, M P; Kalla, A K

    2009-10-01

    Twenty autosomal markers, including linked markers at two gene markers, are used to understand the genomic similarity and diversity among three tribal (Paite, Thadou, and Kom) and one nontribal communities of Manipur (Northeast India). Two of the markers (CD4 and HB9) are monomorphic in Paite and one (the CD4 marker) in Kom. Data suggest the Meitei (nontribal groups) stand apart from the three tribal groups with respect to higher heterozygosity (0.366) and presence of the highest ancestor haplotypes of DRD2 markers (0.228); this is also supported by principal co-ordinate analysis. These populations are found to be genomically closer to the Chinese population than to other Indian populations.

  17. Analysis of the full genome of human group C rotaviruses reveals lineage diversification and reassortment.

    PubMed

    Medici, Maria Cristina; Tummolo, Fabio; Martella, Vito; Arcangeletti, Maria Cristina; De Conto, Flora; Chezzi, Carlo; Fehér, Enikő; Marton, Szilvia; Calderaro, Adriana; Bányai, Krisztián

    2016-08-01

    Group C rotaviruses (RVC) are enteric pathogens of humans and animals. Whole-genome sequences are available only for few RVCs, leaving gaps in our knowledge about their genetic diversity. We determined the full-length genome sequence of two human RVCs (PR2593/2004 and PR713/2012), detected in Italy from hospital-based surveillance for rotavirus infection in 2004 and 2012. In the 11 RNA genomic segments, the two Italian RVCs segregated within separate intra-genotypic lineages showed variation ranging from 1.9 % (VP6) to 15.9 % (VP3) at the nucleotide level. Comprehensive analysis of human RVC sequences available in the databases allowed us to reveal the existence of at least two major genome configurations, defined as type I and type II. Human RVCs of type I were all associated with the M3 VP3 genotype, including the Italian strain PR2593/2004. Conversely, human RVCs of type II were all associated with the M2 VP3 genotype, including the Italian strain PR713/2012. Reassortant RVC strains between these major genome configurations were identified. Although only a few full-genome sequences of human RVCs, mostly of Asian origin, are available, the analysis of human RVC sequences retrieved from the databases indicates that at least two intra-genotypic RVC lineages circulate in European countries. Gathering more sequence data is necessary to develop a standardized genotype and intra-genotypic lineage classification system useful for epidemiological investigations and avoiding confusion in the literature.

  18. Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions

    PubMed Central

    Bellas, Christopher M.; Anesio, Alexandre M.; Barker, Gary

    2015-01-01

    Microbial communities in glacial ecosystems are diverse, active, and subjected to strong viral pressures and infection rates. In this study we analyse putative virus genomes assembled from three dsDNA viromes from cryoconite hole ecosystems of Svalbard and the Greenland Ice Sheet to assess the potential hosts and functional role viruses play in these habitats. We assembled 208 million reads from the virus-size fraction and developed a procedure to select genuine virus scaffolds from cellular contamination. Our curated virus library contained 546 scaffolds up to 230 Kb in length, 54 of which were circular virus consensus genomes. Analysis of virus marker genes revealed a wide range of viruses had been assembled, including bacteriophages, cyanophages, nucleocytoplasmic large DNA viruses and a virophage, with putative hosts identified as Cyanobacteria, Alphaproteobacteria, Gammaproteobacteria, Actinobacteria, Firmicutes, eukaryotic algae and amoebae. Whole genome comparisons revealed the majority of circular genome scaffolds (CGS) formed 12 novel groups, two of which contained multiple phage members with plasmid-like properties, including a group of phage-plasmids possessing plasmid-like partition genes and toxin-antitoxin addiction modules to ensure their replication and a satellite phage-plasmid group. Surprisingly we also assembled a phage that not only encoded plasmid partition genes, but a clustered regularly interspaced short palindromic repeat (CRISPR)/Cas adaptive bacterial immune system. One of the spacers was an exact match for another phage in our virome, indicating that in a novel use of the system, the lysogen was potentially capable of conferring immunity on its bacterial host against other phage. Together these results suggest that highly novel and diverse groups of viruses are present in glacial environments, some of which utilize very unusual life strategies and genes to control their replication and maintain a long-term relationship with their hosts

  19. Draft Genome Sequences of 17 French Clostridium botulinum Group III Strains.

    PubMed

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2015-01-01

    Animal botulism is mainly associated with Clostridium botulinum group III strains producing neurotoxin types C, C/D, D, and D/C. In this report, we present the draft genome sequences of fourteen strains of Clostridium botulinum producing type C/D and two strains producing type D/C isolated in France, and one strain producing type D/C that originated from New Caledonia. PMID:26430029

  20. Genome drafts of four phytoplasma strains of the ribosomal group 16SrIII.

    PubMed

    Saccardo, Federica; Martini, Marta; Palmano, Sabrina; Ermacora, Paolo; Scortichini, Marco; Loi, Nazia; Firrao, Giuseppe

    2012-11-01

    By applying a coverage-based read selection and filtration through a healthy plant dataset, and a post-assembly contig selection based on homology and linkage, genome sequence drafts were obtained for four phytoplasma strains belonging to the 16SrIII group (X disease clade), namely Vaccinium Witches' Broom phytoplasma (647 754 nt in 272 contigs), Italian Clover Phyllody phytoplasma strain MA (597 245 nt in 197 contigs), Poinsettia branch-inducing phytoplasma strain JR1 (631 440 nt in 185 contigs) and Milkweed Yellows phytoplasma (583 806 nt in 158 contigs). Despite assignment to different 16SrIII subgroups, the genomes of the four strains were similar, comprising a highly conserved core (92-98 % similar in their nucleotide sequence among each other over alignments about 500 kb in length) and a minor strain-specific component. As far as their protein complement was concerned, they did not differ significantly in their basic metabolism potential from the genomes of other wide-host-range phytoplasmas sequenced previously, but were distinct from strains of other species, as well as among each other, in genes encoding functions conceivably related to interactions with the host, such as membrane trafficking components, proteases, DNA methylases, effectors and several hypothetical proteins of unknown function, some of which are likely secreted through the Sec-dependent secretion system. The four genomes displayed a group of genes encoding hypothetical proteins with high similarity to a central domain of IcmE/DotG, a core component of the type IVB secretion system of Gram-negative Legionella spp. Conversely, genes encoding functional GroES/GroEL chaperones were not detected in any of the four drafts. The results also indicated the significant role of horizontal gene transfer among different 'Candidatus Phytoplasma' species in shaping phytoplasma genomes and promoting their diversity. PMID:22936033

  1. Genomic insights into the taxonomic status of the Bacillus cereus group

    PubMed Central

    Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P.; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze

    2015-01-01

    The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19–20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441

  2. Genomic insights into the taxonomic status of the Bacillus cereus group.

    PubMed

    Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze

    2015-01-01

    The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19-20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441

  3. Genomic insights into the taxonomic status of the Bacillus cereus group.

    PubMed

    Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze

    2015-09-16

    The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19-20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments.

  4. Extending the cereus group genomics to putative food-bornepathogens of different toxicity

    SciTech Connect

    Lapidus, Alla; Goltsman, Eugene; Auger, Sandrine; Galleron,Nathalie; Segurens, Beatrice; Dossat, Carole; Land, Miriam L.; Broussole,Veronique; Brillard, Julien; Guinebretiere, Marie-Helene; Sanchis,Vincent; Nguen-the, Christophe; Lereclus, Didier; Richardson, Paul; Winker, Patrick; Weissenbach, Jean; Ehrlich, S.Dusko; Sorokin, Alexei

    2006-08-24

    The cereus group represents sporulating soil bacteriacontaining pathogenic strains which may cause diarrheic or emetic foodpoisoning outbreaks. Multiple locus sequence typing revealed a presencein natural samples of these bacteria of about thirty clonal complexes.Application of genomic methods to this group was however biased due tothe major interest for representatives closely related to B. anthracis.Albeit the most important food-borne pathogens were not yet defined,existing dataindicate that they are scattered all over the phylogenetictree. The preliminary analysis of the sequences of three genomesdiscussed in this paper narrows down the gaps in our knowledge of thecereus group. The strain NVH391-98 is a rare but particularly severefood-borne pathogen. Sequencing revealed that the strain must be arepresentative of a novel bacterial species, for which the name Bacilluscytotoxis is proposed. This strain has a reduced genome size compared toother cereus group strains. Genome analysis revealed absence of sigma Bfactor and the presence of genes encoding diarrheic Nhe toxin, notdetected earlier. The strain B. cereus F837/76 represents a clonalcomplex close to that of B. anthracis. Including F837/76, three such B.cereus strains had been sequenced. Alignment of genomes suggests that B.anthracis is their common ancestor. Since such strains often emerge fromclinical cases, they merit a special attention. The third strain, KBAB4,is a typical psychrotrophe characteristic to unbiased soil communities.Phylogenic studies show that in nature it is the most active group interms of gene exchange. Genomic sequence revealed high presence ofextra-chromosomal genetic material (about 530 kb) that may account forthis phenomenon. Genes coding Nhe-like toxin were found on a big plasmidin this strain. This may indicate a potential mechanism of toxicityspread from the psychrotrophic strain community. The results of thisgenomic work and ecological compartments of different strains incite

  5. Whole-Genome Comparison Uncovers Genomic Mutations between Group B Streptococci Sampled from Infected Newborns and Their Mothers

    PubMed Central

    Almeida, Alexandre; Villain, Adrien; Joubrel, Caroline; Touak, Gérald; Sauvage, Elisabeth; Rosinski-Chupin, Isabelle

    2015-01-01

    ABSTRACT Streptococcus agalactiae (group B Streptococcus or GBS), a commensal of the human gut and genitourinary tract, is a leading cause of neonatal infections, in which vertical transmission from mother to child remains the most frequent route of contamination. Here, we investigated whether the progression of GBS from carriage to disease is associated with genomic adaptation. Whole-genome comparison of 47 GBS samples from 19 mother-child pairs uncovered 21 single nucleotide polymorphisms (SNPs) and seven insertions/deletions. Of the SNPs detected, 16 appear to have been fixed in the population sampled whereas five mutations were found to be polymorphic. In the infant strains, 14 mutations were detected, including two independently fixed variants affecting the covRS locus, which is known to encode a major regulatory system of virulence. A one-nucleotide insertion was also identified in the promoter region of the highly immunogenic surface protein Rib gene. Gene expression analysis after incubation in human blood showed that these mutations influenced the expression of virulence-associated genes. Additional identification of three mutated strains in the mothers' milk raised the possibility of the newborns also being a source of contamination for their mothers. Overall, our work showed that GBS strains in carriage and disease scenarios might undergo adaptive changes following colonization. The types and locations of the mutations found, together with the experimental results showing their phenotypic impact, suggest that those in a context of infection were positively selected during the transition of GBS from commensal to pathogen, contributing to an increased capacity to cause disease. IMPORTANCE Group B Streptococcus (GBS) is a major pathogen responsible for neonatal infections. Considering that its colonization of healthy adults is mostly asymptomatic, the mechanisms behind its switch from a commensal to an invasive state are largely unknown. In this work, we

  6. Draft Genome Comparison of Representatives of the Three Dominant Genotype Groups of Dairy Bacillus licheniformis Strains

    PubMed Central

    Dhakal, Rajat; Seale, R. Brent; Deeth, Hilton C.; Craven, Heather

    2014-01-01

    The spore-forming bacterium Bacillus licheniformis is a common contaminant of milk and milk products. Strains of this species isolated from dairy products can be differentiated into three major groups, namely, G, F1, and F2, using random amplification of polymorphic DNA (RAPD) analysis; however, little is known about the genomic differences between these groups and the identity of the fragments that make up their RAPD profiles. In this work we obtained high-quality draft genomes of representative strains from each of the three RAPD groups (designated strain G-1, strain F1-1, and strain F2-1) and compared them to each other and to B. licheniformis ATCC 14580 and Bacillus subtilis 168. Whole-genome comparison and multilocus sequence typing revealed that strain G-1 contains significant sequence variability and belongs to a lineage distinct from the group F strains. Strain G-1 was found to contain genes coding for a type I restriction modification system, urease production, and bacitracin synthesis, as well as the 8-kbp plasmid pFL7, and these genes were not present in strains F1-1 and F2-1. In agreement with this, all isolates of group G, but no group F isolates, were found to possess urease activity and antimicrobial activity against Micrococcus. Identification of RAPD band sequences revealed that differences in the RAPD profiles were due to differences in gene lengths, 3′ ends of predicted primer binding sites, or gene presence or absence. This work provides a greater understanding of the phylogenetic and phenotypic differences observed within the B. licheniformis species. PMID:24657871

  7. Group-theoretic models of the inversion process in bacterial genomes.

    PubMed

    Egri-Nagy, Attila; Gebhardt, Volker; Tanaka, Mark M; Francis, Andrew R

    2014-07-01

    The variation in genome arrangements among bacterial taxa is largely due to the process of inversion. Recent studies indicate that not all inversions are equally probable, suggesting, for instance, that shorter inversions are more frequent than longer, and those that move the terminus of replication are less probable than those that do not. Current methods for establishing the inversion distance between two bacterial genomes are unable to incorporate such information. In this paper we suggest a group-theoretic framework that in principle can take these constraints into account. In particular, we show that by lifting the problem from circular permutations to the affine symmetric group, the inversion distance can be found in polynomial time for a model in which inversions are restricted to acting on two regions. This requires the proof of new results in group theory, and suggests a vein of new combinatorial problems concerning permutation groups on which group theorists will be needed to collaborate with biologists. We apply the new method to inferring distances and phylogenies for published Yersinia pestis data.

  8. Genomic data do not support comb jellies as the sister group to all other animals

    PubMed Central

    Pisani, Davide; Pett, Walker; Dohrmann, Martin; Feuda, Roberto; Rota-Stabelli, Omar; Philippe, Hervé; Lartillot, Nicolas; Wörheide, Gert

    2015-01-01

    Understanding how complex traits, such as epithelia, nervous systems, muscles, or guts, originated depends on a well-supported hypothesis about the phylogenetic relationships among major animal lineages. Traditionally, sponges (Porifera) have been interpreted as the sister group to the remaining animals, a hypothesis consistent with the conventional view that the last common animal ancestor was relatively simple and more complex body plans arose later in evolution. However, this premise has recently been challenged by analyses of the genomes of comb jellies (Ctenophora), which, instead, found ctenophores as the sister group to the remaining animals (the “Ctenophora-sister” hypothesis). Because ctenophores are morphologically complex predators with true epithelia, nervous systems, muscles, and guts, this scenario implies these traits were either present in the last common ancestor of all animals and were lost secondarily in sponges and placozoans (Trichoplax) or, alternatively, evolved convergently in comb jellies. Here, we analyze representative datasets from recent studies supporting Ctenophora-sister, including genome-scale alignments of concatenated protein sequences, as well as a genomic gene content dataset. We found no support for Ctenophora-sister and conclude it is an artifact resulting from inadequate methodology, especially the use of simplistic evolutionary models and inappropriate choice of species to root the metazoan tree. Our results reinforce a traditional scenario for the evolution of complexity in animals, and indicate that inferences about the evolution of Metazoa based on the Ctenophora-sister hypothesis are not supported by the currently available data. PMID:26621703

  9. Genomic data do not support comb jellies as the sister group to all other animals.

    PubMed

    Pisani, Davide; Pett, Walker; Dohrmann, Martin; Feuda, Roberto; Rota-Stabelli, Omar; Philippe, Hervé; Lartillot, Nicolas; Wörheide, Gert

    2015-12-15

    Understanding how complex traits, such as epithelia, nervous systems, muscles, or guts, originated depends on a well-supported hypothesis about the phylogenetic relationships among major animal lineages. Traditionally, sponges (Porifera) have been interpreted as the sister group to the remaining animals, a hypothesis consistent with the conventional view that the last common animal ancestor was relatively simple and more complex body plans arose later in evolution. However, this premise has recently been challenged by analyses of the genomes of comb jellies (Ctenophora), which, instead, found ctenophores as the sister group to the remaining animals (the "Ctenophora-sister" hypothesis). Because ctenophores are morphologically complex predators with true epithelia, nervous systems, muscles, and guts, this scenario implies these traits were either present in the last common ancestor of all animals and were lost secondarily in sponges and placozoans (Trichoplax) or, alternatively, evolved convergently in comb jellies. Here, we analyze representative datasets from recent studies supporting Ctenophora-sister, including genome-scale alignments of concatenated protein sequences, as well as a genomic gene content dataset. We found no support for Ctenophora-sister and conclude it is an artifact resulting from inadequate methodology, especially the use of simplistic evolutionary models and inappropriate choice of species to root the metazoan tree. Our results reinforce a traditional scenario for the evolution of complexity in animals, and indicate that inferences about the evolution of Metazoa based on the Ctenophora-sister hypothesis are not supported by the currently available data. PMID:26621703

  10. Genomic data do not support comb jellies as the sister group to all other animals.

    PubMed

    Pisani, Davide; Pett, Walker; Dohrmann, Martin; Feuda, Roberto; Rota-Stabelli, Omar; Philippe, Hervé; Lartillot, Nicolas; Wörheide, Gert

    2015-12-15

    Understanding how complex traits, such as epithelia, nervous systems, muscles, or guts, originated depends on a well-supported hypothesis about the phylogenetic relationships among major animal lineages. Traditionally, sponges (Porifera) have been interpreted as the sister group to the remaining animals, a hypothesis consistent with the conventional view that the last common animal ancestor was relatively simple and more complex body plans arose later in evolution. However, this premise has recently been challenged by analyses of the genomes of comb jellies (Ctenophora), which, instead, found ctenophores as the sister group to the remaining animals (the "Ctenophora-sister" hypothesis). Because ctenophores are morphologically complex predators with true epithelia, nervous systems, muscles, and guts, this scenario implies these traits were either present in the last common ancestor of all animals and were lost secondarily in sponges and placozoans (Trichoplax) or, alternatively, evolved convergently in comb jellies. Here, we analyze representative datasets from recent studies supporting Ctenophora-sister, including genome-scale alignments of concatenated protein sequences, as well as a genomic gene content dataset. We found no support for Ctenophora-sister and conclude it is an artifact resulting from inadequate methodology, especially the use of simplistic evolutionary models and inappropriate choice of species to root the metazoan tree. Our results reinforce a traditional scenario for the evolution of complexity in animals, and indicate that inferences about the evolution of Metazoa based on the Ctenophora-sister hypothesis are not supported by the currently available data.

  11. Genomic analysis reveals hidden biodiversity within colugos, the sister group to primates.

    PubMed

    Mason, Victor C; Li, Gang; Minx, Patrick; Schmitz, Jürgen; Churakov, Gennady; Doronina, Liliya; Melin, Amanda D; Dominy, Nathaniel J; Lim, Norman T-L; Springer, Mark S; Wilson, Richard K; Warren, Wesley C; Helgen, Kristofer M; Murphy, William J

    2016-08-01

    Colugos are among the most poorly studied mammals despite their centrality to resolving supraordinal primate relationships. Two described species of these gliding mammals are the sole living members of the order Dermoptera, distributed throughout Southeast Asia. We generated a draft genome sequence for a Sunda colugo and a Philippine colugo reference alignment, and used these to identify colugo-specific genetic changes that were enriched in sensory and musculoskeletal-related genes that likely underlie their nocturnal and gliding adaptations. Phylogenomic analysis and catalogs of rare genomic changes overwhelmingly support the contested hypothesis that colugos are the sister group to primates (Primatomorpha), to the exclusion of treeshrews. We captured ~140 kb of orthologous sequence data from colugo museum specimens sampled across their range and identified large genetic differences between many geographically isolated populations that may result in a >300% increase in the number of recognized colugo species. Our results identify conservation units to mitigate future losses of this enigmatic mammalian order. PMID:27532052

  12. Genomic analysis reveals hidden biodiversity within colugos, the sister group to primates

    PubMed Central

    Mason, Victor C.; Li, Gang; Minx, Patrick; Schmitz, Jürgen; Churakov, Gennady; Doronina, Liliya; Melin, Amanda D.; Dominy, Nathaniel J.; Lim, Norman T-L.; Springer, Mark S.; Wilson, Richard K.; Warren, Wesley C.; Helgen, Kristofer M.; Murphy, William J.

    2016-01-01

    Colugos are among the most poorly studied mammals despite their centrality to resolving supraordinal primate relationships. Two described species of these gliding mammals are the sole living members of the order Dermoptera, distributed throughout Southeast Asia. We generated a draft genome sequence for a Sunda colugo and a Philippine colugo reference alignment, and used these to identify colugo-specific genetic changes that were enriched in sensory and musculoskeletal-related genes that likely underlie their nocturnal and gliding adaptations. Phylogenomic analysis and catalogs of rare genomic changes overwhelmingly support the contested hypothesis that colugos are the sister group to primates (Primatomorpha), to the exclusion of treeshrews. We captured ~140 kb of orthologous sequence data from colugo museum specimens sampled across their range and identified large genetic differences between many geographically isolated populations that may result in a >300% increase in the number of recognized colugo species. Our results identify conservation units to mitigate future losses of this enigmatic mammalian order. PMID:27532052

  13. Inactivation of group II intron RmInt1 in the Sinorhizobium meliloti genome.

    PubMed

    Molina-Sánchez, María Dolores; Toro, Nicolás

    2015-07-09

    Group II introns are self-splicing catalytic RNAs that probably originated in bacteria and act as mobile retroelements. The dispersal and dynamics of group II intron spread within a bacterial genome are thought to follow a selection-driven extinction model. Likewise, various studies on the evolution of group II introns have suggested that they are evolving toward an inactive form by fragmentation, with the loss of the intron 3'-terminus, but with some intron fragments remaining and continuing to evolve in the genome. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti, but some strains of this species have no RmInt1 introns. We studied the splicing ability and mobility of the three full-length RmInt1 copies harbored by S. meliloti 1021, and obtained evidence suggesting that specific mutations may lead to the impairment of intron splicing and retrohoming. Our data suggest that the RmInt1 copies in this strain are undergoing a process of inactivation.

  14. New Insights into the Genetic Diversity of Clostridium botulinum Group III through Extensive Genome Exploration.

    PubMed

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2016-01-01

    Animal botulism is caused by group III Clostridium botulinum strains producing type C and D toxins, or their chimeric forms C/D and D/C. Animal botulism is considered an emerging disease in Europe, notably in poultry production. Before our study, 14 genomes from different countries were available in the public database, but none were from France. In order to investigate the genetic relationship of French strains with different geographical areas and find new potential typing targets, 17 strains of C. botulinum group III were sequenced (16 from France and one from New Caledonia). Fourteen were type C/D strains isolated from chickens, ducks, guinea fowl and turkeys and three were type D/C strains isolated from cattle. The New Caledonian strain was a type D/C strain. Whole genome sequence analysis showed the French strains to be closely related to European strains from C. botulinum group III lineages Ia and Ib. The investigation of CRISPR sequences as genetic targets for differentiating strains in group III proved to be irrelevant for type C/D due to a deficient CRISPR/Cas mechanism, but not for type D/C. Conversely, the extrachromosomal elements of type C/D strains could be used to generate a genetic ID card. The highest level of discrimination was achieved with SNP core phylogeny, which allowed differentiation up to strain level and provide the most relevant information for genetic epidemiology studies and discrimination.

  15. New Insights into the Genetic Diversity of Clostridium botulinum Group III through Extensive Genome Exploration

    PubMed Central

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2016-01-01

    Animal botulism is caused by group III Clostridium botulinum strains producing type C and D toxins, or their chimeric forms C/D and D/C. Animal botulism is considered an emerging disease in Europe, notably in poultry production. Before our study, 14 genomes from different countries were available in the public database, but none were from France. In order to investigate the genetic relationship of French strains with different geographical areas and find new potential typing targets, 17 strains of C. botulinum group III were sequenced (16 from France and one from New Caledonia). Fourteen were type C/D strains isolated from chickens, ducks, guinea fowl and turkeys and three were type D/C strains isolated from cattle. The New Caledonian strain was a type D/C strain. Whole genome sequence analysis showed the French strains to be closely related to European strains from C. botulinum group III lineages Ia and Ib. The investigation of CRISPR sequences as genetic targets for differentiating strains in group III proved to be irrelevant for type C/D due to a deficient CRISPR/Cas mechanism, but not for type D/C. Conversely, the extrachromosomal elements of type C/D strains could be used to generate a genetic ID card. The highest level of discrimination was achieved with SNP core phylogeny, which allowed differentiation up to strain level and provide the most relevant information for genetic epidemiology studies and discrimination. PMID:27242769

  16. New Insights into the Genetic Diversity of Clostridium botulinum Group III through Extensive Genome Exploration.

    PubMed

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2016-01-01

    Animal botulism is caused by group III Clostridium botulinum strains producing type C and D toxins, or their chimeric forms C/D and D/C. Animal botulism is considered an emerging disease in Europe, notably in poultry production. Before our study, 14 genomes from different countries were available in the public database, but none were from France. In order to investigate the genetic relationship of French strains with different geographical areas and find new potential typing targets, 17 strains of C. botulinum group III were sequenced (16 from France and one from New Caledonia). Fourteen were type C/D strains isolated from chickens, ducks, guinea fowl and turkeys and three were type D/C strains isolated from cattle. The New Caledonian strain was a type D/C strain. Whole genome sequence analysis showed the French strains to be closely related to European strains from C. botulinum group III lineages Ia and Ib. The investigation of CRISPR sequences as genetic targets for differentiating strains in group III proved to be irrelevant for type C/D due to a deficient CRISPR/Cas mechanism, but not for type D/C. Conversely, the extrachromosomal elements of type C/D strains could be used to generate a genetic ID card. The highest level of discrimination was achieved with SNP core phylogeny, which allowed differentiation up to strain level and provide the most relevant information for genetic epidemiology studies and discrimination. PMID:27242769

  17. Complete Genome Sequence of the RmInt1 Group II Intronless Sinorhizobium meliloti Strain RMO17

    PubMed Central

    Martínez-Abarca, Francisco; Nisa-Martínez, Rafael

    2014-01-01

    We report the complete genome sequence of the RmInt1 group II intronless Sinorhizobium meliloti strain RMO17 isolated from Medicago orbicularis nodules from Spanish soil. The genome consists of 6.73 Mb distributed between a single chromosome and two megaplasmids (the chromid pSymB and pSymA). PMID:25301650

  18. Genomic Characterization of Group C Orthobunyavirus Reference Strains and Recent South American Clinical Isolates

    PubMed Central

    Yang, Yu; Solórzano, Víctor Fiestas; Kuschner, Robert A.; Halsey, Eric S.; Jarman, Richard G.; Kochel, Tadeusz J.

    2014-01-01

    Group C orthobunyaviruses (family Bunyaviridae, genus Orthobunyavirus), discovered in the 1950s, are vector-borne human pathogens in the Americas. Currently there is a gap in genomic information for group C viruses. In this study, we obtained complete coding region sequences of reference strains of Caraparu (CARV), Oriboca (ORIV), Marituba (MTBV) and Madrid (MADV) viruses, and five clinical isolates from Peru and Bolivia, using an unbiased de novo approach consisting of random reverse transcription, random anchored PCR amplification, and high throughput pyrosequencing. The small, medium, and large segments encode for a 235 amino acid nucleocapsid protein, an approximately 1430 amino acid surface glycoprotein polyprotein precursor, and a 2248 amino acid RNA-dependent RNA polymerase, respectively. Additionally, the S segment encodes for an 83 amino acid non-structural protein, although this protein is truncated or silenced in some isolates. Phylogenetically, three clinical isolates clustered with CARV, one clustered with MTBV, and one isolate appeared to be a reassortant or a genetic drift resulted from the high variability of the medium segment which was also seen in a few other orthobunyaviruses. These data represent the first complete coding region sequences for this serocomplex of pathogenic orthobunyaviruses. The genome-wide phylogeny of reference strains is consistent with the antigenic properties of the viruses reported in the original serological studies conducted in the 1960s. Comparative analysis of conserved protein regions across group C virus strains and the other orthobunyavirus groups revealed that these group C viruses contain characteristic domains of potential structural and functional significance. Our results provide the basis for the developments of diagnostics, further genetic analyses, and future epidemiologic studies of group C viruses. PMID:24633174

  19. Phylogeny of a Genomically Diverse Group of Elymus (Poaceae) Allopolyploids Reveals Multiple Levels of Reticulation

    PubMed Central

    Mason-Gamer, Roberta J.

    2013-01-01

    The grass tribe Triticeae (=Hordeeae) comprises only about 300 species, but it is well known for the economically important crop plants wheat, barley, and rye. The group is also recognized as a fascinating example of evolutionary complexity, with a history shaped by numerous events of auto- and allopolyploidy and apparent introgression involving diploids and polyploids. The genus Elymus comprises a heterogeneous collection of allopolyploid genome combinations, all of which include at least one set of homoeologs, designated St, derived from Pseudoroegneria. The current analysis includes a geographically and genomically diverse collection of 21 tetraploid Elymus species, and a single hexaploid species. Diploid and polyploid relationships were estimated using four molecular data sets, including one that combines two regions of the chloroplast genome, and three from unlinked nuclear genes: phosphoenolpyruvate carboxylase, β-amylase, and granule-bound starch synthase I. Four gene trees were generated using maximum likelihood, and the phylogenetic placement of the polyploid sequences reveals extensive reticulation beyond allopolyploidy alone. The trees were interpreted with reference to numerous phenomena known to complicate allopolyploid phylogenies, and introgression was identified as a major factor in their history. The work illustrates the interpretation of complicated phylogenetic results through the sequential consideration of numerous possible explanations, and the results highlight the value of careful inspection of multiple independent molecular phylogenetic estimates, with particular focus on the differences among them. PMID:24302986

  20. Developing improved durum wheat germplasm by altering the cytoplasmic genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In eukaryotic organisms, nuclear and cytoplasmic genomes interact to drive cellular functions. These genomes have co-evolved to form specific nuclear-cytoplasmic interactions that are essential to the origin, success, and evolution of diploid and polyploid species. Hundreds of genetic diseases in h...

  1. Unique Mitochondrial Genome Structure in Diplonemids, the Sister Group of Kinetoplastids

    PubMed Central

    Marande, William; Lukeš, Julius; Burger, Gertraud

    2005-01-01

    Kinetoplastid flagellates are characterized by uniquely massed mitochondrial DNAs (mtDNAs), the kinetoplasts. Kinetoplastids of the trypanosomatid group possess two types of mtDNA molecules: maxicircles bearing protein and mitoribosomal genes and minicircles specifying guide RNAs, which mediate uridine insertion/deletion RNA editing. These circles are interlocked with one another to form dense networks. Whether these peculiar mtDNA features are restricted to kinetoplastids or prevail throughout Euglenozoa (euglenids, diplonemids, and kinetoplastids) is unknown. Here, we describe the mitochondrial genome and the mitochondrial ultrastructure of Diplonema papillatum, a member of the diplonemid flagellates, the sister group of kinetoplastids. Fluorescence and electron microscopy show a single mitochondrion per cell with an ultrastructure atypical for Euglenozoa. In addition, DNA is evenly distributed throughout the organelle rather than compacted. Molecular and electron microscopy studies distinguish numerous 6- and 7-kbp-sized mitochondrial chromosomes of monomeric circular topology and relaxed conformation in vivo. Remarkably, the cox1 gene (and probably other mitochondrial genes) is fragmented, with separate gene pieces encoded on different chromosomes. Generation of the contiguous cox1 mRNA requires trans-splicing, the precise mechanism of which remains to be determined. Taken together, the mitochondrial gene/genome structure of Diplonema is not only different from that of kinetoplastids but unique among eukaryotes as a whole. PMID:15947205

  2. The co-evolved Helicobacter pylori and gastric cancer: trinity of bacterial virulence, host susceptibility and lifestyle.

    PubMed

    Akhter, Yusuf; Ahmed, Irshad; Devi, S Manjulata; Ahmed, Niyaz

    2007-01-04

    Helicobacter pylori is an important yet unproven etiological agent of gastric cancer. H. pylori infection is more prevalent in developing Asian countries like India and it is usually acquired at an early age. It has been two decades since Marshall and Warren (1984) first described curved bacilli in the stomach of ulcer and gastritis patients. This discovery has won them the Nobel Prize recently, but the debate whether H. pylori is a pathogen or a commensal organism is still hot. Associations with disease-specific factors remain elusive years after the genome sequences were made available. Cytotoxin-associated antigen A (CagA) and the so-called plasticity region cluster genes are implicated in pathogenesis of the carcinoma of stomach. Another virulence factor VacA whose role is still debatable, has recently been projected in pathology of gastric cancer. Studies of the evolution through genetic variation in H. pylori populations have provided a window into the history of human population migrations and a possible co-evolution of this pathogen with its human host. Possible symbiotic relationships were seriously debated since the discovery of this pathogen. The debate has been further intensified as some studies proposed H. pylori infection to be beneficial in some humans. In this commentary, we attempt to briefly discuss about H. pylori as a human pathogen, and some of the important issues linked to its pathophysiology in different hosts. 'We dance around in a ring and suppose, the secret sits in the middle and knows'--Robert Frost.

  3. Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae)

    PubMed Central

    Otis, Christian

    2016-01-01

    Background The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA) structure, size, gene order, and intron content have been observed. The large inverted repeat (IR), an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales) but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum. Methods The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. Results The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium, it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold longer and

  4. Differentiation of strains from the Bacillus cereus group by RFLP-PFGE genomic fingerprinting.

    PubMed

    Otlewska, Anna; Oltuszak-Walczak, Elzbieta; Walczak, Piotr

    2013-11-01

    Bacillus mycoides, Bacillus pseudomycoides, Bacillus weihenstephanensis, Bacillus anthracis, Bacillus thuringiensis, and Bacillus cereus belong to the B. cereus group. The last three species are characterized by different phenotype features and pathogenicity spectrum, but it has been shown that these species are genetically closely related. The macrorestriction analysis of the genomic DNA with the NotI enzyme was used to generate polymorphism of restriction profiles for 39 food-borne isolates (B. cereus, B. mycoides) and seven reference strains (B. mycoides, B. thuringiensis, B. weihenstephanensis, and B. cereus). The PFGE method was applied to differentiate the examined strains of the B. cereus group. On the basis of the unweighted pair group method with the arithmetic mean method and Dice coefficient, the strains were divided into five clusters (types A-E), and the most numerous group was group A (25 strains). A total of 21 distinct pulsotypes were observed. The RFLP-PFGE analysis was successfully used for the differentiation and characterization of B. cereus and B. mycoides strains isolated from different food products.

  5. The co-evolved Helicobacter pylori and gastric cancer: trinity of bacterial virulence, host susceptibility and lifestyle.

    PubMed

    Akhter, Yusuf; Ahmed, Irshad; Devi, S Manjulata; Ahmed, Niyaz

    2007-01-01

    Helicobacter pylori is an important yet unproven etiological agent of gastric cancer. H. pylori infection is more prevalent in developing Asian countries like India and it is usually acquired at an early age. It has been two decades since Marshall and Warren (1984) first described curved bacilli in the stomach of ulcer and gastritis patients. This discovery has won them the Nobel Prize recently, but the debate whether H. pylori is a pathogen or a commensal organism is still hot. Associations with disease-specific factors remain elusive years after the genome sequences were made available. Cytotoxin-associated antigen A (CagA) and the so-called plasticity region cluster genes are implicated in pathogenesis of the carcinoma of stomach. Another virulence factor VacA whose role is still debatable, has recently been projected in pathology of gastric cancer. Studies of the evolution through genetic variation in H. pylori populations have provided a window into the history of human population migrations and a possible co-evolution of this pathogen with its human host. Possible symbiotic relationships were seriously debated since the discovery of this pathogen. The debate has been further intensified as some studies proposed H. pylori infection to be beneficial in some humans. In this commentary, we attempt to briefly discuss about H. pylori as a human pathogen, and some of the important issues linked to its pathophysiology in different hosts. 'We dance around in a ring and suppose, the secret sits in the middle and knows'--Robert Frost. PMID:17201930

  6. TYPE-I X-RAY BURSTS REVEAL A FAST CO-EVOLVING BEHAVIOR OF THE CORONA IN AN X-RAY BINARY

    SciTech Connect

    Chen, Yu-Peng; Zhang, Shu; Zhang, Shuang-Nan; Li, Jian; Wang, Jian-Min E-mail: szhang@ihep.ac.cn

    2012-06-20

    The coronae in X-ray binaries (XRBs) still remain poorly understood, although they have been believed for a long time to play a key role in modeling the characteristic outbursts of XRBs. Type-I X-ray bursts, the thermonuclear flashes happening on the surface of a neutron star (NS), can be used as a probe to the innermost region of an NS XRB, where the corona is believed to be located very close to the NS. We report the discovery of a tiny life cycle of the corona that is promptly co-evolved with the type-I bursts superimposed on the outburst of the NS XRB IGR J17473-2721. This finding may serve as the first evidence of directly seeing the rapid disappearance and formation of a corona in an XRB with a cooling/heating timescale of less than a second, which can strongly constrain the accretion models in XRBs at work.

  7. Investigating the potential for ethnic group harm in collaborative genomics research in Africa: Is ethnic stigmatisation likely?

    PubMed Central

    de Vries, Jantina; Jallow, Muminatou; Williams, Thomas N.; Kwiatkowski, Dominic; Parker, Michael; Fitzpatrick, Raymond

    2013-01-01

    A common assumption in genomics research is that the use of ethnic categories has the potential to lead to ethnic stigmatisation – particularly when the research is done on minority populations. Yet few empirical studies have sought to investigate the relation between genomics and stigma, and fewer still with a focus on Africa. In this paper, we investigate the potential for genomics research to lead to harms to ethnic groups. We carried out 49 semi-structured, open-ended interviews with stakeholders in a current medical genomics research project in Africa, MalariaGEN. Interviews were conducted with MalariaGEN researchers, fieldworkers, members of three ethics committees who reviewed MalariaGEN project proposals, and with members of the two funding bodies providing support to the MalariaGEN project. Interviews were conducted in Kenya, The Gambia and the UK between June 2008 and October 2009. They covered a range of aspects relating to the use of ethnicity in the genomics project, including views on adverse effects of the inclusion of ethnicity in such research. Drawing on the empirical data, we argue that the risk of harm to ethnic groups is likely to be more acute in specific types of genomics research. We develop a typology of research questions and projects that carry a greater risk of harm to the populations included in genomics research. We conclude that the potential of generating harm to ethnic groups in genomics research is present if research includes populations that are already stigmatised or discriminated against, or where the research investigates questions with particular normative implications. We identify a clear need for genomics researchers to take account of the social context of the work they are proposing to do, including understanding the local realities and relations between ethnic groups, and whether diseases are already stigmatised. PMID:22749442

  8. Genomic analysis reveals the molecular basis for capsule loss in the group B Streptococcus population.

    PubMed

    Rosini, Roberto; Campisi, Edmondo; De Chiara, Matteo; Tettelin, Hervé; Rinaudo, Daniela; Toniolo, Chiara; Metruccio, Matteo; Guidotti, Silvia; Sørensen, Uffe B Skov; Kilian, Mogens; Ramirez, Mario; Janulczyk, Robert; Donati, Claudio; Grandi, Guido; Margarit, Immaculada

    2015-01-01

    The human and bovine bacterial pathogen Streptococcus agalactiae (Group B Streptococcus, GBS) expresses a thick polysaccharide capsule that constitutes a major virulence factor and vaccine target. GBS can be classified into ten distinct serotypes differing in the chemical composition of their capsular polysaccharide. However, non-typeable strains that do not react with anti-capsular sera are frequently isolated from colonized and infected humans and cattle. To gain a comprehensive insight into the molecular basis for the loss of capsule expression in GBS, a collection of well-characterized non-typeable strains was investigated by genome sequencing. Genome based phylogenetic analysis extended to a wide population of sequenced strains confirmed the recently observed high clonality among GBS lineages mainly containing human strains, and revealed a much higher degree of diversity in the bovine population. Remarkably, non-typeable strains were equally distributed in all lineages. A number of distinct mutations in the cps operon were identified that were apparently responsible for inactivation of capsule synthesis. The most frequent genetic alterations were point mutations leading to stop codons in the cps genes, and the main target was found to be cpsE encoding the portal glycosyl transferase of capsule biosynthesis. Complementation of strains carrying missense mutations in cpsE with a wild-type gene restored capsule expression allowing the identification of amino acid residues essential for enzyme activity.

  9. Molecular and genomic characterization of pathogenic traits of group A Streptococcus pyogenes.

    PubMed

    Hamada, Shigeyuki; Kawabata, Shigetada; Nakagawa, Ichiro

    2015-01-01

    Group A streptococcus (GAS) or Streptococcus pyogenes causes various diseases ranging from self-limiting sore throat to deadly invasive diseases. The genome size of GAS is 1.85-1.9 Mb, and genomic rearrangement has been demonstrated. GAS possesses various surface-associated substances such as hyaluronic capsule, M proteins, and fibronectin/laminin/immunoglobulin-binding proteins. These are related to the virulence and play multifaceted and mutually reflected roles in the pathogenesis of GAS infections. Invasion of GAS into epithelial cells and deeper tissues provokes immune and non-immune defense or inflammatory responses including the recruitment of neutrophils, macrophages, and dendritic cells in hosts. GAS frequently evades host defense mechanisms by using its virulence factors. Extracellular products of GAS may perturb cellular and subcellular functions and degrade tissues enzymatically, which leads to the aggravation of local and/or systemic disorders in the host. In this review, we summarize some important cellular and extracellular substances that may affect pathogenic processes during GAS infections, and the host responses to these.

  10. Genomic Analysis Reveals the Molecular Basis for Capsule Loss in the Group B Streptococcus Population

    PubMed Central

    Rosini, Roberto; Campisi, Edmondo; De Chiara, Matteo; Tettelin, Hervé; Rinaudo, Daniela; Toniolo, Chiara; Metruccio, Matteo; Guidotti, Silvia; Sørensen, Uffe B. Skov; Kilian, Mogens; Ramirez, Mario; Janulczyk, Robert; Donati, Claudio; Grandi, Guido; Margarit, Immaculada

    2015-01-01

    The human and bovine bacterial pathogen Streptococcus agalactiae (Group B Streptococcus, GBS) expresses a thick polysaccharide capsule that constitutes a major virulence factor and vaccine target. GBS can be classified into ten distinct serotypes differing in the chemical composition of their capsular polysaccharide. However, non-typeable strains that do not react with anti-capsular sera are frequently isolated from colonized and infected humans and cattle. To gain a comprehensive insight into the molecular basis for the loss of capsule expression in GBS, a collection of well-characterized non-typeable strains was investigated by genome sequencing. Genome based phylogenetic analysis extended to a wide population of sequenced strains confirmed the recently observed high clonality among GBS lineages mainly containing human strains, and revealed a much higher degree of diversity in the bovine population. Remarkably, non-typeable strains were equally distributed in all lineages. A number of distinct mutations in the cps operon were identified that were apparently responsible for inactivation of capsule synthesis. The most frequent genetic alterations were point mutations leading to stop codons in the cps genes, and the main target was found to be cpsE encoding the portal glycosyl trasferase of capsule biosynthesis. Complementation of strains carrying missense mutations in cpsE with a wild-type gene restored capsule expression allowing the identification of amino acid residues essential for enzyme activity. PMID:25946017

  11. Molecular and genomic characterization of pathogenic traits of group A Streptococcus pyogenes

    PubMed Central

    HAMADA, Shigeyuki; KAWABATA, Shigetada; NAKAGAWA, Ichiro

    2015-01-01

    Group A streptococcus (GAS) or Streptococcus pyogenes causes various diseases ranging from self-limiting sore throat to deadly invasive diseases. The genome size of GAS is 1.85–1.9 Mb, and genomic rearrangement has been demonstrated. GAS possesses various surface-associated substances such as hyaluronic capsule, M proteins, and fibronectin/laminin/immunoglobulin-binding proteins. These are related to the virulence and play multifaceted and mutually reflected roles in the pathogenesis of GAS infections. Invasion of GAS into epithelial cells and deeper tissues provokes immune and non-immune defense or inflammatory responses including the recruitment of neutrophils, macrophages, and dendritic cells in hosts. GAS frequently evades host defense mechanisms by using its virulence factors. Extracellular products of GAS may perturb cellular and subcellular functions and degrade tissues enzymatically, which leads to the aggravation of local and/or systemic disorders in the host. In this review, we summarize some important cellular and extracellular substances that may affect pathogenic processes during GAS infections, and the host responses to these. PMID:26666305

  12. Genomic analysis reveals the molecular basis for capsule loss in the group B Streptococcus population.

    PubMed

    Rosini, Roberto; Campisi, Edmondo; De Chiara, Matteo; Tettelin, Hervé; Rinaudo, Daniela; Toniolo, Chiara; Metruccio, Matteo; Guidotti, Silvia; Sørensen, Uffe B Skov; Kilian, Mogens; Ramirez, Mario; Janulczyk, Robert; Donati, Claudio; Grandi, Guido; Margarit, Immaculada

    2015-01-01

    The human and bovine bacterial pathogen Streptococcus agalactiae (Group B Streptococcus, GBS) expresses a thick polysaccharide capsule that constitutes a major virulence factor and vaccine target. GBS can be classified into ten distinct serotypes differing in the chemical composition of their capsular polysaccharide. However, non-typeable strains that do not react with anti-capsular sera are frequently isolated from colonized and infected humans and cattle. To gain a comprehensive insight into the molecular basis for the loss of capsule expression in GBS, a collection of well-characterized non-typeable strains was investigated by genome sequencing. Genome based phylogenetic analysis extended to a wide population of sequenced strains confirmed the recently observed high clonality among GBS lineages mainly containing human strains, and revealed a much higher degree of diversity in the bovine population. Remarkably, non-typeable strains were equally distributed in all lineages. A number of distinct mutations in the cps operon were identified that were apparently responsible for inactivation of capsule synthesis. The most frequent genetic alterations were point mutations leading to stop codons in the cps genes, and the main target was found to be cpsE encoding the portal glycosyl transferase of capsule biosynthesis. Complementation of strains carrying missense mutations in cpsE with a wild-type gene restored capsule expression allowing the identification of amino acid residues essential for enzyme activity. PMID:25946017

  13. Comparative genomic hybridisation divides retinoblastomas into a high and a low level chromosomal instability group

    PubMed Central

    van der Wal, J E; Hermsen, M A J A; Gille, H J P; Schouten-Van Meeteren, N Y N; Moll, A C; Imhof, S M; Meijer, G A; Baak, J P A; van der Valk, P

    2003-01-01

    Background: Retinoblastoma is the most common intraocular malignancy in childhood and is responsible for approximately 1% of all deaths caused by childhood cancer. Aims/methods: Comparative genomic hybridisation was performed on 13 consecutive, histologically confirmed retinoblastomas to analyse patterns of chromosomal changes and correlate these to clinicopathological variables. Six cases were hereditary and seven cases were sporadic. Results: In 11 of the 13 tumours chromosomal abnormalities were detected, most frequently gains. Frequent chromosomal gains concerned 6p (46%), 1q (38%), 2p, 9q (30%), 5p, 7q, 10q, 17q, and 20q (23%). Frequent losses occurred at Xq (46%), 13q14, 16q, and 4q (23%). High level copy number gains were found at 5p15 and 6p11–12. A loss at 13q14 occurred in three cases only. Relatively few events occurred in the hereditary cases (27) compared with the non-hereditary cases (70 events). The number of chromosomal aberrations in these 13 retinoblastomas showed a bimodal distribution. Seven tumours showed less than four chromosomal aberrations, falling into a low level chromosomal instability (CIN) group, and six tumours showed at least eight aberrations, falling into a high level CIN group. In the low level CIN group the mean age was half that seen in the high level CIN group, there were less male patients, and there were more hereditary and bilateral cases. Microsatellite instability was not detected in either of the two groups. Conclusion: Despite the complex pattern of genetic changes in retinoblastomas, certain chromosomal regions appear to be affected preferentially. On the basis of the number of genetic events, retinoblastomas can be divided in low and a high level chromosomal instability groups, which have striking differences in clinical presentation. PMID:12499428

  14. Assembly-Driven Community Genomics of a Hypersaline Microbial Ecosystem

    PubMed Central

    Podell, Sheila; Ugalde, Juan A.; Narasingarao, Priya; Banfield, Jillian F.; Heidelberg, Karla B.; Allen, Eric E.

    2013-01-01

    Microbial populations inhabiting a natural hypersaline lake ecosystem in Lake Tyrrell, Victoria, Australia, have been characterized using deep metagenomic sampling, iterative de novo assembly, and multidimensional phylogenetic binning. Composite genomes representing habitat-specific microbial populations were reconstructed for eleven different archaea and one bacterium, comprising between 0.6 and 14.1% of the planktonic community. Eight of the eleven archaeal genomes were from microbial species without previously cultured representatives. These new genomes provide habitat-specific reference sequences enabling detailed, lineage-specific compartmentalization of predicted functional capabilities and cellular properties associated with both dominant and less abundant community members, including organisms previously known only by their 16S rRNA sequences. Together, these data provide a comprehensive, culture-independent genomic blueprint for ecosystem-wide analysis of protein functions, population structure, and lifestyles of co-existing, co-evolving microbial groups within the same natural habitat. The “assembly-driven” community genomic approach demonstrated in this study advances our ability to push beyond single gene investigations, and promotes genome-scale reconstructions as a tangible goal in the quest to define the metabolic, ecological, and evolutionary dynamics that underpin environmental microbial diversity. PMID:23637883

  15. Clostridium botulinum Group II Isolate Phylogenomic Profiling Using Whole-Genome Sequence Data

    PubMed Central

    Weedmark, K. A.; Mabon, P.; Hayden, K. L.; Lambert, D.; Van Domselaar, G.; Austin, J. W.

    2015-01-01

    Clostridium botulinum group II isolates (n = 163) from different geographic regions, outbreaks, and neurotoxin types and subtypes were characterized in silico using whole-genome sequence data. Two clusters representing a variety of botulinum neurotoxin (BoNT) types and subtypes were identified by multilocus sequence typing (MLST) and core single nucleotide polymorphism (SNP) analysis. While one cluster included BoNT/B4/F6/E9 and nontoxigenic members, the other comprised a wide variety of different BoNT/E subtype isolates and a nontoxigenic strain. In silico MLST and core SNP methods were consistent in terms of clade-level isolate classification; however, core SNP analysis showed higher resolution capability. Furthermore, core SNP analysis correctly distinguished isolates by outbreak and location. This study illustrated the utility of next-generation sequence-based typing approaches for isolate characterization and source attribution and identified discrete SNP loci and MLST alleles for isolate comparison. PMID:26116673

  16. Whole genome sequencing reveals extensive community-level transmission of group A Streptococcus in remote communities.

    PubMed

    Bowen, A C; Harris, T; Holt, D C; Giffard, P M; Carapetis, J R; Campbell, P T; McVERNON, J; Tong, S Y C

    2016-07-01

    Impetigo is common in remote Indigenous children of northern Australia, with the primary driver in this context being Streptococcus pyogenes [or group A Streptococcus (GAS)]. To reduce the high burden of impetigo, the transmission dynamics of GAS must be more clearly elucidated. We performed whole genome sequencing on 31 GAS isolates collected in a single community from children in 11 households with ⩾2 GAS-infected children. We aimed to determine whether transmission was occurring principally within households or across the community. The 31 isolates were represented by nine multilocus sequence types and isolates within each sequence type differed from one another by only 0-3 single nucleotide polymorphisms. There was evidence of extensive transmission both within households and across the community. Our findings suggest that strategies to reduce the burden of impetigo in this setting will need to extend beyond individual households, and incorporate multi-faceted, community-wide approaches. PMID:26833141

  17. Whole genome sequencing reveals extensive community-level transmission of group A Streptococcus in remote communities.

    PubMed

    Bowen, A C; Harris, T; Holt, D C; Giffard, P M; Carapetis, J R; Campbell, P T; McVERNON, J; Tong, S Y C

    2016-07-01

    Impetigo is common in remote Indigenous children of northern Australia, with the primary driver in this context being Streptococcus pyogenes [or group A Streptococcus (GAS)]. To reduce the high burden of impetigo, the transmission dynamics of GAS must be more clearly elucidated. We performed whole genome sequencing on 31 GAS isolates collected in a single community from children in 11 households with ⩾2 GAS-infected children. We aimed to determine whether transmission was occurring principally within households or across the community. The 31 isolates were represented by nine multilocus sequence types and isolates within each sequence type differed from one another by only 0-3 single nucleotide polymorphisms. There was evidence of extensive transmission both within households and across the community. Our findings suggest that strategies to reduce the burden of impetigo in this setting will need to extend beyond individual households, and incorporate multi-faceted, community-wide approaches.

  18. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups B and E to genome segments

    SciTech Connect

    Gombold, J.L.; Estes, M.K.; Ramig, R.F.

    1985-05-01

    Recombinant (reassortant) viruses were selected from crosses between temperature-sensitive (ts) mutants of simian rotavirus SA11 and wild-type human rotavirus Wa. The double-stranded genome RNAs of the reassortants were examined by electrophoresis in Tris-glycine-buffered polyacrylamide gels and by dot hybridization with a cloned DNA probe for genome segment 2. Analysis of replacements of genome segments in the reassortants allowed construction of a map correlating genome segments providing functions interchangeable between SA11 and Wa. The reassortants revealed a functional correspondence in order of increasing electrophoretic mobility of genome segments. Analysis of the parental origin of genome segments in ts+ SA11/Wa reassortants derived from the crosses SA11 tsB(339) X Wa and SA11 tsE(1400) X Wa revealed that the group B lesion of tsB(339) was located on genome segment 3 and the group E lesion of tsE(1400) was on segment 8.

  19. Genome Sequences of Two Bacillus cereus Group Bacteriophages, Eyuki and AvesoBmore

    PubMed Central

    2015-01-01

    The genomes of two double-stranded DNA (dsDNA) bacteriophages isolated on Bacillus thuringiensis show similarity to previously sequenced phages and provide evidence of the mosaicism of phage genomes. PMID:26472840

  20. In Silico Genomic Fingerprints of the Bacillus anthracis Group Obtained by Virtual Hybridization

    PubMed Central

    Jaimes-Díaz, Hueman; Larios-Serrato, Violeta; Lloret-Sánchez, Teresa; Olguín-Ruiz, Gabriela; Sánchez-Vallejo, Carlos; Carreño-Durán, Luis; Maldonado-Rodríguez, Rogelio; Méndez-Tenorio, Alfonso

    2015-01-01

    In this study we evaluate the capacity of Virtual Hybridization to identify between highly related bacterial strains. Eight genomic fingerprints were obtained by virtual hybridization for the Bacillus anthracis genome set, and a set of 15,264 13-nucleotide short probes designed to produce genomic fingerprints unique for each organism. The data obtained from each genomic fingerprint were used to obtain hybridization patterns simulating a DNA microarray. Two virtual hybridization methods were used: the Direct and the Extended method to identify the number of potential hybridization sites and thus determine the minimum sensitivity value to discriminate between genomes with 99.9% similarity. Genomic fingerprints were compared using both methods and phylogenomic trees were constructed to verify that the minimum detection value is 0.000017. Results obtained from the genomic fingerprints suggest that the distribution in the trees is correct, as compared to other taxonomic methods. Specific virtual hybridization sites for each of the genomes studied were also identified.

  1. Complete Genome Sequence of Nitrosomonas ureae Strain Nm10, an Oligotrophic Group 6a Nitrosomonad.

    PubMed

    Kozlowski, Jessica A; Kits, K Dimitri; Stein, Lisa Y

    2016-01-01

    The complete genome of Nitrosomonas ureae strain Nm10, a mesophilic betaproteobacterial ammonia oxidizer isolated from Mediterranean soils in Sardinia, Italy, is reported here. This genome represents a cluster 6a nitrosomonad. PMID:26966201

  2. Big Data: the challenge for small research groups in the era of cancer genomics.

    PubMed

    Noor, Aisyah Mohd; Holmberg, Lars; Gillett, Cheryl; Grigoriadis, Anita

    2015-11-17

    In the past decade, cancer research has seen an increasing trend towards high-throughput techniques and translational approaches. The increasing availability of assays that utilise smaller quantities of source material and produce higher volumes of data output have resulted in the necessity for data storage solutions beyond those previously used. Multifactorial data, both large in sample size and heterogeneous in context, needs to be integrated in a standardised, cost-effective and secure manner. This requires technical solutions and administrative support not normally financially accounted for in small- to moderate-sized research groups. In this review, we highlight the Big Data challenges faced by translational research groups in the precision medicine era; an era in which the genomes of over 75,000 patients will be sequenced by the National Health Service over the next 3 years to advance healthcare. In particular, we have looked at three main themes of data management in relation to cancer research, namely (1) cancer ontology management, (2) IT infrastructures that have been developed to support data management and (3) the unique ethical challenges introduced by utilising Big Data in research.

  3. Big Data: the challenge for small research groups in the era of cancer genomics.

    PubMed

    Noor, Aisyah Mohd; Holmberg, Lars; Gillett, Cheryl; Grigoriadis, Anita

    2015-11-17

    In the past decade, cancer research has seen an increasing trend towards high-throughput techniques and translational approaches. The increasing availability of assays that utilise smaller quantities of source material and produce higher volumes of data output have resulted in the necessity for data storage solutions beyond those previously used. Multifactorial data, both large in sample size and heterogeneous in context, needs to be integrated in a standardised, cost-effective and secure manner. This requires technical solutions and administrative support not normally financially accounted for in small- to moderate-sized research groups. In this review, we highlight the Big Data challenges faced by translational research groups in the precision medicine era; an era in which the genomes of over 75,000 patients will be sequenced by the National Health Service over the next 3 years to advance healthcare. In particular, we have looked at three main themes of data management in relation to cancer research, namely (1) cancer ontology management, (2) IT infrastructures that have been developed to support data management and (3) the unique ethical challenges introduced by utilising Big Data in research. PMID:26492224

  4. Big Data: the challenge for small research groups in the era of cancer genomics

    PubMed Central

    Noor, Aisyah Mohd; Holmberg, Lars; Gillett, Cheryl; Grigoriadis, Anita

    2015-01-01

    In the past decade, cancer research has seen an increasing trend towards high-throughput techniques and translational approaches. The increasing availability of assays that utilise smaller quantities of source material and produce higher volumes of data output have resulted in the necessity for data storage solutions beyond those previously used. Multifactorial data, both large in sample size and heterogeneous in context, needs to be integrated in a standardised, cost-effective and secure manner. This requires technical solutions and administrative support not normally financially accounted for in small- to moderate-sized research groups. In this review, we highlight the Big Data challenges faced by translational research groups in the precision medicine era; an era in which the genomes of over 75 000 patients will be sequenced by the National Health Service over the next 3 years to advance healthcare. In particular, we have looked at three main themes of data management in relation to cancer research, namely (1) cancer ontology management, (2) IT infrastructures that have been developed to support data management and (3) the unique ethical challenges introduced by utilising Big Data in research. PMID:26492224

  5. Genome-Wide Identification of Genes Required for Fitness of Group A Streptococcus in Human Blood

    PubMed Central

    Le Breton, Yoann; Mistry, Pragnesh; Valdes, Kayla M.; Quigley, Jeffrey; Kumar, Nikhil; Tettelin, Hervé

    2013-01-01

    The group A streptococcus (GAS) is a strict human pathogen responsible for a wide spectrum of diseases. Although GAS genome sequences are available, functional genomic analyses have been limited. We developed a mariner-based transposon, osKaR, designed to perform Transposon-Site Hybridization (TraSH) in GAS and successfully tested its use in several invasive serotypes. A complex osKaR mutant library in M1T1 GAS strain 5448 was subjected to negative selection in human blood to identify genes important for GAS fitness in this clinically relevant environment. Mutants underrepresented after growth in blood (output pool) compared to growth in rich media (input pool) were identified using DNA microarray hybridization of transposon-specific tags en masse. Using blood from three different donors, we identified 81 genes that met our criteria for reduced fitness in blood from at least two individuals. Genes known to play a role in survival of GAS in blood were found, including those encoding the virulence regulator Mga (mga), the peroxide response regulator PerR (perR), and the RofA-like regulator Ralp-3 (ralp3). We also identified genes previously reported for their contribution to sepsis in other pathogens, such as de novo nucleotide synthesis (purD, purA, pyrB, carA, carB, guaB), sugar metabolism (scrB, fruA), zinc uptake (adcC), and transcriptional regulation (cpsY). To validate our findings, independent mutants with mutations in 10 different genes identified in our screen were confirmed to be defective for survival in blood bactericidal assays. Overall, this work represents the first use of TraSH in GAS to identify potential virulence genes. PMID:23297387

  6. Functional gene groups are concentrated within chromosomes, among chromosomes and in the nuclear space of the human genome.

    PubMed

    Thévenin, Annelyse; Ein-Dor, Liat; Ozery-Flato, Michal; Shamir, Ron

    2014-09-01

    Genomes undergo changes in organization as a result of gene duplications, chromosomal rearrangements and local mutations, among other mechanisms. In contrast to prokaryotes, in which genes of a common function are often organized in operons and reside contiguously along the genome, most eukaryotes show much weaker clustering of genes by function, except for few concrete functional groups. We set out to check systematically if there is a relation between gene function and gene organization in the human genome. We test this question for three types of functional groups: pairs of interacting proteins, complexes and pathways. We find a significant concentration of functional groups both in terms of their distance within the same chromosome and in terms of their dispersal over several chromosomes. Moreover, using Hi-C contact map of the tendency of chromosomal segments to appear close in the 3D space of the nucleus, we show that members of the same functional group that reside on distinct chromosomes tend to co-localize in space. The result holds for all three types of functional groups that we tested. Hence, the human genome shows substantial concentration of functional groups within chromosomes and across chromosomes in space.

  7. How the indirect reciprocity with co-evolving norm and strategy for 2 × 2 prisoner's dilemma game works for emerging cooperation

    NASA Astrophysics Data System (ADS)

    Tanimoto, Jun; Sagara, Hirokji

    2015-11-01

    We built a new indirect reciprocity model based on binary image scores, where an agent's strategy and norm co-evolve. The norm, meaning what behavior is evaluated as "good" or "bad," stipulates how image scores of two agents playing a game is altered, which has been presumed to be a fixed value in most previous studies. Also, unlike former studies, our model allows an agent to play with an agent who has a different norm. This point of relaxing the freedom of the model pulls down cooperation level vis-à-vis the case where an agent always plays with another one having same norm. However, it is observed that a rather larger dilemma shows robust cooperation establishing compared with a smaller dilemma, since a norm that punishes a so-called second-order free-rider is prompted. To encourage the evolution of norms to be able to punish second-order free-riders, a society needs a small number of defectors. This is elucidated by the fact that cases with action error are more cooperative than those without action error.

  8. RNA-RNA interactions and pre-mRNA mislocalization as drivers of group II intron loss from nuclear genomes.

    PubMed

    Qu, Guosheng; Dong, Xiaolong; Piazza, Carol Lyn; Chalamcharla, Venkata R; Lutz, Sheila; Curcio, M Joan; Belfort, Marlene

    2014-05-01

    Group II introns are commonly believed to be the progenitors of spliceosomal introns, but they are notably absent from nuclear genomes. Barriers to group II intron function in nuclear genomes therefore beg examination. A previous study showed that nuclear expression of a group II intron in yeast results in nonsense-mediated decay and translational repression of mRNA, and that these roadblocks to expression are group II intron-specific. To determine the molecular basis for repression of gene expression, we investigated cellular dynamics of processed group II intron RNAs, from transcription to cellular localization. Our data show pre-mRNA mislocalization to the cytoplasm, where the RNAs are targeted to foci. Furthermore, tenacious mRNA-pre-mRNA interactions, based on intron-exon binding sequences, result in reduced abundance of spliced mRNAs. Nuclear retention of pre-mRNA prevents this interaction and relieves these expression blocks. In addition to providing a mechanistic rationale for group II intron-specific repression, our data support the hypothesis that RNA silencing of the host gene contributed to expulsion of group II introns from nuclear genomes and drove the evolution of spliceosomal introns.

  9. Genome characterisation of enteroviruses 117 and 118: a new group within human enterovirus species C.

    PubMed

    Piralla, Antonio; Daleno, Cristina; Scala, Alessia; Greenberg, David; Usonis, Vytautas; Principi, Nicola; Baldanti, Fausto; Esposito, Susanna

    2013-01-01

    The more than 120 genotypes of human enteroviruses (HEVs) reflect a wide range of evolutionary divergence, and there are 23 currently classified as human enterovirus C species (HEV-C). Two new HEV-C (EV-C117 and EV-C118) were identified in the Community-Acquired Pneumonia Pediatric Research Initiative (CAP-PRI) study, and the present paper describes the characterisation of the complete genome of one EV-C117 strain (LIT22) and two EV-C118 (ISR38 and ISR10) strains. The EV-C117 and EV-C118 5'UTR sequences were related to those of EV-C104, EV-C105 and EV-C109, and were slightly shorter than those of other HEV A-D species. Similarity plot analyses showed that EV-C117 and EV-C118 have a P1 region that is highly divergent from that of the other HEV-C, and phylogenetic analyses highly supported a monophyletic group consisting of EV-C117, EV-C118, EV-C104, EV-C105 and EV-C109 strains. Phylogenetic, Simplot and Bootscan analyses indicated that recombination was not the main mechanism of EV-C117 and EV-C118 evolution, thus strengthening the hypothesis of the monophyletic origin of the coding regions, as in the case of other HEV-C. Phylogenetic analysis also revealed the emergence of a new group within HEV-C that is divided into two subgroups. Nucleotide and amino acid identity in VP1 sequences have been established as useful criteria for assigning new HEV types, but analysis of the complete P1 region improves resolution.

  10. Core Genome Multilocus Sequence Typing for Identification of Globally Distributed Clonal Groups and Differentiation of Outbreak Strains of Listeria monocytogenes

    PubMed Central

    Gonzalez-Escalona, Narjol; Hammack, Thomas S.; Allard, Marc W.; Strain, Errol A.; Brown, Eric W.

    2016-01-01

    ABSTRACT Many listeriosis outbreaks are caused by a few globally distributed clonal groups, designated clonal complexes or epidemic clones, of Listeria monocytogenes, several of which have been defined by classic multilocus sequence typing (MLST) schemes targeting 6 to 8 housekeeping or virulence genes. We have developed and evaluated core genome MLST (cgMLST) schemes and applied them to isolates from multiple clonal groups, including those associated with 39 listeriosis outbreaks. The cgMLST clusters were congruent with MLST-defined clonal groups, which had various degrees of diversity at the whole-genome level. Notably, cgMLST could distinguish among outbreak strains and epidemiologically unrelated strains of the same clonal group, which could not be achieved using classic MLST schemes. The precise selection of cgMLST gene targets may not be critical for the general identification of clonal groups and outbreak strains. cgMLST analyses further identified outbreak strains, including those associated with recent outbreaks linked to contaminated French-style cheese, Hispanic-style cheese, stone fruit, caramel apple, ice cream, and packaged leafy green salad, as belonging to major clonal groups. We further developed lineage-specific cgMLST schemes, which can include accessory genes when core genomes do not possess sufficient diversity, and this provided additional resolution over species-specific cgMLST. Analyses of isolates from different common-source listeriosis outbreaks revealed various degrees of diversity, indicating that the numbers of allelic differences should always be combined with cgMLST clustering and epidemiological evidence to define a listeriosis outbreak. IMPORTANCE Classic multilocus sequence typing (MLST) schemes targeting internal fragments of 6 to 8 genes that define clonal complexes or epidemic clones have been widely employed to study L. monocytogenes biodiversity and its relation to pathogenicity potential and epidemiology. We demonstrated

  11. Draft genome sequence of Streptomyces vitaminophilus ATCC 31673, a producer of pyrrolomycin antibiotics, some of which contain a nitro group

    DOE PAGES

    Mahan, Kristina M.; Klingeman, Dawn Marie; Robert L. Hettich; Parry, Ronald J.; Graham, David E.

    2016-01-21

    Streptomyces vitaminophilus produces pyrrolomycins, which are halogenated polyketide antibiotics. Some of the pyrrolomycins contain a rare nitro group located on the pyrrole ring. In addition, the 6.5-Mbp genome encodes 5,941 predicted protein-coding sequences in 39 contigs with a 71.9% G+C content.

  12. Draft Genome Sequences of Two Clostridium botulinum Group II (Nonproteolytic) Type B Strains (DB-2 and KAPB-3).

    PubMed

    Petronella, Nicholas; Kenwell, Robyn; Pagotto, Franco; Pightling, Arthur W

    2014-01-01

    Clostridium botulinum is important for food safety and studies of neurotoxins associated with human botulism. We present the draft genome sequences of two strains belonging to group II type B: one collected from Pacific Ocean sediments (DB-2) and another obtained during a botulism outbreak (KAPB-3). PMID:25377702

  13. Complete Genome Sequence of Streptococcus mitis Strain SVGS_061 Isolated from a Neutropenic Patient with Viridans Group Streptococcal Shock Syndrome

    PubMed Central

    Petrosyan, Varduhi; Holder, Michael; Ajami, Nadim J.; Petrosino, Joseph F.; Sahasrabhojane, Pranoti; Thompson, Erika J.; Kalia, Awdhesh

    2016-01-01

    Streptococcus mitis frequently causes invasive infections in neutropenic cancer patients, with a subset of patients developing viridans group streptococcal (VGS) shock syndrome. We report here the first complete genome sequence of S. mitis strain SVGS_061, which caused VGS shock syndrome, to help elucidate the pathogenesis of severe VGS infection. PMID:27056234

  14. Effects of racial and ethnic group and health literacy on responses to genomic risk information in a medically underserved population

    PubMed Central

    Kaphingst, Kimberly A.; Stafford, Jewel D.; McGowan, Lucy D’Agostino; Seo, Joann; Lachance, Christina R.; Goodman, Melody S.

    2015-01-01

    Objective Few studies have examined how individuals respond to genomic risk information for common, chronic diseases. This randomized study examined differences in responses by type of genomic information [genetic test/family history] and disease condition [diabetes/heart disease] and by race/ethnicity in a medically underserved population. Methods 1057 English-speaking adults completed a survey containing one of four vignettes (two-by-two randomized design). Differences in dependent variables (i.e., interest in receiving genomic assessment, discussing with doctor or family, changing health habits) by experimental condition and race/ethnicity were examined using chi-squared tests and multivariable regression analysis. Results No significant differences were found in dependent variables by type of genomic information or disease condition. In multivariable models, Hispanics were more interested in receiving a genomic assessment than Whites (OR=1.93; p<0.0001); respondents with marginal (OR=1.54; p=0.005) or limited (OR=1.85; p=0.009) health literacy had greater interest than those with adequate health literacy. Blacks (OR=1.78; p=0.001) and Hispanics (OR=1.85; p=0.001) had greater interest in discussing information with family than Whites. Non-Hispanic Blacks (OR=1.45; p=0.04) had greater interest in discussing genomic information with a doctor than Whites. Blacks (β= −0.41; p<0.001) and Hispanics (β= −0.25; p=0.033) intended to change fewer health habits than Whites; health literacy was negatively associated with number of health habits participants intended to change. Conclusions Findings suggest that race/ethnicity may affect responses to genomic risk information. Additional research could examine how cognitive representations of this information differ across racial/ethnic groups. Health literacy is also critical to consider in developing approaches to communicating genomic information. PMID:25622080

  15. Genomic properties of Marine Group A bacteria indicate a role in the marine sulfur cycle

    PubMed Central

    Wright, Jody J; Mewis, Keith; Hanson, Niels W; Konwar, Kishori M; Maas, Kendra R; Hallam, Steven J

    2014-01-01

    Marine Group A (MGA) is a deeply branching and uncultivated phylum of bacteria. Although their functional roles remain elusive, MGA subgroups are particularly abundant and diverse in oxygen minimum zones and permanent or seasonally stratified anoxic basins, suggesting metabolic adaptation to oxygen-deficiency. Here, we expand a previous survey of MGA diversity in O2-deficient waters of the Northeast subarctic Pacific Ocean (NESAP) to include Saanich Inlet (SI), an anoxic fjord with seasonal O2 gradients and periodic sulfide accumulation. Phylogenetic analysis of small subunit ribosomal RNA (16S rRNA) gene clone libraries recovered five previously described MGA subgroups and defined three novel subgroups (SHBH1141, SHBH391, and SHAN400) in SI. To discern the functional properties of MGA residing along gradients of O2 in the NESAP and SI, we identified and sequenced to completion 14 fosmids harboring MGA-associated 16S RNA genes from a collection of 46 fosmid libraries sourced from NESAP and SI waters. Comparative analysis of these fosmids, in addition to four publicly available MGA-associated large-insert DNA fragments from Hawaii Ocean Time-series and Monterey Bay, revealed widespread genomic differentiation proximal to the ribosomal RNA operon that did not consistently reflect subgroup partitioning patterns observed in 16S rRNA gene clone libraries. Predicted protein-coding genes associated with adaptation to O2-deficiency and sulfur-based energy metabolism were detected on multiple fosmids, including polysulfide reductase (psrABC), implicated in dissimilatory polysulfide reduction to hydrogen sulfide and dissimilatory sulfur oxidation. These results posit a potential role for specific MGA subgroups in the marine sulfur cycle. PMID:24030600

  16. Fundamental molecules of life are pigments which arose and co-evolved as a response to the thermodynamic imperative of dissipating the prevailing solar spectrum

    NASA Astrophysics Data System (ADS)

    Michaelian, K.; Simeonov, A.

    2015-08-01

    The driving force behind the origin and evolution of life has been the thermodynamic imperative of increasing the entropy production of the biosphere through increasing the global solar photon dissipation rate. In the upper atmosphere of today, oxygen and ozone derived from life processes are performing the short-wavelength UV-C and UV-B dissipation. On Earth's surface, water and organic pigments in water facilitate the near-UV and visible photon dissipation. The first organic pigments probably formed, absorbed, and dissipated at those photochemically active wavelengths in the UV-C and UV-B that could have reached Earth's surface during the Archean. Proliferation of these pigments can be understood as an autocatalytic photochemical process obeying non-equilibrium thermodynamic directives related to increasing solar photon dissipation rate. Under these directives, organic pigments would have evolved over time to increase the global photon dissipation rate by (1) increasing the ratio of their effective photon cross sections to their physical size, (2) decreasing their electronic excited state lifetimes, (3) quenching radiative de-excitation channels (e.g., fluorescence), (4) covering ever more completely the prevailing solar spectrum, and (5) proliferating and dispersing to cover an ever greater surface area of Earth. From knowledge of the evolution of the spectrum of G-type stars, and considering the most probable history of the transparency of Earth's atmosphere, we construct the most probable Earth surface solar spectrum as a function of time and compare this with the history of molecular absorption maxima obtained from the available data in the literature. This comparison supports the conjecture that many fundamental molecules of life are pigments which arose, proliferated, and co-evolved as a response to dissipating the solar spectrum, supports the thermodynamic dissipation theory for the origin of life, constrains models for Earth's early atmosphere, and sheds

  17. Community Genomic and Proteomic Analyses of Chemoautotrophic Iron-Oxidizing "Leptospirillum rubarum" (Group II) and "Leptospirillum ferrodiazotrophum" (Group III) Bacteria in Acid Mine Drainage Biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum group II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, Iron Mountain, CA, acid mine drainage biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum groups II and III, respectively, and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and >60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid carries conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacterial groups have genes for community-essential functions, including carbon fixation and biosynthesis of vitamins, fatty acids, and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum group II uses a methyl-dependent and Leptospirillum group III a methyl-independent response pathway. Although only Leptospirillum group III can fix nitrogen, these proteins were not identified by proteomics. The abundances of core proteins are similar in all communities, but the abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum groups II and III.

  18. Genomic Analysis of Melioribacter roseus, Facultatively Anaerobic Organotrophic Bacterium Representing a Novel Deep Lineage within Bacteriodetes/Chlorobi Group

    PubMed Central

    Kadnikov, Vitaly V.; Mardanov, Andrey V.; Podosokorskaya, Olga A.; Gavrilov, Sergey N.; Kublanov, Ilya V.; Beletsky, Alexey V.; Bonch-Osmolovskaya, Elizaveta A.; Ravin, Nikolai V.

    2013-01-01

    Melioribacter roseus is a moderately thermophilic facultatively anaerobic organotrophic bacterium representing a novel deep branch within Bacteriodetes/Chlorobi group. To better understand the metabolic capabilities and possible ecological functions of M. roseus and get insights into the evolutionary history of this bacterial lineage, we sequenced the genome of the type strain P3M-2T. A total of 2838 open reading frames was predicted from its 3.30 Mb genome. The whole proteome analysis supported phylum-level classification of M. roseus since most of the predicted proteins had closest matches in Bacteriodetes, Proteobacteria, Chlorobi, Firmicutes and deeply-branching bacterium Caldithrix abyssi, rather than in one particular phylum. Consistent with the ability of the bacterium to grow on complex carbohydrates, the genome analysis revealed more than one hundred glycoside hydrolases, glycoside transferases, polysaccharide lyases and carbohydrate esterases. The reconstructed central metabolism revealed pathways enabling the fermentation of complex organic substrates, as well as their complete oxidation through aerobic and anaerobic respiration. Genes encoding the photosynthetic and nitrogen-fixation machinery of green sulfur bacteria, as well as key enzymes of autotrophic carbon fixation pathways, were not identified. The M. roseus genome supports its affiliation to a novel phylum Ignavibateriae, representing the first step on the evolutionary pathway from heterotrophic ancestors of Bacteriodetes/Chlorobi group towards anaerobic photoautotrophic Chlorobi. PMID:23301019

  19. Social and behavioral research in genomic sequencing: approaches from the Clinical Sequencing Exploratory Research Consortium Outcomes and Measures Working Group.

    PubMed

    Gray, Stacy W; Martins, Yolanda; Feuerman, Lindsay Z; Bernhardt, Barbara A; Biesecker, Barbara B; Christensen, Kurt D; Joffe, Steven; Rini, Christine; Veenstra, David; McGuire, Amy L

    2014-10-01

    The routine use of genomic sequencing in clinical medicine has the potential to dramatically alter patient care and medical outcomes. To fully understand the psychosocial and behavioral impact of sequencing integration into clinical practice, it is imperative that we identify the factors that influence sequencing-related decision making and patient outcomes. In an effort to develop a collaborative and conceptually grounded approach to studying sequencing adoption, members of the National Human Genome Research Institute's Clinical Sequencing Exploratory Research Consortium formed the Outcomes and Measures Working Group. Here we highlight the priority areas of investigation and psychosocial and behavioral outcomes identified by the Working Group. We also review some of the anticipated challenges to measurement in social and behavioral research related to genomic sequencing; opportunities for instrument development; and the importance of qualitative, quantitative, and mixed-method approaches. This work represents the early, shared efforts of multiple research teams as we strive to understand individuals' experiences with genomic sequencing. The resulting body of knowledge will guide recommendations for the optimal use of sequencing in clinical practice.

  20. Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.

    PubMed

    Nunoura, Takuro; Takaki, Yoshihiro; Kakuta, Jungo; Nishi, Shinro; Sugahara, Junichi; Kazama, Hiromi; Chee, Gab-Joo; Hattori, Masahira; Kanai, Akio; Atomi, Haruyuki; Takai, Ken; Takami, Hideto

    2011-04-01

    The domain Archaea has historically been divided into two phyla, the Crenarchaeota and Euryarchaeota. Although regarded as members of the Crenarchaeota based on small subunit rRNA phylogeny, environmental genomics and efforts for cultivation have recently revealed two novel phyla/divisions in the Archaea; the 'Thaumarchaeota' and 'Korarchaeota'. Here, we show the genome sequence of Candidatus 'Caldiarchaeum subterraneum' that represents an uncultivated crenarchaeotic group. A composite genome was reconstructed from a metagenomic library previously prepared from a microbial mat at a geothermal water stream of a sub-surface gold mine. The genome was found to be clearly distinct from those of the known phyla/divisions, Crenarchaeota (hyperthermophiles), Euryarchaeota, Thaumarchaeota and Korarchaeota. The unique traits suggest that this crenarchaeotic group can be considered as a novel archaeal phylum/division. Moreover, C. subterraneum harbors an ubiquitin-like protein modifier system consisting of Ub, E1, E2 and small Zn RING finger family protein with structural motifs specific to eukaryotic system proteins, a system clearly distinct from the prokaryote-type system recently identified in Haloferax and Mycobacterium. The presence of such a eukaryote-type system is unprecedented in prokaryotes, and indicates that a prototype of the eukaryotic protein modifier system is present in the Archaea. PMID:21169198

  1. Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group

    PubMed Central

    Nunoura, Takuro; Takaki, Yoshihiro; Kakuta, Jungo; Nishi, Shinro; Sugahara, Junichi; Kazama, Hiromi; Chee, Gab-Joo; Hattori, Masahira; Kanai, Akio; Atomi, Haruyuki; Takai, Ken; Takami, Hideto

    2011-01-01

    The domain Archaea has historically been divided into two phyla, the Crenarchaeota and Euryarchaeota. Although regarded as members of the Crenarchaeota based on small subunit rRNA phylogeny, environmental genomics and efforts for cultivation have recently revealed two novel phyla/divisions in the Archaea; the ‘Thaumarchaeota’ and ‘Korarchaeota’. Here, we show the genome sequence of Candidatus ‘Caldiarchaeum subterraneum’ that represents an uncultivated crenarchaeotic group. A composite genome was reconstructed from a metagenomic library previously prepared from a microbial mat at a geothermal water stream of a sub-surface gold mine. The genome was found to be clearly distinct from those of the known phyla/divisions, Crenarchaeota (hyperthermophiles), Euryarchaeota, Thaumarchaeota and Korarchaeota. The unique traits suggest that this crenarchaeotic group can be considered as a novel archaeal phylum/division. Moreover, C. subterraneum harbors an ubiquitin-like protein modifier system consisting of Ub, E1, E2 and small Zn RING finger family protein with structural motifs specific to eukaryotic system proteins, a system clearly distinct from the prokaryote-type system recently identified in Haloferax and Mycobacterium. The presence of such a eukaryote-type system is unprecedented in prokaryotes, and indicates that a prototype of the eukaryotic protein modifier system is present in the Archaea. PMID:21169198

  2. Complete genome sequence of Lactobacillus plantarum LZ227, a potential probiotic strain producing B-group vitamins.

    PubMed

    Li, Ping; Zhou, Qingqing; Gu, Qing

    2016-09-20

    B-group vitamins play an important role in human metabolism, whose deficiencies are associated with a variety of disorders and diseases. Certain microorganisms such as Lactic acid bacteria (LAB) have been shown to have capacities for B-group vitamin production and thus could potentially replace chemically synthesized vitamins for food fortification. A potential probiotic strain named Lactobacillus plantarum LZ227, which was isolated from raw cow milk in this study, exhibits the ability to produce B-group vitamins. Complete genome sequencing of LZ227 was performed to gain insights into the genetic elements involved in B-group vitamin production. The genome of LZ227 contains a circular 3,131,750-bp chromosome, three circular plasmids and two predicted linear plasmids. LZ227 also contains gene clusters for biosynthesis of both riboflavin and folate. This genome sequence provides a basis for further elucidation of its molecular genetics and probiotic functions, and will facilitate its applications as starter cultures in food industry. PMID:27480344

  3. Complete genome sequence of Lactobacillus plantarum LZ227, a potential probiotic strain producing B-group vitamins.

    PubMed

    Li, Ping; Zhou, Qingqing; Gu, Qing

    2016-09-20

    B-group vitamins play an important role in human metabolism, whose deficiencies are associated with a variety of disorders and diseases. Certain microorganisms such as Lactic acid bacteria (LAB) have been shown to have capacities for B-group vitamin production and thus could potentially replace chemically synthesized vitamins for food fortification. A potential probiotic strain named Lactobacillus plantarum LZ227, which was isolated from raw cow milk in this study, exhibits the ability to produce B-group vitamins. Complete genome sequencing of LZ227 was performed to gain insights into the genetic elements involved in B-group vitamin production. The genome of LZ227 contains a circular 3,131,750-bp chromosome, three circular plasmids and two predicted linear plasmids. LZ227 also contains gene clusters for biosynthesis of both riboflavin and folate. This genome sequence provides a basis for further elucidation of its molecular genetics and probiotic functions, and will facilitate its applications as starter cultures in food industry.

  4. Community genomic and proteomic analysis of chemoautotrophic, iron-oxidizing "Leptospirillum rubarum" (Group II) and Leptospirillum ferrodiazotrophum (Group III) in acid mine drainage biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum Groups II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, CA acid mine drainage (AMD) biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum Groups II and III, respectively and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and > 60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid encodes conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacteria have genes for community-essential functions, including carbon fixation, biosynthesis of vitamins, fatty acids and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum Group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum Group II uses a methyl-dependent and Leptospirillum Group III a methyl-independent response pathway. Although only Leptospirillum Group III can fix nitrogen, these proteins were not identified by proteomics. Abundances of core proteins are similar in all communities, but abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum Groups II and III.

  5. Genomic resources for multiple species in the Drosophila ananassae species group.

    PubMed

    Signor, Sarah; Seher, Thaddeus; Kopp, Artyom

    2013-01-01

    The development of genomic resources in non-model taxa is essential for understanding the genetic basis of biological diversity. Although the genomes of many Drosophila species have been sequenced, most of the phenotypic diversity in this genus remains to be explored. To facilitate the genetic analysis of interspecific and intraspecific variation, we have generated new genomic resources for seven species and subspecies in the D. ananassae species subgroup. We have generated large amounts of transcriptome sequence data for D. ercepeae, D. merina, D. bipectinata, D. malerkotliana malerkotliana, D. m. pallens, D. pseudoananassae pseudoananassae, and D. p. nigrens. de novo assembly resulted in contigs covering more than half of the predicted transcriptome and matching an average of 59% of annotated genes in the complete genome of D. ananassae. Most contigs, corresponding to an average of 49% of D. ananassae genes, contain sequence polymorphisms that can be used as genetic markers. Subsets of these markers were validated by genotyping the progeny of inter- and intraspecific crosses. The ananassae subgroup is an excellent model system for examining the molecular basis of speciation and phenotypic evolution. The new genomic resources will facilitate the genetic analysis of inter- and intraspecific differences in this lineage. Transcriptome sequencing provides a simple and cost-effective way to identify molecular markers at nearly single-gene density, and is equally applicable to any non-model taxa. PMID:23639891

  6. Genomic resources for multiple species in the Drosophila ananassae species group

    PubMed Central

    Signor, Sarah; Seher, Thaddeus; Kopp, Artyom

    2013-01-01

    The development of genomic resources in non-model taxa is essential for understanding the genetic basis of biological diversity. Although the genomes of many Drosophila species have been sequenced, most of the phenotypic diversity in this genus remains to be explored. To facilitate the genetic analysis of interspecific and intraspecific variation, we have generated new genomic resources for seven species and subspecies in the D. ananassae species subgroup. We have generated large amounts of transcriptome sequence data for D. ercepeae, D. merina, D. bipectinata, D. malerkotliana malerkotliana, D. m. pallens, D. pseudoananassae pseudoananassae, and D. p. nigrens. de novo assembly resulted in contigs covering more than half of the predicted transcriptome and matching an average of 59% of annotated genes in the complete genome of D. ananassae. Most contigs, corresponding to an average of 49% of D. ananassae genes, contain sequence polymorphisms that can be used as genetic markers. Subsets of these markers were validated by genotyping the progeny of inter- and intraspecific crosses. The ananassae subgroup is an excellent model system for examining the molecular basis of speciation and phenotypic evolution. The new genomic resources will facilitate the genetic analysis of inter- and intraspecific differences in this lineage. Transcriptome sequencing provides a simple and cost-effective way to identify molecular markers at nearly single-gene density, and is equally applicable to any non-model taxa. PMID:23639891

  7. Group-based and personalized care in an age of genomic and evidence-based medicine: a reappraisal.

    PubMed

    Maglo, Koffi N

    2012-01-01

    This article addresses the philosophical and moral foundations of group-based and individualized therapy in connection with population care equality. The U.S. Food and Drug Administration (FDA) recently modified its public health policy by seeking to enhance the efficacy and equality of care through the approval of group-specific prescriptions and doses for some drugs. In the age of genomics, when individualization of care increasingly has become a major concern, investigating the relationship between population health, stratified medicine, and personalized therapy can improve our understanding of the ethical and biomedical implications of genomic medicine. I suggest that the need to optimize population health through population substructure-sensitive research and the need to individualize care through genetically targeted therapies are not necessarily incompatible. Accordingly, the article reconceptualizes a unified goal for modern scientific medicine in terms of individualized equal care.

  8. Group-based and personalized care in an age of genomic and evidence-based medicine: a reappraisal.

    PubMed

    Maglo, Koffi N

    2012-01-01

    This article addresses the philosophical and moral foundations of group-based and individualized therapy in connection with population care equality. The U.S. Food and Drug Administration (FDA) recently modified its public health policy by seeking to enhance the efficacy and equality of care through the approval of group-specific prescriptions and doses for some drugs. In the age of genomics, when individualization of care increasingly has become a major concern, investigating the relationship between population health, stratified medicine, and personalized therapy can improve our understanding of the ethical and biomedical implications of genomic medicine. I suggest that the need to optimize population health through population substructure-sensitive research and the need to individualize care through genetically targeted therapies are not necessarily incompatible. Accordingly, the article reconceptualizes a unified goal for modern scientific medicine in terms of individualized equal care. PMID:22643722

  9. Identification, characterization, and comparative genomic distribution of the HERV-K (HML-2) group of human endogenous retroviruses

    PubMed Central

    2011-01-01

    Background Integration of retroviral DNA into a germ cell may lead to a provirus that is transmitted vertically to that host's offspring as an endogenous retrovirus (ERV). In humans, ERVs (HERVs) comprise about 8% of the genome, the vast majority of which are truncated and/or highly mutated and no longer encode functional genes. The most recently active retroviruses that integrated into the human germ line are members of the Betaretrovirus-like HERV-K (HML-2) group, many of which contain intact open reading frames (ORFs) in some or all genes, sometimes encoding functional proteins that are expressed in various tissues. Interestingly, this expression is upregulated in many tumors ranging from breast and ovarian tissues to lymphomas and melanomas, as well as schizophrenia, rheumatoid arthritis, and other disorders. Results No study to date has characterized all HML-2 elements in the genome, an essential step towards determining a possible functional role of HML-2 expression in disease. We present here the most comprehensive and accurate catalog of all full-length and partial HML-2 proviruses, as well as solo LTR elements, within the published human genome to date. Furthermore, we provide evidence for preferential maintenance of proviruses and solo LTR elements on gene-rich chromosomes of the human genome and in proximity to gene regions. Conclusions Our analysis has found and corrected several errors in the annotation of HML-2 elements in the human genome, including mislabeling of a newly identified group called HML-11. HML-elements have been implicated in a wide array of diseases, and characterization of these elements will play a fundamental role to understand the relationship between endogenous retrovirus expression and disease. PMID:22067224

  10. Genomic analysis of NAC transcription factors in banana (Musa acuminata) and definition of NAC orthologous groups for monocots and dicots.

    PubMed

    Cenci, Albero; Guignon, Valentin; Roux, Nicolas; Rouard, Mathieu

    2014-05-01

    Identifying the molecular mechanisms underlying tolerance to abiotic stresses is important in crop breeding. A comprehensive understanding of the gene families associated with drought tolerance is therefore highly relevant. NAC transcription factors form a large plant-specific gene family involved in the regulation of tissue development and responses to biotic and abiotic stresses. The main goal of this study was to set up a framework of orthologous groups determined by an expert sequence comparison of NAC genes from both monocots and dicots. In order to clarify the orthologous relationships among NAC genes of different species, we performed an in-depth comparative study of four divergent taxa, in dicots and monocots, whose genomes have already been completely sequenced: Arabidopsis thaliana, Vitis vinifera, Musa acuminata and Oryza sativa. Due to independent evolution, NAC copy number is highly variable in these plant genomes. Based on an expert NAC sequence comparison, we propose forty orthologous groups of NAC sequences that were probably derived from an ancestor gene present in the most recent common ancestor of dicots and monocots. These orthologous groups provide a curated resource for large-scale protein sequence annotation of NAC transcription factors. The established orthology relationships also provide a useful reference for NAC function studies in newly sequenced genomes such as M. acuminata and other plant species. PMID:24570169

  11. Genomic analysis of NAC transcription factors in banana (Musa acuminata) and definition of NAC orthologous groups for monocots and dicots.

    PubMed

    Cenci, Albero; Guignon, Valentin; Roux, Nicolas; Rouard, Mathieu

    2014-05-01

    Identifying the molecular mechanisms underlying tolerance to abiotic stresses is important in crop breeding. A comprehensive understanding of the gene families associated with drought tolerance is therefore highly relevant. NAC transcription factors form a large plant-specific gene family involved in the regulation of tissue development and responses to biotic and abiotic stresses. The main goal of this study was to set up a framework of orthologous groups determined by an expert sequence comparison of NAC genes from both monocots and dicots. In order to clarify the orthologous relationships among NAC genes of different species, we performed an in-depth comparative study of four divergent taxa, in dicots and monocots, whose genomes have already been completely sequenced: Arabidopsis thaliana, Vitis vinifera, Musa acuminata and Oryza sativa. Due to independent evolution, NAC copy number is highly variable in these plant genomes. Based on an expert NAC sequence comparison, we propose forty orthologous groups of NAC sequences that were probably derived from an ancestor gene present in the most recent common ancestor of dicots and monocots. These orthologous groups provide a curated resource for large-scale protein sequence annotation of NAC transcription factors. The established orthology relationships also provide a useful reference for NAC function studies in newly sequenced genomes such as M. acuminata and other plant species.

  12. The first detection and whole genome characterization of the G6P[15] group A rotavirus strain from roe deer.

    PubMed

    Jamnikar-Ciglenecki, Urska; Kuhar, Urska; Sturm, Sabina; Kirbis, Andrej; Racki, Nejc; Steyer, Andrej

    2016-08-15

    Although rotaviruses have been detected in a variety of host species, there are only limited records of their occurrence in deer, where their role is unknown. In this study, group A rotavirus was identified in roe deer during a study of enteric viruses in game animals. 102 samples of intestinal content were collected from roe deer (56), wild boars (29), chamois (10), red deer (6) and mouflon (1), but only one sample from roe deer was positive. Following whole genome sequence analysis, the rotavirus strain D38/14 was characterized by next generation sequencing. The genotype constellation, comprising 11 genome segments, was G6-P[15]-I2-R2-C2-M2-A3-N2-T6-E2-H3. Phylogenetic analysis of the VP7 genome segment showed that the D38/14 rotavirus strain is closely related to the various G6 zoonotic rotavirus strains of bovine-like origin frequently detected in humans. In the VP4 segment, this strain showed high variation compared to that in the P[15] strain found in sheep and in a goat. This finding suggests that rotaviruses from deer are similar to those in other DS-1 rotavirus groups and could constitute a source of zoonotically transmitted rotaviruses. The epidemiological status of group A rotaviruses in deer should be further investigated. PMID:27374907

  13. A novel virus genome discovered in an extreme environment suggests recombination between unrelated groups of RNA and DNA viruses

    PubMed Central

    2012-01-01

    Background Viruses are known to be the most abundant organisms on earth, yet little is known about their collective origin and evolutionary history. With exceptionally high rates of genetic mutation and mosaicism, it is not currently possible to resolve deep evolutionary histories of the known major virus groups. Metagenomics offers a potential means of establishing a more comprehensive view of viral evolution as vast amounts of new sequence data becomes available for comparative analysis. Results Bioinformatic analysis of viral metagenomic sequences derived from a hot, acidic lake revealed a circular, putatively single-stranded DNA virus encoding a major capsid protein similar to those found only in single-stranded RNA viruses. The presence and circular configuration of the complete virus genome was confirmed by inverse PCR amplification from native DNA extracted from lake sediment. The virus genome appears to be the result of a RNA-DNA recombination event between two ostensibly unrelated virus groups. Environmental sequence databases were examined for homologous genes arranged in similar configurations and three similar putative virus genomes from marine environments were identified. This result indicates the existence of a widespread but previously undetected group of viruses. Conclusions This unique viral genome carries implications for theories of virus emergence and evolution, as no mechanism for interviral RNA-DNA recombination has yet been identified, and only scant evidence exists that genetic exchange occurs between such distinct virus lineages. Reviewers This article was reviewed by EK, MK (nominated by PF) and AM. For the full reviews, please go to the Reviewers' comments section. PMID:22515485

  14. Including different groups of genotyped females for genomic prediction in a Nordic Jersey population.

    PubMed

    Gao, H; Madsen, P; Nielsen, U S; Aamand, G P; Su, G; Byskov, K; Jensen, J

    2015-12-01

    Including genotyped females in a reference population (RP) is an obvious way to increase the RP in genomic selection, especially for dairy breeds of limited population size. However, the incorporation of these females must be conducted cautiously because of the potential preferential treatment of the genotyped cows and lower reliabilities of phenotypes compared with the proven pseudo-phenotypes of bulls. Breeding organizations in Denmark, Finland, and Sweden have implemented a female-genotyping project with the possibility of genotyping entire herds using the low-density (LD) chip. In the present study, 5 scenarios for building an RP were investigated in the Nordic Jersey population: (1) bulls only, (2) bulls with females from the LD project, (3) bulls with females from the LD project plus non-LD project females genotyped before their first calving, (4) bulls with females from the LD project plus non-LD project females genotyped after their first calving, and (5) bulls with all genotyped females. The genomically enhanced breeding value (GEBV) was predicted for 8 traits in the Nordic total merit index through a genomic BLUP model using deregressed proof (DRP) as the response variable in all scenarios. In addition, (daughter) yield deviation and raw phenotypic data were studied as response variables for comparison with the DRP, using stature as a model trait. The validation population was formed using a cut-off birth year of 2005 based on the genotyped Nordic Jersey bulls with DRP. The average increment in reliability of the GEBV across the 8 traits investigated was 1.9 to 4.5 percentage points compared with using only bulls in the RP (scenario 1). The addition of all the genotyped females to the RP resulted in the highest gain in reliability (scenario 5), followed by scenario 3, scenario 2, and scenario 4. All scenarios led to inflated GEBV because the regression coefficients are less than 1. However, scenario 2 and scenario 3 led to less bias of genomic predictions

  15. Social evolution. Genomic signatures of evolutionary transitions from solitary to group living.

    PubMed

    Kapheim, Karen M; Pan, Hailin; Li, Cai; Salzberg, Steven L; Puiu, Daniela; Magoc, Tanja; Robertson, Hugh M; Hudson, Matthew E; Venkat, Aarti; Fischman, Brielle J; Hernandez, Alvaro; Yandell, Mark; Ence, Daniel; Holt, Carson; Yocum, George D; Kemp, William P; Bosch, Jordi; Waterhouse, Robert M; Zdobnov, Evgeny M; Stolle, Eckart; Kraus, F Bernhard; Helbing, Sophie; Moritz, Robin F A; Glastad, Karl M; Hunt, Brendan G; Goodisman, Michael A D; Hauser, Frank; Grimmelikhuijzen, Cornelis J P; Pinheiro, Daniel Guariz; Nunes, Francis Morais Franco; Soares, Michelle Prioli Miranda; Tanaka, Érica Donato; Simões, Zilá Luz Paulino; Hartfelder, Klaus; Evans, Jay D; Barribeau, Seth M; Johnson, Reed M; Massey, Jonathan H; Southey, Bruce R; Hasselmann, Martin; Hamacher, Daniel; Biewer, Matthias; Kent, Clement F; Zayed, Amro; Blatti, Charles; Sinha, Saurabh; Johnston, J Spencer; Hanrahan, Shawn J; Kocher, Sarah D; Wang, Jun; Robinson, Gene E; Zhang, Guojie

    2015-06-01

    The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks.

  16. Complete genome sequence and comparative genomic analysis of Mycobacterium massiliense JCM 15300 in the Mycobacterium abscessus group reveal a conserved genomic island MmGI-1 related to putative lipid metabolism.

    PubMed

    Sekizuka, Tsuyoshi; Kai, Masanori; Nakanaga, Kazue; Nakata, Noboru; Kazumi, Yuko; Maeda, Shinji; Makino, Masahiko; Hoshino, Yoshihiko; Kuroda, Makoto

    2014-01-01

    Mycobacterium abscessus group subsp., such as M. massiliense, M. abscessus sensu stricto and M. bolletii, are an environmental organism found in soil, water and other ecological niches, and have been isolated from respiratory tract infection, skin and soft tissue infection, postoperative infection of cosmetic surgery. To determine the unique genetic feature of M. massiliense, we sequenced the complete genome of M. massiliense type strain JCM 15300 (corresponding to CCUG 48898). Comparative genomic analysis was performed among Mycobacterium spp. and among M. abscessus group subspp., showing that additional ß-oxidation-related genes and, notably, the mammalian cell entry (mce) operon were located on a genomic island, M. massiliense Genomic Island 1 (MmGI-1), in M. massiliense. In addition, putative anaerobic respiration system-related genes and additional mycolic acid cyclopropane synthetase-related genes were found uniquely in M. massiliense. Japanese isolates of M. massiliense also frequently possess the MmGI-1 (14/44, approximately 32%) and three unique conserved regions (26/44; approximately 60%, 34/44; approximately 77% and 40/44; approximately 91%), as well as isolates of other countries (Malaysia, France, United Kingdom and United States). The well-conserved genomic island MmGI-1 may play an important role in high growth potential with additional lipid metabolism, extra factors for survival in the environment or synthesis of complex membrane-associated lipids. ORFs on MmGI-1 showed similarities to ORFs of phylogenetically distant M. avium complex (MAC), suggesting that horizontal gene transfer or genetic recombination events might have occurred within MmGI-1 among M. massiliense and MAC. PMID:25503461

  17. Complete Genome Sequence and Comparative Genomic Analysis of Mycobacterium massiliense JCM 15300 in the Mycobacterium abscessus Group Reveal a Conserved Genomic Island MmGI-1 Related to Putative Lipid Metabolism

    PubMed Central

    Nakanaga, Kazue; Nakata, Noboru; Kazumi, Yuko; Maeda, Shinji; Makino, Masahiko; Hoshino, Yoshihiko; Kuroda, Makoto

    2014-01-01

    Mycobacterium abscessus group subsp., such as M. massiliense, M. abscessus sensu stricto and M. bolletii, are an environmental organism found in soil, water and other ecological niches, and have been isolated from respiratory tract infection, skin and soft tissue infection, postoperative infection of cosmetic surgery. To determine the unique genetic feature of M. massiliense, we sequenced the complete genome of M. massiliense type strain JCM 15300 (corresponding to CCUG 48898). Comparative genomic analysis was performed among Mycobacterium spp. and among M. abscessus group subspp., showing that additional ß-oxidation-related genes and, notably, the mammalian cell entry (mce) operon were located on a genomic island, M. massiliense Genomic Island 1 (MmGI-1), in M. massiliense. In addition, putative anaerobic respiration system-related genes and additional mycolic acid cyclopropane synthetase-related genes were found uniquely in M. massiliense. Japanese isolates of M. massiliense also frequently possess the MmGI-1 (14/44, approximately 32%) and three unique conserved regions (26/44; approximately 60%, 34/44; approximately 77% and 40/44; approximately 91%), as well as isolates of other countries (Malaysia, France, United Kingdom and United States). The well-conserved genomic island MmGI-1 may play an important role in high growth potential with additional lipid metabolism, extra factors for survival in the environment or synthesis of complex membrane-associated lipids. ORFs on MmGI-1 showed similarities to ORFs of phylogenetically distant M. avium complex (MAC), suggesting that horizontal gene transfer or genetic recombination events might have occurred within MmGI-1 among M. massiliense and MAC. PMID:25503461

  18. Insights into the strategies used by related group II introns to adapt successfully for the colonisation of a bacterial genome.

    PubMed

    Martínez-Rodríguez, Laura; García-Rodríguez, Fernando M; Molina-Sánchez, María Dolores; Toro, Nicolás; Martínez-Abarca, Francisco

    2014-01-01

    Group II introns are self-splicing RNAs and site-specific mobile retroelements found in bacterial and organellar genomes. The group II intron RmInt1 is present at high copy number in Sinorhizobium meliloti species, and has a multifunctional intron-encoded protein (IEP) with reverse transcriptase/maturase activities, but lacking the DNA-binding and endonuclease domains. We characterized two RmInt1-related group II introns RmInt2 from S. meliloti strain GR4 and Sr.md.I1 from S. medicae strain WSM419 in terms of splicing and mobility activities. We used both wild-type and engineered intron-donor constructs based on ribozyme ΔORF-coding sequence derivatives, and we determined the DNA target requirements for RmInt2, the element most distantly related to RmInt1. The excision and mobility patterns of intron-donor constructs expressing different combinations of IEP and intron RNA provided experimental evidence for the co-operation of IEPs and intron RNAs from related elements in intron splicing and, in some cases, in intron homing. We were also able to identify the DNA target regions recognized by these IEPs lacking the DNA endonuclease domain. Our results provide new insight into the versatility of related group II introns and the possible co-operation between these elements to facilitate the colonization of bacterial genomes.

  19. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups A, C, F, and G to genome segments.

    PubMed

    Gombold, J L; Ramig, R F

    1987-12-01

    Crosses were performed between prototype temperature-sensitive (ts) mutants of simian rotavirus SA11 representing reassortment groups A, C, F, and G and ts mutants of rhesus rotavirus RRV that belonged to different reassortment groups. Wild-type (ts+) reassortant progeny were identified by plaque formation at nonpermissive temperature (39 degrees), picked, and grown to high titer. The ts+ phenotype of the resulting progeny clones was verified by titration at 39 degrees and 31 degrees. The electropherotypes of the ts+ clones were determined by electrophoresis, and parental origin of each genome segment was assigned by comparison of segment mobility to parental markers. Analysis of the parental origin of genome segments in the ts+ reassortants derived from SA11 ts X RRV ts crosses revealed the following map locations of the SA11 prototype ts mutants: tsA(778), segment 4; tsC(606), segment 1; tsF(2124), segment 2; and tsG(2130), segment 6. The assignment of tsA was made on the basis of genome segment segregation in two independent crosses with each of two independent RRV ts mutants. The assignment of tsC was made on the basis of segregation in only a single cross with an RRV ts mutant; however, a larger number of progeny clones were examined from this cross. The lesion of tsF was mapped with data from three independent crosses using two different RRV ts mutants. The assignment of tsG was made on the basis of segregation in three independent crosses, two with RRV ts mutants and one with Wa. The assignments of tsA, tsC, and tsF were confirmed in crosses between RRV ts mutants representing those reassortment groups, and SA11 ts mutants in other reassortment groups.

  20. The genomes, proteomes, and structures of three novel phages that infect the Bacillus cereus group and carry putative virulence factors.

    PubMed

    Grose, Julianne H; Belnap, David M; Jensen, Jordan D; Mathis, Andrew D; Prince, John T; Merrill, Bryan D; Burnett, Sandra H; Breakwell, Donald P

    2014-10-01

    This article reports the results of studying three novel bacteriophages, JL, Shanette, and Basilisk, which infect the pathogen Bacillus cereus and carry genes that may contribute to its pathogenesis. We analyzed host range and superinfection ability, mapped their genomes, and characterized phage structure by mass spectrometry and transmission electron microscopy (TEM). The JL and Shanette genomes were 96% similar and contained 217 open reading frames (ORFs) and 220 ORFs, respectively, while Basilisk has an unrelated genome containing 138 ORFs. Mass spectrometry revealed 23 phage particle proteins for JL and 15 for Basilisk, while only 11 and 4, respectively, were predicted to be present by sequence analysis. Structural protein homology to well-characterized phages suggested that JL and Shanette were members of the family Myoviridae, which was confirmed by TEM. The third phage, Basilisk, was similar only to uncharacterized phages and is an unrelated siphovirus. Cryogenic electron microscopy of this novel phage revealed a T=9 icosahedral capsid structure with the major capsid protein (MCP) likely having the same fold as bacteriophage HK97 MCP despite the lack of sequence similarity. Several putative virulence factors were encoded by these phage genomes, including TerC and TerD involved in tellurium resistance. Host range analysis of all three phages supports genetic transfer of such factors within the B. cereus group, including B. cereus, B. anthracis, and B. thuringiensis. This study provides a basis for understanding these three phages and other related phages as well as their contributions to the pathogenicity of B. cereus group bacteria. Importance: The Bacillus cereus group of bacteria contains several human and plant pathogens, including B. cereus, B. anthracis, and B. thuringiensis. Phages are intimately linked to the evolution of their bacterial hosts and often provide virulence factors, making the study of B. cereus phages important to understanding the

  1. Genome Sequence of Enterobacter cloacae Strain SENG-6, a Bacterium Producing Histo-Blood Group Antigen-Like Substances That Can Bind with Human Noroviruses

    PubMed Central

    Amarasiri, Mohan; Hashiba, Satoshi; Yang, Peiyi; Okabe, Satoshi

    2016-01-01

    Enterobacter sp. strain SENG-6, isolated from healthy human feces, produces histo-blood group antigen (HBGA)-like substances that can bind with human noroviruses. Based on the genome sequence analysis, strain SENG-6 belongs to the species Enterobacter cloacae. The genome sequence of this strain should help identify genes associated with the production of HBGA-like substances. PMID:27563051

  2. Genome Sequence of Enterobacter cloacae Strain SENG-6, a Bacterium Producing Histo-Blood Group Antigen-Like Substances That Can Bind with Human Noroviruses.

    PubMed

    Ishii, Satoshi; Amarasiri, Mohan; Hashiba, Satoshi; Yang, Peiyi; Okabe, Satoshi; Sano, Daisuke

    2016-01-01

    Enterobacter sp. strain SENG-6, isolated from healthy human feces, produces histo-blood group antigen (HBGA)-like substances that can bind with human noroviruses. Based on the genome sequence analysis, strain SENG-6 belongs to the species Enterobacter cloacae The genome sequence of this strain should help identify genes associated with the production of HBGA-like substances. PMID:27563051

  3. Genome Sequence of Enterobacter cloacae Strain SENG-6, a Bacterium Producing Histo-Blood Group Antigen-Like Substances That Can Bind with Human Noroviruses.

    PubMed

    Ishii, Satoshi; Amarasiri, Mohan; Hashiba, Satoshi; Yang, Peiyi; Okabe, Satoshi; Sano, Daisuke

    2016-01-01

    Enterobacter sp. strain SENG-6, isolated from healthy human feces, produces histo-blood group antigen (HBGA)-like substances that can bind with human noroviruses. Based on the genome sequence analysis, strain SENG-6 belongs to the species Enterobacter cloacae The genome sequence of this strain should help identify genes associated with the production of HBGA-like substances.

  4. Variable Tick Protein in Two Genomic Groups of the Relapsing Fever Spirochete Borrelia hermsii in Western North America

    PubMed Central

    Porcella, Stephen F.; Raffel, Sandra J.; Anderson, Donald E.; Gilk, Stacey D.; Bono, James L.; Schrumpf, Merry E.; Schwan, Tom G.

    2005-01-01

    Borrelia hermsii is the primary cause of tick-borne relapsing fever in North America. When its tick vector, Ornithodoros hermsi, acquires these spirochetes from the blood of an infected mammal, the bacteria switch their outer surface from one of many bloodstream variable major proteins (Vmps) to a unique protein, Vtp (Vsp33). Vtp may be critical for successful tick transmission of B. hermsii; however, the gene encoding this protein has been described previously in only one isolate. Here we identified and sequenced the vtp gene in 31 isolates of B. hermsii collected over 40 years from localities throughout much of its known geographic distribution. Seven major Vtp types were found. Little or no sequence variation existed within types, but between them significant variation was observed, similar to the pattern of diversity described for the outer surface protein C (OspC) gene in Lyme disease spirochetes. The pattern of sequence relatedness among the Vtp types was incongruent in two branches compared to two genomic groups identified among the isolates by multilocus sequence typing of the 16S rRNA, flaB, gyrB, and glpQ genes. Therefore, both horizontal transfer and recombination within and between the two genomic groups were responsible for some of the variation observed in the vtp gene. O. hermsi ticks were capable of transmitting spirochetes in the newly identified genomic group. Therefore, given the longevity of the tick vector and persistent infection of spirochetes in ticks, these arthropods rather than mammals may be the likely host where the exchange of spirochetal DNA occurs. PMID:16177341

  5. Admixture patterns and genetic differentiation in negrito groups from West Malaysia estimated from genome-wide SNP data.

    PubMed

    Jinam, Timothy A; Phipps, Maude E; Saitou, Naruya

    2013-01-01

    Southeast Asia houses various culturally and linguistically diverse ethnic groups. In Malaysia, where the Malay, Chinese, and Indian ethnic groups form the majority, there exist minority groups such as the "negritos" who are believed to be descendants of the earliest settlers of Southeast Asia. Here we report patterns of genetic substructure and admixture in two Malaysian negrito populations (Jehai and Kensiu), using ~50,000 genome-wide single-nucleotide polymorphism (SNP) data. We found traces of recent admixture in both the negrito populations, particularly in the Jehai, with the Malay through principal component analysis and STRUCTURE analysis software, which suggested that the admixture was as recent as one generation ago. We also identified significantly differentiated nonsynonymous SNPs and haplotype blocks related to intracellular transport, metabolic processes, and detection of stimulus. These results highlight the different levels of admixture experienced by the two Malaysian negritos. Delineating admixture and differentiated genomic regions should be of importance in designing and interpretation of molecular anthropology and disease association studies. PMID:24297225

  6. Genomic study of the Ket: a Paleo-Eskimo-related ethnic group with significant ancient North Eurasian ancestry.

    PubMed

    Flegontov, Pavel; Changmai, Piya; Zidkova, Anastassiya; Logacheva, Maria D; Altınışık, N Ezgi; Flegontova, Olga; Gelfand, Mikhail S; Gerasimov, Evgeny S; Khrameeva, Ekaterina E; Konovalova, Olga P; Neretina, Tatiana; Nikolsky, Yuri V; Starostin, George; Stepanova, Vita V; Travinsky, Igor V; Tříska, Martin; Tříska, Petr; Tatarinova, Tatiana V

    2016-01-01

    The Kets, an ethnic group in the Yenisei River basin, Russia, are considered the last nomadic hunter-gatherers of Siberia, and Ket language has no transparent affiliation with any language family. We investigated connections between the Kets and Siberian and North American populations, with emphasis on the Mal'ta and Paleo-Eskimo ancient genomes, using original data from 46 unrelated samples of Kets and 42 samples of their neighboring ethnic groups (Uralic-speaking Nganasans, Enets, and Selkups). We genotyped over 130,000 autosomal SNPs, identified mitochondrial and Y-chromosomal haplogroups, and performed high-coverage genome sequencing of two Ket individuals. We established that Nganasans, Kets, Selkups, and Yukaghirs form a cluster of populations most closely related to Paleo-Eskimos in Siberia (not considering indigenous populations of Chukotka and Kamchatka). Kets are closely related to modern Selkups and to some Bronze and Iron Age populations of the Altai region, with all these groups sharing a high degree of Mal'ta ancestry. Implications of these findings for the linguistic hypothesis uniting Ket and Na-Dene languages into a language macrofamily are discussed. PMID:26865217

  7. Genomic study of the Ket: a Paleo-Eskimo-related ethnic group with significant ancient North Eurasian ancestry

    PubMed Central

    Flegontov, Pavel; Changmai, Piya; Zidkova, Anastassiya; Logacheva, Maria D.; Altınışık, N. Ezgi; Flegontova, Olga; Gelfand, Mikhail S.; Gerasimov, Evgeny S.; Khrameeva, Ekaterina E.; Konovalova, Olga P.; Neretina, Tatiana; Nikolsky, Yuri V.; Starostin, George; Stepanova, Vita V.; Travinsky, Igor V.; Tříska, Martin; Tříska, Petr; Tatarinova, Tatiana V.

    2016-01-01

    The Kets, an ethnic group in the Yenisei River basin, Russia, are considered the last nomadic hunter-gatherers of Siberia, and Ket language has no transparent affiliation with any language family. We investigated connections between the Kets and Siberian and North American populations, with emphasis on the Mal’ta and Paleo-Eskimo ancient genomes, using original data from 46 unrelated samples of Kets and 42 samples of their neighboring ethnic groups (Uralic-speaking Nganasans, Enets, and Selkups). We genotyped over 130,000 autosomal SNPs, identified mitochondrial and Y-chromosomal haplogroups, and performed high-coverage genome sequencing of two Ket individuals. We established that Nganasans, Kets, Selkups, and Yukaghirs form a cluster of populations most closely related to Paleo-Eskimos in Siberia (not considering indigenous populations of Chukotka and Kamchatka). Kets are closely related to modern Selkups and to some Bronze and Iron Age populations of the Altai region, with all these groups sharing a high degree of Mal’ta ancestry. Implications of these findings for the linguistic hypothesis uniting Ket and Na-Dene languages into a language macrofamily are discussed. PMID:26865217

  8. Genomic Analysis of the Emergence and Rapid Global Dissemination of the Clonal Group 258 Klebsiella pneumoniae Pandemic.

    PubMed

    Bowers, Jolene R; Kitchel, Brandon; Driebe, Elizabeth M; MacCannell, Duncan R; Roe, Chandler; Lemmer, Darrin; de Man, Tom; Rasheed, J Kamile; Engelthaler, David M; Keim, Paul; Limbago, Brandi M

    2015-01-01

    Multidrug-resistant Klebsiella pneumoniae producing the KPC carbapenemase have rapidly spread throughout the world, causing severe healthcare-associated infections with limited antimicrobial treatment options. Dissemination of KPC-producing K. pneumoniae is largely attributed to expansion of a single dominant strain, ST258. In this study, we explore phylogenetic relationships and evolution within ST258 and its clonal group, CG258, using whole genome sequence analysis of 167 isolates from 20 countries collected over 17 years. Our results show a common ST258 ancestor emerged from its diverse parental clonal group around 1995 and likely acquired blaKPC prior to dissemination. Over the past two decades, ST258 has remained highly clonal despite diversity in accessory elements and divergence in the capsule polysaccharide synthesis locus. Apart from the large recombination event that gave rise to ST258, few mutations set it apart from its clonal group. However, one mutation occurs in a global transcription regulator. Characterization of outer membrane protein sequences revealed a profile in ST258 that includes a truncated OmpK35 and modified OmpK37. Our work illuminates potential genomic contributors to the pathogenic success of ST258, helps us better understand the global dissemination of this strain, and identifies genetic markers unique to ST258.

  9. Genomic Analysis of the Emergence and Rapid Global Dissemination of the Clonal Group 258 Klebsiella pneumoniae Pandemic

    PubMed Central

    Driebe, Elizabeth M.; MacCannell, Duncan R.; Roe, Chandler; Lemmer, Darrin; de Man, Tom; Rasheed, J. Kamile; Engelthaler, David M.; Keim, Paul; Limbago, Brandi M.

    2015-01-01

    Multidrug-resistant Klebsiella pneumoniae producing the KPC carbapenemase have rapidly spread throughout the world, causing severe healthcare-associated infections with limited antimicrobial treatment options. Dissemination of KPC-producing K. pneumoniae is largely attributed to expansion of a single dominant strain, ST258. In this study, we explore phylogenetic relationships and evolution within ST258 and its clonal group, CG258, using whole genome sequence analysis of 167 isolates from 20 countries collected over 17 years. Our results show a common ST258 ancestor emerged from its diverse parental clonal group around 1995 and likely acquired blaKPC prior to dissemination. Over the past two decades, ST258 has remained highly clonal despite diversity in accessory elements and divergence in the capsule polysaccharide synthesis locus. Apart from the large recombination event that gave rise to ST258, few mutations set it apart from its clonal group. However, one mutation occurs in a global transcription regulator. Characterization of outer membrane protein sequences revealed a profile in ST258 that includes a truncated OmpK35 and modified OmpK37. Our work illuminates potential genomic contributors to the pathogenic success of ST258, helps us better understand the global dissemination of this strain, and identifies genetic markers unique to ST258. PMID:26196384

  10. Sequence variants from whole genome sequencing a large group of Icelanders.

    PubMed

    Gudbjartsson, Daniel F; Sulem, Patrick; Helgason, Hannes; Gylfason, Arnaldur; Gudjonsson, Sigurjon A; Zink, Florian; Oddson, Asmundur; Magnusson, Gisli; Halldorsson, Bjarni V; Hjartarson, Eirikur; Sigurdsson, Gunnar Th; Kong, Augustine; Helgason, Agnar; Masson, Gisli; Magnusson, Olafur Th; Thorsteinsdottir, Unnur; Stefansson, Kari

    2015-01-01

    We have accumulated considerable data on the genetic makeup of the Icelandic population by sequencing the whole genomes of 2,636 Icelanders to depth of at least 10X and by chip genotyping 101,584 more. The sequencing was done with Illumina technology. The median sequencing depth was 20X and 909 individuals were sequenced to a depth of at least 30X. We found 20 million single nucleotide polymorphisms (SNPs) and 1.5 million insertions/deletions (indels) that passed stringent quality control. Almost all the common SNPs (derived allele frequency (DAF) over 2%) that we identified in Iceland have been observed by either dbSNP (build 137) or the Exome Sequencing Project (ESP) while only 60 and 20% of rare (DAF<0.5%) SNPs and indels in coding regions, the most heavily studied parts of the genome, have been observed in the public databases. Features of our variant data, such as the transition/transversion ratio and the length distribution of indels, are similar to published reports. PMID:25977816

  11. Genome sequence of Shimia str. SK013, a representative of the Roseobacter group isolated from marine sediment

    DOE PAGES

    Kanukollu, Saranya; Voget, Sonja; Pohlner, Marion; Vandieken, Verona; Petersen, Jörn; Kyrpides, Nikos C.; Woyke, Tanja; Shapiro, Nicole; Göker, Markus; Klenk, Hans -Peter; et al

    2016-03-12

    Shimia strain SK013 is an aerobic, Gram-negative, rod shaped alphaproteobacterium affiliated with the Roseobacter group within the family Rhodobacteraceae. The strain was isolated from surface sediment (0-1 cm) of the Skagerrak at 114 m below sea level. The 4,049,808 bp genome of Shimia str. SK013 comprises 3,981 protein-coding genes and 47 RNA genes. It contains one chromosome and no extrachromosomal elements. The genome analysis revealed the presence of genes for a dimethylsulfoniopropionate lyase, demethylase and the trimethylamine methyltransferase (mttB) as well as genes for nitrate, nitrite and dimethyl sulfoxide reduction. This indicates that Shimia str. SK013 is able to switchmore » from aerobic to anaerobic metabolism and thus is capable of aerobic and anaerobic sulfur cycling at the seafloor. Among the ability to convert other sulfur compounds it has the genetic capacity to produce climatically active dimethyl sulfide. Growth on glutamate as a sole carbon source results in formation of cell-connecting filaments, a putative phenotypic adaptation of the surface-associated strain to the environmental conditions at the seafloor. Genome analysis revealed the presence of a flagellum (fla1) and a type IV pilus biogenesis, which is speculated to be a prerequisite for biofilm formation. This is also related to genes responsible for signalling such as N-acyl homoserine lactones, as well as quip-genes responsible for quorum quenching and antibiotic biosynthesis. Pairwise similarities of 16S rRNA genes (98.56 % sequence similarity to the next relative S. haliotis) and the in silico DNA-DNA hybridization (21.20 % sequence similarity to S. haliotis) indicated Shimia str. SK013 to be considered as a new species. In conclusion, the genome analysis of Shimia str. SK013 offered first insights into specific physiological and phenotypic adaptation mechanisms of Roseobacter-affiliated bacteria to the benthic environment.« less

  12. Implications of Genome-Based Discrimination between Clostridium botulinum Group I and Clostridium sporogenes Strains for Bacterial Taxonomy

    PubMed Central

    Weigand, Michael R.; Pena-Gonzalez, Angela; Shirey, Timothy B.; Broeker, Robin G.; Ishaq, Maliha K.; Konstantinidis, Konstantinos T.

    2015-01-01

    Taxonomic classification of Clostridium botulinum is based on the production of botulinum neurotoxin (BoNT), while closely related, nontoxic organisms are classified as Clostridium sporogenes. However, this taxonomic organization does not accurately mirror phylogenetic relationships between these species. A phylogenetic reconstruction using 2,016 orthologous genes shared among strains of C. botulinum group I and C. sporogenes clearly separated these two species into discrete clades which showed ∼93% average nucleotide identity (ANI) between them. Clustering of strains based on the presence of variable orthologs revealed 143 C. sporogenes clade-specific genetic signatures, a subset of which were further evaluated for their ability to correctly classify a panel of presumptive C. sporogenes strains by PCR. Genome sequencing of several C. sporogenes strains lacking these signatures confirmed that they clustered with C. botulinum strains in a core genome phylogenetic tree. Our analysis also identified C. botulinum strains that contained C. sporogenes clade-specific signatures and phylogenetically clustered with C. sporogenes strains. The genome sequences of two bont/B2-containing strains belonging to the C. sporogenes clade contained regions with similarity to a bont-bearing plasmid (pCLD), while two different strains belonging to the C. botulinum clade carried bont/B2 on the chromosome. These results indicate that bont/B2 was likely acquired by C. sporogenes strains through horizontal gene transfer. The genome-based classification of these species used to identify candidate genes for the development of rapid assays for molecular identification may be applicable to additional bacterial species that are challenging with respect to their classification. PMID:26048939

  13. Complete Genome Sequence of Bacillus cereus Group Phage TsarBomba.

    PubMed

    Erill, Ivan; Caruso, Steven M

    2015-01-01

    The Bacillus cereus group bacteriophage TsarBomba, a double-stranded DNA Myoviridae, was isolated from soil collected in Saratov, Russia. TsarBomba was found to be similar to Bacillus phages BCP78 and BCU4, and to have a wide host range among Bacillus cereus group species.

  14. Viruses within animal genomes.

    PubMed

    De Brognier, A; Willems, L

    2016-04-01

    Viruses and their hosts can co-evolve to reach a fragile equilibrium that allows the survival of both. An excess of pathogenicity in the absence of a reservoir would be detrimental to virus survival. A significant proportion of all animal genomes has been shaped by the insertion of viruses that subsequently became 'fossilised'. Most endogenous viruses have lost the capacity to replicate via an infectious cycle and now replicate passively. The insertion of endogenous viruses has contributed to the evolution of animal genomes, for example in the reproductive biology of mammals. However, spontaneous viral integration still occasionally occurs in a number of virus-host systems. This constitutes a potential risk to host survival but also provides an opportunity for diversification and evolution.

  15. Comparative genomics of a plant-parasitic nematode endosymbiont suggest a role in nutritional symbiosis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bacterial mutualists can increase the biochemical capacity of animals. Highly co-evolved nutritional mutualists do this by synthesizing nutrients missing from the host's diet. Genomics tools have recently advanced the study of these partnerships. Here we examined the endosymbiont Xiphinematobacter (...

  16. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants.

    PubMed

    Li, Meng-Yao; Xu, Zhi-Sheng; Tian, Chang; Huang, Ying; Wang, Feng; Xiong, Ai-Sheng

    2016-01-01

    WRKY transcription factors belong to one of the largest transcription factor families. These factors possess functions in plant growth and development, signal transduction, and stress response. Here, we identified 95 DcWRKY genes in carrot based on the carrot genomic and transcriptomic data, and divided them into three groups. Phylogenetic analysis of WRKY proteins from carrot and Arabidopsis divided these proteins into seven subgroups. To elucidate the evolution and distribution of WRKY transcription factors in different species, we constructed a schematic of the phylogenetic tree and compared the WRKY family factors among 22 species, which including plants, slime mold and protozoan. An in-depth study was performed to clarify the homologous factor groups of nine divergent taxa in lower and higher plants. Based on the orthologous factors between carrot and Arabidopsis, 38 DcWRKY proteins were calculated to interact with other proteins in the carrot genome. Yeast two-hybrid assay showed that DcWRKY20 can interact with DcMAPK1 and DcMAPK4. The expression patterns of the selected DcWRKY genes based on transcriptome data and qRT-PCR suggested that those selected DcWRKY genes are involved in root development, biotic and abiotic stress response. This comprehensive analysis provides a basis for investigating the evolution and function of WRKY genes. PMID:26975939

  17. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants.

    PubMed

    Li, Meng-Yao; Xu, Zhi-Sheng; Tian, Chang; Huang, Ying; Wang, Feng; Xiong, Ai-Sheng

    2016-03-15

    WRKY transcription factors belong to one of the largest transcription factor families. These factors possess functions in plant growth and development, signal transduction, and stress response. Here, we identified 95 DcWRKY genes in carrot based on the carrot genomic and transcriptomic data, and divided them into three groups. Phylogenetic analysis of WRKY proteins from carrot and Arabidopsis divided these proteins into seven subgroups. To elucidate the evolution and distribution of WRKY transcription factors in different species, we constructed a schematic of the phylogenetic tree and compared the WRKY family factors among 22 species, which including plants, slime mold and protozoan. An in-depth study was performed to clarify the homologous factor groups of nine divergent taxa in lower and higher plants. Based on the orthologous factors between carrot and Arabidopsis, 38 DcWRKY proteins were calculated to interact with other proteins in the carrot genome. Yeast two-hybrid assay showed that DcWRKY20 can interact with DcMAPK1 and DcMAPK4. The expression patterns of the selected DcWRKY genes based on transcriptome data and qRT-PCR suggested that those selected DcWRKY genes are involved in root development, biotic and abiotic stress response. This comprehensive analysis provides a basis for investigating the evolution and function of WRKY genes.

  18. Random forest fishing: a novel approach to identifying organic group of risk factors in genome-wide association studies

    PubMed Central

    Yang, Wei; Charles Gu, C

    2014-01-01

    Genome-wide association studies (GWAS) has brought methodological challenges in handling massive high-dimensional data and also real opportunities for studying the joint effect of many risk factors acting in concert as an organic group. The random forest (RF) methodology is recognized by many for its potential in examining interaction effects in large data sets. However, RF is not designed to directly handle GWAS data, which typically have hundreds of thousands of single-nucleotide polymorphisms as predictor variables. We propose and evaluate a novel extension of RF, called random forest fishing (RFF), for GWAS analysis. RFF repeatedly updates a relatively small set of predictors obtained by RF tests to find globally important groups predictive of the disease phenotype, using a novel search algorithm based on genetic programming and simulated annealing. A key improvement of RFF results from the use of guidance incorporating empirical test results of genome-wide pairwise interactions. Evaluated using simulated and real GWAS data sets, RFF is shown to be effective in identifying important predictors, particularly when both marginal effects and interactions exist, and is applicable to very large GWAS data sets. PMID:23695277

  19. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants

    PubMed Central

    Li, Meng-Yao; Xu, Zhi-Sheng; Tian, Chang; Huang, Ying; Wang, Feng; Xiong, Ai-Sheng

    2016-01-01

    WRKY transcription factors belong to one of the largest transcription factor families. These factors possess functions in plant growth and development, signal transduction, and stress response. Here, we identified 95 DcWRKY genes in carrot based on the carrot genomic and transcriptomic data, and divided them into three groups. Phylogenetic analysis of WRKY proteins from carrot and Arabidopsis divided these proteins into seven subgroups. To elucidate the evolution and distribution of WRKY transcription factors in different species, we constructed a schematic of the phylogenetic tree and compared the WRKY family factors among 22 species, which including plants, slime mold and protozoan. An in-depth study was performed to clarify the homologous factor groups of nine divergent taxa in lower and higher plants. Based on the orthologous factors between carrot and Arabidopsis, 38 DcWRKY proteins were calculated to interact with other proteins in the carrot genome. Yeast two-hybrid assay showed that DcWRKY20 can interact with DcMAPK1 and DcMAPK4. The expression patterns of the selected DcWRKY genes based on transcriptome data and qRT-PCR suggested that those selected DcWRKY genes are involved in root development, biotic and abiotic stress response. This comprehensive analysis provides a basis for investigating the evolution and function of WRKY genes. PMID:26975939

  20. Genomic identification of group A bZIP transcription factors and their responses to abiotic stress in carrot.

    PubMed

    Que, F; Wang, G L; Huang, Y; Xu, Z S; Wang, F; Xiong, A S

    2015-01-01

    The basic-region/leucine-zipper (bZIP) family is one of the major transcription factor (TF) families associated with responses to abiotic stresses. Many members of group A in this family have been extensively examined and are reported to perform significant functions in ABA signaling as well as in responses to abiotic stresses. In this study, 10 bZIP factors in carrot were classified into group A based on their DNA-binding domains. The cis-acting regulatory elements and folding states of these 10 factors were analyzed. Evolutionary analysis of the group A members suggested their importance during the course of evolution in plants. In addition, cis-acting elements and the folding state of proteins were important for DNA binding and could affect gene expression. Quantitative RT-PCR was conducted to investigate the stress response of 10 genes encoding the group A factors. Six genes showed responses to abiotic stresses, while four genes showed other special phenomenon. The current analysis on group A bZIP family TFs in carrot is the first to investigate the TFs of Apiaceae via genome analysis. These results provide new information for future studies on carrot. PMID:26535641

  1. The genomic organization of the Fanconi anemia group A (FAA) gene

    SciTech Connect

    Ianzano, L.; Centra, M.; Savino, M.

    1997-05-01

    Fanconi anemia (FA) is a genetically heterogeneous disease involving at least five genes on the basis of complementation analysis (FAA to FAE). The FAA gene has been recently isolated by two independent approaches, positional and functional cloning. In the present study we describe the genomic structure of the FAA gene. The gene contains 43 exons spanning approximately 80 kb as determined by the alignment of four cosmids and the fine localization of the first and the last exons in restriction fragments of these clones. Exons range from 34 to 188 bp. All but three of the splice sites were consistent with the ag-gt rule. We also describe three alternative splicing events in cDNA clones that result in the loss of exon 37, a 23-bp deletion at the 5{prime} end of exon 41. Sequence analysis of the 5{prime} region upstream of the putative transcription start site showed no obvious TATA and CAAT boxes, but did show a GC-rich region, typical of housekeeping genes. Knowledge of the structure of the FAA gene will provide an invaluable resource for the discovery of mutations in the gene that accounts for about 60-66% of FA patients. 24 refs., 3 figs., 1 tab.

  2. Insights from the Genome Sequence of Acidovorax citrulli M6, a Group I Strain of the Causal Agent of Bacterial Fruit Blotch of Cucurbits

    PubMed Central

    Eckshtain-Levi, Noam; Shkedy, Dafna; Gershovits, Michael; Da Silva, Gustavo M.; Tamir-Ariel, Dafna; Walcott, Ron; Pupko, Tal; Burdman, Saul

    2016-01-01

    Acidovorax citrulli is a seedborne bacterium that causes bacterial fruit blotch of cucurbit plants including watermelon and melon. A. citrulli strains can be divided into two major groups based on DNA fingerprint analyses and biochemical properties. Group I strains have been generally isolated from non-watermelon cucurbits, while group II strains are closely associated with watermelon. In the present study, we report the genome sequence of M6, a group I model A. citrulli strain, isolated from melon. We used comparative genome analysis to investigate differences between the genome of strain M6 and the genome of the group II model strain AAC00-1. The draft genome sequence of A. citrulli M6 harbors 139 contigs, with an overall approximate size of 4.85 Mb. The genome of M6 is ∼500 Kb shorter than that of strain AAC00-1. Comparative analysis revealed that this size difference is mainly explained by eight fragments, ranging from ∼35–120 Kb and distributed throughout the AAC00-1 genome, which are absent in the M6 genome. In agreement with this finding, while AAC00-1 was found to possess 532 open reading frames (ORFs) that are absent in strain M6, only 123 ORFs in M6 were absent in AAC00-1. Most of these M6 ORFs are hypothetical proteins and most of them were also detected in two group I strains that were recently sequenced, tw6 and pslb65. Further analyses by PCR assays and coverage analyses with other A. citrulli strains support the notion that some of these fragments or significant portions of them are discriminative between groups I and II strains of A. citrulli. Moreover, GC content, effective number of codon values and cluster of orthologs’ analyses indicate that these fragments were introduced into group II strains by horizontal gene transfer events. Our study reports the genome sequence of a model group I strain of A. citrulli, one of the most important pathogens of cucurbits. It also provides the first comprehensive comparison at the genomic level between

  3. Perspective: maternal kin groups and the origins of asymmetric genetic systems-genomic imprinting, haplodiploidy, and parthenogenesis.

    PubMed

    Normark, Benjamin B

    2006-04-01

    The genetic systems of animals and plants are typically eumendelian. That is, an equal complement of autosomes is inherited from each of two parents, and at each locus, each parent's allele is equally likely to be expressed and equally likely to be transmitted. Genetic systems that violate any of these eumendelian symmetries are termed asymmetric and include parent-specific gene expression (PSGE), haplodiploidy, thelytoky, and related systems. Asymmetric genetic systems typically arise in lineages with close associations between kin (gregarious siblings, brooding, or viviparity). To date, different explanatory frameworks have been proposed to account for each of the different asymmetric genetic systems. Haig's kinship theory of genomic imprinting argues that PSGE arises when kinship asymmetries between interacting kin create conflicts between maternally and paternally derived alleles. Greater maternal than paternal relatedness within groups selects for more "abstemious" expression of maternally derived alleles and more "greedy" expression of paternally derived alleles. Here, I argue that this process may also underlie origins of haplodiploidy and many origins of thelytoky. The tendency for paternal alleles to be more "greedy" in maternal kin groups means that maternal-paternal conflict is not a zero-sum game: the maternal optimum will more closely correspond to the optimum for family groups and demes and for associated entities such as symbionts. Often in these circumstances, partial or complete suppression of paternal gene expression will evolve (haplodiploidy, thelytoky), or other features of the life cycle will evolve to minimize the conflict (monogamy, inbreeding). Maternally transmitted cytoplasmic elements and maternally imprinted nuclear alleles have a shared interest in minimizing agonistic interactions between female siblings and may cooperate to exclude the paternal genome. Eusociality is the most dramatic expression of the conflict-reducing effects of

  4. Chemical rescue, multiple ionizable groups, and general acid-base catalysis in the HDV genomic ribozyme.

    PubMed

    Perrotta, Anne T; Wadkins, Timothy S; Been, Michael D

    2006-07-01

    In the ribozyme from the hepatitis delta virus (HDV) genomic strand RNA, a cytosine side chain is proposed to facilitate proton transfer in the transition state of the reaction and, thus, act as a general acid-base catalyst. Mutation of this active-site cytosine (C75) reduced RNA cleavage rates by as much as one million-fold, but addition of exogenous cytosine and certain nucleobase or imidazole analogs can partially rescue activity in these mutants. However, pH-rate profiles for the rescued reactions were bell shaped, and only one leg of the pH-rate curve could be attributed to ionization of the exogenous nucleobase or buffer. When a second potential ionizable nucleobase (C41) was removed, one leg of the bell-shaped curve was eliminated in the chemical-rescue reaction. With this construct, the apparent pK(a) determined from the pH-rate profile correlated with the solution pK(a) of the buffer, and the contribution of the buffer to the rate enhancement could be directly evaluated in a free-energy or Brønsted plot. The free-energy relationship between the acid dissociation constant of the buffer and the rate constant for cleavage (Brønsted value, beta, = approximately 0.5) was consistent with a mechanism in which the buffer acted as a general acid-base catalyst. These data support the hypothesis that cytosine 75, in the intact ribozyme, acts as a general acid-base catalyst.

  5. Phylogenetic study of clonal complex (CC)198 capsule null locus (cnl) genomes: A distinctive group within the species Neisseria meningitidis.

    PubMed

    Xu, Zheng; Du, Pengcheng; Zhu, Bingqing; Xu, Li; Wang, Haiyin; Gao, Yuan; Zhou, Haijian; Zhang, Wen; Chen, Chen; Shao, Zhujun

    2015-08-01

    Capsule null locus (cnl) strains, one type of specific unencapsulated Neisseria spp., only have regions D and E of the capsule gene cluster which encodes the genes for capsule biosynthesis, modification, and transportation. Compared with encapsulated strains, regions A and C of cnl strains have been replaced by 113 or 114 bp conserved non-coding sequences. Cnl strains include multiple clonal complexes (CC). According to previous studies, CC198 is the major clonal lineage in both cnl patients and healthy cnl carriers. We hypothesized that CC198 possesses different genome characteristics compared with other cnl strains. In this study, we obtained the draft genomes of two CC198 strains from healthy carriers. Using 75071 single nucleotide polymorphisms located in 1163 core genes, we constructed the phylogenetic relationships between a batch of representative Neisseria meningitidis genomes. CC198 and CC1136 clustered together, but apart from other N. meningitidis strains including CC53. We also aligned the sequences of genes located in regions D and E of the capsule gene locus from encapsulated and unencapsulated strains. A number of possible recombination events were identified in the galE and tex genes between different serogroups of encapsulated N. meningitidis and CC53 strains, especially in tex. In contrast, there is almost no recombination in N. meningitidis CC198 strains. These results showed that CC198 belongs to a phylogenetically distinct group within the species N. meningitidis, which may be directly derived from the cnl-type ancestor of N. meningitidis. The encapsulated strains may acquire other necessary genes for capsule formation by horizontal transfer.

  6. Use of Longitudinal Data in Genetic Studies in the Genome-wide Association Studies Era: Summary of Group 14

    PubMed Central

    Kerner, Berit; North, Kari E; Fallin, M Daniele

    2010-01-01

    Participants analyzed actual and simulated longitudinal data from the Framingham Heart Study for various metabolic and cardiovascular traits. The genetic information incorporated into these investigations ranged from selected single-nucleotide polymorphisms to genome-wide association arrays. Genotypes were incorporated using a broad range of methodological approaches including conditional logistic regression, linear mixed models, generalized estimating equations, linear growth curve estimation, growth modeling, growth mixture modeling, population attributable risk fraction based on survival functions under the proportional hazards models, and multivariate adaptive splines for the analysis of longitudinal data. The specific scientific questions addressed by these different approaches also varied, ranging from a more precise definition of the phenotype, bias reduction in control selection, estimation of effect sizes and genotype associated risk, to direct incorporation of genetic data into longitudinal modeling approaches and the exploration of population heterogeneity with regard to longitudinal trajectories. The group reached several overall conclusions: 1) The additional information provided by longitudinal data may be useful in genetic analyses. 2) The precision of the phenotype definition as well as control selection in nested designs may be improved, especially if traits demonstrate a trend over time or have strong age-of-onset effects. 3) Analyzing genetic data stratified for high-risk subgroups defined by a unique development over time could be useful for the detection of rare mutations in common multi-factorial diseases. 4) Estimation of the population impact of genomic risk variants could be more precise. The challenges and computational complexity demanded by genome-wide single-nucleotide polymorphism data were also discussed. PMID:19924713

  7. Identification and Genomic Analysis of a Novel Group C Orthobunyavirus Isolated from a Mosquito Captured near Iquitos, Peru.

    PubMed

    Treangen, Todd J; Schoeler, George; Phillippy, Adam M; Bergman, Nicholas H; Turell, Michael J

    2016-04-01

    Group C orthobunyaviruses are single-stranded RNA viruses found in both South and North America. Until very recently, and despite their status as important vector-borne human pathogens, no Group C whole genome sequences containing all three segments were available in public databases. Here we report a Group C orthobunyavirus, named El Huayo virus, isolated from a pool of Culex portesi mosquitoes captured near Iquitos, Peru. Although initial metagenomic analysis yielded only a handful of reads belonging to the genus Orthobunyavirus, single contig assemblies were generated for L, M, and S segments totaling over 200,000 reads (~0.5% of sample). Given the moderately high viremia in hamsters (>107 plaque-forming units/ml) and the propensity for Cx. portesi to feed on rodents, it is possible that El Huayo virus is maintained in nature in a Culex portesi/rodent cycle. El Huayo virus was found to be most similar to Peruvian Caraparu virus isolates and constitutes a novel subclade within Group C.

  8. Identification and Genomic Analysis of a Novel Group C Orthobunyavirus Isolated from a Mosquito Captured near Iquitos, Peru

    PubMed Central

    Treangen, Todd J.; Schoeler, George; Phillippy, Adam M.; Bergman, Nicholas H.; Turell, Michael J.

    2016-01-01

    Group C orthobunyaviruses are single-stranded RNA viruses found in both South and North America. Until very recently, and despite their status as important vector-borne human pathogens, no Group C whole genome sequences containing all three segments were available in public databases. Here we report a Group C orthobunyavirus, named El Huayo virus, isolated from a pool of Culex portesi mosquitoes captured near Iquitos, Peru. Although initial metagenomic analysis yielded only a handful of reads belonging to the genus Orthobunyavirus, single contig assemblies were generated for L, M, and S segments totaling over 200,000 reads (~0.5% of sample). Given the moderately high viremia in hamsters (>107 plaque-forming units/ml) and the propensity for Cx. portesi to feed on rodents, it is possible that El Huayo virus is maintained in nature in a Culex portesi/rodent cycle. El Huayo virus was found to be most similar to Peruvian Caraparu virus isolates and constitutes a novel subclade within Group C. PMID:27074162

  9. Framework for development of physician competencies in genomic medicine: report of the Competencies Working Group of the Inter-Society Coordinating Committee for Physician Education in Genomics.

    PubMed

    Korf, Bruce R; Berry, Anna B; Limson, Melvin; Marian, Ali J; Murray, Michael F; O'Rourke, P Pearl; Passamani, Eugene R; Relling, Mary V; Tooker, John; Tsongalis, Gregory J; Rodriguez, Laura L

    2014-11-01

    Completion of the Human Genome Project, in conjunction with dramatic reductions in the cost of DNA sequencing and advances in translational research, is gradually ushering genomic discoveries and technologies into the practice of medicine. The rapid pace of these advances is opening up a gap between the knowledge available about the clinical relevance of genomic information and the ability of clinicians to include such information in their medical practices. This educational gap threatens to be rate limiting to the clinical adoption of genomics in medicine. Solutions will require not only a better understanding of the clinical implications of genetic discoveries but also training in genomics at all levels of professional development, including for individuals in formal training and others who long ago completed such training. The National Human Genome Research Institute has convened the Inter-Society Coordinating Committee for Physician Education in Genomics (ISCC) to develop and share best practices in the use of genomics in medicine. The ISCC has developed a framework for development of genomics practice competencies that may serve as a starting point for formulation of competencies for physicians in various medical disciplines.

  10. Genome analysis of Elusimicrobium minutum, the first cultivated representative of the Elusimicrobia phylum (formerly Termite Group 1)

    SciTech Connect

    Herlemann, D. P. R.; Geissinger, O.; Ikeda-Ohtsubo, W.; Kunin, V.; Sun, H.; Lapidus, A.; Hugenholtz, P.; Brune, A.

    2009-02-01

    The candidate phylum Termite group 1 (TG1), is regularly 1 encountered in termite hindguts but is present also in many other habitats. Here we report the complete genome sequence (1.64 Mbp) of Elusimicrobium minutum strain Pei191{sup T}, the first cultured representative of the TG1 phylum. We reconstructed the metabolism of this strictly anaerobic bacterium isolated from a beetle larva gut and discuss the findings in light of physiological data. E. minutum has all genes required for uptake and fermentation of sugars via the Embden-Meyerhof pathway, including several hydrogenases, and an unusual peptide degradation pathway comprising transamination reactions and leading to the formation of alanine, which is excreted in substantial amounts. The presence of genes encoding lipopolysaccharide biosynthesis and the presence of a pathway for peptidoglycan formation are consistent with ultrastructural evidence of a Gram-negative cell envelope. Even though electron micrographs showed no cell appendages, the genome encodes many genes putatively involved in pilus assembly. We assigned some to a type II secretion system, but the function of 60 pilE-like genes remains unknown. Numerous genes with hypothetical functions, e.g., polyketide synthesis, non-ribosomal peptide synthesis, antibiotic transport, and oxygen stress protection, indicate the presence of hitherto undiscovered physiological traits. Comparative analysis of 22 concatenated single-copy marker genes corroborated the status of Elusimicrobia (formerly TG1) as a separate phylum in the bacterial domain, which was so far based only on 16S rRNA sequence analysis.

  11. Genomic Recombination Leading to Decreased Virulence of Group B Streptococcus in a Mouse Model of Adult Invasive Disease

    PubMed Central

    Teatero, Sarah; Lemire, Paul; Dewar, Ken; Wasserscheid, Jessica; Calzas, Cynthia; Mallo, Gustavo V.; Li, Aimin; Athey, Taryn B.T.; Segura, Mariela; Fittipaldi, Nahuel

    2016-01-01

    Adult invasive disease caused by Group B Streptococcus (GBS) is increasing worldwide. Whole-genome sequencing (WGS) now permits rapid identification of recombination events, a phenomenon that occurs frequently in GBS. Using WGS, we described that strain NGBS375, a capsular serotype V GBS isolate of sequence type (ST)297, has an ST1 genomic background but has acquired approximately 300 kbp of genetic material likely from an ST17 strain. Here, we examined the virulence of this strain in an in vivo model of GBS adult invasive infection. The mosaic ST297 strain showed intermediate virulence, causing significantly less systemic infection and reduced mortality than a more virulent, serotype V ST1 isolate. Bacteremia induced by the ST297 strain was similar to that induced by a serotype III ST17 strain, which was the least virulent under the conditions tested. Yet, under normalized bacteremia levels, the in vivo intrinsic capacity to induce the production of pro-inflammatory cytokines was similar between the ST297 strain and the virulent ST1 strain. Thus, the diminished virulence of the mosaic strain may be due to reduced capacity to disseminate or multiply in blood during a systemic infection which could be mediated by regulatory factors contained in the recombined region. PMID:27527222

  12. The transcriptome of the reference potato genome Solanum tuberosum Group Phureja clone DM1-3 516R44.

    PubMed

    Massa, Alicia N; Childs, Kevin L; Lin, Haining; Bryan, Glenn J; Giuliano, Giovanni; Buell, C Robin

    2011-01-01

    Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family.

  13. Bacterial genome remodeling through bacteriophage recombination.

    PubMed

    Menouni, Rachid; Hutinet, Geoffrey; Petit, Marie-Agnès; Ansaldi, Mireille

    2015-01-01

    Bacteriophages co-exist and co-evolve with their hosts in natural environments. Virulent phages lyse infected cells through lytic cycles, whereas temperate phages often remain dormant and can undergo lysogenic or lytic cycles. In their lysogenic state, prophages are actually part of the host genome and replicate passively in rhythm with host division. However, prophages are far from being passive residents: they can modify or bring new properties to their host. In this review, we focus on two important phage-encoded recombination mechanisms, i.e. site-specific recombination and homologous recombination, and how they remodel bacterial genomes. PMID:25790500

  14. Bacterial genome remodeling through bacteriophage recombination.

    PubMed

    Menouni, Rachid; Hutinet, Geoffrey; Petit, Marie-Agnès; Ansaldi, Mireille

    2015-01-01

    Bacteriophages co-exist and co-evolve with their hosts in natural environments. Virulent phages lyse infected cells through lytic cycles, whereas temperate phages often remain dormant and can undergo lysogenic or lytic cycles. In their lysogenic state, prophages are actually part of the host genome and replicate passively in rhythm with host division. However, prophages are far from being passive residents: they can modify or bring new properties to their host. In this review, we focus on two important phage-encoded recombination mechanisms, i.e. site-specific recombination and homologous recombination, and how they remodel bacterial genomes.

  15. Phylogeny of the Sphaerotilus-Leptothrix group inferred from morphological comparisons, genomic fingerprinting, and 16S ribosomal DNA sequence analyses.

    PubMed

    Siering, P L; Ghiorse, W C

    1996-01-01

    Phase-contrast light microscopy revealed that only one of eight cultivated strains belonging to the Sphaerotilus-Leptothrix group of sheathed bacteria actually produced a sheath in standard growth media. Two Sphaerotilus natans strains produced branched cells, but other morphological characteristics that were used to identify these bacteria were consistent with previously published descriptions. Genomic fingerprints, which were obtained by performing PCR amplification with primers corresponding to enterobacterial repetitive intergenic consensus sequences, were useful for distinguishing between the genera Sphaerotilus and Leptothrix, as well as among individual strains. The complete 16S ribosomal DNA (rDNA) sequences of two strains of "Leptothrix discophora" (strains SP-6 and SS-1) were determined. In addition, partial sequences (approximately 300 nucleotides) of one strain of Leptothrix cholodnii (strain LMG 7171), an unidentified Leptothrix strain (strain NC-1), and four strains of Sphaerotilus natans (strains ATCC 13338T [T = type strain], ATCC 15291, ATCC 29329, and ATCC 29330) were determined. We found that two of the S. natans strains (ATCC 15291 and ATCC 13338T), which differed in morphology and in their genomic fingerprints, had identical sequences in the 300-nucleotide region sequenced. Both parsimony and distance matrix methods were used to infer the evolutionary relationships of the eight strains in a comparison of the 16S rDNA sequences of these organisms with 16S rDNA sequences obtained from ribosomal sequence databases. All of the strains clustered in the Rubrivivax subdivision of the beta subclass of the Proteobacteria, which confirmed previously published conclusions concerning selected individual strains. Additional analyses revealed that all of the S. natans strains clustered in one closely related group, while the Leptothrix strains clustered in two separate lineages that were approximately equidistant from the S. natans cluster. This finding

  16. Phylogeny of the Sphaerotilus-Leptothrix group inferred from morphological comparisons, genomic fingerprinting, and 16S ribosomal DNA sequence analyses.

    PubMed

    Siering, P L; Ghiorse, W C

    1996-01-01

    Phase-contrast light microscopy revealed that only one of eight cultivated strains belonging to the Sphaerotilus-Leptothrix group of sheathed bacteria actually produced a sheath in standard growth media. Two Sphaerotilus natans strains produced branched cells, but other morphological characteristics that were used to identify these bacteria were consistent with previously published descriptions. Genomic fingerprints, which were obtained by performing PCR amplification with primers corresponding to enterobacterial repetitive intergenic consensus sequences, were useful for distinguishing between the genera Sphaerotilus and Leptothrix, as well as among individual strains. The complete 16S ribosomal DNA (rDNA) sequences of two strains of "Leptothrix discophora" (strains SP-6 and SS-1) were determined. In addition, partial sequences (approximately 300 nucleotides) of one strain of Leptothrix cholodnii (strain LMG 7171), an unidentified Leptothrix strain (strain NC-1), and four strains of Sphaerotilus natans (strains ATCC 13338T [T = type strain], ATCC 15291, ATCC 29329, and ATCC 29330) were determined. We found that two of the S. natans strains (ATCC 15291 and ATCC 13338T), which differed in morphology and in their genomic fingerprints, had identical sequences in the 300-nucleotide region sequenced. Both parsimony and distance matrix methods were used to infer the evolutionary relationships of the eight strains in a comparison of the 16S rDNA sequences of these organisms with 16S rDNA sequences obtained from ribosomal sequence databases. All of the strains clustered in the Rubrivivax subdivision of the beta subclass of the Proteobacteria, which confirmed previously published conclusions concerning selected individual strains. Additional analyses revealed that all of the S. natans strains clustered in one closely related group, while the Leptothrix strains clustered in two separate lineages that were approximately equidistant from the S. natans cluster. This finding

  17. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    ERIC Educational Resources Information Center

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L. M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David; Elgin, Sarah C. R.

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington…

  18. The Genome and Linkage Map of the Northern Pike (Esox lucius): Conserved Synteny Revealed between the Salmonid Sister Group and the Neoteleostei

    PubMed Central

    Rondeau, Eric B.; Minkley, David R.; Leong, Jong S.; Messmer, Amber M.; Jantzen, Johanna R.; von Schalburg, Kristian R.; Lemon, Craig; Bird, Nathan H.; Koop, Ben F.

    2014-01-01

    The northern pike is the most frequently studied member of the Esociformes, the closest order to the diverse and economically important Salmoniformes. The ancestor of all salmonids purportedly experienced a whole-genome duplication (WGD) event, making salmonid species ideal for studying the early impacts of genome duplication while complicating their use in wider analyses of teleost evolution. Studies suggest that the Esociformes diverged from the salmonid lineage prior to the WGD, supporting the use of northern pike as a pre-duplication outgroup. Here we present the first genome assembly, reference transcriptome and linkage map for northern pike, and evaluate the suitability of this species to provide a representative pre-duplication genome for future studies of salmonid and teleost evolution. The northern pike genome sequence is composed of 94,267 contigs (N50 = 16,909 bp) contained in 5,688 scaffolds (N50 = 700,535 bp); the total scaffolded genome size is 878 million bases. Multiple lines of evidence suggest that over 96% of the protein-coding genome is present in the genome assembly. The reference transcriptome was constructed from 13 tissues and contains 38,696 transcripts, which are accompanied by normalized expression data in all tissues. Gene-prediction analysis produced a total of 19,601 northern pike-specific gene models. The first-generation linkage map identifies 25 linkage groups, in agreement with northern pike's diploid karyotype of 2N = 50, and facilitates the placement of 46% of assembled bases onto linkage groups. Analyses reveal a high degree of conserved synteny between northern pike and other model teleost genomes. While conservation of gene order is limited to smaller syntenic blocks, the wider conservation of genome organization implies the northern pike exhibits a suitable approximation of a non-duplicated Protacanthopterygiian genome. This dataset will facilitate future studies of esocid biology and empower ongoing examinations

  19. Attitudes of non-African American focus group participants toward return of results from exome and whole genome sequencing.

    PubMed

    Yu, Joon-Ho; Crouch, Julia; Jamal, Seema M; Bamshad, Michael J; Tabor, Holly K

    2014-09-01

    Exome sequencing and whole genome sequencing (ES/WGS) present individuals with the opportunity to benefit from a broad scope of genetic results of clinical and personal utility. Yet, it is unclear which genetic results people want to receive (i.e., what type of genetic information they want to learn about themselves) or conversely not receive, and how they want to receive or manage results over time. Very little is known about whether and how attitudes toward receiving individual results from ES/WGS vary among racial/ethnic populations. We conducted 13 focus groups with a racially and ethnically diverse parent population (n = 76) to investigate attitudes toward return of individual results from WGS. We report on our findings for non-African American (non-AA) participants. Non-AA participants were primarily interested in genetic results on which they could act or "do something about." They defined "actionability" broadly to include individual medical treatment and disease prevention. The ability to plan for the future was both a motivation for and an expected benefit of receiving results. Their concerns focused on the meaning of results, specifically the potential inaccuracy and uncertainty of results. Non-AA participants expected healthcare providers to be involved in results management by helping them interpret results in the context of their own health and by providing counseling support. We compare and contrast these themes with those we previously reported from our analysis of African American (AA) perspectives to highlight the importance of varying preferences for results, characterize the central role of temporal orientation in framing expectations about the possibility of receiving ES/WGS results, and identify potential avenues by which genomic healthcare disparities may be inadvertently perpetuated.

  20. Evolutionary pathway to increased virulence and epidemic group A Streptococcus disease derived from 3,615 genome sequences.

    PubMed

    Nasser, Waleed; Beres, Stephen B; Olsen, Randall J; Dean, Melissa A; Rice, Kelsey A; Long, S Wesley; Kristinsson, Karl G; Gottfredsson, Magnus; Vuopio, Jaana; Raisanen, Kati; Caugant, Dominique A; Steinbakk, Martin; Low, Donald E; McGeer, Allison; Darenberg, Jessica; Henriques-Normark, Birgitta; Van Beneden, Chris A; Hoffmann, Steen; Musser, James M

    2014-04-29

    We sequenced the genomes of 3,615 strains of serotype Emm protein 1 (M1) group A Streptococcus to unravel the nature and timing of molecular events contributing to the emergence, dissemination, and genetic diversification of an unusually virulent clone that now causes epidemic human infections worldwide. We discovered that the contemporary epidemic clone emerged in stepwise fashion from a precursor cell that first contained the phage encoding an extracellular DNase virulence factor (streptococcal DNase D2, SdaD2) and subsequently acquired the phage encoding the SpeA1 variant of the streptococcal pyrogenic exotoxin A superantigen. The SpeA2 toxin variant evolved from SpeA1 by a single-nucleotide change in the M1 progenitor strain before acquisition by horizontal gene transfer of a large chromosomal region encoding secreted toxins NAD(+)-glycohydrolase and streptolysin O. Acquisition of this 36-kb region in the early 1980s into just one cell containing the phage-encoded sdaD2 and speA2 genes was the final major molecular event preceding the emergence and rapid intercontinental spread of the contemporary epidemic clone. Thus, we resolve a decades-old controversy about the type and sequence of genomic alterations that produced this explosive epidemic. Analysis of comprehensive, population-based contemporary invasive strains from seven countries identified strong patterns of temporal population structure. Compared with a preepidemic reference strain, the contemporary clone is significantly more virulent in nonhuman primate models of pharyngitis and necrotizing fasciitis. A key finding is that the molecular evolutionary events transpiring in just one bacterial cell ultimately have produced millions of human infections worldwide. PMID:24733896

  1. Extensive Capsule Locus Variation and Large-Scale Genomic Recombination within the Klebsiella pneumoniae Clonal Group 258

    PubMed Central

    Wyres, Kelly L.; Gorrie, Claire; Edwards, David J.; Wertheim, Heiman F.L.; Hsu, Li Yang; Van Kinh, Nguyen; Zadoks, Ruth; Baker, Stephen; Holt, Kathryn E.

    2015-01-01

    Klebsiella pneumoniae clonal group (CG) 258, comprising sequence types (STs) 258, 11, and closely related variants, is associated with dissemination of the K. pneumoniae carbapenemase (KPC). Hospital outbreaks of KPC CG258 infections have been observed globally and are very difficult to treat. As a consequence, there is renewed interest in alternative infection control measures such as vaccines and phage or depolymerase treatments targeting the K. pneumoniae polysaccharide capsule. To date, 78 immunologically distinct capsule variants have been described in K. pneumoniae. Previous investigations of ST258 and a small number of closely related strains suggested that capsular variation was limited within this clone; only two distinct ST258 capsule polysaccharide synthesis (cps) loci have been identified, both acquired through large-scale recombination events (>50 kb). In contrast to previous studies, we report a comparative genomic analysis of the broader K. pneumoniae CG258 (n = 39). We identified 11 different cps loci within CG258, indicating that capsular switching is actually common within the complex. We observed several insertion sequences (IS) within the cps loci, and show further intraclone diversification of two cps loci through IS activity. Our data also indicate that several large-scale recombination events have shaped the genomes of CG258, and that definition of the complex should be broadened to include ST395 (also reported to harbor KPC). As only the second report of extensive intraclonal cps variation among Gram-negative bacterial species, our findings alter our understanding of the evolution of these organisms and have key implications for the design of control measures targeting K. pneumoniae capsules. PMID:25861820

  2. Extensive Capsule Locus Variation and Large-Scale Genomic Recombination within the Klebsiella pneumoniae Clonal Group 258.

    PubMed

    Wyres, Kelly L; Gorrie, Claire; Edwards, David J; Wertheim, Heiman F L; Hsu, Li Yang; Van Kinh, Nguyen; Zadoks, Ruth; Baker, Stephen; Holt, Kathryn E

    2015-05-01

    Klebsiella pneumoniae clonal group (CG) 258, comprising sequence types (STs) 258, 11, and closely related variants, is associated with dissemination of the K. pneumoniae carbapenemase (KPC). Hospital outbreaks of KPC CG258 infections have been observed globally and are very difficult to treat. As a consequence, there is renewed interest in alternative infection control measures such as vaccines and phage or depolymerase treatments targeting the K. pneumoniae polysaccharide capsule. To date, 78 immunologically distinct capsule variants have been described in K. pneumoniae. Previous investigations of ST258 and a small number of closely related strains suggested that capsular variation was limited within this clone; only two distinct ST258 capsule polysaccharide synthesis (cps) loci have been identified, both acquired through large-scale recombination events (>50 kb). In contrast to previous studies, we report a comparative genomic analysis of the broader K. pneumoniae CG258 (n = 39). We identified 11 different cps loci within CG258, indicating that capsular switching is actually common within the complex. We observed several insertion sequences (IS) within the cps loci, and show further intraclone diversification of two cps loci through IS activity. Our data also indicate that several large-scale recombination events have shaped the genomes of CG258, and that definition of the complex should be broadened to include ST395 (also reported to harbor KPC). As only the second report of extensive intraclonal cps variation among Gram-negative bacterial species, our findings alter our understanding of the evolution of these organisms and have key implications for the design of control measures targeting K. pneumoniae capsules. PMID:25861820

  3. Evolutionary pathway to increased virulence and epidemic group A Streptococcus disease derived from 3,615 genome sequences

    PubMed Central

    Nasser, Waleed; Beres, Stephen B.; Olsen, Randall J.; Dean, Melissa A.; Rice, Kelsey A.; Long, S. Wesley; Kristinsson, Karl G.; Gottfredsson, Magnus; Vuopio, Jaana; Raisanen, Kati; Caugant, Dominique A.; Steinbakk, Martin; Low, Donald E.; McGeer, Allison; Darenberg, Jessica; Henriques-Normark, Birgitta; Van Beneden, Chris A.; Hoffmann, Steen; Musser, James M.

    2014-01-01

    We sequenced the genomes of 3,615 strains of serotype Emm protein 1 (M1) group A Streptococcus to unravel the nature and timing of molecular events contributing to the emergence, dissemination, and genetic diversification of an unusually virulent clone that now causes epidemic human infections worldwide. We discovered that the contemporary epidemic clone emerged in stepwise fashion from a precursor cell that first contained the phage encoding an extracellular DNase virulence factor (streptococcal DNase D2, SdaD2) and subsequently acquired the phage encoding the SpeA1 variant of the streptococcal pyrogenic exotoxin A superantigen. The SpeA2 toxin variant evolved from SpeA1 by a single-nucleotide change in the M1 progenitor strain before acquisition by horizontal gene transfer of a large chromosomal region encoding secreted toxins NAD+-glycohydrolase and streptolysin O. Acquisition of this 36-kb region in the early 1980s into just one cell containing the phage-encoded sdaD2 and speA2 genes was the final major molecular event preceding the emergence and rapid intercontinental spread of the contemporary epidemic clone. Thus, we resolve a decades-old controversy about the type and sequence of genomic alterations that produced this explosive epidemic. Analysis of comprehensive, population-based contemporary invasive strains from seven countries identified strong patterns of temporal population structure. Compared with a preepidemic reference strain, the contemporary clone is significantly more virulent in nonhuman primate models of pharyngitis and necrotizing fasciitis. A key finding is that the molecular evolutionary events transpiring in just one bacterial cell ultimately have produced millions of human infections worldwide. PMID:24733896

  4. Group II Intron-Mediated Trans-Splicing in the Gene-Rich Mitochondrial Genome of an Enigmatic Eukaryote, Diphylleia rotans.

    PubMed

    Kamikawa, Ryoma; Shiratori, Takashi; Ishida, Ken-Ichiro; Miyashita, Hideaki; Roger, Andrew J

    2016-02-01

    Although mitochondria have evolved from a single endosymbiotic event, present day mitochondria of diverse eukaryotes display a great range of genome structures, content and features. Group I and group II introns are two features that are distributed broadly but patchily in mitochondrial genomes across branches of the tree of eukaryotes. While group I intron-mediated trans-splicing has been reported from some lineages distantly related to each other, findings of group II intron-mediated trans-splicing has been restricted to members of the Chloroplastida. In this study, we found the mitochondrial genome of the unicellular eukaryote Diphylleia rotans possesses currently the second largest gene repertoire. On the basis of a probable phylogenetic position of Diphylleia, which is located within Amorphea, current mosaic gene distribution in Amorphea must invoke parallel gene losses from mitochondrial genomes during evolution. Most notably, although the cytochrome c oxidase subunit (cox) 1 gene was split into four pieces which located at a distance to each other, we confirmed that a single mature mRNA that covered the entire coding region could be generated by group II intron-mediated trans-splicing. This is the first example of group II intron-mediated trans-splicing outside Chloroplastida. Similar trans-splicing mechanisms likely work for bipartitely split cox2 and nad3 genes to generate single mature mRNAs. We finally discuss origin and evolution of this type of trans-splicing in D. rotans as well as in eukaryotes. PMID:26833505

  5. Group II Intron-Mediated Trans-Splicing in the Gene-Rich Mitochondrial Genome of an Enigmatic Eukaryote, Diphylleia rotans

    PubMed Central

    Kamikawa, Ryoma; Shiratori, Takashi; Ishida, Ken-Ichiro; Miyashita, Hideaki; Roger, Andrew J.

    2016-01-01

    Although mitochondria have evolved from a single endosymbiotic event, present day mitochondria of diverse eukaryotes display a great range of genome structures, content and features. Group I and group II introns are two features that are distributed broadly but patchily in mitochondrial genomes across branches of the tree of eukaryotes. While group I intron-mediated trans-splicing has been reported from some lineages distantly related to each other, findings of group II intron-mediated trans-splicing has been restricted to members of the Chloroplastida. In this study, we found the mitochondrial genome of the unicellular eukaryote Diphylleia rotans possesses currently the second largest gene repertoire. On the basis of a probable phylogenetic position of Diphylleia, which is located within Amorphea, current mosaic gene distribution in Amorphea must invoke parallel gene losses from mitochondrial genomes during evolution. Most notably, although the cytochrome c oxidase subunit (cox) 1 gene was split into four pieces which located at a distance to each other, we confirmed that a single mature mRNA that covered the entire coding region could be generated by group II intron-mediated trans-splicing. This is the first example of group II intron-mediated trans-splicing outside Chloroplastida. Similar trans-splicing mechanisms likely work for bipartitely split cox2 and nad3 genes to generate single mature mRNAs. We finally discuss origin and evolution of this type of trans-splicing in D. rotans as well as in eukaryotes. PMID:26833505

  6. Challenges of flow-cytometric estimation of nuclear genome size in orchids, a plant group with both whole-genome and progressively partial endoreplication.

    PubMed

    Trávníček, Pavel; Ponert, Jan; Urfus, Tomáš; Jersáková, Jana; Vrána, Jan; Hřibová, Eva; Doležel, Jaroslav; Suda, Jan

    2015-10-01

    Nuclear genome size is an inherited quantitative trait of eukaryotic organisms with both practical and biological consequences. A detailed analysis of major families is a promising approach to fully understand the biological meaning of the extensive variation in genome size in plants. Although Orchidaceae accounts for ∼10% of the angiosperm diversity, the knowledge of patterns and dynamics of their genome size is limited, in part due to difficulties in flow cytometric analyses. Cells in various somatic tissues of orchids undergo extensive endoreplication, either whole-genome or partial, and the G1-phase nuclei with 2C DNA amounts may be lacking, resulting in overestimated genome size values. Interpretation of DNA content histograms is particularly challenging in species with progressively partial endoreplication, in which the ratios between the positions of two neighboring DNA peaks are lower than two. In order to assess distributions of nuclear DNA amounts and identify tissue suitable for reliable estimation of nuclear DNA content, we analyzed six different tissue types in 48 orchid species belonging to all recognized subfamilies. Although traditionally used leaves may provide incorrect C-values, particularly in species with progressively partial endoreplication, young ovaries and pollinaria consistently yield 2C and 1C peaks of their G1-phase nuclei, respectively, and are, therefore, the most suitable parts for genome size studies in orchids. We also provide new DNA C-values for 22 orchid genera and 42 species. Adhering to the proposed methodology would allow for reliable genome size estimates in this largest plant family. Although our research was limited to orchids, the need to find a suitable tissue with dominant 2C peak of G1-phase nuclei applies to all endopolyploid species.

  7. Challenges of flow-cytometric estimation of nuclear genome size in orchids, a plant group with both whole-genome and progressively partial endoreplication.

    PubMed

    Trávníček, Pavel; Ponert, Jan; Urfus, Tomáš; Jersáková, Jana; Vrána, Jan; Hřibová, Eva; Doležel, Jaroslav; Suda, Jan

    2015-10-01

    Nuclear genome size is an inherited quantitative trait of eukaryotic organisms with both practical and biological consequences. A detailed analysis of major families is a promising approach to fully understand the biological meaning of the extensive variation in genome size in plants. Although Orchidaceae accounts for ∼10% of the angiosperm diversity, the knowledge of patterns and dynamics of their genome size is limited, in part due to difficulties in flow cytometric analyses. Cells in various somatic tissues of orchids undergo extensive endoreplication, either whole-genome or partial, and the G1-phase nuclei with 2C DNA amounts may be lacking, resulting in overestimated genome size values. Interpretation of DNA content histograms is particularly challenging in species with progressively partial endoreplication, in which the ratios between the positions of two neighboring DNA peaks are lower than two. In order to assess distributions of nuclear DNA amounts and identify tissue suitable for reliable estimation of nuclear DNA content, we analyzed six different tissue types in 48 orchid species belonging to all recognized subfamilies. Although traditionally used leaves may provide incorrect C-values, particularly in species with progressively partial endoreplication, young ovaries and pollinaria consistently yield 2C and 1C peaks of their G1-phase nuclei, respectively, and are, therefore, the most suitable parts for genome size studies in orchids. We also provide new DNA C-values for 22 orchid genera and 42 species. Adhering to the proposed methodology would allow for reliable genome size estimates in this largest plant family. Although our research was limited to orchids, the need to find a suitable tissue with dominant 2C peak of G1-phase nuclei applies to all endopolyploid species. PMID:25929591

  8. Genome-wide evidence of Austronesian–Bantu admixture and cultural reversion in a hunter-gatherer group of Madagascar

    PubMed Central

    Pierron, Denis; Razafindrazaka, Harilanto; Pagani, Luca; Ricaut, François-Xavier; Antao, Tiago; Capredon, Mélanie; Sambo, Clément; Radimilahy, Chantal; Rakotoarisoa, Jean-Aimé; Blench, Roger M.; Letellier, Thierry; Kivisild, Toomas

    2014-01-01

    Linguistic and cultural evidence suggest that Madagascar was the final point of two major dispersals of Austronesian- and Bantu-speaking populations. Today, the Mikea are described as the last-known Malagasy population reported to be still practicing a hunter-gatherer lifestyle. It is unclear, however, whether the Mikea descend from a remnant population that existed before the arrival of Austronesian and Bantu agriculturalists or whether it is only their lifestyle that separates them from the other contemporary populations of South Madagascar. To address these questions we have performed a genome-wide analysis of >700,000 SNP markers on 21 Mikea, 24 Vezo, and 24 Temoro individuals, together with 50 individuals from Bajo and Lebbo populations from Indonesia. Our analyses of these data in the context of data available from other Southeast Asian and African populations reveal that all three Malagasy populations are derived from the same admixture event involving Austronesian and Bantu sources. In contrast to the fact that most of the vocabulary of the Malagasy speakers is derived from the Barito group of the Austronesian language family, we observe that only one-third of their genetic ancestry is related to the populations of the Java-Kalimantan-Sulawesi area. Because no additional ancestry components distinctive for the Mikea were found, it is likely that they have adopted their hunter-gatherer way of life through cultural reversion, and selection signals suggest a genetic adaptation to their new lifestyle. PMID:24395773

  9. Complete mtDNA genomes of Filipino ethnolinguistic groups: a melting pot of recent and ancient lineages in the Asia-Pacific region

    PubMed Central

    Delfin, Frederick; Min-Shan Ko, Albert; Li, Mingkun; Gunnarsdóttir, Ellen D; Tabbada, Kristina A; Salvador, Jazelyn M; Calacal, Gayvelline C; Sagum, Minerva S; Datar, Francisco A; Padilla, Sabino G; De Ungria, Maria Corazon A; Stoneking, Mark

    2014-01-01

    The Philippines is a strategic point in the Asia-Pacific region for the study of human diversity, history and origins, as it is a cross-road for human migrations and consequently exhibits enormous ethnolinguistic diversity. Following on a previous in-depth study of Y-chromosome variation, here we provide new insights into the maternal genetic history of Filipino ethnolinguistic groups by surveying complete mitochondrial DNA (mtDNA) genomes from a total of 14 groups (11 groups in this study and 3 groups previously published) including previously published mtDNA hypervariable segment (HVS) data from Filipino regional center groups. Comparison of HVS data indicate genetic differences between ethnolinguistic and regional center groups. The complete mtDNA genomes of 14 ethnolinguistic groups reveal genetic aspects consistent with the Y-chromosome, namely: diversity and heterogeneity of groups, no support for a simple dichotomy between Negrito and non-Negrito groups, and different genetic affinities with Asia-Pacific groups that are both ancient and recent. Although some mtDNA haplogroups can be associated with the Austronesian expansion, there are others that associate with South Asia, Near Oceania and Australia that are consistent with a southern migration route for ethnolinguistic group ancestors into the Asia-Pacific, with a timeline that overlaps with the initial colonization of the Asia-Pacific region, the initial colonization of the Philippines and a possible separate post-colonization migration into the Philippine archipelago. PMID:23756438

  10. Whole-Genome Sequences of Four Strains Closely Related to Members of the Mycobacterium chelonae Group, Isolated from Biofilms in a Drinking Water Distribution System Simulator

    EPA Science Inventory

    We report the draft genome sequences of four Mycobacterium chelonae group strains from biofilms obtained after a ‘chlorine burn’ in a chloraminated drinking water distribution system simulator. These opportunistic pathogens have been detected in drinking and hospital water distr...

  11. Draft Genome Sequence of Dietzia maris DSM 43672, a Gram-Positive Bacterium of the Mycolata Group

    PubMed Central

    Ganguly, Sonalli; Jimenez-Galisteo, Guadalupe; Pletzer, Daniel; Winterhalter, Mathias; Benz, Roland

    2016-01-01

    Here, we report the draft genome sequence of Dietzia maris, known previously as Rhodococcus maris. It is 3,505,372 bp in size with a G+C content of 73%. The draft genome sequence will improve our understanding of Dietzia maris related to other mycolata species and constitutes a basic tool for exploring the cell wall proteins. PMID:27284155

  12. Draft Genome Sequence of Dietzia maris DSM 43672, a Gram-Positive Bacterium of the Mycolata Group.

    PubMed

    Ganguly, Sonalli; Jimenez-Galisteo, Guadalupe; Pletzer, Daniel; Winterhalter, Mathias; Benz, Roland; Viñas, Miguel

    2016-01-01

    Here, we report the draft genome sequence of Dietzia maris, known previously as Rhodococcus maris It is 3,505,372 bp in size with a G+C content of 73%. The draft genome sequence will improve our understanding of Dietzia maris related to other mycolata species and constitutes a basic tool for exploring the cell wall proteins. PMID:27284155

  13. Draft Genome Sequence of Dietzia maris DSM 43672, a Gram-Positive Bacterium of the Mycolata Group.

    PubMed

    Ganguly, Sonalli; Jimenez-Galisteo, Guadalupe; Pletzer, Daniel; Winterhalter, Mathias; Benz, Roland; Viñas, Miguel

    2016-06-09

    Here, we report the draft genome sequence of Dietzia maris, known previously as Rhodococcus maris It is 3,505,372 bp in size with a G+C content of 73%. The draft genome sequence will improve our understanding of Dietzia maris related to other mycolata species and constitutes a basic tool for exploring the cell wall proteins.

  14. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    PubMed Central

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L.M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington University in St. Louis, is to provide such research opportunities. Using a versatile curriculum that has been adapted to many different class settings, GEP undergraduates undertake projects to bring draft-quality genomic sequence up to high quality and/or participate in the annotation of these sequences. GEP undergraduates have improved more than 2 million bases of draft genomic sequence from several species of Drosophila and have produced hundreds of gene models using evidence-based manual annotation. Students appreciate their ability to make a contribution to ongoing research, and report increased independence and a more active learning approach after participation in GEP projects. They show knowledge gains on pre- and postcourse quizzes about genes and genomes and in bioinformatic analysis. Participating faculty also report professional gains, increased access to genomics-related technology, and an overall positive experience. We have found that using a genomics research project as the core of a laboratory course is rewarding for both faculty and students. PMID:20194808

  15. Complete genome sequence of the bacteriochlorophyll a-containing Roseibacterium elongatum type strain (DSM 19469(T)), a representative of the Roseobacter group isolated from Australian coast sand.

    PubMed

    Riedel, Thomas; Fiebig, Anne; Göker, Markus; Klenk, Hans-Peter

    2014-06-15

    Roseibacterium elongatum Suzuki et al. 2006 is a pink-pigmented and bacteriochlorophyll a-producing representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the marine 'Roseobacter group' were found to be abundant in the ocean and play an important role in global and biogeochemical processes. In the present study we describe the features of R. elongatum strain OCh 323(T) together with its genome sequence and annotation. The 3,555,102 bp long genome consists of one circular chromosome with no extrachromosomal elements and is one of the smallest known Roseobacter genomes. It contains 3,540 protein-coding genes and 59 RNA genes. Genome analysis revealed the presence of a photosynthetic gene cluster, which putatively enables a photoheterotrophic lifestyle. Gene sequences associated with quorum sensing, motility, surface attachment, and thiosulfate and carbon monoxide oxidation could be detected. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197467

  16. Three groups of transposable elements with contrasting copy number dynamics and host responses in the maize (Zea mays ssp. mays) genome.

    PubMed

    Diez, Concepcion M; Meca, Esteban; Tenaillon, Maud I; Gaut, Brandon S

    2014-04-01

    Most angiosperm nuclear DNA is repetitive and derived from silenced transposable elements (TEs). TE silencing requires substantial resources from the plant host, including the production of small interfering RNAs (siRNAs). Thus, the interaction between TEs and siRNAs is a critical aspect of both the function and the evolution of plant genomes. Yet the co-evolutionary dynamics between these two entities remain poorly characterized. Here we studied the organization of TEs within the maize (Zea mays ssp mays) genome, documenting that TEs fall within three groups based on the class and copy numbers. These groups included DNA elements, low copy RNA elements and higher copy RNA elements. The three groups varied statistically in characteristics that included length, location, age, siRNA expression and 24:22 nucleotide (nt) siRNA targeting ratios. In addition, the low copy retroelements encompassed a set of TEs that had previously been shown to decrease expression within a 24 nt siRNA biogenesis mutant (mop1). To investigate the evolutionary dynamics of the three groups, we estimated their abundance in two landraces, one with a genome similar in size to that of the maize reference and the other with a 30% larger genome. For all three accessions, we assessed TE abundance as well as 22 nt and 24 nt siRNA content within leaves. The high copy number retroelements are under targeted similarly by siRNAs among accessions, appear to be born of a rapid bust of activity, and may be currently transpositionally dead or limited. In contrast, the lower copy number group of retrolements are targeted more dynamically and have had a long and ongoing history of transposition in the maize genome. PMID:24743518

  17. Complete genome sequence analysis and identification of putative metallo-beta-lactamase and SpoIIIE homologs in Bacillus cereus group phage BCP8-2, a new member of the proposed Bastille-like group.

    PubMed

    Asare, Paul Tetteh; Bandara, Nadeeka; Jeong, Tae-Yong; Ryu, Sangryeol; Klumpp, Jochen; Kim, Kwang-Pyo

    2015-10-01

    Bacillus cereus group-specific bacteriophage BCP8-2 exhibits a broad lysis spectrum among food and human isolates (330/364) of B. cereus while not infecting B. subtilis (50) or B. licheniformis (12) strains. Its genome is 159,071 bp long with 220 open reading frames, including genes for putative methyltransferases, metallo-beta-lactamase, and a sporulation-related SpoIIIE homolog, as wells as 18 tRNAs. Comparative genome analysis showed that BCP8-2 is related to the recently proposed Bastille-like phages, but not with either SPO1-like or Twort-like phages of the subfamily Spounavirinae.

  18. A diverse group of small circular ssDNA viral genomes in human and non-human primate stools

    PubMed Central

    Ng, Terry Fei Fan; Zhang, Wen; Sachsenröder, Jana; Kondov, Nikola O.; da Costa, Antonio Charlys; Vega, Everardo; Holtz, Lori R.; Wu, Guang; Wang, David; Stine, Colin O.; Antonio, Martin; Mulvaney, Usha S.; Muench, Marcus O.; Deng, Xutao; Ambert-Balay, Katia; Pothier, Pierre; Vinjé, Jan; Delwart, Eric

    2015-01-01

    Viral metagenomics sequencing of fecal samples from outbreaks of acute gastroenteritis from the US revealed the presence of small circular ssDNA viral genomes encoding a replication initiator protein (Rep). Viral genomes were ∼2.5 kb in length, with bi-directionally oriented Rep and capsid (Cap) encoding genes and a stem loop structure downstream of Rep. Several genomes showed evidence of recombination. By digital screening of an in-house virome database (1.04 billion reads) using BLAST, we identified closely related sequences from cases of unexplained diarrhea in France. Deep sequencing and PCR detected such genomes in 7 of 25 US (28 percent) and 14 of 21 French outbreaks (67 percent). One of eighty-five sporadic diarrhea cases in the Gambia was positive by PCR. Twenty-two complete genomes were characterized showing that viruses from patients in the same outbreaks were closely related suggesting common origins. Similar genomes were also characterized from the stools of captive chimpanzees, a gorilla, a black howler monkey, and a lemur that were more diverse than the human stool-associated genomes. The name smacovirus is proposed for this monophyletic viral clade. Possible tropism include mammalian enteric cells or ingested food components such as infected plants. No evidence of viral amplification was found in immunodeficient mice orally inoculated with smacovirus-positive stool supernatants. A role for smacoviruses in diarrhea, if any, remains to be demonstrated. PMID:27774288

  19. Genomic Testing

    MedlinePlus

    ... Working Group Independent Web site Informing the effective integration of genomics into health practice—Lynch syndrome ACCE Model for Evaluating Genetic Tests Recommendations by the EGAPP Working Group Top of ... ...

  20. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

    PubMed

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-10-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species.

  1. Taxonomic Identity Resolution of Highly Phylogenetically Related Strains and Selection of Phylogenetic Markers by Using Genome-Scale Methods: The Bacillus pumilus Group Case

    PubMed Central

    Espariz, Martín; Zuljan, Federico A.; Esteban, Luis; Magni, Christian

    2016-01-01

    Bacillus pumilus group strains have been studied due their agronomic, biotechnological or pharmaceutical potential. Classifying strains of this taxonomic group at species level is a challenging procedure since it is composed of seven species that share among them over 99.5% of 16S rRNA gene identity. In this study, first, a whole-genome in silico approach was used to accurately demarcate B. pumilus group strains, as a case of highly phylogenetically related taxa, at the species level. In order to achieve that and consequently to validate or correct taxonomic identities of genomes in public databases, an average nucleotide identity correlation, a core-based phylogenomic and a gene function repertory analyses were performed. Eventually, more than 50% such genomes were found to be misclassified. Hierarchical clustering of gene functional repertoires was also used to infer ecotypes among B. pumilus group species. Furthermore, for the first time the machine-learning algorithm Random Forest was used to rank genes in order of their importance for species classification. We found that ybbP, a gene involved in the synthesis of cyclic di-AMP, was the most important gene for accurately predicting species identity among B. pumilus group strains. Finally, principal component analysis was used to classify strains based on the distances between their ybbP genes. The methodologies described could be utilized more broadly to identify other highly phylogenetically related species in metagenomic or epidemiological assessments. PMID:27658251

  2. Mitotic index, microvascular proliferation, and necrosis define 3 groups of 1p/19q codeleted anaplastic oligodendrogliomas associated with different genomic alterations

    PubMed Central

    Figarella-Branger, Dominique; Mokhtari, Karima; Dehais, Caroline; Jouvet, Anne; Uro-Coste, Emmanuelle; Colin, Carole; Carpentier, Catherine; Forest, Fabien; Maurage, Claude-Alain; Vignaud, Jean-Michel; Polivka, Marc; Lechapt-Zalcman, Emmanuelle; Eimer, Sandrine; Viennet, Gabriel; Quintin-Roué, Isabelle; Aubriot-Lorton, Marie-Hélène; Diebold, Marie-Danièle; Loussouarn, Delphine; Lacroix, Catherine; Rigau, Valérie; Laquerrière, Annie; Vandenbos, Fanny; Michalak, Sophie; Sevestre, Henri; Peoch, Michel; Labrousse, François; Christov, Christo; Kemeny, Jean-Louis; Chenard, Marie-Pierre; Chiforeanu, Danchristian; Ducray, François; Idbaih, Ahmed; Desenclos, Christine; Menei, Philippe; Al Nader, Edmond; Godard, Joel; Servagi-Vernat, Stéphanie; Carpentier, Antoine; Loiseau, Hugues; Dam-Hieu, Phong; Guillamo, Jean Sebastien; Emery, Evelyne; Verelle, Pierre; Durando, Xavier; Faillot, Thierry; Le Guerinel, Caroline; Ghiringhelli, François; Parker, Fabrice; Adam, Clovis; Dubois, François; Ramirez, Carole; Gueye, Edouard Marcel; Honnorat, Jerome; Chinot, Olivier; Bauchet, Luc; Beauchesne, Patrick; Campone, Mario; Frenel, Jean Sébastien; Fontaine, Denys; Campello, Chantal; Roger, Pascal; Heitzmann, Anne; Fesneau, Mélanie; Delattre, Jean Yves; Elouadhani-Hamdi, Selma; Ricard, Damien; Colin, Philippe; Vauléon, Elodie; Langlois, Olivier; Fotso, Marie Janette Motsuo; Andraud, Marie; Mouton, Servane; Noel, Georges; Desse, Nicolas; Soulard, Raoulin; Cohen-Moyal, Elisabeth; Lubrano, Vincent; Dhermain, Frederic

    2014-01-01

    Background The aim of this study was to correlate histological features and molecular characteristics in anaplastic oligodendrogliomas (AOs). Methods The histological characteristics of 203 AO patients, enrolled in the French national network POLA, were analyzed. The genomic profiles of 191 cases were studied using genomic arrays. IDH mutational status was assessed by immunohistochemistry and direct sequencing. Results 1p/19q codeletion was present in 79% of cases and was associated with alpha-internexin expression (P < 10−4), IDH1/2 mutation (P < 10−4), chromosome 4 loss (P < 10−3), and better overall survival (P < 10−4). Based on mitotic index, microvascular proliferation (MVP), and necrosis, 3 groups of 1p/19q codeleted AOs were identified: (group 1) AO with more than 5 mitoses per 10-HPF, no MVP, and no necrosis; (group 2) AO with MVP and no necrosis; and (group 3) AO with MVP and necrosis. Compared with group 1, groups 2 and 3 AOs had a higher mean Ki-67 proliferation index and a higher rate of 9p and 9q losses. Compared with group 2, group 3 AOs had a higher number of chromosomal alterations including chromosome 4 loss. In the subgroup of 157 1p/19q codeleted AOs, chromosomal instability was associated with shorter progression-free survival (P = .024) and shorter overall survival (P = .023). Conclusions The present study shows that oligodendrogliomas with classic histological features remain a molecularly heterogeneous entity and should be stratified according to 1p/19q status because of its major prognostic relevance. Moreover, 1p/19q codeleted AOs are also heterogeneous. Interestingly, mitotic index, MVP, and necrosis help to classify them into 3 groups associated with distinct genomic alterations. PMID:24723566

  3. Complete genome sequence of the bacteriochlorophyll a-containing Roseibacterium elongatum type strain (DSM 19469T), a representative of the Roseobacter group isolated from Australian coast sand

    PubMed Central

    Riedel, Thomas; Fiebig, Anne; Göker, Markus; Klenk, Hans-Peter

    2014-01-01

    Roseibacterium elongatum Suzuki et al. 2006 is a pink-pigmented and bacteriochlorophyll a-producing representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the marine ‘Roseobacter group’ were found to be abundant in the ocean and play an important role in global and biogeochemical processes. In the present study we describe the features of R. elongatum strain OCh 323T together with its genome sequence and annotation. The 3,555,102 bp long genome consists of one circular chromosome with no extrachromosomal elements and is one of the smallest known Roseobacter genomes. It contains 3,540 protein-coding genes and 59 RNA genes. Genome analysis revealed the presence of a photosynthetic gene cluster, which putatively enables a photoheterotrophic lifestyle. Gene sequences associated with quorum sensing, motility, surface attachment, and thiosulfate and carbon monoxide oxidation could be detected. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197467

  4. Genome sequence of the Wenxinia marina type strain (DSM 24838T), a representative of the Roseobacter group isolated from oilfield sediments

    PubMed Central

    Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Spring, Stefan; Petersen, Jörn; Ivanova, Natalia N.; Markowitz, Victor; Woyke, Tanja; Göker, Markus; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2014-01-01

    Wenxinia marina Ying et al. 2007 is the type species of the genus Wenxinia, a representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae, isolated from oilfield sediments of the South China Sea. This family was shown to harbor the most abundant bacteria especially from coastal and polar waters, but was also found in microbial mats, sediments and attached to different kind of surfaces. Here we describe the features of W. marina strain HY34T together with the genome sequence and annotation of strain DSM 24838T and novel aspects of its phenotype. The 4,181,754 bp containing genome sequence encodes 4,047 protein-coding genes and 59 RNA genes. The genome of W. marina DSM 24838T was sequenced as part of the activities of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project funded by the DoE and the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197468

  5. Genome sequence of the Wenxinia marina type strain (DSM 24838(T)), a representative of the Roseobacter group isolated from oilfield sediments.

    PubMed

    Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Spring, Stefan; Petersen, Jörn; Ivanova, Natalia N; Markowitz, Victor; Woyke, Tanja; Göker, Markus; Kyrpides, Nikos C; Klenk, Hans-Peter

    2014-06-15

    Wenxinia marina Ying et al. 2007 is the type species of the genus Wenxinia, a representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae, isolated from oilfield sediments of the South China Sea. This family was shown to harbor the most abundant bacteria especially from coastal and polar waters, but was also found in microbial mats, sediments and attached to different kind of surfaces. Here we describe the features of W. marina strain HY34(T) together with the genome sequence and annotation of strain DSM 24838(T) and novel aspects of its phenotype. The 4,181,754 bp containing genome sequence encodes 4,047 protein-coding genes and 59 RNA genes. The genome of W. marina DSM 24838(T) was sequenced as part of the activities of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project funded by the DoE and the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG).

  6. Evaluating direct-to-consumer marketing of race-based pharmacogenomics: a focus group study of public understandings of applied genomic medication.

    PubMed

    Bates, Benjamin R; Poirot, Kristan; Harris, Tina M; Condit, Celeste M; Achter, Paul J

    2004-01-01

    Some medical providers have advocated applied genomics, including the use of genetically linked racial phenotypes in medical practice, raising fear that race-based medication will become justified. As with other emerging medical genetic technologies, pharmaceutical companies may advertise these treatments. Researchers fear that consumers will uncritically accept pharmaceutical messages and demand the product. In this exploratory study, we examined public reactions to advertisements for applied genomic medications. A focus group methodology was employed. Participants tended to resist the message and generated warrants for doing so, indicating critical reception of the messages. Message accepters also provided warrants. Warrants for resistance and acceptance differ between self-identified racial groups. Consumers, health care providers, and pharmaceutical corporations will benefit from a better understanding of direct-to-consumer advertisements as medical communication. Our study concludes that both advocates and opponents of direct-to-consumer advertisements should recognize that potential consumers of pharmacogenomics act as critical consumers of health advertising discourse.

  7. Enrichment and Genome Sequence of the Group I.1a Ammonia-Oxidizing Archaeon “Ca. Nitrosotenuis uzonensis” Representing a Clade Globally Distributed in Thermal Habitats

    PubMed Central

    Pelletier, Eric; Schuster, Nathalie; Hauzmayer, Sandra; Bulaev, Aleksandr; Grigor’eva, Nadezhda V.; Galushko, Alexander; Schmid, Markus; Palatinszky, Marton; Le Paslier, Denis; Daims, Holger; Wagner, Michael

    2013-01-01

    The discovery of ammonia-oxidizing archaea (AOA) of the phylum Thaumarchaeota and the high abundance of archaeal ammonia monooxygenase subunit A encoding gene sequences in many environments have extended our perception of nitrifying microbial communities. Moreover, AOA are the only aerobic ammonia oxidizers known to be active in geothermal environments. Molecular data indicate that in many globally distributed terrestrial high-temperature habits a thaumarchaeotal lineage within the Nitrosopumilus cluster (also called “marine” group I.1a) thrives, but these microbes have neither been isolated from these systems nor functionally characterized in situ yet. In this study, we report on the enrichment and genomic characterization of a representative of this lineage from a thermal spring in Kamchatka. This thaumarchaeote, provisionally classified as “Candidatus Nitrosotenuis uzonensis”, is a moderately thermophilic, non-halophilic, chemolithoautotrophic ammonia oxidizer. The nearly complete genome sequence (assembled into a single scaffold) of this AOA confirmed the presence of the typical thaumarchaeotal pathways for ammonia oxidation and carbon fixation, and indicated its ability to produce coenzyme F420 and to chemotactically react to its environment. Interestingly, like members of the genus Nitrosoarchaeum, “Candidatus N. uzonensis” also possesses a putative artubulin-encoding gene. Genome comparisons to related AOA with available genome sequences confirmed that the newly cultured AOA has an average nucleotide identity far below the species threshold and revealed a substantial degree of genomic plasticity with unique genomic regions in “Ca. N. uzonensis”, which potentially include genetic determinants of ecological niche differentiation. PMID:24278328

  8. Evolutionary Dynamics of the Mitochondrial Genome in the Evaniomorpha (Hymenoptera)—A Group with an Intermediate Rate of Gene Rearrangement

    PubMed Central

    Mao, Meng; Gibson, Tracey; Dowton, Mark

    2014-01-01

    We determined the complete mitochondrial (mt) genomes of three evaniomorph species, Ceraphron sp. (Ceraphronoidea), Gasteruption sp. (Evanioidea), and Orthogonalys pulchella (Trigonalyoidea) as well as the nearly complete mt genome from another evaniomorph species, Megalyra sp. (Megalyroidea). Each of them possesses dramatic gene rearrangements, including protein-coding or rRNA genes. Gene inversions were identified in all of these mt genomes; for example, the two rRNA genes have inverted and moved into the nad2-cox1 junction in the Megalyra sp. mt genome. In addition, we found two copies of a 10-bp complementary repeat at the beginning of rrnS and at the end of trnL2 in the Gasteruption sp. mt genome, consistent with recombination as the possible mechanism for gene inversion and long-range movement. Although each of the genomes contains a number of repeats of varying size, there was no consistent association of the size or number of repeats with the extent or type of gene rearrangement. The breakpoint distance analysis showed the Evaniomorpha has an intermediate rate of gene rearrangement. Sequence-based phylogenetic analyses of 13 protein-coding and 2 rRNA genes in 22 hymenopteran taxa recovered a paraphyletic Evaniomorpha with the Aculeata nested within it. Within the Evaniomorpha, our analyses confirmed the Trigonalyoidea + Megalyroidea as the sister group to the Aculeata and recovered a novel clade, Ceraphronoidea + Evanioidea. In contrast to previous hymenopteran phylogenetic studies, the internal relationships of the Evaniomorpha were highly supported and robust to the variation of alignment approach and phylogenetic inference approach. PMID:25115010

  9. Genome-Wide Differentiation of Various Melon Horticultural Groups for Use in GWAS for Fruit Firmness and Construction of a High Resolution Genetic Map

    PubMed Central

    Nimmakayala, Padma; Tomason, Yan R.; Abburi, Venkata L.; Alvarado, Alejandra; Saminathan, Thangasamy; Vajja, Venkata G.; Salazar, Germania; Panicker, Girish K.; Levi, Amnon; Wechter, William P.; McCreight, James D.; Korol, Abraham B.; Ronin, Yefim; Garcia-Mas, Jordi; Reddy, Umesh K.

    2016-01-01

    Melon (Cucumis melo L.) is a phenotypically diverse eudicot diploid (2n = 2x = 24) has climacteric and non-climacteric morphotypes and show wide variation for fruit firmness, an important trait for transportation and shelf life. We generated 13,789 SNP markers using genotyping-by-sequencing (GBS) and anchored them to chromosomes to understand genome-wide fixation indices (Fst) between various melon morphotypes and genomewide linkage disequilibrium (LD) decay. The FST between accessions of cantalupensis and inodorus was 0.23. The FST between cantalupensis and various agrestis accessions was in a range of 0.19–0.53 and between inodorus and agrestis accessions was in a range of 0.21–0.59 indicating sporadic to wide ranging introgression. The EM (Expectation Maximization) algorithm was used for estimation of 1436 haplotypes. Average genome-wide LD decay for the melon genome was noted to be 9.27 Kb. In the current research, we focused on the genome-wide divergence underlying diverse melon horticultural groups. A high-resolution genetic map with 7153 loci was constructed. Genome-wide segregation distortion and recombination rate across various chromosomes were characterized. Melon has climacteric and non-climacteric morphotypes and wide variation for fruit firmness, a very important trait for transportation and shelf life. Various levels of QTLs were identified with high to moderate stringency and linked to fruit firmness using both genome-wide association study (GWAS) and biparental mapping. Gene annotation revealed some of the SNPs are located in β-D-xylosidase, glyoxysomal malate synthase, chloroplastic anthranilate phosphoribosyltransferase, and histidine kinase, the genes that were previously characterized for fruit ripening and softening in other crops. PMID:27713759

  10. The Complete Genome Phylogeny of Geographically Distinct Dengue Virus Serotype 2 Isolates (1944-2013) Supports Further Groupings within the Cosmopolitan Genotype

    PubMed Central

    Ali, Akhtar; Ali, Ijaz

    2015-01-01

    Dengue virus serotype 2 (DENV-2) isolates have been implicated in deadly outbreaks of dengue fever (DF) and dengue hemorrhagic fever (DHF) in several regions of the world. Phylogenetic analysis of DENV-2 isolates collected from particular countries has been performed using partial or individual genes but only a few studies have examined complete whole-genome sequences collected worldwide. Herein, 50 complete genome sequences of DENV-2 isolates, reported over the past 70 years from 19 different countries, were downloaded from GenBank. Phylogenetic analysis was conducted and evolutionary distances of the 50 DENV-2 isolates were determined using maximum likelihood (ML) trees or Bayesian phylogenetic analysis created from complete genome nucleotide (nt) and amino acid (aa) sequences or individual gene sequences. The results showed that all DENV-2 isolates fell into seven main groups containing five previously defined genotypes. A Cosmopolitan genotype showed further division into three groups (C-I, C-II, and C-III) with the C-I group containing two subgroups (C-IA and C-IB). Comparison of the aa sequences showed specific mutations among the various groups of DENV-2 isolates. A maximum number of aa mutations was observed in the NS5 gene, followed by the NS2A, NS3 and NS1 genes, while the smallest number of aa substitutions was recorded in the capsid gene, followed by the PrM/M, NS4A, and NS4B genes. Maximum evolutionary distances were found in the NS2A gene, followed by the NS4A and NS4B genes. Based on these results, we propose that genotyping of DENV-2 isolates in future studies should be performed on entire genome sequences in order to gain a complete understanding of the evolution of various isolates reported from different geographical locations around the world. PMID:26414178

  11. First Genome Sequence of Wild Onion Symptomless Virus, a Novel Member of Potyvirus in the Turnip Mosaic Virus Phylogenetic Group

    PubMed Central

    Korkmaz, Savas; Mitoma, Shinichiro; Nomiyama, Rei; Honda, Yuki

    2016-01-01

    The nearly complete genome sequence of a new species of potyvirus was obtained from the symptomless wild onion (Allium sp.) in Turkey. This virus has less than 67% nucleotide sequence identities over the polyprotein to other known potyviruses. We propose the name wild onion symptomless virus for this novel potyvirus. PMID:27540073

  12. First Genome Sequence of Wild Onion Symptomless Virus, a Novel Member of Potyvirus in the Turnip Mosaic Virus Phylogenetic Group.

    PubMed

    Ohshima, Kazusato; Korkmaz, Savas; Mitoma, Shinichiro; Nomiyama, Rei; Honda, Yuki

    2016-01-01

    The nearly complete genome sequence of a new species of potyvirus was obtained from the symptomless wild onion (Allium sp.) in Turkey. This virus has less than 67% nucleotide sequence identities over the polyprotein to other known potyviruses. We propose the name wild onion symptomless virus for this novel potyvirus. PMID:27540073

  13. Population Structure and Comparative Genome Hybridization of European Flor Yeast Reveal a Unique Group of Saccharomyces cerevisiae Strains with Few Gene Duplications in Their Genome

    PubMed Central

    Legras, Jean-Luc; Erny, Claude; Charpentier, Claudine

    2014-01-01

    Wine biological aging is a wine making process used to produce specific beverages in several countries in Europe, including Spain, Italy, France, and Hungary. This process involves the formation of a velum at the surface of the wine. Here, we present the first large scale comparison of all European flor strains involved in this process. We inferred the population structure of these European flor strains from their microsatellite genotype diversity and analyzed their ploidy. We show that almost all of these flor strains belong to the same cluster and are diploid, except for a few Spanish strains. Comparison of the array hybridization profile of six flor strains originating from these four countries, with that of three wine strains did not reveal any large segmental amplification. Nonetheless, some genes, including YKL221W/MCH2 and YKL222C, were amplified in the genome of four out of six flor strains. Finally, we correlated ICR1 ncRNA and FLO11 polymorphisms with flor yeast population structure, and associate the presence of wild type ICR1 and a long Flo11p with thin velum formation in a cluster of Jura strains. These results provide new insight into the diversity of flor yeast and show that combinations of different adaptive changes can lead to an increase of hydrophobicity and affect velum formation. PMID:25272156

  14. [Genomic fingerprints of organisms from different taxonomic groups: the use of phage M13 DNA as a hybridization probe].

    PubMed

    Ryskov, A P; Dzhincharadze, A G; Prosnik, M I; Ivanov, P L; Limborskaia, S A

    1988-02-01

    Hypervariable polymorphic patterns were detected using wild-type M13 DNA as a probe in genomic DNAs of very different organisms ranging from procaryotes and lower eucaryotes to upper plants and animals, including human beings. Due to somatic stability of highly polymorphic patterns and their discrete inheritance, individual-specific restriction pattern analysis ("DNA fingerprinting") with this test probe was found to be useful in applied human genetics, in particular, for identifying paternity and maternity, and mapping of human genomes. The data obtained also demonstrate some possibilities of the DNA fingerprinting technology in genetics and selection of agricultural plants and animals, such as variety analysis, classification and registration of individual inbred lines and strains, as well as identification of bacterial strains.

  15. Metagenomic identification of novel enteric viruses in urban wild rats and genome characterization of a group A rotavirus.

    PubMed

    Sachsenröder, Jana; Braun, Anne; Machnowska, Patrycja; Ng, Terry Fei Fan; Deng, Xutao; Guenther, Sebastian; Bernstein, Samuel; Ulrich, Rainer G; Delwart, Eric; Johne, Reimar

    2014-12-01

    Rats are known as reservoirs and vectors for several zoonotic pathogens. However, information on the viruses shed by urban wild rats that could pose a zoonotic risk to human health is scare. Here, intestinal contents from 20 wild Norway rats (Rattus norvegicus) collected in the city of Berlin, Germany, were subjected to metagenomic analysis of viral nucleic acids. The determined faecal viromes of rats consisted of a variety of known and unknown viruses, and were highly variable among the individuals. Members of the families Parvoviridae and Picobirnaviridae represented the most abundant species. Novel picornaviruses, bocaviruses, sapoviruses and stool-associated circular ssDNA viruses were identified, which showed only low sequence identity to known representatives of the corresponding taxa. In addition, noroviruses and rotaviruses were detected as potential zoonotic gastroenteritis viruses. However, partial-genome sequence analyses indicated that the norovirus was closely related to the recently identified rat norovirus and the rotavirus B was closely related to the rat rotavirus strain IDIR; both viruses clustered separately from respective human virus strains in phylogenetic trees. In contrast, the rotavirus A sequences showed high identity to human and animal strains. Analysis of the nearly complete genome of this virus revealed the known genotypes G3, P[3] and N2 for three of the genome segments, whereas the remaining eight genome segments represented the novel genotypes I20-R11-C11-M10-A22-T14-E18-H13. Our results indicated a high heterogeneity of enteric viruses present in urban wild rats; their ability to be transmitted to humans remains to be assessed in the future. PMID:25121550

  16. A new group in the Leptospirillum clade: cultivation-independent community genomics, proteomics and transcriptomics of the new species Leptospirillum group IV UBA BS.

    SciTech Connect

    Goltsman, Daniela; Dasari, Mauna; Thomas, BC; Shah, Manesh B; Verberkmoes, Nathan C; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2013-01-01

    Leptospirillum spp. are widespread members of acidophilic microbial communities that catalyze ferrous iron oxidation, thereby increasing sulfide mineral dissolution rates. These bacteria play important roles in environmental acidification and are harnessed for bioleaching-based metal recovery. Known members of the Leptospirillum clade of the Nitrospira phylum are Leptospirillum ferrooxidans (group I), Leptospirillum ferriphilum and Leptospirillum rubarum (group II), and Leptospirillum ferrodiazotrophum (group III). In the Richmond Mine acid mine drainage (AMD) system, biofilm formation is initiated by L. rubarum; L. ferrodiazotrophum appears in later developmental stages. Here we used community metagenomic data from unusual, thick floating biofilms to identify distinguishing metabolic traits in a rare and uncultivated community member, the new species Leptospirillum group IV UBA BS. These biofilms typically also contain a variety of Archaea, Actinobacteria, and a few other Leptospirillum spp. The Leptospirillum group IV UBA BS species shares 98% 16S rRNA sequence identity and 70% average amino acid identity between orthologs with its closest relative, L. ferrodiazotrophum. The presence of nitrogen fixation and reverse tricarboxylic acid (TCA) cycle proteins suggest an autotrophic metabolism similar to that of L. ferrodiazotrophum, while hydrogenase proteins suggest anaerobic metabolism. Community transcriptomic and proteomic analyses demonstrate expression of a multicopper oxidase unique to this species, as well as hydrogenases and core metabolic genes. Results suggest that the Leptospirillum group IV UBA BS species might play important roles in carbon fixation, nitrogen fixation, hydrogen metabolism, and iron oxidation in some acidic environments.

  17. New Group in the Leptospirillum Clade: Cultivation-Independent Community Genomics, Proteomics, and Transcriptomics of the New Species “Leptospirillum Group IV UBA BS”

    PubMed Central

    Dasari, Mauna; Thomas, Brian C.; Shah, Manesh B.; VerBerkmoes, Nathan C.; Hettich, Robert L.; Banfield, Jillian F.

    2013-01-01

    Leptospirillum spp. are widespread members of acidophilic microbial communities that catalyze ferrous iron oxidation, thereby increasing sulfide mineral dissolution rates. These bacteria play important roles in environmental acidification and are harnessed for bioleaching-based metal recovery. Known members of the Leptospirillum clade of the Nitrospira phylum are Leptospirillum ferrooxidans (group I), Leptospirillum ferriphilum and “Leptospirillum rubarum” (group II), and Leptospirillum ferrodiazotrophum (group III). In the Richmond Mine acid mine drainage (AMD) system, biofilm formation is initiated by L. rubarum; L. ferrodiazotrophum appears in later developmental stages. Here we used community metagenomic data from unusual, thick floating biofilms to identify distinguishing metabolic traits in a rare and uncultivated community member, the new species “Leptospirillum group IV UBA BS.” These biofilms typically also contain a variety of Archaea, Actinobacteria, and a few other Leptospirillum spp. The Leptospirillum group IV UBA BS species shares 98% 16S rRNA sequence identity and 70% average amino acid identity between orthologs with its closest relative, L. ferrodiazotrophum. The presence of nitrogen fixation and reverse tricarboxylic acid (TCA) cycle proteins suggest an autotrophic metabolism similar to that of L. ferrodiazotrophum, while hydrogenase proteins suggest anaerobic metabolism. Community transcriptomic and proteomic analyses demonstrate expression of a multicopper oxidase unique to this species, as well as hydrogenases and core metabolic genes. Results suggest that the Leptospirillum group IV UBA BS species might play important roles in carbon fixation, nitrogen fixation, hydrogen metabolism, and iron oxidation in some acidic environments. PMID:23645189

  18. NIH working group report—using genomic information to guide weight management: From universal to precision treatment

    PubMed Central

    Bray, Molly S; Loos, Ruth JF; McCaffery, Jeanne M; Ling, Charlotte; Franks, Paul W; Weinstock, George M; Snyder, Michael P; Vassy, Jason L; Agurs-Collins, Tanya

    2016-01-01

    Objective Precision medicine utilizes genomic and other data to optimize and personalize treatment. Although more than 2,500 genetic tests are currently available, largely for extreme and/or rare phenotypes, the question remains whether this approach can be used for the treatment of common, complex conditions like obesity, inflammation, and insulin resistance, which underlie a host of metabolic diseases. Methods This review, developed from a Trans-NIH Conference titled “Genes, Behaviors, and Response to Weight Loss Interventions,” provides an overview of the state of genetic and genomic research in the area of weight change and identifies key areas for future research. Results Although many loci have been identified that are associated with cross-sectional measures of obesity/body size, relatively little is known regarding the genes/loci that influence dynamic measures of weight change over time. Although successful short-term weight loss has been achieved using many different strategies, sustainable weight loss has proven elusive for many, and there are important gaps in our understanding of energy balance regulation. Conclusions Elucidating the molecular basis of variability in weight change has the potential to improve treatment outcomes and inform innovative approaches that can simultaneously take into account information from genomic and other sources in devising individualized treatment plans. PMID:26692578

  19. The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

    PubMed Central

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications

  20. The mitochondrial genome of the prasinophyte Prasinoderma coloniale reveals two trans-spliced group I introns in the large subunit rRNA gene.

    PubMed

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications

  1. Genome sequence and analysis of a broad-host range lytic bacteriophage that infects the Bacillus cereus group

    PubMed Central

    2013-01-01

    Background Comparatively little information is available on members of the Myoviridae infecting low G+C content, Gram-positive host bacteria of the family Firmicutes. While numerous Bacillus phages have been isolated up till now only very few Bacillus cereus phages have been characterized in detail. Results Here we present data on the large, virulent, broad-host-range B. cereus phage vB_BceM_Bc431v3 (Bc431v3). Bc431v3 features a 158,618 bp dsDNA genome, encompassing 239 putative open reading frames (ORFs) and, 20 tRNA genes encoding 17 different amino acids. Since pulsed-field gel electrophoresis indicated that the genome of this phage has a mass of 155-158 kb Bc431v3 DNA appears not to contain long terminal repeats that are found in the genome of Bacillus phage SPO1. Conclusions Bc431v3 displays significant sequence similarity, at the protein level, to B. cereus phage BCP78, Listeria phage A511 and Enterococcus phage ØEF24C and other morphologically related phages infecting Firmicutes such as Staphylococcus phage K and Lactobacillus phage LP65. Based on these data we suggest that Bc431v3 should be included as a member of the Spounavirinae; however, because of all the diverse taxonomical information has been addressed recently, it is difficult to determine the genus. The Bc431v3 phage contains some highly unusual genes such as gp143 encoding putative tRNAHis guanylyltransferase. In addition, it carries some genes that appear to be related to the host sporulation regulators. These are: gp098, which encodes a putative segregation protein related to FstK/SpoIIIE DNA transporters; gp105, a putative segregation protein; gp108, RNA polymerase sigma factor F/B; and, gp109 encoding RNA polymerase sigma factor G. PMID:23388049

  2. Whole genomic analysis of human G1P[8] rotavirus strains from different age groups in China.

    PubMed

    Shintani, Tsuzumi; Ghosh, Souvik; Wang, Yuan-Hong; Zhou, Xuan; Zhou, Dun-Jin; Kobayashi, Nobumichi

    2012-08-01

    G1P[8] rotaviruses are an important cause of diarrhea in humans in China. To date, there are no reports on the whole genomic analysis of the Chinese G1P[8] rotaviruses. To determine the origin and overall genetic makeup of the recent Chinese G1P[8] strains, the whole genomes of three strains, RVA/Human-wt/CHN/E1911/2009/G1P[8], RVA/Human-tc/CHN/R588/2005/G1P[8] and RVA/Human-tc/CHN/Y128/2004/G1P[8], detected in an infant, a child and an adult, respectively, were analyzed. Strains E1911, R588 and Y128 exhibited a typical Wa-like genotype constellation. Except for the NSP3 gene of E1911, the whole genomes of strains E1911, R588 and Y128 were found to be more closely related to those of the recent Wa-like common human strains from different countries than those of the prototype G1P[8] strain, or other old strains. On the other hand, the NSP3 gene of E1911 was genetically distinct from those of Y128, R588, or other Wa-like common human strains, and appeared to share a common origin with those of the porcine-like human G9 strains, providing evidence for intergenotype reassortment events. Comparisons of the amino acid residues defining the VP7 and VP4 antigenic domains revealed several mismatches between these Chinese G1P[8] strains and the G1 and P[8] strains contained in the currently licensed rotavirus vaccines Rotarix(TM )and RotaTeq(TM). PMID:23012626

  3. Description of Bacillus toyonensis sp. nov., a novel species of the Bacillus cereus group, and pairwise genome comparisons of the species of the group by means of ANI calculations.

    PubMed

    Jiménez, Guillermo; Urdiain, Mercedes; Cifuentes, Ana; López-López, Aránzazu; Blanch, Anicet R; Tamames, Javier; Kämpfer, Peter; Kolstø, Anne-Brit; Ramón, Daniel; Martínez, Juan F; Codoñer, Francisco M; Rosselló-Móra, Ramon

    2013-09-01

    Strain BCT-7112(T) was isolated in 1966 in Japan from a survey designed to obtain naturally occurring microorganisms as pure cultures in the laboratory for use as probiotics in animal nutrition. This strain, which was primarily identified as Bacillus cereus var toyoi, has been in use for more than 30 years as the active ingredient of the preparation TOYOCERIN(®), an additive for use in animal nutrition (e.g. swine, poultry, cattle, rabbits and aquaculture). Despite the fact that the strain was initially classified as B. cereus, it showed significant genomic differences from the type strains of the B. cereus group that were large enough (ANI values below 92%) to allow it to be considered as a different species within the group. The polyphasic taxonomic study presented here provides sufficient discriminative parameters to classify BCT-7112(T) as a new species for which the name Bacillus toyonensis sp. nov. is proposed, with BCT-7112(T) (=CECT 876(T); =NCIMB 14858(T)) being designated as the type strain. In addition, a pairwise comparison between the available genomes of the whole B. cereus group by means of average nucleotide identity (ANI) calculations indicated that besides the eight classified species (including B. toyonensis), additional genomospecies could be detected, and most of them also had ANI values below 94%. ANI values were on the borderline of a species definition only in the cases of representatives of B. cereus versus B. thuringiensis, and B. mycoides and B. weihenstephanensis.

  4. Genome analysis of Chitinivibrio alkaliphilus gen. nov., sp. nov., a novel extremely haloalkaliphilic anaerobic chitinolytic bacterium from the candidate phylum Termite Group 3.

    PubMed

    Sorokin, Dimitry Y; Gumerov, Vadim M; Rakitin, Andrey L; Beletsky, Alexey V; Damsté, J S Sinninghe; Muyzer, Gerard; Mardanov, Andrey V; Ravin, Nikolai V

    2014-06-01

    Anaerobic enrichments from hypersaline soda lakes with chitin as substrate yielded five closely related anaerobic haloalkaliphilic isolates growing on insoluble chitin by fermentation at pH 10 and salinities up to 3.5 M. The chitinolytic activity was exclusively cell associated. To better understand the biology and evolutionary history of this novel bacterial lineage, the genome of the type strain ACht1 was sequenced. Analysis of the 2.6 Mb draft genome revealed enzymes of chitin-degradation pathways, including secreted cell-bound chitinases. The reconstructed central metabolism revealed pathways enabling the fermentation of polysaccharides, while it lacks the genes needed for aerobic or anaerobic respiration. The Rnf-type complex, oxaloacetate decarboxylase and sodium-transporting V-type adenosine triphosphatase were identified among putative membrane-bound ion pumps. According to 16S ribosomal RNA analysis, the isolates belong to the candidate phylum Termite Group 3, representing its first culturable members. Phylogenetic analysis using ribosomal proteins and taxonomic distribution analysis of the whole proteome supported a class-level classification of ACht1 most probably affiliated to the phylum Fibribacteres. Based on phylogenetic, phenotypic and genomic analyses, the novel bacteria are proposed to be classified as Chitinivibrio alkaliphilus gen. nov., sp. nov., within a novel class Chitinivibrione.

  5. Complete genome analyses of the first porcine rotavirus group H identified from a South African pig does not provide evidence for recent interspecies transmission events.

    PubMed

    Nyaga, Martin M; Peenze, Ina; Potgieter, Christiaan A; Seheri, L Mapaseka; Page, Nicola A; Yinda, Claude K; Steele, A Duncan; Matthijnssens, Jelle; Mphahlele, M Jeffrey

    2016-03-01

    Rotaviruses (RVs) are classified into eight species/groups (RVA-RVH) according to the migration patterns of their 11 genome segments, as well as by serological and molecular properties of Viral Protein 6 (VP6). In 1997 a new unclassified RV was reported infecting adults in Bangladesh and China. This virus was initially named novel adult diarrhoea rotavirus (ADRV-N), but later renamed as RVH. Since then, RVH has been detected in humans only very sporadically. However, RVH is increasingly being detected in pig populations in the USA, Brazil and Japan, but not yet in Africa. Unfortunately, whole genome sequence data of porcine RVH strains in GenBank is currently restricted to a single strain (SKA-1) from Japan. Porcine diarrhoeic samples were collected in South Africa and analysed for rotavirus using an RVA ELISA and electropherotyping by PAGE. One sample displayed a 4:2:1:1:1:1:1 migration pattern, typical for RVH. In order to further investigate this strain, sequence-independent amplification followed by random sequencing using the 454/Roche GS FLX Sequencer was performed, resulting in the second complete porcine RVH strain (MRC-DPRU1575) available in databases. Phylogenetically, all segments of MRC-DPRU1575 clustered closely with the SKA-1 strain and in some segments with known porcine RVH strains from Brazil and the USA. In contrast, the porcine RVH strains were only distantly related to human RVH strains from Asia and a partial RVH-like strain recently detected in bats from Cameroon. Overall, strain MRC-DPRU1575 is the first complete genome of a porcine RVH from Africa and allows for the development of improved RVH screening methods. Our analyses indicate that RVH strains cluster according to their host species, not suggesting any evidence of recent interspecies transmission events. However, more RVH genomes from a wider host range are needed to better understand their evolutionary pathways and zoonotic potential. PMID:26658066

  6. Genomic Copy Number Signatures Uncovered a Genetically Distinct Group from Adenocarcinoma and Squamous Cell Carcinoma in Non-Small Cell Lung Cancer.

    PubMed

    Lee, Eunjung; Moon, Ji Wook; Wang, Xianfu; Kim, Chungyeul; Li, Shibo; Shin, Bong Kyung; Jung, Wonkyung; Kim, Hyun Koo; Kim, Han Kyeom; Lee, Ji-Yun

    2015-08-01

    Adenocarcinoma (AC) and squamous cell carcinoma (SCC) of non-small cell lung carcinoma (NSCLC) have different clinical presentations, morphologies, treatments, and prognoses. Recent studies suggested that fundamental genetic alterations related to carcinogenesis of each tumor type may be different. In this study, we investigated the genomic alterations of 47 primary NSCLC samples (22 ACs and 25 SCCs) as well as the corresponding normal tissue using array comparative genomic hybridization. Frequent copy number alterations (CNAs), which were identified in more than 68% of all of the cases, were evaluated in each subtype (SCC and AC), and a CNA signature was established. Among these CNAs, 37 genes from the SCCs and 15 genes from the ACs were located in a region of gain, and 4 genes from the SCCs and 13 genes from the ACs were located in a region of loss. The most frequent gain was located on 3q26-29 including the gene TP63 in SCCs and 7q11.23 and 7q36.3 in ACs. Moreover, we identified 3 genetically distinct groups (group I [16 SCC] with CNA signature of SCC; group II [7 SCC + 8 AC], which has a genetically distinctive CNA signature from SCC and AC; and group III [2 SCC + 14 AC] with CNA signature of AC) by gene clustering extracted from CNAs, which are associated with a prognosis. The present study contributed to the molecular characterization of AC and SCC of NSCLC and showed a subtype of tumor that has a unique genetic CNA signature. However, further study about the significance of these 3 distinct groups and their usefulness as a diagnostic marker of identified CNAs is necessary.

  7. A collection of ordered tetranucleotide-repeat markers from the human genome. The Utah Marker Development Group.

    PubMed Central

    1995-01-01

    A collection of 1,069 human PCR-based genetic markers has been developed, and their distribution over the 22 autosomes and the X chromosome has been determined. Each marker was developed around a short-tandem-repeat DNA sequence. The majority (85%) of the markers described here were selected to contain tetranucleotide repeats, because these repeats show better stability during PCR than do dinucleotide repeats. Linkage maps constructed from genotypes collected with these markers in four CEPH pedigrees (1331, 1332, 1362, and 884) covered 3,417 cM of the human genome. More than 600 of the loci revealed heterozygosities > .70. Overall, 444 loci were ordered, with odds > 100:1 against inversion of adjacent loci. The average distance between markers was 7.4 cM on the autosomes and 24.8 cM on the X chromosome. Likely locations (100:1 odds intervals) were assigned for the remaining 621 short-tandem-repeat polymorphisms, as well as for 160 other markers that are present on the framework maps published by the Cooperative Human Linkage Center. Four markers specific to the Y chromosome are also reported here. From our maps, 347 markers were chosen to define "index" maps for each of the 22 autosomes. The index markers detect loci with an average heterozygosity of .85 and cover 3,169 cM of the autosomes, with an average distance between markers of 9.2 cM. These polymorphic short tandem repeats will be highly useful as reagents for the ongoing genetic and physical mapping of the human genome and for characterization of genetic changes in cancer. PMID:7668290

  8. Nucleosome organization in the Drosophila genome.

    PubMed

    Mavrich, Travis N; Jiang, Cizhong; Ioshikhes, Ilya P; Li, Xiaoyong; Venters, Bryan J; Zanton, Sara J; Tomsho, Lynn P; Qi, Ji; Glaser, Robert L; Schuster, Stephan C; Gilmour, David S; Albert, Istvan; Pugh, B Franklin

    2008-05-15

    Comparative genomics of nucleosome positions provides a powerful means for understanding how the organization of chromatin and the transcription machinery co-evolve. Here we produce a high-resolution reference map of H2A.Z and bulk nucleosome locations across the genome of the fly Drosophila melanogaster and compare it to that from the yeast Saccharomyces cerevisiae. Like Saccharomyces, Drosophila nucleosomes are organized around active transcription start sites in a canonical -1, nucleosome-free region, +1 arrangement. However, Drosophila does not incorporate H2A.Z into the -1 nucleosome and does not bury its transcriptional start site in the +1 nucleosome. At thousands of genes, RNA polymerase II engages the +1 nucleosome and pauses. How the transcription initiation machinery contends with the +1 nucleosome seems to be fundamentally different across major eukaryotic lines.

  9. Comparative genomic analysis of catfish linkage group 8 reveals two homologous chromosomes in zebrafish and other teleosts with extensive inter-chromosomal rearrangements

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Comparative genomics is a powerful tool to transfer genomic information from model species to related non-model species. Channel catfish (Ictalurus punctatus) is the primary aquaculture species in the United States. Its existing genome resources such as genomic sequences generated from n...

  10. Genome-Wide SNP Analysis of Southern African Populations Provides New Insights into the Dispersal of Bantu-Speaking Groups

    PubMed Central

    González-Santos, Miguel; Montinaro, Francesco; Oosthuizen, Ockie; Oosthuizen, Erica; Busby, George B.J.; Anagnostou, Paolo; Destro-Bisol, Giovanni; Pascali, Vincenzo; Capelli, Cristian

    2015-01-01

    The expansion of Bantu-speaking agropastoralist populations had a great impact on the genetic, linguistic, and cultural variation of sub-Saharan Africa. It is generally accepted that Bantu languages originated in an area around the present border between Cameroon and Nigeria approximately 5,000 years ago, from where they spread South and East becoming the largest African linguistic branch. The demic consequences of this event are reflected in the relatively high genetic homogeneity observed across most of sub-Saharan Africa populations. In this work, we explored genome-wide single nucleotide polymorphism data from 28 populations to characterize the genetic components present in sub-Saharan African populations. Combining novel data from four Southern African populations with previously published results, we reject the hypothesis that the “non-Bantu” genetic component reported in South-Eastern Africa (Mozambique) reflects extensive gene flow between incoming agriculturalist and resident hunter-gatherer communities. We alternatively suggest that this novel component is the result of demographic dynamics associated with the Bantu dispersal. PMID:26363465

  11. Genomic Analysis Reveals Multi-Drug Resistance Clusters in Group B Streptococcus CC17 Hypervirulent Isolates Causing Neonatal Invasive Disease in Southern Mainland China

    PubMed Central

    Campisi, Edmondo; Rosini, Roberto; Ji, Wenjing; Guidotti, Silvia; Rojas-López, Maricarmen; Geng, Guozhu; Deng, Qiulian; Zhong, Huamin; Wang, Weidong; Liu, Haiying; Nan, Cassandra; Margarit, Immaculada; Rinaudo, C. D.

    2016-01-01

    Neonatal invasive disease caused by group B Streptococcus (GBS) represents a significant public health care concern globally. However, data related to disease burden, serotype distribution, and molecular epidemiology in China and other Asian countries are very few and specifically relative to confined regions. The aim of this study was to investigate the genetic characteristics of GBS isolates recovered from neonates with invasive disease during 2013–2014 at Guangzhou and Changsha hospitals in southern mainland China. We assessed the capsular polysaccharide type, pilus islands (PIs) distribution and hvgA gene presence in a panel of 26 neonatal clinical isolates, of which 8 were recovered from Early Onset Disease and 18 from Late Onset Disease (LOD). Among 26 isolates examined, five serotypes were identified. Type III was the most represented (15 cases), particularly among LOD strains (n = 11), followed by types Ib (n = 5), V (n = 3), Ia (n = 2) and II (n = 1). We performed whole-genome sequencing analysis and antimicrobial susceptibility testing on the 14 serotype III isolates belonging to the hypervirulent Clonal Complex 17 (serotype III-CC17). The presence of PI-2b alone was associated with 13 out of 14 serotype III-CC17 strains. Genome analysis led us to identify two multi-drug resistance gene clusters harbored in two new versions of integrative and conjugative elements (ICEs), carrying five or eight antibiotic resistance genes, respectively. These ICEs replaced the 16 kb-locus that normally contains the PI-1 operon. All isolates harboring the identified ICEs showed multiple resistances to aminoglycoside, macrolide, and tetracycline antibiotic classes. In conclusion, we report the first whole-genome sequence analysis of 14 GBS serotype III-CC17 strains isolated in China, representing the most prevalent lineage causing neonatal invasive disease. The acquisition of newly identified ICEs conferring multiple antibiotic resistance could in part explain the spread

  12. Genomic Analysis Reveals Multi-Drug Resistance Clusters in Group B Streptococcus CC17 Hypervirulent Isolates Causing Neonatal Invasive Disease in Southern Mainland China.

    PubMed

    Campisi, Edmondo; Rosini, Roberto; Ji, Wenjing; Guidotti, Silvia; Rojas-López, Maricarmen; Geng, Guozhu; Deng, Qiulian; Zhong, Huamin; Wang, Weidong; Liu, Haiying; Nan, Cassandra; Margarit, Immaculada; Rinaudo, C D

    2016-01-01

    Neonatal invasive disease caused by group B Streptococcus (GBS) represents a significant public health care concern globally. However, data related to disease burden, serotype distribution, and molecular epidemiology in China and other Asian countries are very few and specifically relative to confined regions. The aim of this study was to investigate the genetic characteristics of GBS isolates recovered from neonates with invasive disease during 2013-2014 at Guangzhou and Changsha hospitals in southern mainland China. We assessed the capsular polysaccharide type, pilus islands (PIs) distribution and hvgA gene presence in a panel of 26 neonatal clinical isolates, of which 8 were recovered from Early Onset Disease and 18 from Late Onset Disease (LOD). Among 26 isolates examined, five serotypes were identified. Type III was the most represented (15 cases), particularly among LOD strains (n = 11), followed by types Ib (n = 5), V (n = 3), Ia (n = 2) and II (n = 1). We performed whole-genome sequencing analysis and antimicrobial susceptibility testing on the 14 serotype III isolates belonging to the hypervirulent Clonal Complex 17 (serotype III-CC17). The presence of PI-2b alone was associated with 13 out of 14 serotype III-CC17 strains. Genome analysis led us to identify two multi-drug resistance gene clusters harbored in two new versions of integrative and conjugative elements (ICEs), carrying five or eight antibiotic resistance genes, respectively. These ICEs replaced the 16 kb-locus that normally contains the PI-1 operon. All isolates harboring the identified ICEs showed multiple resistances to aminoglycoside, macrolide, and tetracycline antibiotic classes. In conclusion, we report the first whole-genome sequence analysis of 14 GBS serotype III-CC17 strains isolated in China, representing the most prevalent lineage causing neonatal invasive disease. The acquisition of newly identified ICEs conferring multiple antibiotic resistance could in part explain the spread of

  13. Sinorhizobium meliloti Phage ΦM9 Defines a New Group of T4 Superfamily Phages with Unusual Genomic Features but a Common T=16 Capsid

    PubMed Central

    Johnson, Matthew C.; Tatum, Kelsey B.; Lynn, Jason S.; Brewer, Tess E.; Lu, Stephen; Washburn, Brian K.

    2015-01-01

    ABSTRACT Relatively little is known about the phages that infect agriculturally important nitrogen-fixing rhizobial bacteria. Here we report the genome and cryo-electron microscopy structure of the Sinorhizobium meliloti-infecting T4 superfamily phage ΦM9. This phage and its close relative Rhizobium phage vB_RleM_P10VF define a new group of T4 superfamily phages. These phages are distinctly different from the recently characterized cyanophage-like S. meliloti phages of the ΦM12 group. Structurally, ΦM9 has a T=16 capsid formed from repeating units of an extended gp23-like subunit that assemble through interactions between one subunit and the adjacent E-loop insertion domain. Though genetically very distant from the cyanophages, the ΦM9 capsid closely resembles that of the T4 superfamily cyanophage Syn9. ΦM9 also has the same T=16 capsid architecture as the very distant phage SPO1 and the herpesviruses. Despite their overall lack of similarity at the genomic and structural levels, ΦM9 and S. meliloti phage ΦM12 have a small number of open reading frames in common that appear to encode structural proteins involved in interaction with the host and which may have been acquired by horizontal transfer. These proteins are predicted to encode tail baseplate proteins, tail fibers, tail fiber assembly proteins, and glycanases that cleave host exopolysaccharide. IMPORTANCE Despite recent advances in the phylogenetic and structural characterization of bacteriophages, only a small number of phages of plant-symbiotic nitrogen-fixing soil bacteria have been studied at the molecular level. The effects of phage predation upon beneficial bacteria that promote plant growth remain poorly characterized. First steps in understanding these soil bacterium-phage dynamics are genetic, molecular, and structural characterizations of these groups of phages. The T4 superfamily phages are among the most complex phages; they have large genomes packaged within an icosahedral head and a long

  14. The mitochondrial genome of Paraspadella gotoi is highly reduced and reveals that chaetognaths are a sister-group to protostomes

    SciTech Connect

    Helfenbein, Kevin G.; Fourcade, H. Matthew; Vanjani, Rohit G.; Boore, Jeffrey L.

    2004-05-01

    We report the first complete mitochondrial (mt) DNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous non-coding intergenic nucleotides, while the other group is tightly packed, having no non-coding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha within the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins gives a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore.

  15. CELF4 Variant and Anthracycline-Related Cardiomyopathy: A Children’s Oncology Group Genome-Wide Association Study

    PubMed Central

    Wang, Xuexia; Sun, Can-Lan; Quiñones-Lombraña, Adolfo; Singh, Purnima; Landier, Wendy; Hageman, Lindsey; Mather, Molly; Rotter, Jerome I.; Taylor, Kent D.; Chen, Yii-Der Ida; Armenian, Saro H.; Winick, Naomi; Ginsberg, Jill P.; Neglia, Joseph P.; Oeffinger, Kevin C.; Castellino, Sharon M.; Dreyer, Zoann E.; Hudson, Melissa M.; Robison, Leslie L.; Blanco, Javier G.

    2016-01-01

    Purpose Interindividual variability in the dose-dependent association between anthracyclines and cardiomyopathy suggests that genetic susceptibility could play a role. The current study uses an agnostic approach to identify genetic variants that could modify cardiomyopathy risk. Methods A genome-wide association study was conducted in childhood cancer survivors with and without cardiomyopathy (cases and controls, respectively). Single-nucleotide polymorphisms (SNPs) that surpassed a prespecified threshold for statistical significance were independently replicated. The possible mechanistic significance of validated SNP(s) was sought by using healthy heart samples. Results No SNP was marginally associated with cardiomyopathy. However, SNP rs1786814 on the CELF4 gene passed the significance cutoff for gene-environment interaction (Pge = 1.14 × 10−5). Multivariable analyses adjusted for age at cancer diagnosis, sex, anthracycline dose, and chest radiation revealed that, among patients with the A allele, cardiomyopathy was infrequent and not dose related. However, among those exposed to greater than 300 mg/m2 of anthracyclines, the rs1786814 GG genotype conferred a 10.2-fold (95% CI, 3.8- to 27.3-fold; P < .001) increased risk of cardiomyopathy compared with those who had GA/AA genotypes and anthracycline exposure of 300 mg/m2 or less. This gene-environment interaction was successfully replicated in an independent set of anthracycline-related cardiomyopathy. CUG-BP and ETR-3-like factor proteins control developmentally regulated splicing of TNNT2, the gene that encodes for cardiac troponin T (cTnT), a biomarker of myocardial injury. Coexistence of more than one cTnT variant results in a temporally split myofilament response to calcium, which causes decreased contractility. Analysis of TNNT2 splicing variants in healthy human hearts suggested an association between the rs1786814 GG genotype and coexistence of more than one TNNT2 splicing variant (90.5% GG v 41.7% GA

  16. Genomic characterization, phylogenetic comparison and differential expression of the cyclic nucleotide-gated channels gene family in pear (Pyrus bretchneideri Rehd.).

    PubMed

    Chen, Jianqing; Yin, Hao; Gu, Jinping; Li, Leiting; Liu, Zhe; Jiang, Xueting; Zhou, Hongsheng; Wei, Shuwei; Zhang, Shaoling; Wu, Juyou

    2015-01-01

    The cyclic nucleotide-gated channel (CNGC) family is involved in the uptake of various cations, such as Ca(2+), to regulate plant growth and respond to biotic and abiotic stresses. However, there is far less information about this family in woody plants such as pear. Here, we provided a genome-wide identification and analysis of the CNGC gene family in pear. Phylogenetic analysis showed that the 21 pear CNGC genes could be divided into five groups (I, II, III, IVA and IVB). The majority of gene duplications in pear appeared to have been caused by segmental duplication and occurred 32.94-39.14 million years ago. Evolutionary analysis showed that positive selection had driven the evolution of pear CNGCs. Motif analyses showed that Group I CNGCs generally contained 26 motifs, which was the greatest number of motifs in all CNGC groups. Among these, eight motifs were shared by each group, suggesting that these domains play a conservative role in CNGC activity. Tissue-specific expression analysis indicated that functional diversification of the duplicated CNGC genes was a major feature of long-term evolution. Our results also suggested that the P-S6 and PBC & hinge domains had co-evolved during the evolution. These results provide valuable information to increase our understanding of the function, evolution and expression analyses of the CNGC gene family in higher plants.

  17. Complete Genome Sequence of Pelosinus fermentans JBW45, a Member of a Remarkably Competitive Group of Negativicutes in the Firmicutes Phylum

    DOE PAGES

    De León, Kara B.; Utturkar, Sagar M.; Camilleri, Laura B.; Elias, Dwayne A.; Arkin, Adam P.; Fields, Matthew W.; Brown, Steven D.; Wall, Judy D.

    2015-09-24

    The genome of Pelosinus fermentans JBW45, isolated from a chromium-contaminated site in Hanford, Washington, USA, has been completed with PacBio sequencing. Finally, nine copies of the rRNA gene operon and multiple transposase genes with identical sequences resulted in breaks in the original draft genome and may suggest genomic instability of JBW45.

  18. Complete Genome Sequence of Pelosinus fermentans JBW45, a Member of a Remarkably Competitive Group of Negativicutes in the Firmicutes Phylum

    PubMed Central

    De León, Kara B.; Utturkar, Sagar M.; Camilleri, Laura B.; Elias, Dwayne A.; Arkin, Adam P.; Fields, Matthew W.; Brown, Steven D.

    2015-01-01

    The genome of Pelosinus fermentans JBW45, isolated from a chromium-contaminated site in Hanford, Washington, USA, has been completed with PacBio sequencing. Nine copies of the rRNA gene operon and multiple transposase genes with identical sequences resulted in breaks in the original draft genome and may suggest genomic instability of JBW45. PMID:26404608

  19. Genome-wide protective response used by group A Streptococcus to evade destruction by human polymorphonuclear leukocytes.

    PubMed

    Voyich, Jovanka M; Sturdevant, Daniel E; Braughton, Kevin R; Kobayashi, Scott D; Lei, Benfang; Virtaneva, Kimmo; Dorward, David W; Musser, James M; DeLeo, Frank R

    2003-02-18

    Group A Streptococcus (GAS) evades polymorphonuclear leukocyte (PMN) phagocytosis and killing to cause human disease, including pharyngitis and necrotizing fasciitis (flesh-eating syndrome). We show that GAS genes differentially regulated during phagocytic interaction with human PMNs comprise a global pathogen-protective response to innate immunity. GAS prophage genes and genes involved in virulence, oxidative stress, cell wall biosynthesis, and gene regulation were up-regulated during PMN phagocytosis. Genes encoding novel secreted proteins were up-regulated, and the proteins were produced during human GAS infections. We discovered an essential role for the Ihk-Irr two-component regulatory system in evading PMN-mediated killing and promoting host-cell lysis, processes that would facilitate GAS pathogenesis. Importantly, the irr gene was highly expressed during human GAS pharyngitis. We conclude that a complex pathogen genetic program circumvents human innate immunity to promote disease. The gene regulatory program revealed by our studies identifies previously undescribed potential vaccine antigens and targets for therapeutic interventions designed to control GAS infections.

  20. Genomics:GTL Contractor-Grantee Workshop IV and Metabolic Engineering Working Group Inter-Agency Conference on Metabolic Engineering 2006

    SciTech Connect

    Mansfield, Betty Kay; Martin, Sheryl A

    2006-02-01

    Welcome to the 2006 joint meeting of the fourth Genomics:GTL Contractor-Grantee Workshop and the six Metabolic Engineering Working Group Inter-Agency Conference. The vision and scope of the Genomics:GTL program continue to expand and encompass research and technology issues from diverse scientific disciplines, attracting broad interest and support from researchers at universities, DOE national laboratories, and industry. Metabolic engineering's vision is the targeted and purposeful alteration of metabolic pathways to improve the understanding and use of cellular pathways for chemical transformation, energy transduction, and supramolecular assembly. These two programs have much complementarity in both vision and technological approaches, as reflected in this joint workshop. GLT's challenge to the scientific community remains the further development and use of a broad array of innovative technologies and computational tools to systematically leverage the knowledge and capabilities brought to us by DNA sequencing projects. The goal is to seek a broad and predictive understanding of the functioning and control of complex systems--individual microbes, microbial communities, and plants. GTL's prominent position at the interface of the physical, computational, and biological sciences is both a strength and challenge. Microbes remain GTL's principal biological focus. In the complex 'simplicity' of microbes, they find capabilities needed by DOE and the nation for clean and secure energy, cleanup of environmental contamination, and sequestration of atmospheric carbon dioxide that contributes to global warming. An ongoing challenge for the entire GTL community is to demonstrate that the fundamental science conducted in each of your research projects brings us a step closer to biology-based solutions for these important national energy and environmental needs.

  1. Improved immunogenicity of a H44/76 group B outer membrane vesicle vaccine with over-expressed genome-derived Neisserial antigen 1870.

    PubMed

    Koeberling, Oliver; Welsch, Jo Anne; Granoff, Dan M

    2007-02-26

    A broadly protective vaccine against meningococcal group B disease is not available. We previously reported that an outer membrane vesicle (OMV) vaccine containing over-expressed genome-derived antigen (GNA) 1870 elicited broader protective antibody responses than recombinant GNA1870 or conventional OMV vaccines prepared from a strain that naturally expresses low amounts of GNA1870. Certain wildtype strains such as H44/76 naturally express larger amounts of GNA1870 and, potentially, could be used to prepare an improved OMV vaccine without genetic over-expression of the antigen. We transformed H44/76 with a shuttle vector to over-express variant 1 (v.1) GNA1870 and compared the immunogenicity in mice of OMV vaccines prepared from wildtype H44/76 (v.1), the mutant, and a recombinant v.1 GNA1870 vaccine. Mice immunized with OMV with over-expressed GNA1870 developed broader serum bactericidal and/or greater C3 deposition activity on the surface of encapsulated strains of N. meningitidis than control mice immunized with the OMV vaccine prepared from the wildtype strain, or the rGNA1870 vaccine. When a panel of group B strains from patients in California was tested, sera from mice immunized with the OMV vaccine containing over-expressed GNA1870 were bactericidal against 100% of the v.1 strains. In contrast, only 20% of isolates that expressed subvariants of the v.1 GNA1870 protein were susceptible to bactericidal activity of antibodies elicited by the rGNA1870 or conventional OMV vaccines. Thus, even a modest increase in GNA1870 expression in a strain that naturally is a high producer of GNA1870 results in an OMV vaccine that elicits broader protection against meningococcal disease.

  2. Genomic Analysis Reveals That Immune Function Genes Are Strongly Linked to Clinical Outcome in the North Central Cancer Treatment Group N9831 Adjuvant Trastuzumab Trial

    PubMed Central

    Perez, Edith A.; Thompson, E. Aubrey; Ballman, Karla V.; Anderson, S. Keith; Asmann, Yan W.; Kalari, Krishna R.; Eckel-Passow, Jeanette E.; Dueck, Amylou C.; Tenner, Kathleen S.; Jen, Jin; Fan, Jian-Bing; Geiger, Xochiquetzal J.; McCullough, Ann E.; Chen, Beiyun; Jenkins, Robert B.; Sledge, George W.; Winer, Eric P.; Gralow, Julie R.; Reinholz, Monica M.

    2015-01-01

    Purpose To develop a genomic signature that predicts benefit from trastuzumab in human epidermal growth factor receptor 2–positive breast cancer. Patients and Methods DASL technology was used to quantify mRNA in samples from 1,282 patients enrolled onto the Combination Chemotherapy With or Without Trastuzumab in Treating Women With Breast Cancer (North Central Cancer Treatment Group N9831 [NCCTG-N9831]) adjuvant trastuzumab trial. Cox proportional hazard ratios (HRs), adjusted for significant clinicopathologic risk factors, were used to determine the association of each gene with relapse-free survival (RFS) for 433 patients who received chemotherapy alone (arm A) and 849 patients who received chemotherapy plus trastuzumab (arms B and C). Network and pathway analyses were used to identify key biologic processes linked to RFS. The signature was built by using a voting scheme. Results Network and functional ontology analyses suggested that increased RFS was linked to a subset of immune function genes. A voting scheme model was used to define immune gene enrichment based on the expression of any nine or more of 14 immune function genes at or above the 0.40 quantile for the population. This model was used to identify immune gene–enriched tumors in arm A and arms B and C. Immune gene enrichment was linked to increased RFS in arms B and C (HR, 0.35; 95% CI, 0.22 to 0.55; P < .001), whereas arm B and C patients who did not exhibit immune gene enrichment did not benefit from trastuzumab (HR, 0.89; 95% CI, 0.62 to 1.28; P = .53). Enriched immune function gene expression as defined by our predictive signature was not associated with increased RFS in arm A (HR, 0.90; 95% CI, 0.60 to 1.37; P = .64). Conclusion Increased expression of a subset of immune function genes may provide a means of predicting benefit from adjuvant trastuzumab. PMID:25605861

  3. Full-Length Genome Sequence of Type M/emm83 Group A Streptococcus pyogenes Strain STAB1101, Isolated from Clustered Cases in Brittany.

    PubMed

    Soriano, Nicolas; Vincent, Pascal; Auger, Gabriel; Cariou, Marie-Estelle; Moullec, Séverine; Lagente, Vincent; Ygout, Jean-François; Kayal, Samer; Faili, Ahmad

    2015-01-22

    Here, we announce the complete annotated genome sequence of a Streptococcus pyogenes M/emm83 strain, STAB1101, isolated from clustered cases in homeless persons in Brittany (France). The genome is composed of 1,709,790 bp, with a G+C content of 38.4% and 1,550 identified coding sequences (CDS), and it harbors a Tn916-like transposon.

  4. Analysis of ATP6 sequence diversity in the Triticum-Aegilops species group reveals the crucial role of rearrangement in mitochondrial genome evolution.

    PubMed

    Soltani, Ali; Ghavami, Farhad; Mergoum, Mohamed; Hegstad, Justin; Noyszewski, Andrzej; Meinhardt, Steven; Kianian, Shahryar F

    2014-05-01

    Mutation and chromosomal rearrangements are the two main forces of increasing genetic diversity for natural selection to act upon, and ultimately drive the evolutionary process. Although genome evolution is a function of both forces, simultaneously, the ratio of each can be varied among different genomes and genomic regions. It is believed that in plant mitochondrial genome, rearrangements play a more important role than point mutations, but relatively few studies have directly addressed this phenomenon. To address this issue, we isolated and sequenced the ATP6-1 and ATP6-2 genes from 46 different euplasmic and alloplasmic wheat lines. Four different ATP6-1 orthologs were detected, two of them reported for the first time. Expression analysis revealed that all four orthologs are transcriptionally active. Results also indicated that both point mutation and genomic rearrangement are involved in the evolution of ATP6. However, rearrangement is the predominant force that triggers drastic variation. Data also indicated that speciation of domesticated wheat cultivars were simultaneous with the duplication of this gene. These results directly support the notion that rearrangement plays a significant role in driving plant mitochondrial genome evolution.

  5. Genome sequence of the pink to light reddish-pigmented Rubellimicrobium mesophilum type strain (DSM 19309T), a representative of the Roseobacter group isolated from soil, and emended description of the species

    PubMed Central

    Riedel, Thomas; Spring, Stefan; Fiebig, Anne; Petersen, Jörn; Göker, Markus; Klenk, Hans-Peter

    2014-01-01

    Rubellimicrobium mesophilum Dastager et al. 2008 is a mesophilic and light reddish-pigmented representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the Roseobacter group play an important role in the marine biogeochemical cycles and were found in a broad variety of marine environments associated with algal blooms, different kinds of sediments, and surfaces of invertebrates and vertebrates. Roseobacters were shown to be widely distributed, especially within the total bacterial community found in coastal waters, as well as in mixed water layers of the open ocean. Here we describe the features of R. mesophilum strain MSL-20T together with its genome sequence and annotation generated from a culture of DSM 19309T. The 4,927,676 bp genome sequence consists of one chromosome and probably one extrachromosomal element. It contains 5,082 protein-coding genes and 56 RNA genes. As previously reported, the G+C content is significantly different from the actual genome sequence-based G+C content and as the type strain tests positively for oxidase, the species description is emended accordingly. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197472

  6. Analysis of ATP6 sequence diversity in the Triticum-Aegilops group of species reveals the crucial role of rearrangement in mitochondrial genome evolution

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mutation and chromosomal rearrangements are the two main forces of increasing genetic diversity for natural selection to act upon, and ultimately drive the evolutionary process. Although genome evolution is a function of both forces, simultaneously, the ratio of each can be varied among different ge...

  7. Draft Genome Sequences of Two Isolates of the Roseobacter Group, Sulfitobacter sp. Strains 3SOLIMAR09 and 1FIGIMAR09, from Harbors of Mallorca Island (Mediterranean Sea)

    PubMed Central

    Mas-Lladó, Maria; Piña-Villalonga, Joana Maria; Brunet-Galmés, Isabel; Nogales, Balbina

    2014-01-01

    We present the draft genome sequences of two isolates of the Roseobacter lineage, 3SOLIMAR09 and 1FIGIMAR09, which were obtained from harbors of Mallorca Island, Spain, and are affiliated with the Sulfitobacter genus. Both isolates harbor the complete gene set for protocatechuate catabolism and incomplete pathways for several additional monoaromatic compounds. PMID:24855294

  8. Whole-Genome Sequences of Four Strains Closely Related to Members of the Mycobacterium chelonae Group, Isolated from Biofilms in a Drinking Water Distribution System Simulator

    PubMed Central

    Revetta, Randy P.

    2016-01-01

    We report here the draft genome sequences of four Mycobacterium chelonae strains from biofilms subjected to a “chlorine burn” in a chloraminated drinking water distribution system simulator. These opportunistic pathogens have been detected in hospital and municipal water distribution systems, in which biofilms have been recognized as an important factor for their persistence. PMID:26798093

  9. The African Turquoise Killifish Genome Provides Insights into Evolution and Genetic Architecture of Lifespan.

    PubMed

    Valenzano, Dario Riccardo; Benayoun, Bérénice A; Singh, Param Priya; Zhang, Elisa; Etter, Paul D; Hu, Chi-Kuo; Clément-Ziza, Mathieu; Willemsen, David; Cui, Rongfeng; Harel, Itamar; Machado, Ben E; Yee, Muh-Ching; Sharp, Sabrina C; Bustamante, Carlos D; Beyer, Andreas; Johnson, Eric A; Brunet, Anne

    2015-12-01

    Lifespan is a remarkably diverse trait ranging from a few days to several hundred years in nature, but the mechanisms underlying the evolution of lifespan differences remain elusive. Here we de novo assemble a reference genome for the naturally short-lived African turquoise killifish, providing a unique resource for comparative and experimental genomics. The identification of genes under positive selection in this fish reveals potential candidates to explain its compressed lifespan. Several aging genes are under positive selection in this short-lived fish and long-lived species, raising the intriguing possibility that the same gene could underlie evolution of both compressed and extended lifespans. Comparative genomics and linkage analysis identify candidate genes associated with lifespan differences between various turquoise killifish strains. Remarkably, these genes are clustered on the sex chromosome, suggesting that short lifespan might have co-evolved with sex determination. Our study provides insights into the evolutionary forces that shape lifespan in nature.

  10. The African Turquoise Killifish Genome Provides Insights into Evolution and Genetic Architecture of Lifespan.

    PubMed

    Valenzano, Dario Riccardo; Benayoun, Bérénice A; Singh, Param Priya; Zhang, Elisa; Etter, Paul D; Hu, Chi-Kuo; Clément-Ziza, Mathieu; Willemsen, David; Cui, Rongfeng; Harel, Itamar; Machado, Ben E; Yee, Muh-Ching; Sharp, Sabrina C; Bustamante, Carlos D; Beyer, Andreas; Johnson, Eric A; Brunet, Anne

    2015-12-01

    Lifespan is a remarkably diverse trait ranging from a few days to several hundred years in nature, but the mechanisms underlying the evolution of lifespan differences remain elusive. Here we de novo assemble a reference genome for the naturally short-lived African turquoise killifish, providing a unique resource for comparative and experimental genomics. The identification of genes under positive selection in this fish reveals potential candidates to explain its compressed lifespan. Several aging genes are under positive selection in this short-lived fish and long-lived species, raising the intriguing possibility that the same gene could underlie evolution of both compressed and extended lifespans. Comparative genomics and linkage analysis identify candidate genes associated with lifespan differences between various turquoise killifish strains. Remarkably, these genes are clustered on the sex chromosome, suggesting that short lifespan might have co-evolved with sex determination. Our study provides insights into the evolutionary forces that shape lifespan in nature. PMID:26638078

  11. [Advances in insect obligate endosymbionts and their genomes--a review].

    PubMed

    Rao, Qiong; Wu, Huiming

    2014-07-01

    In nature, many insects, especially sap-feeding insects, harbor nutritional bacterial symbionts, which are called obligate endosymbionts. These bacteria co-evolved with their hosts for millions of years. Obligate endosymbionts are commonly found in specialized organs, named bacteriomes or mycetomes that consist of a number of insect's cells (bacteriocytes or mycetocytes). Obligate endosymbionts strictly maternally inherited, providing essential amino acids to the hosts, and relating to survival, reproduction and evolution of the insects. Because of enriched nutritional environment, compared to those free-living bacteria, the genomes of obligate endosymbionts have different characteristics, such as genome size, GC content, and gene deletion. Although the genomes of many insect endosymbionts have been carefully analysis, the gene functions of endosymbionts and the interactions between endosymbionts/hosts and endosymbionts remain unknown. Thus, to provide an insight into the co-evolution of endosymbionts and their hosts, further studies of endosymbionts at genetic level are required.

  12. Genome sequence of the pink–pigmented marine bacterium Loktanella hongkongensis type strain (UST950701–009PT), a representative of the Roseobacter group

    DOE PAGES

    Lau, Stanley CK; Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Petersen, Jörn; Ivanova, Natalia N.; Markowitz, Victor; Woyke, Tanja; Göker, Markus; et al

    2015-08-11

    Loktanella hongkongensis UST950701-009PT is a Gram-negative, non-motile and rod-shaped bacterium isolated from a marine biofilm in the subtropical seawater of Hong Kong. When growing as a monospecies biofilm on polystyrene surfaces, this bacterium is able to induce larval settlement and metamorphosis of a ubiquitous polychaete tubeworm Hydroides elegans. The inductive cues are low-molecular weight compounds bound to the exopolymeric matrix of the bacterial cells. In the present study we describe the features of L. hongkongensis strain DSM 17492T together with its genome sequence and annotation and novel aspects of its phenotype. The 3,198,444 bp long genome sequence encodes 3104 protein-codingmore » genes and 57 RNA genes. Lastly, the two unambiguously identified extrachromosomal replicons contain replication modules of the RepB and the Rhodobacteraceae-specific DnaA-like type, respectively.« less

  13. Genome sequence of the pink–pigmented marine bacterium Loktanella hongkongensis type strain (UST950701–009PT), a representative of the Roseobacter group

    SciTech Connect

    Lau, Stanley CK; Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Petersen, Jörn; Ivanova, Natalia N.; Markowitz, Victor; Woyke, Tanja; Göker, Markus; Kyrpides, Nikos C.; Klenk, Hans-Peter; Qian, Pei-Yuan

    2015-08-11

    Loktanella hongkongensis UST950701-009PT is a Gram-negative, non-motile and rod-shaped bacterium isolated from a marine biofilm in the subtropical seawater of Hong Kong. When growing as a monospecies biofilm on polystyrene surfaces, this bacterium is able to induce larval settlement and metamorphosis of a ubiquitous polychaete tubeworm Hydroides elegans. The inductive cues are low-molecular weight compounds bound to the exopolymeric matrix of the bacterial cells. In the present study we describe the features of L. hongkongensis strain DSM 17492T together with its genome sequence and annotation and novel aspects of its phenotype. The 3,198,444 bp long genome sequence encodes 3104 protein-coding genes and 57 RNA genes. Lastly, the two unambiguously identified extrachromosomal replicons contain replication modules of the RepB and the Rhodobacteraceae-specific DnaA-like type, respectively.

  14. BLAST screening of chlamydial genomes to identify signature proteins that are unique for the Chlamydiales, Chlamydiaceae, Chlamydophila and Chlamydia groups of species

    PubMed Central

    Griffiths, Emma; Ventresca, Michael S; Gupta, Radhey S

    2006-01-01

    Background Chlamydiae species are of much importance from a clinical viewpoint. Their diversity both in terms of their numbers as well as clinical involvement are presently believed to be significantly underestimated. The obligate intracellular nature of chlamydiae has also limited their genetic and biochemical studies. Thus, it is of importance to develop additional means for their identification and characterization. Results We have carried out analyses of available chlamydiae genomes to identify sets of unique proteins that are either specific for all Chlamydiales genomes, or different Chlamydiaceae family members, or members of the Chlamydia and Chlamydophila genera, or those unique to Protochlamydia amoebophila, but which are not found in any other bacteria. In total, 59 Chlamydiales-specific proteins, 79 Chlamydiaceae-specific proteins, 20 proteins each that are specific for both Chlamydia and Chlamydophila and 445 ORFs that are Protochlamydia-specific were identified. Additionally, 33 cases of possible gene loss or lateral gene transfer were also detected. Conclusion The identified chlamydiae-lineage specific proteins, many of which are highly conserved, provide novel biomarkers that should prove of much value in the diagnosis of these bacteria and in exploration of their prevalence and diversity. These conserved protein sequences (CPSs) also provide novel therapeutic targets for drugs that are specific for these bacteria. Lastly, functional studies on these chlamydiae or chlamydiae subgroup-specific proteins should lead to important insights into lineage-specific adaptations with regards to development, infectivity and pathogenicity. PMID:16436211

  15. Experimental RNomics and genomic comparative analysis reveal a large group of species-specific small non-message RNAs in the silkworm Bombyx mori

    PubMed Central

    Li, Dandan; Wang, Yanhong; Zhang, Kun; Jiao, Zhujin; Zhu, Xiaopeng; Skogerboe, Geir; Guo, Xiangqian; Chinnusamy, Viswanathan; Bi, Lijun; Huang, Yongping; Dong, Shuanglin; Chen, Runsheng; Kan, Yunchao

    2011-01-01

    Accumulating evidences show that small non-protein coding RNAs (ncRNAs) play important roles in development, stress response and other cellular processes. The silkworm is an important model for studies on insect genetics and control of lepidopterous pests. Here, we have performed the first systematic identification and analysis of intermediate size ncRNAs (50–500 nt) in the silkworm. We identified 189 novel ncRNAs, including 141 snoRNAs, six snRNAs, three tRNAs, one SRP and 38 unclassified ncRNAs. Forty ncRNAs showed significantly altered expression during silkworm development or across specific stage transitions. Genomic comparisons revealed that 123 of these ncRNAs are potentially silkworm-specific. Analysis of the genomic organization of the ncRNA loci showed that 32.62% of the novel snoRNA loci are intergenic, and that all the intronic snoRNAs follow the pattern of one-snoRNA-per-intron. Target site analysis predicted a total of 95 2′-O-methylation and pseudouridylation modification sites of rRNAs, snRNAs and tRNAs. Together, these findings provide new clues for future functional study of ncRNA during insect development and evolution. PMID:21227919

  16. Genomic analyses of cherry rusty mottle group and cherry twisted leaf-associated viruses reveal a possible new genus within the family betaflexiviridae.

    PubMed

    Villamor, D E V; Susaimuthu, J; Eastwell, K C

    2015-03-01

    It is demonstrated that closely related viruses within the family Betaflexiviridae are associated with a number of diseases that affect sweet cherry (Prunus avium) and other Prunus spp. Cherry rusty mottle-associated virus (CRMaV) is correlated with the appearance of cherry rusty mottle disease (CRMD), and Cherry twisted leaf-associated virus (CTLaV) is linked to cherry twisted leaf disease (CTLD) and apricot ringpox disease (ARPD). Comprehensive analysis of previously reported full genomic sequences plus those determined in this study representing isolates of CTLaV, CRMaV, Cherry green ring mottle virus, and Cherry necrotic rusty mottle virus revealed segregation of sequences into four clades corresponding to distinct virus species. High-throughput sequencing of RNA from representative source trees for CRMD, CTLD, and ARPD did not reveal additional unique virus sequences that might be associated with these diseases, thereby further substantiating the association of CRMaV and CTLaV with CRMD and CTLD or ARPD, respectively. Based on comparison of the nucleotide and amino acid sequence identity values, phylogenetic relationships with other triple-gene block-coding viruses within the family Betaflexiviridae, genome organization, and natural host range, a new genus (Robigovirus) is suggested. PMID:25496302

  17. Quantifying dynamical spillover in co-evolving multiplex networks

    NASA Astrophysics Data System (ADS)

    Vijayaraghavan, Vikram S.; Noël, Pierre-André; Maoz, Zeev; D'Souza, Raissa M.

    2015-10-01

    Multiplex networks (a system of multiple networks that have different types of links but share a common set of nodes) arise naturally in a wide spectrum of fields. Theoretical studies show that in such multiplex networks, correlated edge dynamics between the layers can have a profound effect on dynamical processes. However, how to extract the correlations from real-world systems is an outstanding challenge. Here we introduce the Multiplex Markov chain to quantify correlations in edge dynamics found in longitudinal data of multiplex networks. By comparing the results obtained from the multiplex perspective to a null model which assumes layers in a network are independent, we can identify real correlations as distinct from simultaneous changes that occur due to random chance. We use this approach on two different data sets: the network of trade and alliances between nation states, and the email and co-commit networks between developers of open source software. We establish the existence of “dynamical spillover” showing the correlated formation (or deletion) of edges of different types as the system evolves. The details of the dynamics over time provide insight into potential causal pathways.

  18. Quantifying dynamical spillover in co-evolving multiplex networks

    PubMed Central

    Vijayaraghavan, Vikram S.; Noël, Pierre-André; Maoz, Zeev; D’Souza, Raissa M.

    2015-01-01

    Multiplex networks (a system of multiple networks that have different types of links but share a common set of nodes) arise naturally in a wide spectrum of fields. Theoretical studies show that in such multiplex networks, correlated edge dynamics between the layers can have a profound effect on dynamical processes. However, how to extract the correlations from real-world systems is an outstanding challenge. Here we introduce the Multiplex Markov chain to quantify correlations in edge dynamics found in longitudinal data of multiplex networks. By comparing the results obtained from the multiplex perspective to a null model which assumes layers in a network are independent, we can identify real correlations as distinct from simultaneous changes that occur due to random chance. We use this approach on two different data sets: the network of trade and alliances between nation states, and the email and co-commit networks between developers of open source software. We establish the existence of “dynamical spillover” showing the correlated formation (or deletion) of edges of different types as the system evolves. The details of the dynamics over time provide insight into potential causal pathways. PMID:26459949

  19. The complete nucleotide sequence of the cassava (Manihot esculenta) chloroplast genome and the evolution of atpF in Malpighiales: RNA editing and multiple losses of a group II intron

    PubMed Central

    Wurdack, Kenneth J.; Kanagaraj, Anderson; Lee, Seung-Bum; Saski, Christopher; Jansen, Robert K.

    2008-01-01

    The complete sequence of the chloroplast genome of cassava (Manihot esculenta, Euphorbiaceae) has been determined. The genome is 161,453 bp in length and includes a pair of inverted repeats (IR) of 26,954 bp. The genome includes 128 genes; 96 are single copy and 16 are duplicated in the IR. There are four rRNA genes and 30 distinct tRNAs, seven of which are duplicated in the IR. The infA gene is absent; expansion of IRb has duplicated 62 amino acids at the 3′ end of rps19 and a number of coding regions have large insertions or deletions, including insertions within the 23S rRNA gene. There are 17 intron-containing genes in cassava, 15 of which have a single intron while two (clpP, ycf3) have two introns. The usually conserved atpF group II intron is absent and this is the first report of its loss from land plant chloroplast genomes. The phylogenetic distribution of the atpF intron loss was determined by a PCR survey of 251 taxa representing 34 families of Malpighiales and 16 taxa from closely related rosids. The atpF intron is not only missing in cassava but also from closely related Euphorbiaceae and other Malpighiales, suggesting that there have been at least seven independent losses. In cassava and all other sequenced Malphigiales, atpF gene sequences showed a strong association between C-to-T substitutions at nucleotide position 92 and the loss of the intron, suggesting that recombination between an edited mRNA and the atpF gene may be a possible mechanism for the intron loss. PMID:18214421

  20. Genome Data Mining and Soil Survey for the Novel Group 5 [NiFe]-Hydrogenase To Explore the Diversity and Ecological Importance of Presumptive High-Affinity H2-Oxidizing Bacteria ▿†

    PubMed Central

    Constant, Philippe; Chowdhury, Soumitra Paul; Hesse, Laura; Pratscher, Jennifer; Conrad, Ralf

    2011-01-01

    Streptomyces soil isolates exhibiting the unique ability to oxidize atmospheric H2 possess genes specifying a putative high-affinity [NiFe]-hydrogenase. This study was undertaken to explore the taxonomic diversity and the ecological importance of this novel functional group. We propose to designate the genes encoding the small and large subunits of the putative high-affinity hydrogenase hhyS and hhyL, respectively. Genome data mining revealed that the hhyL gene is unevenly distributed in the phyla Actinobacteria, Proteobacteria, Chloroflexi, and Acidobacteria. The hhyL gene sequences comprised a phylogenetically distinct group, namely, the group 5 [NiFe]-hydrogenase genes. The presumptive high-affinity H2-oxidizing bacteria constituting group 5 were shown to possess a hydrogenase gene cluster, including the genes encoding auxiliary and structural components of the enzyme and four additional open reading frames (ORFs) of unknown function. A soil survey confirmed that both high-affinity H2 oxidation activity and the hhyL gene are ubiquitous. A quantitative PCR assay revealed that soil contained 106 to 108 hhyL gene copies g (dry weight)−1. Assuming one hhyL gene copy per genome, the abundance of presumptive high-affinity H2-oxidizing bacteria was higher than the maximal population size for which maintenance energy requirements would be fully supplied through the H2 oxidation activity measured in soil. Our data indicate that the abundance of the hhyL gene should not be taken as a reliable proxy for the uptake of atmospheric H2 by soil, because high-affinity H2 oxidation is a facultatively mixotrophic metabolism, and microorganisms harboring a nonfunctional group 5 [NiFe]-hydrogenase may occur. PMID:21742924

  1. Life-history traits maintain the genomic integrity of sympatric species of the spruce budworm (Choristoneura fumiferana) group on an isolated forest island

    PubMed Central

    Lumley, Lisa M; Sperling, Felix AH

    2011-01-01

    Identification of widespread species collected from islands can be challenging due to the potential for local ecological and phenotypic divergence in isolated populations. We sought to determine how many species of the spruce budworm (Choristoneura fumiferana) complex reside in Cypress Hills, an isolated remnant coniferous forest in western Canada. We integrated data on behavior, ecology, morphology, mitochondrial DNA, and simple sequence repeats, comparing Cypress Hills populations to those from other regions of North America to determine which species they resembled most. We identified C. fumiferana, C. occidentalis, C. lambertiana, and hybrid forms in Cypress Hills. Adult flight phenology and pheromone attraction were identified as key life-history traits involved in maintaining the genomic integrity of species. Our study highlights the importance of extensive sampling of both specimens and a variety of characters for understanding species boundaries in biodiversity research. PMID:22393489

  2. Genomic medicine implementation: learning by example.

    PubMed

    Williams, Marc S

    2014-03-01

    Genomic Medicine is beginning to emerge into clinical practice. The National Human Genome Research Institute's Genomic Medicine Working Group consists of organizations that have begun to implement some aspect of genomic medicine (e.g., family history, systematic implementation of Mendelian disease program, pharmacogenomics, whole exome/genome sequencing). This article concisely reviews the working group and provides a broader context for the articles in the special issue including an assessment of anticipated provider needs and ethical, legal, and social issues relevant to the implementation of genomic medicine. The challenges of implementation of innovation in clinical practice and the potential value of genomic medicine are discussed.

  3. The UCSC Genome Browser

    PubMed Central

    Karolchik, Donna; Hinrichs, Angie S.; Kent, W. James

    2011-01-01

    The University of California Santa Cruz (UCSC) Genome Browser is a popular Web-based tool for quickly displaying a requested portion of a genome at any scale, accompanied by a series of aligned annotation “tracks.” The annotations generated by the UCSC Genome Bioinformatics Group and external collaborators include gene predictions, mRNA and expressed sequence tag alignments, simple nucleotide polymorphisms, expression and regulatory data, phenotype and variation data, and pairwise and multiple-species comparative genomics data. All information relevant to a region is presented in one window, facilitating biological analysis and interpretation. The database tables underlying the Genome Browser tracks can be viewed, downloaded, and manipulated using another Web-based application, the UCSC Table Browser. Users can upload personal datasets in a wide variety of formats as custom annotation tracks in both browsers for research or educational purposes. PMID:21975940

  4. A genome-wide identification and characterization of mircoRNAs and their targets in 'Suli' pear (Pyrus pyrifolia white pear group).

    PubMed

    Niu, Qingfeng; Qian, Minjie; Liu, Guoqin; Yang, Fengxia; Teng, Yuanwen

    2013-12-01

    MicroRNAs (miRNAs) are a class of small non-coding RNA molecules that are endogenous regulators of gene expression. miRNAs play a crucial role in cells via degradation of target mRNAs or by inhibition of target protein translation. In the present study, 186 new potentially conserved pear miRNAs belonging to 37 families were identified. The length of mature miRNAs ranged from 19 to 24 nt, and most of the miRNAs (154 out of 186) were 21 nt in length. The length of pre-miRNAs in pear was also found to vary from 62 to 282 nt with an average of 105 ± 43 nt. The potential miRNAs belonged to 29 clusters involving 20 different miRNA families. Using these potential miRNAs, we further scoured of the pear genome and found 326 potential target genes, which included transcription factors, stress responsive genes, and the genes involved in transmembrane transport and signal transduction. Gene ontology analysis of these potential targets suggested that 47 biological processes were potentially regulated by miRNAs, including oxidation-reduction, stress response, transport, etc. KEGG pathway analysis showed that the identified miRNAs were found in 15 metabolism networks which were related to starch and sucrose metabolism, and ascorbate and aldarate metabolism, among others. Our study will help in the further understanding of the essential role of miRNAs in growth and development and stress response of pear.

  5. Genomic diversity and affinities in population groups of North West India: an analysis of Alu insertion and a single nucleotide polymorphism.

    PubMed

    Saini, J S; Kumar, A; Matharoo, K; Sokhi, J; Badaruddoza; Bhanwer, A J S

    2012-12-15

    The North West region of India is extremely important to understand the peopling of India, as it acted as a corridor to the foreign invaders from Eurasia and Central Asia. A series of these invasions along with multiple migrations led to intermixture of variable populations, strongly contributing to genetic variations. The present investigation was designed to explore the genetic diversities and affinities among the five major ethnic groups from North West India; Brahmin, Jat Sikh, Bania, Rajput and Gujjar. A total of 327 individuals of the abovementioned ethnic groups were analyzed for 4 Alu insertion marker loci (ACE, PV92, APO and D1) and a Single Nucleotide Polymorphism (SNP) rs2234693 in the intronic region of the ESR1 gene. Statistical analysis was performed to interpret the genetic structure and diversity of the population groups. Genotypes for ACE, APO, ESR1 and PV92 loci were found to be in Hardy-Weinberg equilibrium in all the ethnic groups, while significant departures were observed at the D1 locus in every investigated population after Bonferroni's correction. The average heterozygosity for all the loci in these ethnic groups was fairly substantial ranging from 0.3927 ± 0.1877 to 0.4333 ± 0.1416. Inbreeding coefficient indicated an overall 10% decrease in heterozygosity in these North West Indian populations. The gene differentiation among the populations was observed to be of the order of 0.013. Genetic distance estimates revealed that Gujjars were close to Banias and Jat Sikhs were close to Rajputs. Overall the study favored the recent division of the populations of North West India into largely endogamous groups. It was observed that the populations of North West India represent a more or less homogenous genetic entity, owing to their common ancestral history as well as geographical proximity.

  6. Comparison of the prevalence and genomic characteristics of Clostridium difficile isolated from various production groups in a vertically integrated swine operation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective of this study was to compare the prevalence of Clostridium difficile among different age and production groups of swine in a vertically integrated swine operation in Texas in 2006. Isolation of C. difficile was performed utilizing an enrichment technique and restrictive media. Prelim...

  7. Single-Cell Genome and Group-Specific dsrAB Sequencing Implicate Marine Members of the Class Dehalococcoidia (Phylum Chloroflexi) in Sulfur Cycling

    PubMed Central

    Cooper, Myriel; Schreiber, Lars; Lloyd, Karen G.; Baker, Brett J.; Petersen, Dorthe G.; Jørgensen, Bo Barker; Stepanauskas, Ramunas; Reinhardt, Richard; Schramm, Andreas; Loy, Alexander; Adrian, Lorenz

    2016-01-01

    ABSTRACT The marine subsurface sediment biosphere is widely inhabited by bacteria affiliated with the class Dehalococcoidia (DEH), phylum Chloroflexi, and yet little is known regarding their metabolisms. In this report, genomic content from a single DEH cell (DEH-C11) with a 16S rRNA gene that was affiliated with a diverse cluster of 16S rRNA gene sequences prevalent in marine sediments was obtained from sediments of Aarhus Bay, Denmark. The distinctive gene content of this cell suggests metabolic characteristics that differ from those of known DEH and Chloroflexi. The presence of genes encoding dissimilatory sulfite reductase (Dsr) suggests that DEH could respire oxidized sulfur compounds, although Chloroflexi have never been implicated in this mode of sulfur cycling. Using long-range PCR assays targeting DEH dsr loci, dsrAB genes were amplified and sequenced from various marine sediments. Many of the amplified dsrAB sequences were affiliated with the DEH Dsr clade, which we propose equates to a family-level clade. This provides supporting evidence for the potential for sulfite reduction by diverse DEH species. DEH-C11 also harbored genes encoding reductases for arsenate, dimethyl sulfoxide, and halogenated organics. The reductive dehalogenase homolog (RdhA) forms a monophyletic clade along with RdhA sequences from various DEH-derived contigs retrieved from available metagenomes. Multiple facts indicate that this RdhA may not be a terminal reductase. The presence of other genes indicated that nutrients and energy may be derived from the oxidation of substituted homocyclic and heterocyclic aromatic compounds. Together, these results suggest that marine DEH play a previously unrecognized role in sulfur cycling and reveal the potential for expanded catabolic and respiratory functions among subsurface DEH. PMID:27143384

  8. The UCSC Genome Browser

    PubMed Central

    Karolchik, Donna; Hinrichs, Angie S.; Kent, W. James

    2009-01-01

    The University of California Santa Cruz (UCSC) Genome Browser (genome.ucsc.edu) is a popular Web-based tool for quickly displaying a requested portion of a genome at any scale, accompanied by a series of aligned annotation “tracks”. The annotations—generated by the UCSC Genome Bioinformatics Group and external collaborators—display gene predictions, mRNA and expressed sequence tag alignments, simple nucleotide polymorphisms, expression and regulatory data, phenotype and variation data, and pairwise and multiple-species comparative genomics data. All information relevant to a region is presented in one window, facilitating biological analysis and interpretation. The database tables underlying the Genome Browser tracks can be viewed, downloaded, and manipulated using another Web-based application, the UCSC Table Browser. Users can upload data as custom annotation tracks in both browsers for research or educational use. This unit describes how to use the Genome Browser and Table Browser for genome analysis, download the underlying database tables, and create and display custom annotation tracks. PMID:19957273

  9. Comparative genomics of Brassicaceae crops.

    PubMed

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-05-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  10. 76 FR 28056 - National Human Genome Research Institute; Notice of Closed Meetings

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-05-13

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group, Genome Research Review... Scientific Review, National Human Genome Research Institute, National Institutes of Health, Bethesda,...

  11. 76 FR 58023 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-09-19

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Review, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892,...

  12. 77 FR 28888 - National Human Genome Research Institute Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-05-16

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... applications. Place: National Human Genome Research Institute, 3635 Fishers Lane, Suite 4076, ] Rockville,...

  13. 78 FR 64222 - National Human Genome Research Institute; Notice of Closed Meetings

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-10-28

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review..., Ph.D., Scientific Review Officer, Office of Scientific Review, National Human Genome...

  14. The UCSC Genome Browser.

    PubMed

    Karolchik, Donna; Hinrichs, Angie S; Kent, W James

    2012-12-01

    The University of California Santa Cruz (UCSC) Genome Browser is a popular Web-based tool for quickly displaying a requested portion of a genome at any scale, accompanied by a series of aligned annotation "tracks." The annotations generated by the UCSC Genome Bioinformatics Group and external collaborators include gene predictions, mRNA and expressed sequence tag alignments, simple nucleotide polymorphisms, expression and regulatory data, phenotype and variation data, and pairwise and multiple-species comparative genomics data. All information relevant to a region is presented in one window, facilitating biological analysis and interpretation. The database tables underlying the Genome Browser tracks can be viewed, downloaded, and manipulated using another Web-based application, the UCSC Table Browser. Users can upload personal datasets in a wide variety of formats as custom annotation tracks in both browsers for research or educational purposes. This unit describes how to use the Genome Browser and Table Browser for genome analysis, download the underlying database tables, and create and display custom annotation tracks.

  15. Complete genomic sequence analyses of the first group A giraffe rotavirus reveals close evolutionary relationship with rotaviruses infecting other members of the Artiodactyla.

    PubMed

    O'Shea, Helen; Mulherin, Emily; Matthijnssens, Jelle; McCusker, Matthew P; Collins, P J; Cashman, Olivia; Gunn, Lynda; Beltman, Marijke E; Fanning, Séamus

    2014-05-14

    Group A Rotaviruses (RVA) have been established as significant contributory agents of acute gastroenteritis in young children and many animal species. In 2008, we described the first RVA strain detected in a giraffe calf (RVA/Giraffe-wt/IRL/GirRV/2008/G10P[11]), presenting with acute diarrhoea. Molecular characterisation of the VP7 and VP4 genes revealed the bovine-like genotypes G10 and P[11], respectively. To further investigate the origin of this giraffe RVA strain, the 9 remaining gene segments were sequenced and analysed, revealing the following genotype constellation: G10-P[11]-I2-R2-C2-M2-A3-N2-T6-E2-H3. This genotype constellation is very similar to RVA strains isolated from cattle or other members of the artiodactyls. Phylogenetic analyses confirmed the close relationship between GirRV and RVA strains with a bovine-like genotype constellation detected from several host species, including humans. These results suggest that RVA strain GirRV was the result of an interspecies transmission from a bovine host to the giraffe calf. However, we cannot rule out completely that this bovine-like RVA genotype constellation may be enzootic in giraffes. Future RVA surveillance in giraffes may answer this intriguing question.

  16. Complete genomic sequence analyses of the first group A giraffe rotavirus reveals close evolutionary relationship with rotaviruses infecting other members of the Artiodactyla.

    PubMed

    O'Shea, Helen; Mulherin, Emily; Matthijnssens, Jelle; McCusker, Matthew P; Collins, P J; Cashman, Olivia; Gunn, Lynda; Beltman, Marijke E; Fanning, Séamus

    2014-05-14

    Group A Rotaviruses (RVA) have been established as significant contributory agents of acute gastroenteritis in young children and many animal species. In 2008, we described the first RVA strain detected in a giraffe calf (RVA/Giraffe-wt/IRL/GirRV/2008/G10P[11]), presenting with acute diarrhoea. Molecular characterisation of the VP7 and VP4 genes revealed the bovine-like genotypes G10 and P[11], respectively. To further investigate the origin of this giraffe RVA strain, the 9 remaining gene segments were sequenced and analysed, revealing the following genotype constellation: G10-P[11]-I2-R2-C2-M2-A3-N2-T6-E2-H3. This genotype constellation is very similar to RVA strains isolated from cattle or other members of the artiodactyls. Phylogenetic analyses confirmed the close relationship between GirRV and RVA strains with a bovine-like genotype constellation detected from several host species, including humans. These results suggest that RVA strain GirRV was the result of an interspecies transmission from a bovine host to the giraffe calf. However, we cannot rule out completely that this bovine-like RVA genotype constellation may be enzootic in giraffes. Future RVA surveillance in giraffes may answer this intriguing question. PMID:24582453

  17. Comparative mitochondrial genomics in zygomycetes: bacteria-like RNase P RNAs, mobile elements and a close source of the group I intron invasion in angiosperms

    PubMed Central

    Seif, Elias; Leigh, Jessica; Liu, Yu; Roewer, Ingeborg; Forget, Lise; Lang, B. Franz

    2005-01-01

    To generate data for comparative analyses of zygomycete mitochondrial gene expression, we sequenced mtDNAs of three distantly related zygomycetes, Rhizopus oryzae, Mortierella verticillata and Smittium culisetae. They all contain the standard fungal mitochondrial gene set, plus rnpB, the gene encoding the RNA subunit of the mitochondrial RNase P (mtP-RNA) and rps3, encoding ribosomal protein S3 (the latter lacking in R.oryzae). The mtP-RNAs of R.oryzae and of additional zygomycete relatives have the most eubacteria-like RNA structures among fungi. Precise mapping of the 5′ and 3′ termini of the R.oryzae and M.verticillata mtP-RNAs confirms their expression and processing at the exact sites predicted by secondary structure modeling. The 3′ RNA processing of zygomycete mitochondrial mRNAs, SSU-rRNA and mtP-RNA occurs at the C-rich sequence motifs similar to those identified in fission yeast and basidiomycete mtDNAs. The C-rich motifs are included in the mature transcripts, and are likely generated by exonucleolytic trimming of RNA 3′ termini. Zygomycete mtDNAs feature a variety of insertion elements: (i) mtDNAs of R.oryzae and M.verticillata were subject to invasions by double hairpin elements; (ii) genes of all three species contain numerous mobile group I introns, including one that is closest to an intron that invaded angiosperm mtDNAs; and (iii) at least one additional case of a mobile element, characterized by a homing endonuclease insertion between partially duplicated genes [Paquin,B., Laforest,M.J., Forget,L., Roewer,I., Wang,Z., Longcore,J. and Lang,B.F. (1997) Curr. Genet., 31, 380–395]. The combined mtDNA-encoded proteins contain insufficient phylogenetic signal to demonstrate monophyly of zygomycetes. PMID:15689432

  18. Widespread Recurrent Evolution of Genomic Features

    PubMed Central

    Maeso, Ignacio; Roy, Scott William; Irimia, Manuel

    2012-01-01

    The recent explosion of genome sequences from all major phylogenetic groups has unveiled an unexpected wealth of cases of recurrent evolution of strikingly similar genomic features in different lineages. Here, we review the diverse known types of recurrent evolution in eukaryotic genomes, with a special focus on metazoans, ranging from reductive genome evolution to origins of splice-leader trans-splicing, from tandem exon duplications to gene family expansions. We first propose a general classification scheme for evolutionary recurrence at the genomic level, based on the type of driving force—mutation or selection—and the environmental and genomic circumstances underlying these forces. We then discuss various cases of recurrent genomic evolution under this scheme. Finally, we provide a broader context for repeated genomic evolution, including the unique relationship of genomic recurrence with the genotype–phenotype map, and the ways in which the study of recurrent genomic evolution can be used to understand fundamental evolutionary processes. PMID:22417916

  19. Long-term study of an infection with ranaviruses in a group of edible frogs (Pelophylax kl. esculentus) and partial characterization of two viruses based on four genomic regions.

    PubMed

    Stöhr, Anke C; Hoffmann, Alexandra; Papp, Tibor; Robert, Nadia; Pruvost, Nicolas B M; Reyer, Heinz-Ulrich; Marschang, Rachel E

    2013-08-01

    Several edible frogs (Pelophylax kl. esculentus) collected into a single group from various ponds in Europe died suddenly with reddening of the skin (legs, abdomen) and haemorrhages in the gastrointestinal tract. Ranavirus was detected in some of the dead frogs using PCR, and virus was also isolated in cell culture. Over the following 3 years, another two outbreaks occurred with low to high mortality in between asymptomatic periods. In the first 2 years, the same ranavirus was detected repeatedly, but a new ranavirus was isolated in association with the second mass-mortality event. The two different ranaviruses were characterized based on nucleotide sequences from four genomic regions, namely, major capsid protein, DNA polymerase, ribonucleoside diphosphate reductase alpha and beta subunit genes. The sequences showed slight variations to each other or GenBank entries and both clustered to the Rana esculenta virus (REV-like) clade in the phylogenetic analysis. Furthermore, a quiescent infection was demonstrated in two individuals. By comparing samples taken before and after transport and caging in groups it was possible to identify the pond of origin and a ranavirus was detected for the first time in wild amphibians in Germany.

  20. Genomics of Volvocine Algae

    PubMed Central

    Umen, James G.; Olson, Bradley J.S.C.

    2015-01-01

    Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411

  1. Genome-to-Watershed Predictive Understanding of Terrestrial Environments

    NASA Astrophysics Data System (ADS)

    Hubbard, S. S.; Agarwal, D.; Banfield, J. F.; Beller, H. R.; Brodie, E.; Long, P.; Nico, P. S.; Steefel, C. I.; Tokunaga, T. K.; Williams, K. H.

    2014-12-01

    Although terrestrial environments play a critical role in cycling water, greenhouse gasses, and other life-critical elements, the complexity of interactions among component microbes, plants, minerals, migrating fluids and dissolved constituents hinders predictive understanding of system behavior. The 'Sustainable Systems 2.0' project is developing genome-to-watershed scale predictive capabilities to quantify how the microbiome affects biogeochemical watershed functioning, how watershed-scale hydro-biogeochemical processes affect microbial functioning, and how these interactions co-evolve with climate and land-use changes. Development of such predictive capabilities is critical for guiding the optimal management of water resources, contaminant remediation, carbon stabilization, and agricultural sustainability - now and with global change. Initial investigations are focused on floodplains in the Colorado River Basin, and include iterative model development, experiments and observations with an early emphasis on subsurface aspects. Field experiments include local-scale experiments at Rifle CO to quantify spatiotemporal metabolic and geochemical responses to O2and nitrate amendments as well as floodplain-scale monitoring to quantify genomic and biogeochemical response to natural hydrological perturbations. Information obtained from such experiments are represented within GEWaSC, a Genome-Enabled Watershed Simulation Capability, which is being developed to allow mechanistic interrogation of how genomic information stored in a subsurface microbiome affects biogeochemical cycling. This presentation will describe the genome-to-watershed scale approach as well as early highlights associated with the project. Highlights include: first insights into the diversity of the subsurface microbiome and metabolic roles of organisms involved in subsurface nitrogen, sulfur and hydrogen and carbon cycling; the extreme variability of subsurface DOC and hydrological controls on carbon and

  2. Aquaculture Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genomics chapter covers the basics of genome mapping and sequencing and the current status of several relevant species. The chapter briefly describes the development and use of (cDNA, BAC, etc.) libraries for mapping and obtaining specific sequence information. Other topics include comparative ...

  3. Human Genome Annotation

    NASA Astrophysics Data System (ADS)

    Gerstein, Mark

    A central problem for 21st century science is annotating the human genome and making this annotation useful for the interpretation of personal genomes. My talk will focus on annotating the 99% of the genome that does not code for canonical genes, concentrating on intergenic features such as structural variants (SVs), pseudogenes (protein fossils), binding sites, and novel transcribed RNAs (ncRNAs). In particular, I will describe how we identify regulatory sites and variable blocks (SVs) based on processing next-generation sequencing experiments. I will further explain how we cluster together groups of sites to create larger annotations. Next, I will discuss a comprehensive pseudogene identification pipeline, which has enabled us to identify >10K pseudogenes in the genome and analyze their distribution with respect to age, protein family, and chromosomal location. Throughout, I will try to introduce some of the computational algorithms and approaches that are required for genome annotation. Much of this work has been carried out in the framework of the ENCODE, modENCODE, and 1000 genomes projects.

  4. Human Social Genomics

    PubMed Central

    Cole, Steven W.

    2014-01-01

    A growing literature in human social genomics has begun to analyze how everyday life circumstances influence human gene expression. Social-environmental conditions such as urbanity, low socioeconomic status, social isolation, social threat, and low or unstable social status have been found to associate with differential expression of hundreds of gene transcripts in leukocytes and diseased tissues such as metastatic cancers. In leukocytes, diverse types of social adversity evoke a common conserved transcriptional response to adversity (CTRA) characterized by increased expression of proinflammatory genes and decreased expression of genes involved in innate antiviral responses and antibody synthesis. Mechanistic analyses have mapped the neural “social signal transduction” pathways that stimulate CTRA gene expression in response to social threat and may contribute to social gradients in health. Research has also begun to analyze the functional genomics of optimal health and thriving. Two emerging opportunities now stand to revolutionize our understanding of the everyday life of the human genome: network genomics analyses examining how systems-level capabilities emerge from groups of individual socially sensitive genomes and near-real-time transcriptional biofeedback to empirically optimize individual well-being in the context of the unique genetic, geographic, historical, developmental, and social contexts that jointly shape the transcriptional realization of our innate human genomic potential for thriving. PMID:25166010

  5. Ebolavirus comparative genomics

    DOE PAGES

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; et al

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  6. THE PHYLOGENY AND GENOME OF TRICHINELLA SPECIES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In 2004, funding was received by Washington University’s Genome Sequencing Center through NHGRI, to completely sequence several nematode genomes as part of a holistic effort to advance our understanding of the human genome. Trichinella spiralis was among this group because of its strategic ...

  7. Evolution of recombination and genome structure in eusocial insects.

    PubMed

    Kent, Clement F; Zayed, Amro

    2013-03-01

    Eusocial Hymenoptera, such as the European honey bee, Apis mellifera, have the highest recombination rates of multicellular animals.(1) Recently, we showed(2) that a side-effect of recombination in the honey bee, GC biased gene conversion (bGC), helps maintain the unusual bimodal GC-content distribution of the bee genome by increasing GC-content in high recombination areas while low recombination areas are losing GC-content because of biased AT mutations and low rates of bGC. Although the very high recombination rate of A. mellifera makes GC-content evolution easier to study, the pattern is consistent with results found in many other species including mammals and yeast.(3) Also consistent across phyla is the association of higher genetic diversity and divergence with high GC and high recombination areas.(4) (,) (5) Finally, we showed that genes overexpressed in the brains of workers cluster in GC-rich genomic areas with the highest rates of recombination and molecular evolution.(2) In this Addendum we present a conceptual model of how eusociality and high recombination rates may co-evolve.

  8. International Summer School, ‘ From Genome to Life’

    PubMed Central

    2002-01-01

    This report from the International Summer School ‘From Genome to Life’, held at the Institute d'Etudes Scientifiques de Cargèse in Corsica in July 2002, covers the talks of the invited speakers. The topics of the talks can be broadly grouped into the areas of genome annotation, comparative and evolutionary genomics, functional genomics, proteomics, structural genomics, pharmacogenomics, and organelle genomes, epigenetics and RNA. PMID:18629253

  9. Antarctic Genomics

    PubMed Central

    Clarke, Andrew; Cockell, Charles S.; Convey, Peter; Detrich III, H. William; Fraser, Keiron P. P.; Johnston, Ian A.; Methe, Barbara A.; Murray, Alison E.; Peck, Lloyd S.; Römisch, Karin; Rogers, Alex D.

    2004-01-01

    With the development of genomic science and its battery of technologies, polar biology stands on the threshold of a revolution, one that will enable the investigation of important questions of unprecedented scope and with extraordinary depth and precision. The exotic organisms of polar ecosystems are ideal candidates for genomic analysis. Through such analyses, it will be possible to learn not only the novel features that enable polar organisms to survive, and indeed thrive, in their extreme environments, but also fundamental biological principles that are common to most, if not all, organisms. This article aims to review recent developments in Antarctic genomics and to demonstrate the global context of such studies. PMID:18629155

  10. Correlation between the genomic o454-nlpD region polymorphisms, virulence gene equipment and phylogenetic group of extraintestinal Escherichia coli (ExPEC) enables pathotyping irrespective of host, disease and source of isolation

    PubMed Central

    2014-01-01

    Background The mutS-rpoS intergenic region in E. coli displays a mosaic structure which revealed pathotype specific patterns. To assess the importance of this region as a surrogate marker for the identification of highly virulent extraintestinal pathogenic E. coli (ExPEC) strains we aimed to: (i) characterize the genetic diversity of the mutS gene and the o454-nlpD genomic region among 510 E. coli strains from animals and humans; (ii) delineate associations between the polymorphism of this region and features such as phylogenetic background of E. coli, pathotype, host species, clinical condition, serogroup and virulence associated genes (VAG)s; and (iii) identify the most important VAGs for classification of the o454-nlpD region. Methods Size variation in the o454-nlpD region was investigated by PCR amplification and sequencing. Phylogenetic relationships were assessed by Ecor- and Multilocus sequence- typing (MLST), and a comparative analysis between mutS gene phylogenetic tree obtained with RAxML and the MLST grouping method was performed. Correlation between o454-nlpD patterns and the features described above were analysed. In addition, the importance of 47 PCR-amplified ExPEC-related VAGs for classification of o454-nlpD patterns was investigated by means of Random Forest algorithm. Results Four main structures (patterns I-IV) of the o454-nlpD region among ExPEC and commensal E. coli strains were identified. Statistical analysis showed a positive and exclusive association between pattern III and the ExPEC strains. A strong association between pattern III and either the Ecor group B2 or the sequence type complexes known to represent the phylogenetic background of highly virulent ExPEC strains (such as STC95, STC73 and STC131) was found as well. RF analyses determined five genes (csgA, malX, chuA, sit, and vat) to be suitable to predict pattern III strains. Conclusion The significant association between pattern III and group B2 strains suggested the o454-nlp

  11. Theory of prokaryotic genome evolution

    PubMed Central

    Sela, Itamar; Wolf, Yuri I.; Koonin, Eugene V.

    2016-01-01

    Bacteria and archaea typically possess small genomes that are tightly packed with protein-coding genes. The compactness of prokaryotic genomes is commonly perceived as evidence of adaptive genome streamlining caused by strong purifying selection in large microbial populations. In such populations, even the small cost incurred by nonfunctional DNA because of extra energy and time expenditure is thought to be sufficient for this extra genetic material to be eliminated by selection. However, contrary to the predictions of this model, there exists a consistent, positive correlation between the strength of selection at the protein sequence level, measured as the ratio of nonsynonymous to synonymous substitution rates, and microbial genome size. Here, by fitting the genome size distributions in multiple groups of prokaryotes to predictions of mathematical models of population evolution, we show that only models in which acquisition of additional genes is, on average, slightly beneficial yield a good fit to genomic data. These results suggest that the number of genes in prokaryotic genomes reflects the equilibrium between the benefit of additional genes that diminishes as the genome grows and deletion bias (i.e., the rate of deletion of genetic material being slightly greater than the rate of acquisition). Thus, new genes acquired by microbial genomes, on average, appear to be adaptive. The tight spacing of protein-coding genes likely results from a combination of the deletion bias and purifying selection that efficiently eliminates nonfunctional, noncoding sequences. PMID:27702904

  12. u-Genome: a database on genome design in unicellular genomes.

    PubMed

    Sakharkar, Kishore Ramaji; Chaturvedi, Iti; Chow, Vincent T K; Kwoh, Chee Keong; Kangueane, Pandjassarame; Sakharkar, Meena Kishore

    2005-01-01

    Unicellular eukaryotes were among the first ones to be selected for complete genome sequencing because of the small size of their genomes and their interactions with humans and a broad range of animals and plants. Currently, ten completely sequenced unicellular genome sequences have been publicly released and as the number of available unicellular genomes increases, comparative genomics analysis within this group of organisms becomes more and more instructive. However, such an analysis is difficult to carry out without a suitable platform gathering not only the original annotations but also relevant information available in public databases or obtained by applying common bioinformatics methods. With the aim of solving these difficulties, we have developed a web-accessible database named u-Genome, the unicellular genome design database. The database is unique in featuring three datasets namely (1) orthologous proteins (2) paralogous proteins and (3) statistical distributions on exons, introns, intergenic DNA and correlations between them. A tool, Uniview, designed to visualize the gene structures for individual genes in the genome is also integrated. This database is of importance in understanding unicellular genome design and architecture and evolution related studies. The database is available through a web interface at http://sege.ntu.edu.sg/wester/ugenome.

  13. Imaging genomics

    PubMed Central

    Thompson, Paul M.; Martin, Nicholas G.; Wright, Margaret J.

    2010-01-01

    Purpose of review Imaging genomics is an emerging field that is rapidly identifying genes that influence the brain, cognition, and risk for disease. Worldwide, thousands of individuals are being scanned with high-throughput genotyping (genome-wide scans), and new imaging techniques [high angular resolution diffusion imaging and resting state functional magnetic resonance imaging (MRI)] that provide fine-grained measures of the brain’s structural and functional connectivity. Along with clinical diagnosis and cognitive testing, brain imaging offers highly reproducible measures that can be subjected to genetic analysis. Recent findings Recent studies of twin, pedigree, and population-based datasets have discovered several candidate genes that consistently show small to moderate effects on brain measures. Many studies measure single phenotypes from the images, such as hippocampal volume, but voxel-wise genomic methods can plot the profile of genetic association at each 3D point in the brain. This exploits the full arsenal of imaging statistics to discover and replicate gene effects. Summary Imaging genomics efforts worldwide are now working together to discover and replicate many promising leads. By studying brain phenotypes closer to causative gene action, larger gene effects are detectable with realistic sample sizes obtainable from meta-analysis of smaller studies. Imaging genomics has broad applications to dementia, mental illness, and public health. PMID:20581684

  14. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  15. Listeria Genomics

    NASA Astrophysics Data System (ADS)

    Cabanes, Didier; Sousa, Sandra; Cossart, Pascale

    The opportunistic intracellular foodborne pathogen Listeria monocytogenes has become a paradigm for the study of host-pathogen interactions and bacterial adaptation to mammalian hosts. Analysis of L. monocytogenes infection has provided considerable insight into how bacteria invade cells, move intracellularly, and disseminate in tissues, as well as tools to address fundamental processes in cell biology. Moreover, the vast amount of knowledge that has been gathered through in-depth comparative genomic analyses and in vivo studies makes L. monocytogenes one of the most well-studied bacterial pathogens. This chapter provides an overview of progress in the exploration of genomic, transcriptomic, and proteomic data in Listeria spp. to understand genome evolution and diversity, as well as physiological aspects of metabolism used by bacteria when growing in diverse environments, in particular in infected hosts.

  16. Genome Informatics

    PubMed Central

    Winslow, Raimond L.; Boguski, Mark S.

    2005-01-01

    This article reviews recent advances in genomics and informatics relevant to cardiovascular research. In particular, we review the status of (1) whole genome sequencing efforts in human, mouse, rat, zebrafish, and dog; (2) the development of data mining and analysis tools; (3) the launching of the National Heart, Lung, and Blood Institute Programs for Genomics Applications and Proteomics Initiative; (4) efforts to characterize the cardiac transcriptome and proteome; and (5) the current status of computational modeling of the cardiac myocyte. In each instance, we provide links to relevant sources of information on the World Wide Web and critical appraisals of the promises and the challenges of an expanding and diverse information landscape. PMID:12750305

  17. Teaching Residents Genomic Pathology: A Novel Approach for New Technology

    PubMed Central

    Haspel, Richard L.

    2013-01-01

    Genomics-based diagnostics have become part of patient care. As pathologists have the expertise in clinical laboratory testing as well as access to patient samples, all genomic medicine is genomic pathology. This article will review the evidence that there is a critical need for pathology resident training in genomics. Several individual program curricula are described as well as the progress of the Training Residents in Genomic (TRIG) Working Group. This group has made significant advances towards developing, implementing and evaluating a national curriculum in genomics for pathology residents. The novel approach of the TRIG Working Group can be used as a model for training pathology professionals in any new technology. PMID:23399798

  18. Inter- and intra-genomic homology of the Brassica genomes: implications for their origin and evolution.

    PubMed

    Truco, M J; Hu, J; Sadowski, J; Quiros, C F

    1996-12-01

    In order to determine the homologous regions shared by the cultivated Brassica genomes, linkage maps of the diploid cultivated B. rapa (A genome, n = 10), B. nigra (B genome, n = 8) and B. oleracea (C genome, n = 9), were compared. We found intergenomic conserved regions but with extensitve reordering among the genomes. Eighteen linkage groups from all three species could be associated on the basis of homologous segments based on at least three common markers. Intragenomic homologous conservation was also observed for some of the chromosomes of the A, B and C genomes. A possible chromosome phylogenetic pathway based on an ancestral genome of at least five, and no more than seven chromosomes, was drawn from the chromosomal inter-relationships observed. These results demonstrate that extensive duplication and rearrangement have been involved in the formation of the Brassica genomes from a smaller ancestral genome.

  19. Ebolavirus comparative genomics

    PubMed Central

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  20. Jumbled genomes: missing Apicomplexan synteny.

    PubMed

    DeBarry, Jeremy D; Kissinger, Jessica C

    2011-10-01

    Whole-genome comparisons provide insight into genome evolution by informing on gene repertoires, gene gains/losses, and genome organization. Most of our knowledge about eukaryotic genome evolution is derived from studies of multicellular model organisms. The eukaryotic phylum Apicomplexa contains obligate intracellular protist parasites responsible for a wide range of human and veterinary diseases (e.g., malaria, toxoplasmosis, and theileriosis). We have developed an in silico protein-encoding gene based pipeline to investigate synteny across 12 apicomplexan species from six genera. Genome rearrangement between lineages is extensive. Syntenic regions (conserved gene content and order) are rare between lineages and appear to be totally absent across the phylum, with no group of three genes found on the same chromosome and in the same order within 25 kb up- and downstream of any orthologous genes. Conserved synteny between major lineages is limited to small regions in Plasmodium and Theileria/Babesia species, and within these conserved regions, there are a number of proteins putatively targeted to organelles. The observed overall lack of synteny is surprising considering the divergence times and the apparent absence of transposable elements (TEs) within any of the species examined. TEs are ubiquitous in all other groups of eukaryotes studied to date and have been shown to be involved in genomic rearrangements. It appears that there are different criteria governing genome evolution within the Apicomplexa relative to other well-studied unicellular and multicellular eukaryotes. PMID:21504890

  1. Biobanks for Genomics and Genomics for Biobanks

    PubMed Central

    Ducournau, Pascal; Gourraud, Pierre-Antoine; Pontille, David

    2003-01-01

    Biobanks include biological samples and attached databases. Human biobanks occur in research, technological development and medical activities. Population genomics is highly dependent on the availability of large biobanks. Ethical issues must be considered: protecting the rights of those people whose samples or data are in biobanks (information, autonomy, confidentiality, protection of private life), assuring the non-commercial use of human body elements and the optimal use of samples and data. They balance other issues, such as protecting the rights of researchers and companies, allowing long-term use of biobanks while detailed information on future uses is not available. At the level of populations, the traditional form of informed consent is challenged. Other dimensions relate to the rights of a group as such, in addition to individual rights. Conditions of return of results and/or benefit to a population need to be defined. With ‘large-scale biobanking’ a marked trend in genomics, new societal dimensions appear, regarding communication, debate, regulation, societal control and valorization of such large biobanks. Exploring how genomics can help health sector biobanks to become more rationally constituted and exploited is an interesting perspective. For example, evaluating how genomic approaches can help in optimizing haematopoietic stem cell donor registries using new markers and high-throughput techniques to increase immunogenetic variability in such registries is a challenge currently being addressed. Ethical issues in such contexts are important, as not only individual decisions or projects are concerned, but also national policies in the international arena and organization of democratic debate about science, medicine and society. PMID:18629026

  2. Comparative genomics - a perspective.

    PubMed

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2007-03-27

    The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology.

  3. Comparative genomics - A perspective

    PubMed Central

    Sivashankari, Selvarajan; Shanmughavel, Piramanayagam

    2007-01-01

    The rapidly emerging field of comparative genomics has yielded dramatic results. Comparative genome analysis has become feasible with the availability of a number of completely sequenced genomes. Comparison of complete genomes between organisms allow for global views on genome evolution and the availability of many completely sequenced genomes increases the predictive power in deciphering the hidden information in genome design, function and evolution. Thus, comparison of human genes with genes from other genomes in a genomic landscape could help assign novel functions for un-annotated genes. Here, we discuss the recently used techniques for comparative genomics and their derived inferences in genome biology. PMID:17597925

  4. Genome cartography: charting the apicomplexan genome.

    PubMed

    Kissinger, Jessica C; DeBarry, Jeremy

    2011-08-01

    Genes reside in particular genomic contexts that can be mapped at many levels. Historically, 'genetic maps' were used primarily to locate genes. Recent technological advances in the determination of genome sequences have made the analysis and comparison of whole genomes possible and increasingly tractable. What do we see if we shift our focus from gene content (the 'inventory' of genes contained within a genome) to the composition and organization of a genome? This review examines what has been learned about the evolution of the apicomplexan genome as well as the significance and impact of genomic location on our understanding of the eukaryotic genome and parasite biology.

  5. Genomics of Bacillus Species

    NASA Astrophysics Data System (ADS)

    Økstad, Ole Andreas; Kolstø, Anne-Brit

    Members of the genus Bacillus are rod-shaped spore-forming bacteria belonging to the Firmicutes, the low G+C gram-positive bacteria. The Bacillus genus was first described and classified by Ferdinand Cohn in Cohn (1872), and Bacillus subtilis was defined as the type species (Soule, 1932). Several Bacilli may be linked to opportunistic infections. However, pathogenicity among Bacillus spp. is mainly a feature of bacteria belonging to the Bacillus cereus group, including B. cereus, Bacillus anthracis, and Bacillus thuringiensis. Here we review the genomics of B. cereus group bacteria in relation to their roles as etiological agents of two food poisoning syndromes (emetic and diarrhoeal).

  6. The Nostoc punctiforme Genome

    SciTech Connect

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9 Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.

  7. Towards a Library of Standard Operating Procedures (SOPs) for (meta)genomic annotation

    SciTech Connect

    Kyrpides, Nikos; Angiuoli, Samuel V.; Cochrane, Guy; Field, Dawn; Garrity, George; Gussman, Aaron; Kodira, Chinnappa D.; Klimke, William; Kyrpides, Nikos; Madupu, Ramana; Markowitz, Victor; Tatusova, Tatiana; Thomson, Nick; White, Owen

    2008-04-01

    Genome annotations describe the features of genomes and accompany sequences in genome databases. The methodologies used to generate genome annotation are diverse and typically vary amongst groups. Descriptions of the annotation procedure are helpful in interpreting genome annotation data. Standard Operating Procedures (SOPs) for genome annotation describe the processes that generate genome annotations. Some groups are currently documenting procedures but standards are lacking for structure and content of annotation SOPs. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse a central online repository of SOPs.

  8. Non-genomic and Immune Evolution of Melanoma Acquiring MAPKi Resistance.

    PubMed

    Hugo, Willy; Shi, Hubing; Sun, Lu; Piva, Marco; Song, Chunying; Kong, Xiangju; Moriceau, Gatien; Hong, Aayoung; Dahlman, Kimberly B; Johnson, Douglas B; Sosman, Jeffrey A; Ribas, Antoni; Lo, Roger S

    2015-09-10

    Clinically acquired resistance to MAPK inhibitor (MAPKi) therapies for melanoma cannot be fully explained by genomic mechanisms and may be accompanied by co-evolution of intra-tumoral immunity. We sought to discover non-genomic mechanisms of acquired resistance and dynamic immune compositions by a comparative, transcriptomic-methylomic analysis of patient-matched melanoma tumors biopsied before therapy and during disease progression. Transcriptomic alterations across resistant tumors were highly recurrent, in contrast to mutations, and were frequently correlated with differential methylation of tumor cell-intrinsic CpG sites. We identified in the tumor cell compartment supra-physiologic c-MET up-expression, infra-physiologic LEF1 down-expression and YAP1 signature enrichment as drivers of acquired resistance. Importantly, high intra-tumoral cytolytic T cell inflammation prior to MAPKi therapy preceded CD8 T cell deficiency/exhaustion and loss of antigen presentation in half of disease-progressive melanomas, suggesting cross-resistance to salvage anti-PD-1/PD-L1 immunotherapy. Thus, melanoma acquires MAPKi resistance with highly dynamic and recurrent non-genomic alterations and co-evolving intra-tumoral immunity.

  9. Non-genomic and Immune Evolution of Melanoma Acquiring MAPKi Resistance

    PubMed Central

    Hugo, Willy; Shi, Hubing; Sun, Lu; Piva, Marco; Song, ChunYing; Kong, Xiangju; Moriceau, Gatien; Hong, Aayoung; Dahlman, Kimberly B.; Johnson, Douglas B.; Sosman, Jeffrey A.; Ribas, Antoni; Lo, Roger S.

    2015-01-01

    SUMMARY Clinically acquired resistance to MAPK inhibitor (MAPKi) therapies for melanoma cannot be fully explained by genomic mechanisms and may be accompanied by co-evolution of intra-tumoral immunity. We sought to discover non-genomic mechanisms of acquired resistance and dynamic immune compositions by a comparative, transcriptomic-methylomic analysis of patient-matched melanoma tumors biopsied before therapy and during disease progression. Transcriptomic alterations across resistant tumors were highly recurrent, in contrast to mutations, and were frequently correlated with differential methylation of tumor cell-intrinsic CpG sites. We identified in the tumor cell compartment supra-physiologic c-MET up-expression, infra-physiologic LEF1 down-expression, and YAP1 signature enrichment as drivers of acquired resistance. Importantly, high intra-tumoral cytolytic T-cell inflammation prior to MAPKi therapy preceded CD8 T-cell deficiency/exhaustion and loss of antigen-presentation in half of disease-progressive melanomas, suggesting cross-resistance to salvage anti-PD-1/PD-L1 immunotherapy. Thus, melanoma acquires MAPKi-resistance with highly dynamic and recurrent non-genomic alterations and co-evolving intra-tumoral immunity. PMID:26359985

  10. Citrus Genomics

    PubMed Central

    Talon, Manuel; Gmitter Jr., Fred G.

    2008-01-01

    Citrus is one of the most widespread fruit crops globally, with great economic and health value. It is among the most difficult plants to improve through traditional breeding approaches. Currently, there is risk of devastation by diseases threatening to limit production and future availability to the human population. As technologies rapidly advance in genomic science, they are quickly adapted to address the biological challenges of the citrus plant system and the world's industries. The historical developments of linkage mapping, markers and breeding, EST projects, physical mapping, an international citrus genome sequencing project, and critical functional analysis are described. Despite the challenges of working with citrus, there has been substantial progress. Citrus researchers engaged in international collaborations provide optimism about future productivity and contributions to the benefit of citrus industries worldwide and to the human population who can rely on future widespread availability of this health-promoting and aesthetically pleasing fruit crop. PMID:18509486

  11. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  12. Avian papillomaviruses: the parrot Psittacus erithacus papillomavirus (PePV) genome has a unique organization of the early protein region and is phylogenetically related to the chaffinch papillomavirus

    PubMed Central

    Tachezy, Ruth; Rector, Annabel; Havelkova, Marta; Wollants, Elke; Fiten, Pierre; Opdenakker, Ghislain; Jenson, A Bennett; Sundberg, John P; Van Ranst, Marc

    2002-01-01

    Background An avian papillomavirus genome has been cloned from a cutaneous exophytic papilloma from an African grey parrot (Psittacus erithacus). The nucleotide sequence, genome organization, and phylogenetic position of the Psittacus erithacus papillomavirus (PePV) were determined. This PePV sequence represents the first complete avian papillomavirus genome defined. Results The PePV genome (7304 basepairs) differs from other papillomaviruses, in that it has a unique organization of the early protein region lacking classical E6 and E7 open reading frames. Phylogenetic comparison of the PePV sequence with partial E1 and L1 sequences of the chaffinch (Fringilla coelebs) papillomavirus (FPV) reveals that these two avian papillomaviruses form a monophyletic cluster with a common branch that originates near the unresolved center of the papillomavirus evolutionary tree. Conclusions The PePV genome has a unique layout of the early protein region which represents a novel prototypic genomic organization for avian papillomaviruses. The close relationship between PePV and FPV, and between their Psittaciformes and Passeriformes hosts, supports the hypothesis that papillomaviruses have co-evolved and speciated together with their host species throughout evolution. PMID:12110158

  13. Genomes on the Edge: Programmed Genome Instability in Ciliates

    PubMed Central

    Bracht, John R.; Fang, Wenwen; Goldman, Aaron David; Dolzhenko, Egor; Stein, Elizabeth M.; Landweber, Laura F.

    2013-01-01

    Ciliates are an ancient and diverse group of microbial eukaryotes that have emerged as powerful models for RNA-mediated epigenetic inheritance. They possess extensive sets of both tiny and long noncoding RNAs that, together with a suite of proteins that includes transposases, orchestrate a broad cascade of genome rearrangements during somatic nuclear development. This Review emphasizes three important themes: the remarkable role of RNA in shaping genome structure, recent discoveries that unify many deeply diverged ciliate genetic systems, and a surprising evolutionary “sign change” in the role of small RNAs between major species groups. PMID:23374338

  14. Development of genome viewer (Web Omics Viewer) for managing databases of cucumber genome

    NASA Astrophysics Data System (ADS)

    Wojcieszek, M.; RóŻ, P.; Pawełkowicz, M.; Nowak, R.; Przybecki, Z.

    Cucumber is an important plant in horticulture and science world. Sequencing projects of C. sativus genome enable new methodological aproaches in further investigation of this species. Accessibility is crucial to fully exploit obtained information about detail structure of genes, markers and other characteristic features such contigs, scaffolds and chromosomes. Genome viewer is one of tools providing plain and easy way for presenting genome data for users and for databases administration. Gbrowse - the main viewer has several very useful features but lacks in managing simplicity. Our group developed new genome browser Web Omics Viewer (WOV), keeping functionality but improving utilization and accessibility to cucumber genome data.

  15. Lateral genomics.

    PubMed

    Doolittle, W F

    1999-12-01

    More than 20 complete prokaryotic genome sequences are now publicly available, each by itself an unparalleled resource for understanding organismal biology. Collectively, these data are even more powerful: they could force a dramatic reworking of the framework in which we understand biological evolution. It is possible that a single universal phylogenetic tree is not the best way to depict relationships between all living and extinct species. Instead a web- or net-like pattern, reflecting the importance of horizontal or lateral gene transfer between lineages of organisms, might provide a more appropriate visual metaphor. Here, I ask whether this way of thinking is really justified, and explore its implications.

  16. Update on RefSeq microbial genomes resources.

    PubMed

    Tatusova, Tatiana; Ciufo, Stacy; Federhen, Scott; Fedorov, Boris; McVeigh, Richard; O'Neill, Kathleen; Tolstoy, Igor; Zaslavsky, Leonid

    2015-01-01

    NCBI RefSeq genome collection http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10,000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30,000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach.

  17. Patterns of genome size variation in snapping shrimp.

    PubMed

    Jeffery, Nicholas W; Hultgren, Kristin; Chak, Solomon Tin Chi; Gregory, T Ryan; Rubenstein, Dustin R

    2016-06-01

    Although crustaceans vary extensively in genome size, little is known about how genome size may affect the ecology and evolution of species in this diverse group, in part due to the lack of large genome size datasets. Here we investigate interspecific, intraspecific, and intracolony variation in genome size in 39 species of Synalpheus shrimps, representing one of the largest genome size datasets for a single genus within crustaceans. We find that genome size ranges approximately 4-fold across Synalpheus with little phylogenetic signal, and is not related to body size. In a subset of these species, genome size is related to chromosome size, but not to chromosome number, suggesting that despite large genomes, these species are not polyploid. Interestingly, there appears to be 35% intraspecific genome size variation in Synalpheus idios among geographic regions, and up to 30% variation in Synalpheus duffyi genome size within the same colony.

  18. Training Groups, Encounter Groups, Sensitivity Groups and Group Psychotherapy

    PubMed Central

    Gottschalk, Louis A.; Pattison, E. Mansell; Schafer, Donald W.

    1971-01-01

    Descriptions and comparison of group therapies and the new group procedures (training groups and sensitivity groups—an outgrowth of the so-called Laboratory Movement methods of the mid-1930's) have been provided for the better understanding of non-psychiatric physicians. A group leader must have proper training and must help his group in its search for its avowed goals, whether he is a group therapist, a sensitivity trainer, or anyone else interested in utilizing group processes. Those goals are either the therapeutic benefit of the individual, as defined in group psychotherapy, or a better understanding of how one functions in groups, as in T-groups or the other group processes in the area of sensitive living. All group situations contain powerful tools which must be handled with proper respect. When so handled by experienced leaders, the individuals involved can achieve their goals in these group experiences. PMID:18730582

  19. Group Counseling

    ERIC Educational Resources Information Center

    Mahler, Clarence A.

    1971-01-01

    This article reviews the major concerns of group counseling and differentiates among group guidance, group counseling, and group therapy. It also evaluates the research status of group counseling and presents implications for the future of this approach. Comment by Carl E. Thoresen follows. (Author)

  20. Genomes on ice.

    PubMed

    Parkhill, Julian

    2016-03-01

    This month's Genome Watch discusses the analysis of a Helicobacter pylori genome from the preserved Copper-Age mummy known as the Iceman and how ancient genomes shed light on the history of bacterial pathogens. PMID:26853114

  1. Genomes on ice.

    PubMed

    Parkhill, Julian

    2016-03-01

    This month's Genome Watch discusses the analysis of a Helicobacter pylori genome from the preserved Copper-Age mummy known as the Iceman and how ancient genomes shed light on the history of bacterial pathogens.

  2. Whole Genome Sequencing

    MedlinePlus

    ... you want to learn. Search form Search Whole Genome Sequencing You are here Home Testing & Services Testing ... the full story, click here . What is whole genome sequencing? Whole genome sequencing is the mapping out ...

  3. Comparative genomic analyses in Asparagus.

    PubMed

    Kuhl, Joseph C; Havey, Michael J; Martin, William J; Cheung, Foo; Yuan, Qiaoping; Landherr, Lena; Hu, Yi; Leebens-Mack, James; Town, Christopher D; Sink, Kenneth C

    2005-12-01

    Garden asparagus (Asparagus officinalis L.) belongs to the monocot family Asparagaceae in the order Asparagales. Onion (Allium cepa L.) and Asparagus officinalis are 2 of the most economically important plants of the core Asparagales, a well supported monophyletic group within the Asparagales. Coding regions in onion have lower GC contents than the grasses. We compared the GC content of 3374 unique expressed sequence tags (ESTs) from A. officinalis with Lycoris longituba and onion (both members of the core Asparagales), Acorus americanus (sister to all other monocots), the grasses, and Arabidopsis. Although ESTs in A. officinalis and Acorus had a higher average GC content than Arabidopsis, Lycoris, and onion, all were clearly lower than the grasses. The Asparagaceae have the smallest nuclear genomes among all plants in the core Asparagales, which typically have huge genomes. Within the Asparagaceae, European Asparagus species have approximately twice the nuclear DNA of that of southern African Asparagus species. We cloned and sequenced 20 genomic amplicons from European A. officinalis and the southern African species Asparagus plumosus and observed no clear evidence for a recent genome doubling in A. officinalis relative to A. plumosus. These results indicate that members of the genus Asparagus with smaller genomes may be useful genomic models for plants in the core Asparagales. PMID:16391674

  4. Genome Sequence of Spizellomyces punctatus

    PubMed Central

    Russ, Carsten; Lang, B. Franz; Chen, Zehua; Gujja, Sharvari; Shea, Terrance; Zeng, Qiandong; Young, Sarah; Nusbaum, Chad

    2016-01-01

    Spizellomyces punctatus is a basally branching chytrid fungus that is found in the Chytridiomycota phylum. Spizellomyces species are common in soil and of importance in terrestrial ecosystems. Here, we report the genome sequence of S. punctatus, which will facilitate the study of this group of early diverging fungi. PMID:27540072

  5. Genome Sequence of Spizellomyces punctatus.

    PubMed

    Russ, Carsten; Lang, B Franz; Chen, Zehua; Gujja, Sharvari; Shea, Terrance; Zeng, Qiandong; Young, Sarah; Cuomo, Christina A; Nusbaum, Chad

    2016-01-01

    Spizellomyces punctatus is a basally branching chytrid fungus that is found in the Chytridiomycota phylum. Spizellomyces species are common in soil and of importance in terrestrial ecosystems. Here, we report the genome sequence of S. punctatus, which will facilitate the study of this group of early diverging fungi. PMID:27540072

  6. Clinical Genomic Database

    PubMed Central

    Solomon, Benjamin D.; Nguyen, Anh-Dao; Bear, Kelly A.; Wolfsberg, Tyra G.

    2013-01-01

    Technological advances have greatly increased the availability of human genomic sequencing. However, the capacity to analyze genomic data in a clinically meaningful way lags behind the ability to generate such data. To help address this obstacle, we reviewed all conditions with genetic causes and constructed the Clinical Genomic Database (CGD) (http://research.nhgri.nih.gov/CGD/), a searchable, freely Web-accessible database of conditions based on the clinical utility of genetic diagnosis and the availability of specific medical interventions. The CGD currently includes a total of 2,616 genes organized clinically by affected organ systems and interventions (including preventive measures, disease surveillance, and medical or surgical interventions) that could be reasonably warranted by the identification of pathogenic mutations. To aid independent analysis and optimize new data incorporation, the CGD also includes all genetic conditions for which genetic knowledge may affect the selection of supportive care, informed medical decision-making, prognostic considerations, reproductive decisions, and allow avoidance of unnecessary testing, but for which specific interventions are not otherwise currently available. For each entry, the CGD includes the gene symbol, conditions, allelic conditions, clinical categorization (for both manifestations and interventions), mode of inheritance, affected age group, description of interventions/rationale, links to other complementary databases, including databases of variants and presumed pathogenic mutations, and links to PubMed references (>20,000). The CGD will be regularly maintained and updated to keep pace with scientific discovery. Further content-based expert opinions are actively solicited. Eventually, the CGD may assist the rapid curation of individual genomes as part of active medical care. PMID:23696674

  7. The 'inner circle' of the cereal genomes.

    PubMed

    Bolot, Stéphanie; Abrouk, Michael; Masood-Quraishi, Umar; Stein, Nils; Messing, Joachim; Feuillet, Catherine; Salse, Jérôme

    2009-04-01

    Early marker-based macrocolinearity studies between the grass genomes led to arranging their chromosomes into concentric 'crop circles' of synteny blocks that initially consisted of 30 rice-independent linkage groups representing the ancestral cereal genome structure. Recently, increased marker density and genome sequencing of several cereal genomes allowed the characterization of intragenomic duplications and their integration with intergenomic colinearity data to identify paleo-duplications and propose a model for the evolution of the grass genomes from a common ancestor. On the basis of these data an 'inner circle' comprising five ancestral chromosomes was defined providing a new reference for the grass chromosomes and new insights into their ancestral relationships and origin, as well as an efficient tool to design cross-genome markers for genetic studies.

  8. Genome sequence of Coxiella burnetii strain Namibia

    PubMed Central

    2014-01-01

    We present the whole genome sequence and annotation of the Coxiella burnetii strain Namibia. This strain was isolated from an aborting goat in 1991 in Windhoek, Namibia. The plasmid type QpRS was confirmed in our work. Further genomic typing placed the strain into a unique genomic group. The genome sequence is 2,101,438 bp long and contains 1,979 protein-coding and 51 RNA genes, including one rRNA operon. To overcome the poor yield from cell culture systems, an additional DNA enrichment with whole genome amplification (WGA) methods was applied. We describe a bioinformatics pipeline for improved genome assembly including several filters with a special focus on WGA characteristics. PMID:25593636

  9. Complete Genome Sequence and Comparative Genomics of a Novel Myxobacterium Myxococcus hansupus

    PubMed Central

    Sharma, Gaurav; Narwani, Tarun; Subramanian, Srikrishna

    2016-01-01

    Myxobacteria, a group of Gram-negative aerobes, belong to the class δ-proteobacteria and order Myxococcales. Unlike anaerobic δ-proteobacteria, they exhibit several unusual physiogenomic properties like gliding motility, desiccation-resistant myxospores and large genomes with high coding density. Here we report a 9.5 Mbp complete genome of Myxococcus hansupus that encodes 7,753 proteins. Phylogenomic and genome-genome distance based analysis suggest that Myxococcus hansupus is a novel member of the genus Myxococcus. Comparative genome analysis with other members of the genus Myxococcus was performed to explore their genome diversity. The variation in number of unique proteins observed across different species is suggestive of diversity at the genus level while the overrepresentation of several Pfam families indicates the extent and mode of genome expansion as compared to non-Myxococcales δ-proteobacteria. PMID:26900859

  10. Ensembl genomes 2016: more genomes, more complexity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent...

  11. Ensembl Genomes 2016: more genomes, more complexity

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J.; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K.; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D.; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello–Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin L.; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  12. Life insurance: genomic stratification and risk classification

    PubMed Central

    Joly, Yann; Burton, Hilary; Knoppers, Bartha Maria; Feze, Ida Ngueng; Dent, Tom; Pashayan, Nora; Chowdhury, Susmita; Foulkes, William; Hall, Alison; Hamet, Pavel; Kirwan, Nick; Macdonald, Angus; Simard, Jacques; Van Hoyweghen, Ine

    2014-01-01

    With the development and increasing accessibility of new genomic tools such as next-generation sequencing, genome-wide association studies, and genomic stratification models, the debate on genetic discrimination in the context of life insurance became even more complex, requiring a review of current practices and the exploration of new scenarios. In this perspective, a multidisciplinary group of international experts representing different interests revisited the genetics and life insurance debate during a 2-day symposium ‘Life insurance: breast cancer research and genetic risk prediction seminar' held in Quebec City, Canada on 24 and 25 September 2012. Having reviewed the current legal, social, and ethical issues on the use of genomic information in the context of life insurance, the Expert Group identified four main questions: (1) Have recent developments in genomics and related sciences changed the contours of the genetics and life insurance debate? (2) Are genomic results obtained in a research context relevant for life insurance underwriting? (3) Should predictive risk assessment and risk stratification models based on genomic data also be used for life insurance underwriting? (4) What positive actions could stakeholders in the debate take to alleviate concerns over the use of genomic information by life insurance underwriters? This paper presents a summary of the discussions and the specific action items recommended by the Expert Group. PMID:24129434

  13. Life insurance: genomic stratification and risk classification.

    PubMed

    Joly, Yann; Burton, Hilary; Knoppers, Bartha Maria; Feze, Ida Ngueng; Dent, Tom; Pashayan, Nora; Chowdhury, Susmita; Foulkes, William; Hall, Alison; Hamet, Pavel; Kirwan, Nick; Macdonald, Angus; Simard, Jacques; Van Hoyweghen, Ine

    2014-05-01

    With the development and increasing accessibility of new genomic tools such as next-generation sequencing, genome-wide association studies, and genomic stratification models, the debate on genetic discrimination in the context of life insurance became even more complex, requiring a review of current practices and the exploration of new scenarios. In this perspective, a multidisciplinary group of international experts representing different interests revisited the genetics and life insurance debate during a 2-day symposium 'Life insurance: breast cancer research and genetic risk prediction seminar' held in Quebec City, Canada on 24 and 25 September 2012. Having reviewed the current legal, social, and ethical issues on the use of genomic information in the context of life insurance, the Expert Group identified four main questions: (1) Have recent developments in genomics and related sciences changed the contours of the genetics and life insurance debate? (2) Are genomic results obtained in a research context relevant for life insurance underwriting? (3) Should predictive risk assessment and risk stratification models based on genomic data also be used for life insurance underwriting? (4) What positive actions could stakeholders in the debate take to alleviate concerns over the use of genomic information by life insurance underwriters? This paper presents a summary of the discussions and the specific action items recommended by the Expert Group.

  14. Phydbac (phylogenomic display of bacterial genes): An interactive resource for the annotation of bacterial genomes.

    PubMed

    Enault, François; Suhre, Karsten; Poirot, Olivier; Abergel, Chantal; Claverie, Jean-Michel

    2003-07-01

    Phydbac is a web interactive resource based on phylogenomic profiling, designed to help microbiologists to annotate bacterial proteins. Phylogenomic annotation is based on the assumption that functionally linked protein-coding genes must evolve in a coordinated manner. The detection of subsets of co-evolving genes within a given genome involves the computation of protein sequence conservation profiles across a spectrum of microbial species, followed by the identification of significant pairwise correlations between them. Many ongoing studies are devoted to the problem of computing the most biologically significant phylogenomic profiles and how best identifying clusters of 'functionally interacting' genes. Here we introduce a web tool, Phydbac, allowing the dynamic construction of phylogenomic profiles of protein sequences of interest and their interactive display. In addition, Phydbac can identify Escherichia coli proteins exhibiting the evolution pattern most similar to arbitrary query protein sequences, hence providing functional hints for open reading frames (ORFs) of hypothetical or unknown function. The phylogenomic profiles of all E.coli K-12 protein-coding genes are pre-computed, allowing queries about E.coli genes to be answered instantaneously. The profiles and phylogenomic neighborhoods are computed using an original method shown to perform better than previous ones. An extension of Phydbac, including precomputed profiles for all available bacterial genomes (including major pathogens) will soon be available. Phydbac can be accessed at: http://igs-server.cnrs-mrs.fr/phydbac/.

  15. Insect mitochondrial genomics: implications for evolution and phylogeny.

    PubMed

    Cameron, Stephen L

    2014-01-01

    The mitochondrial (mt) genome is, to date, the most extensively studied genomic system in insects, outnumbering nuclear genomes tenfold and representing all orders versus very few. Phylogenomic analysis methods have been tested extensively, identifying compositional bias and rate variation, both within and between lineages, as the principal issues confronting accurate analyses. Major studies at both inter- and intraordinal levels have contributed to our understanding of phylogenetic relationships within many groups. Genome rearrangements are an additional data type for defining relationships, with rearrangement synapomorphies identified across multiple orders and at many different taxonomic levels. Hymenoptera and Psocodea have greatly elevated rates of rearrangement offering both opportunities and pitfalls for identifying rearrangement synapomorphies in each group. Finally, insects are model systems for studying aberrant mt genomes, including truncated tRNAs and multichromosomal genomes. Greater integration of nuclear and mt genomic studies is necessary to further our understanding of insect genomic evolution.

  16. Group X

    SciTech Connect

    Fields, Susannah

    2007-08-16

    This project is currently under contract for research through the Department of Homeland Security until 2011. The group I was responsible for studying has to remain confidential so as not to affect the current project. All dates, reference links and authors, and other distinguishing characteristics of the original group have been removed from this report. All references to the name of this group or the individual splinter groups has been changed to 'Group X'. I have been collecting texts from a variety of sources intended for the use of recruiting and radicalizing members for Group X splinter groups for the purpose of researching the motivation and intent of leaders of those groups and their influence over the likelihood of group radicalization. This work included visiting many Group X websites to find information on splinter group leaders and finding their statements to new and old members. This proved difficult because the splinter groups of Group X are united in beliefs, but differ in public opinion. They are eager to tear each other down, prove their superiority, and yet remain anonymous. After a few weeks of intense searching, a list of eight recruiting texts and eight radicalizing texts from a variety of Group X leaders were compiled.

  17. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses.

    PubMed

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation.

  18. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses.

    PubMed

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation. PMID:26569403

  19. Group Flow and Group Genius

    ERIC Educational Resources Information Center

    Sawyer, Keith

    2015-01-01

    Keith Sawyer views the spontaneous collaboration of group creativity and improvisation actions as "group flow," which organizations can use to function at optimum levels. Sawyer establishes ideal conditions for group flow: group goals, close listening, complete concentration, being in control, blending egos, equal participation, knowing…

  20. Funding Opportunity: Genomic Data Centers

    Cancer.gov

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  1. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

    PubMed

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-03-17

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies.

  2. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes

    PubMed Central

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M.; Murphy, Robert W.; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-01-01

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies. PMID:25733869

  3. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

    PubMed

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-03-17

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies. PMID:25733869

  4. Genome Mapping in Plant Comparative Genomics.

    PubMed

    Chaney, Lindsay; Sharp, Aaron R; Evans, Carrie R; Udall, Joshua A

    2016-09-01

    Genome mapping produces fingerprints of DNA sequences to construct a physical map of the whole genome. It provides contiguous, long-range information that complements and, in some cases, replaces sequencing data. Recent advances in genome-mapping technology will better allow researchers to detect large (>1kbp) structural variations between plant genomes. Some molecular and informatics complications need to be overcome for this novel technology to achieve its full utility. This technology will be useful for understanding phenotype responses due to DNA rearrangements and will yield insights into genome evolution, particularly in polyploids. In this review, we outline recent advances in genome-mapping technology, including the processes required for data collection and analysis, and applications in plant comparative genomics.

  5. Enabling functional genomics with genome engineering.

    PubMed

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances.

  6. Enabling functional genomics with genome engineering.

    PubMed

    Hilton, Isaac B; Gersbach, Charles A

    2015-10-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  7. Genome of horsepox virus.

    PubMed

    Tulman, E R; Delhon, G; Afonso, C L; Lu, Z; Zsak, L; Sandybaev, N T; Kerembekova, U Z; Zaitsev, V L; Kutish, G F; Rock, D L

    2006-09-01

    Here we present the genomic sequence of horsepox virus (HSPV) isolate MNR-76, an orthopoxvirus (OPV) isolated in 1976 from diseased Mongolian horses. The 212-kbp genome contained 7.5-kbp inverted terminal repeats and lacked extensive terminal tandem repetition. HSPV contained 236 open reading frames (ORFs) with similarity to those in other OPVs, with those in the central 100-kbp region most conserved relative to other OPVs. Phylogenetic analysis of the conserved region indicated that HSPV is closely related to sequenced isolates of vaccinia virus (VACV) and rabbitpox virus, clearly grouping together these VACV-like viruses. Fifty-four HSPV ORFs likely represented fragments of 25 orthologous OPV genes, including in the central region the only known fragmented form of an OPV ribonucleotide reductase large subunit gene. In terminal genomic regions, HSPV lacked full-length homologues of genes variably fragmented in other VACV-like viruses but was unique in fragmentation of the homologue of VACV strain Copenhagen B6R, a gene intact in other known VACV-like viruses. Notably, HSPV contained in terminal genomic regions 17 kbp of OPV-like sequence absent in known VACV-like viruses, including fragments of genes intact in other OPVs and approximately 1.4 kb of sequence present only in cowpox virus (CPXV). HSPV also contained seven full-length genes fragmented or missing in other VACV-like viruses, including intact homologues of the CPXV strain GRI-90 D2L/I4R CrmB and D13L CD30-like tumor necrosis factor receptors, D3L/I3R and C1L ankyrin repeat proteins, B19R kelch-like protein, D7L BTB/POZ domain protein, and B22R variola virus B22R-like protein. These results indicated that HSPV contains unique genomic features likely contributing to a unique virulence/host range phenotype. They also indicated that while closely related to known VACV-like viruses, HSPV contains additional, potentially ancestral sequences absent in other VACV-like viruses.

  8. Two New Loci for Body-Weight Regulation Identified in a Joint Analysis of Genome-Wide Association Studies for Early-Onset Extreme Obesity in French and German Study Groups

    PubMed Central

    Scherag, André; Dina, Christian; Hinney, Anke; Vatin, Vincent; Scherag, Susann; Vogel, Carla I. G.; Müller, Timo D.; Grallert, Harald; Wichmann, H.-Erich; Balkau, Beverley; Heude, Barbara; Jarvelin, Marjo-Riitta; Hartikainen, Anna-Liisa; Levy-Marchal, Claire; Weill, Jacques; Delplanque, Jérôme; Körner, Antje; Kiess, Wieland; Kovacs, Peter; Rayner, Nigel W.; Prokopenko, Inga; McCarthy, Mark I.; Schäfer, Helmut; Jarick, Ivonne; Boeing, Heiner; Fisher, Eva; Reinehr, Thomas; Heinrich, Joachim; Rzehak, Peter; Berdel, Dietrich; Borte, Michael; Biebermann, Heike; Krude, Heiko; Rosskopf, Dieter; Rimmbach, Christian; Rief, Winfried; Fromme, Tobias; Klingenspor, Martin; Schürmann, Annette; Schulz, Nadja; Nöthen, Markus M.; Mühleisen, Thomas W.; Erbel, Raimund; Jöckel, Karl-Heinz; Moebus, Susanne; Boes, Tanja; Illig, Thomas; Froguel, Philippe; Hebebrand, Johannes; Meyre, David

    2010-01-01

    Meta-analyses of population-based genome-wide association studies (GWAS) in adults have recently led to the detection of new genetic loci for obesity. Here we aimed to discover additional obesity loci in extremely obese children and adolescents. We also investigated if these results generalize by estimating the effects of these obesity loci in adults and in population-based samples including both children and adults. We jointly analysed two GWAS of 2,258 individuals and followed-up the best, according to lowest p-values, 44 single nucleotide polymorphisms (SNP) from 21 genomic regions in 3,141 individuals. After this DISCOVERY step, we explored if the findings derived from the extremely obese children and adolescents (10 SNPs from 5 genomic regions) generalized to (i) the population level and (ii) to adults by genotyping another 31,182 individuals (GENERALIZATION step). Apart from previously identified FTO, MC4R, and TMEM18, we detected two new loci for obesity: one in SDCCAG8 (serologically defined colon cancer antigen 8 gene; p = 1.85×10−8 in the DISCOVERY step) and one between TNKS (tankyrase, TRF1-interacting ankyrin-related ADP-ribose polymerase gene) and MSRA (methionine sulfoxide reductase A gene; p = 4.84×10−7), the latter finding being limited to children and adolescents as demonstrated in the GENERALIZATION step. The odds ratios for early-onset obesity were estimated at ∼1.10 per risk allele for both loci. Interestingly, the TNKS/MSRA locus has recently been found to be associated with adult waist circumference. In summary, we have completed a meta-analysis of two GWAS which both focus on extremely obese children and adolescents and replicated our findings in a large followed-up data set. We observed that genetic variants in or near FTO, MC4R, TMEM18, SDCCAG8, and TNKS/MSRA were robustly associated with early-onset obesity. We conclude that the currently known major common variants related to obesity overlap to a substantial degree between

  9. Two new Loci for body-weight regulation identified in a joint analysis of genome-wide association studies for early-onset extreme obesity in French and german study groups.

    PubMed

    Scherag, André; Dina, Christian; Hinney, Anke; Vatin, Vincent; Scherag, Susann; Vogel, Carla I G; Müller, Timo D; Grallert, Harald; Wichmann, H-Erich; Balkau, Beverley; Heude, Barbara; Jarvelin, Marjo-Riitta; Hartikainen, Anna-Liisa; Levy-Marchal, Claire; Weill, Jacques; Delplanque, Jérôme; Körner, Antje; Kiess, Wieland; Kovacs, Peter; Rayner, Nigel W; Prokopenko, Inga; McCarthy, Mark I; Schäfer, Helmut; Jarick, Ivonne; Boeing, Heiner; Fisher, Eva; Reinehr, Thomas; Heinrich, Joachim; Rzehak, Peter; Berdel, Dietrich; Borte, Michael; Biebermann, Heike; Krude, Heiko; Rosskopf, Dieter; Rimmbach, Christian; Rief, Winfried; Fromme, Tobias; Klingenspor, Martin; Schürmann, Annette; Schulz, Nadja; Nöthen, Markus M; Mühleisen, Thomas W; Erbel, Raimund; Jöckel, Karl-Heinz; Moebus, Susanne; Boes, Tanja; Illig, Thomas; Froguel, Philippe; Hebebrand, Johannes; Meyre, David

    2010-04-22

    Meta-analyses of population-based genome-wide association studies (GWAS) in adults have recently led to the detection of new genetic loci for obesity. Here we aimed to discover additional obesity loci in extremely obese children and adolescents. We also investigated if these results generalize by estimating the effects of these obesity loci in adults and in population-based samples including both children and adults. We jointly analysed two GWAS of 2,258 individuals and followed-up the best, according to lowest p-values, 44 single nucleotide polymorphisms (SNP) from 21 genomic regions in 3,141 individuals. After this DISCOVERY step, we explored if the findings derived from the extremely obese children and adolescents (10 SNPs from 5 genomic regions) generalized to (i) the population level and (ii) to adults by genotyping another 31,182 individuals (GENERALIZATION step). Apart from previously identified FTO, MC4R, and TMEM18, we detected two new loci for obesity: one in SDCCAG8 (serologically defined colon cancer antigen 8 gene; p = 1.85x10(-8) in the DISCOVERY step) and one between TNKS (tankyrase, TRF1-interacting ankyrin-related ADP-ribose polymerase gene) and MSRA (methionine sulfoxide reductase A gene; p = 4.84x10(-7)), the latter finding being limited to children and adolescents as demonstrated in the GENERALIZATION step. The odds ratios for early-onset obesity were estimated at approximately 1.10 per risk allele for both loci. Interestingly, the TNKS/MSRA locus has recently been found to be associated with adult waist circumference. In summary, we have completed a meta-analysis of two GWAS which both focus on extremely obese children and adolescents and replicated our findings in a large followed-up data set. We observed that genetic variants in or near FTO, MC4R, TMEM18, SDCCAG8, and TNKS/MSRA were robustly associated with early-onset obesity. We conclude that the currently known major common variants related to obesity overlap to a substantial degree

  10. Genome size and genome evolution in diploid Triticeae species.

    PubMed

    Eilam, T; Anikster, Y; Millet, E; Manisterski, J; Sagi-Assif, O; Feldman, M

    2007-11-01

    One of the intriguing issues concerning the dynamics of plant genomes is the occurrence of intraspecific variation in nuclear DNA amount. The aim of this work was to assess the ranges of intraspecific, interspecific, and intergeneric variation in nuclear DNA content of diploid species of the tribe Triticeae (Poaceae) and to examine the relation between life form or habitat and genome size. Altogether, 438 plants representing 272 lines that belong to 22 species were analyzed. Nuclear DNA content was estimated by flow cytometry. Very small intraspecific variation in DNA amount was found between lines of Triticeae diploid species collected from different habitats or between different morphs. In contrast to the constancy in nuclear DNA amount at the intraspecific level, there are significant differences in genome size between the various diploid species. Within the genus Aegilops, the 1C DNA amount ranged from 4.84 pg in A. caudata to 7.52 pg in A. sharonensis; among genera, the 1C DNA amount ranged from 4.18 pg in Heteranthelium piliferum to 9.45 pg in Secale montanum. No evidence was found for a smaller genome size in annual, self-pollinating species relative to perennial, cross-pollinating ones. Diploids that grow in the southern part of the group's distribution have larger genomes than those growing in other parts of the distribution. The contrast between the low variation at the intraspecific level and the high variation at the interspecific one suggests that changes in genome size originated in close temporal proximity to the speciation event, i.e., before, during, or immediately after it. The possible effects of sudden changes in genome size on speciation processes are discussed.

  11. Isopermutation group

    SciTech Connect

    Muktibodh, A. S.

    2015-03-10

    The concept of ‘Isotopy’ as formulated by Ruggero Maria Santilli [1, 2, 3] plays a vital role in the development of Iso mathematics. Santilli defined iso-fields of characteristic zero. In this paper we extend this definition to define Iso-Galois fields [4] which are essentially of non-zero characteristic. Isotopically isomorphic realizations of a group define isopermutation group which gives a clear cut distinction between automorphic groups and isotopic groups.

  12. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer.

  13. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer. PMID:26323482

  14. Exploring Other Genomes: Bacteria.

    ERIC Educational Resources Information Center

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  15. Hot Groups.

    ERIC Educational Resources Information Center

    Vail, Kathleen

    1996-01-01

    Collaborators sparked by creative ideas and obsessed by a common task may not realize they're part of a "hot group"--a term coined by business professors Harold J. Leavitt and Jean Lipman-Blumen. Spawned by group decision making and employee empowerment, hot groups can flourish in education settings. They're typically small, short lived, and goal…

  16. Galaxy groups

    SciTech Connect

    Brent Tully, R.

    2015-02-01

    Galaxy groups can be characterized by the radius of decoupling from cosmic expansion, the radius of the caustic of second turnaround, and the velocity dispersion of galaxies within this latter radius. These parameters can be a challenge to measure, especially for small groups with few members. In this study, results are gathered pertaining to particularly well-studied groups over four decades in group mass. Scaling relations anticipated from theory are demonstrated and coefficients of the relationships are specified. There is an update of the relationship between light and mass for groups, confirming that groups with mass of a few times 10{sup 12}M{sub ⊙} are the most lit up while groups with more and less mass are darker. It is demonstrated that there is an interesting one-to-one correlation between the number of dwarf satellites in a group and the group mass. There is the suggestion that small variations in the slope of the luminosity function in groups are caused by the degree of depletion of intermediate luminosity systems rather than variations in the number per unit mass of dwarfs. Finally, returning to the characteristic radii of groups, the ratio of first to second turnaround depends on the dark matter and dark energy content of the universe and a crude estimate can be made from the current observations of Ω{sub matter}∼0.15 in a flat topology, with a 68% probability of being less than 0.44.

  17. Genome Maps, a new generation genome browser.

    PubMed

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org.

  18. Genome Maps, a new generation genome browser

    PubMed Central

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-01-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  19. Genome Maps, a new generation genome browser.

    PubMed

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  20. Molecular phylogeny of RPB2 gene reveals multiple origin, geographic differentiation of H genome, and the relationship of the Y genome to other genomes in Elymus species.

    PubMed

    Sun, Genlou; Ni, Yan; Daley, Tracy

    2008-03-01

    It has been hypothesized from isozymic and cytological studies of Elymus species that the Old and New World taxa may be of separate origin of the H genome in the StH genome species. To test this hypothesis, and estimate the phylogenetic relationships of polyploid Elymus species within the Triticeae, the second largest subunit of RNA polymerase II (RPB2) sequence of 36 Elymus accessions containing StH or StY genomes was analyzed with those of Pseudoroegneria (St), Hordeum (H), Agropyron (P), Australopyrum (W), Lophopyrum(Ee), Thinopyrum(Eb) and Dasypyrum (V). Our data indicated that the H genome in Elymus species is differentiated in accordance with geographical origin, and that the Eurasian and American StH genome species have independent alloploid origins with different H-genome donors. Phylogenetic analysis of Y genome sequences with other genome donors (St, H, P, W) of Elymus revealed that W and P genomes are sister to Y genome with a 87% bootstrap support, and that StY and StH species group might have acquired their RPB2 St sequences from distinct Pseudoroegneria gene pools. Our data did not support the suggestion that the St and Y genomes have the same origin as put forward in a previous study using ITS data. Our result provides some insight on the origin of Y genome and its relationship to other genomes in Elymus. PMID:18262439

  1. Bacterial pathogen genomics and vaccines.

    PubMed

    Moxon, Richard; Rappuoli, Rino

    2002-01-01

    Infectious diseases remain a major cause of deaths and disabilities in the world, the majority of which are caused by bacteria. Although immunisation is the most cost effective and efficient means to control microbial diseases, vaccines are not yet available to prevent many major bacterial infections. Examples include dysentery (shigellosis), gonorrhoea, trachoma, gastric ulcers and cancer (Helicobacter pylori). Improved vaccines are needed to combat some diseases for which current vaccines are inadequate. Tuberculosis, for example, remains rampant throughout most countries in the world and represents a global emergency heightened by the pandemic of HIV. The availability of complete genome sequences has dramatically changed the opportunities for developing novel and improved vaccines and facilitated the efficiency and rapidity of their development. Complete genomic databases provide an inclusive catalogue of all potential candidate vaccines for any bacterial pathogen. In conjunction with adjunct technologies, including bioinformatics, random mutagenesis, microarrays, and proteomics, a systematic and comprehensive approach to identifying vaccine discovery can be undertaken. Genomics must be used in conjunction with population biology to ensure that the vaccine can target all pathogenic strains of a species. A proof in principle of the utility of genomics is provided by the recent exploitation of the complete genome sequence of Neisseria meningitidis group B.

  2. GROUP INEQUALITY

    PubMed Central

    Bowles, Samuel; Loury, Glenn C.; Sethi, Rajiv

    2014-01-01

    We explore the combined effect of segregation in social networks, peer effects, and the relative size of a historically disadvantaged group on the incentives to invest in market-rewarded skills and the dynamics of inequality between social groups. We identify conditions under which group inequality will persist in the absence of differences in ability, credit constraints, or labor market discrimination. Under these conditions, group inequality may be amplified even if initial group differences are negligible. Increases in social integration may destabilize an unequal state and make group equality possible, but the distributional and human capital effects of this depend on the demographic composition of the population. When the size of the initially disadvantaged group is sufficiently small, integration can lower the long-run costs of human capital investment in both groups and result in an increase the aggregate skill share. In contrast, when the initially disadvantaged group is large, integration can induce a fall in the aggregate skill share as the costs of human capital investment rise in both groups. We consider applications to concrete cases and policy implications. PMID:25554727

  3. Genomic sovereignty and the African promise: mining the African genome for the benefit of Africa.

    PubMed

    de Vries, Jantina; Pepper, Michael

    2012-08-01

    Scientific interest in genomics in Africa is on the rise with a number of funding initiatives aimed specifically at supporting research in this area. Genomics research on material of African origin raises a number of important ethical issues. A prominent concern relates to sample export, which is increasingly seen by researchers and ethics committees across the continent as being problematic. The concept of genomic sovereignty proposes that unique patterns of genomic variation can be found in human populations, and that these are commercially, scientifically or symbolically valuable and in need of protection against exploitation. Although it is appealing as a response to increasing concerns regarding sample export, there are a number of important conceptual problems relating to the term. It is not clear, for instance, whether it is appropriate that ownership over human genomic samples should rest with national governments. Furthermore, ethnic groups in Africa are frequently spread across multiple nation states, and protection offered in one state may not prevent researchers from accessing the same group elsewhere. Lastly, scientific evidence suggests that the assumption that genomic data is unique for population groups is false. Although the frequency with which particular variants are found can differ between groups, such genes or variants per se are not unique to any population group. In this paper, the authors describe these concerns in detail and argue that the concept of genomic sovereignty alone may not be adequate to protect the genetic resources of people of African descent. PMID:22493187

  4. 75 FR 52537 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-08-26

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS)...

  5. 75 FR 2148 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-01-14

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group, Genome Research Review... Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS)...

  6. 76 FR 3643 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-01-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: January...

  7. 75 FR 26762 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-05-12

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: May 3,...

  8. 78 FR 24223 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-04-24

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... applications. Place: National Human Genome Research Institute, 3rd floor Conf. Room 3146, 5635 Fishers...

  9. JGI Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  10. Genomic Encyclopedia of Fungi

    SciTech Connect

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  11. Analysis of the biologic differences between porcine circovirus type 2 Group-1 and Group-2 viruses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phylogenetically, porcine circovirus type 2 (PCV2) isolates have been divided into two subgroups: Group-1 and Group-2. Although the genomic nucleotide sequences of Group-1 and Group-2 viruses differ by less than 5 percent, distinct amino acid residue motifs have been noted in the coding sequences o...

  12. Genomic definition of species. Revision 2

    SciTech Connect

    Crkvenjakov, R.; Drmanac, R.

    1993-03-01

    A genome is the sum total of the DNA sequences in the cells of an individual organism. The common usage that species possess genomes comes naturally to biochemists, who have shown that all protein and nucleic acid molecules are at the same time species- and individual-specific, with minor individual variations being superimposed on a consensus sequence that is constant for a species. By extension, this property is attributed to the common features of DNA in the chromosomes of members of a given species and is called species genome. Our proposal for the definition of a biological species is as follows: A species comprises a group of actual and potential biological organisms built according to a unique genome program that is recorded, and at least in part expressed, in the structures of their genomic nucleic acid molecule(s), having intragroup sequence differences which can be fully interconverted in the process of organismal reproduction.

  13. [Ethical issues in personal genome research].

    PubMed

    Kato, Kazuto; Minari, Jusaku

    2013-03-01

    The rapid expansion of techniques for studying human genomics has remarkably changed research and practice. It is expected that more progress will be made in the field of medical and biological research owing to the technological advances. Genomics researchers collect human genetic material, including DNA and cells, from a large number of individuals and carry out "personal genome analysis"; as a result, new types of ethical, legal, and social issues (ELSI) have arisen, including issues such as informed consent procedures, data sharing, protection of genetic information, and return of research results. To address these issues, many large research projects have established specialist groups that are devoted to manage ELSI of their research. The guidelines for genomics research set by the government are also expected to be revised accordingly. In this paper, we present an overview of ELSI of personal genome research and discuss necessary measures to tackle these issues.

  14. Genome sizes for all genera of Cycadales.

    PubMed

    Zonneveld, B J M

    2012-01-01

    Nuclear DNA content (2C) is reported for all genera of the Cycadales, using flow cytometry with propidium iodide. Nuclear DNA content ranges from 24 to 64 pg in cycads. This implies that the largest genome contains roughly 40 × 10(9) more base pairs than the smallest genome. The narrow range in nuclear DNA content within a genus is remarkable for such an old group. Furthermore, 42 of the 58 plants measured, covering five genera, have 18 chromosomes. They vary from 36.1 to 64.7 pg, covering the whole range of genome sizes (excluding the genome of Cycas). Hence, their does not seem to be a correlation between genome size and the number of chromosomes. PMID:22117644

  15. The Caenorhabditis elegans genome: a multifractal analysis.

    PubMed

    Vélez, P E; Garreta, L E; Martínez, E; Díaz, N; Amador, S; Tischer, I; Gutiérrez, J M; Moreno, P A

    2010-05-25

    The Caenorhabditis elegans genome has several regular and irregular characteristics in its nucleotide composition; these are observed within and between chromosomes. To study these particularities, we carried out a multifractal analysis, which requires a large number of exponents to characterize scaling properties. We looked for a relationship between the genetic information content of the chromosomes and multifractal parameters and found less multifractality compared to the human genome. Differences in multifractality among chromosomes and in regions of chromosomes, and two group averages of chromosome regions were observed. All these differences were mainly dependent on differences in the contents of repetitive DNA. Based on these properties, we propose a nonlinear model for the structure of the C. elegans genome, with some biological implications. These results suggest that examining differences in multifractality is a viable approach for measuring local variations of genomic information contents along chromosomes. This approach could be extended to other genomes in order to characterize structural and functional regions of chromosomes.

  16. Nuclear DNA content in Sinningia (Gesneriaceae); intraspecific genome size variation and genome characterization in S. speciosa.

    PubMed

    Zaitlin, David; Pierce, Andrew J

    2010-12-01

    The Gesneriaceae (Lamiales) is a family of flowering plants comprising >3000 species of mainly tropical origin, the most familiar of which is the cultivated African violet (Saintpaulia spp.). Species of Gesneriaceae are poorly represented in the lists of taxa sampled for genome size estimation; measurements are available for three species of Ramonda and one each of Haberlea, Saintpaulia, and Streptocarpus, all species of Old World origin. We report here nuclear genome size estimates for 10 species of Sinningia, a neotropical genus largely restricted to Brazil. Flow cytometry of leaf cell nuclei showed that holoploid genome size in Sinningia is very small (approximately two times the size of the Arabidopsis genome), and is small compared to the other six species of Gesneriaceae with genome size estimates. We also documented intraspecific genome size variation of 21%-26% within a group of wild Sinningia speciosa (Lodd.) Hiern collections. In addition, we analyzed 1210 genome survey sequences from S. speciosa to characterize basic features of the nuclear genome such as guanine-cytosine content, types of repetitive elements, numbers of protein-coding sequences, and sequences unique to S. speciosa. We included several other angiosperm species as genome size standards, one of which was the snapdragon (Antirrhinum majus L.; Veronicaceae, Lamiales). Multiple measurements on three accessions indicated that the genome size of A. majus is ~633 × 10⁶ base pairs, which is approximately 40% of the previously published estimate. PMID:21164539

  17. IMGD: an integrated platform supporting comparative genomics and phylogenetics of insect mitochondrial genomes

    PubMed Central

    Lee, Wonhoon; Park, Jongsun; Choi, Jaeyoung; Jung, Kyongyong; Park, Bongsoo; Kim, Donghan; Lee, Jaeyoung; Ahn, Kyohun; Song, Wonho; Kang, Seogchan; Lee, Yong-Hwan; Lee, Seunghwan

    2009-01-01

    Background Sequences and organization of the mitochondrial genome have been used as markers to investigate evolutionary history and relationships in many taxonomic groups. The rapidly increasing mitochondrial genome sequences from diverse insects provide ample opportunities to explore various global evolutionary questions in the superclass Hexapoda. To adequately support such questions, it is imperative to establish an informatics platform that facilitates the retrieval and utilization of available mitochondrial genome sequence data. Results The Insect Mitochondrial Genome Database (IMGD) is a new integrated platform that archives the mitochondrial genome sequences from 25,747 hexapod species, including 112 completely sequenced and 20 nearly completed genomes and 113,985 partially sequenced mitochondrial genomes. The Species-driven User Interface (SUI) of IMGD supports data retrieval and diverse analyses at multi-taxon levels. The Phyloviewer implemented in IMGD provides three methods for drawing phylogenetic trees and displays the resulting trees on the web. The SNP database incorporated to IMGD presents the distribution of SNPs and INDELs in the mitochondrial genomes of multiple isolates within eight species. A newly developed comparative SNU Genome Browser supports the graphical presentation and interactive interface for the identified SNPs/INDELs. Conclusion The IMGD provides a solid foundation for the comparative mitochondrial genomics and phylogenetics of insects. All data and functions described here are available at the web site . PMID:19351385

  18. Nuclear DNA content in Sinningia (Gesneriaceae); intraspecific genome size variation and genome characterization in S. speciosa.

    PubMed

    Zaitlin, David; Pierce, Andrew J

    2010-12-01

    The Gesneriaceae (Lamiales) is a family of flowering plants comprising >3000 species of mainly tropical origin, the most familiar of which is the cultivated African violet (Saintpaulia spp.). Species of Gesneriaceae are poorly represented in the lists of taxa sampled for genome size estimation; measurements are available for three species of Ramonda and one each of Haberlea, Saintpaulia, and Streptocarpus, all species of Old World origin. We report here nuclear genome size estimates for 10 species of Sinningia, a neotropical genus largely restricted to Brazil. Flow cytometry of leaf cell nuclei showed that holoploid genome size in Sinningia is very small (approximately two times the size of the Arabidopsis genome), and is small compared to the other six species of Gesneriaceae with genome size estimates. We also documented intraspecific genome size variation of 21%-26% within a group of wild Sinningia speciosa (Lodd.) Hiern collections. In addition, we analyzed 1210 genome survey sequences from S. speciosa to characterize basic features of the nuclear genome such as guanine-cytosine content, types of repetitive elements, numbers of protein-coding sequences, and sequences unique to S. speciosa. We included several other angiosperm species as genome size standards, one of which was the snapdragon (Antirrhinum majus L.; Veronicaceae, Lamiales). Multiple measurements on three accessions indicated that the genome size of A. majus is ~633 × 10⁶ base pairs, which is approximately 40% of the previously published estimate.

  19. Genomics and Health Impact Update

    MedlinePlus

    ... Genomics in Practice Newborn Screening Pharmacogenomics Reproductive Health Tools and Databases About the Genomics & Health Impact Update The Office of Public Health Genomics provides updated and credible ...

  20. Plant genomics: an overview.

    PubMed

    Campos-de Quiroz, Hugo

    2002-01-01

    Recent technological advancements have substantially expanded our ability to analyze and understand plant genomes and to reduce the gap existing between genotype and phenotype. The fast evolving field of genomics allows scientists to analyze thousand of genes in parallel, to understand the genetic architecture of plant genomes and also to isolate the genes responsible for mutations. Furthermore, whole genomes can now be sequenced. This review addresses these issues and also discusses ways to extract biological meaning from DNA data. Although genomic issuesare addressed from a plant perspective, this review provides insights into the genomic analyses of other organisms. PMID:12462991

  1. Teaching strategies to incorporate genomics education into academic nursing curricula.

    PubMed

    Quevedo Garcia, Sylvia P; Greco, Karen E; Loescher, Lois J

    2011-11-01

    The translation of genomic science into health care has expanded our ability to understand the effects of genomics on human health and disease. As genomic advances continue, nurses are expected to have the knowledge and skills to translate genomic information into improved patient care. This integrative review describes strategies used to teach genomics in academic nursing programs and their facilitators and barriers to inclusion in nursing curricula. The Learning Engagement Model and the Diffusion of Innovations Theory guided the interpretation of findings. CINAHL, Medline, and Web of Science were resources for articles published during the past decade that included strategies for teaching genomics in academic nursing programs. Of 135 articles, 13 met criteria for review. Examples of effective genomics teaching strategies included clinical application through case studies, storytelling, online genomics resources, student self-assessment, guest lecturers, and a genetics focus group. Most strategies were not evaluated for effectiveness.

  2. AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae

    PubMed Central

    Song, Giltae; Dickins, Benjamin J. A.; Demeter, Janos; Engel, Stacia; Dunn, Barbara; Cherry, J. Michael

    2015-01-01

    The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community. PMID:25781462

  3. AGAPE (Automated Genome Analysis PipelinE) for pan-genome analysis of Saccharomyces cerevisiae.

    PubMed

    Song, Giltae; Dickins, Benjamin J A; Demeter, Janos; Engel, Stacia; Gallagher, Jennifer; Choe, Kisurb; Dunn, Barbara; Snyder, Michael; Cherry, J Michael

    2015-01-01

    The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community.

  4. Phylogenomics and the Dynamic Genome Evolution of the Genus Streptococcus

    PubMed Central

    Richards, Vincent P.; Palmer, Sara R.; Pavinski Bitar, Paulina D.; Qin, Xiang; Weinstock, George M.; Highlander, Sarah K.; Town, Christopher D.; Burne, Robert A.; Stanhope, Michael J.

    2014-01-01

    The genus Streptococcus comprises important pathogens that have a severe impact on human health and are responsible for substantial economic losses to agriculture. Here, we utilize 46 Streptococcus genome sequences (44 species), including eight species sequenced here, to provide the first genomic level insight into the evolutionary history and genetic basis underlying the functional diversity of all major groups of this genus. Gene gain/loss analysis revealed a dynamic pattern of genome evolution characterized by an initial period of gene gain followed by a period of loss, as the major groups within the genus diversified. This was followed by a period of genome expansion associated with the origins of the present extant species. The pattern is concordant with an emerging view that genomes evolve through a dynamic process of expansion and streamlining. A large proportion of the pan-genome has experienced lateral gene transfer (LGT) with causative factors, such as relatedness and shared environment, operating over different evolutionary scales. Multiple gene ontology terms were significantly enriched for each group, and mapping terms onto the phylogeny showed that those corresponding to genes born on branches leading to the major groups represented approximately one-fifth of those enriched. Furthermore, despite the extensive LGT, several biochemical characteristics have been retained since group formation, suggesting genomic cohesiveness through time, and that these characteristics may be fundamental to each group. For example, proteolysis: mitis group; urea metabolism: salivarius group; carbohydrate metabolism: pyogenic group; and transcription regulation: bovis group. PMID:24625962

  5. Paleogenomic data suggest mammal-like genome size in the ancestral amniote and derived large genome size in amphibians.

    PubMed

    Organ, C L; Canoville, A; Reisz, R R; Laurin, M

    2011-02-01

    An unsolved question in evolutionary genomics is whether amniote genomes have been expanding or contracting since the common ancestor of this diverse group. Here, we report on the polarity of amniote genome size evolution using genome size estimates for 14 extinct tetrapod genera from the Paleozoic and early Mesozoic Eras using osteocyte lacunae size as a correlate. We find substantial support for a phylogenetically controlled regression model relating genome size to osteocyte lacunae size (P of slopes <0.01, r²=0.65, phylogenetic signal λ=0.83). Genome size appears to have been homogeneous across Paleozoic crown-tetrapod lineages (average haploid genome size 2.9-3.7 pg) with values similar to those of extant mammals. The differentiation in genome size and underlying architecture among extant tetrapod lineages likely evolved in the Mesozoic and Cenozoic Eras, with expansion in amphibians, contractions along the diapsid lineage, and no directional change within the synapsid lineage leading to mammals.

  6. Genome size diversity in orchids: consequences and evolution

    PubMed Central

    Leitch, I. J.; Kahandawala, I.; Suda, J.; Hanson, L.; Ingrouille, M. J.; Chase, M. W.; Fay, M. F.

    2009-01-01

    Background The amount of DNA comprising the genome of an organism (its genome size) varies a remarkable 40 000-fold across eukaryotes, yet most groups are characterized by much narrower ranges (e.g. 14-fold in gymnosperms, 3- to 4-fold in mammals). Angiosperms stand out as one of the most variable groups with genome sizes varying nearly 2000-fold. Nevertheless within angiosperms the majority of families are characterized by genomes which are small and vary little. Species with large genomes are mostly restricted to a few monocots families including Orchidaceae. Scope A survey of the literature revealed that genome size data for Orchidaceae are comparatively rare representing just 327 species. Nevertheless they reveal that Orchidaceae are currently the most variable angiosperm family with genome sizes ranging 168-fold (1C = 0·33–55·4 pg). Analysing the data provided insights into the distribution, evolution and possible consequences to the plant of this genome size diversity. Conclusions Superimposing the data onto the increasingly robust phylogenetic tree of Orchidaceae revealed how different subfamilies were characterized by distinct genome size profiles. Epidendroideae possessed the greatest range of genome sizes, although the majority of species had small genomes. In contrast, the largest genomes were found in subfamilies Cypripedioideae and Vanilloideae. Genome size evolution within this subfamily was analysed as this is the only one with reasonable representation of data. This approach highlighted striking differences in genome size and karyotype evolution between the closely related Cypripedium, Paphiopedilum and Phragmipedium. As to the consequences of genome size diversity, various studies revealed that this has both practical (e.g. application of genetic fingerprinting techniques) and biological consequences (e.g. affecting where and when an orchid may grow) and emphasizes the importance of obtaining further genome size data given the considerable

  7. Ethics, genomics, and information retrieval.

    PubMed

    Goodman, K W

    1996-05-01

    The union of genomics and computational information retrieval raises a number of ethical issues, including data sharing, database accuracy, group and subgroup stigma, and privacy and confidentiality. These issues are introduced and assigned a preliminary analysis which, it is hoped, may be of use in more sustained efforts to identify issues, solutions and potential guidelines, to stimulate education, and to strike the most appropriate balance between the rights of individuals and the needs of researchers and society.

  8. Genomic Data Commons | Office of Cancer Genomics

    Cancer.gov

    The NCI’s Center for Cancer Genomics launches the Genomic Data Commons (GDC), a unified data sharing platform for the cancer research community. The mission of the GDC is to enable data sharing across the entire cancer research community, to ultimately support precision medicine in oncology.

  9. Group Grammar

    ERIC Educational Resources Information Center

    Adams, Karen

    2015-01-01

    In this article Karen Adams demonstrates how to incorporate group grammar techniques into a classroom activity. In the activity, students practice using the target grammar to do something they naturally enjoy: learning about each other.

  10. KEGG: kyoto encyclopedia of genes and genomes.

    PubMed

    Kanehisa, M; Goto, S

    2000-01-01

    KEGG (Kyoto Encyclopedia of Genes and Genomes) is a knowledge base for systematic analysis of gene functions, linking genomic information with higher order functional information. The genomic information is stored in the GENES database, which is a collection of gene catalogs for all the completely sequenced genomes and some partial genomes with up-to-date annotation of gene functions. The higher order functional information is stored in the PATHWAY database, which contains graphical representations of cellular processes, such as metabolism, membrane transport, signal transduction and cell cycle. The PATHWAY database is supplemented by a set of ortholog group tables for the information about conserved subpathways (pathway motifs), which are often encoded by positionally coupled genes on the chromosome and which are especially useful in predicting gene functions. A third database in KEGG is LIGAND for the information about chemical compounds, enzyme molecules and enzymatic reactions. KEGG provides Java graphics tools for browsing genome maps, comparing two genome maps and manipulating expression maps, as well as computational tools for sequence comparison, graph comparison and path computation. The KEGG databases are daily updated and made freely available (http://www. genome.ad.jp/kegg/).

  11. Harvesting rice's dispensable genome.

    PubMed

    Wing, Rod A

    2015-01-01

    A rapid and cost-effective approach has been developed to harvest and map the dispensable genome, that is, population-level natural sequence variation within a species that is not present in static genome assemblies. PMID:26429765

  12. Genomic Data Commons launches

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  13. GENOMICS AND ENVIRONMENTAL RESEARCH

    EPA Science Inventory

    The impact of recently developed and emerging genomics technologies on environmental sciences has significant implications for human and ecological risk assessment issues. The linkage of data generated from genomics, transcriptomics, proteomics, metabalomics, and ecology can be ...

  14. MaizeGDB's New Genome Browser Project

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MaizeGDB (http://www.maizegdb.org) is the community database for maize genetics and genomics. Based upon the 2006 MaizeGDB Working Group Report (available at http://www.maizegdb.org/working_group.php) and the Allerton Report (http://www.maizegdb.org/AllertonReport.doc), it has become evident that th...

  15. Evolution of gastropod mitochondrial genome arrangements

    PubMed Central

    2008-01-01

    Background Gastropod mitochondrial genomes exhibit an unusually great variety of gene orders compared to other metazoan mitochondrial genome such as e.g those of vertebrates. Hence, gastropod mitochondrial genomes constitute a good model system to study patterns, rates, and mechanisms of mitochondrial genome rearrangement. However, this kind of evolutionary comparative analysis requires a robust phylogenetic framework of the group under study, which has been elusive so far for gastropods in spite of the efforts carried out during the last two decades. Here, we report the complete nucleotide sequence of five mitochondrial genomes of gastropods (Pyramidella dolabrata, Ascobulla fragilis, Siphonaria pectinata, Onchidella celtica, and Myosotella myosotis), and we analyze them together with another ten complete mitochondrial genomes of gastropods currently available in molecular databases in order to reconstruct the phylogenetic relationships among the main lineages of gastropods. Results Comparative analyses with other mollusk mitochondrial genomes allowed us to describe molecular features and general trends in the evolution of mitochondrial genome organization in gastropods. Phylogenetic reconstruction with commonly used methods of phylogenetic inference (ME, MP, ML, BI) arrived at a single topology, which was used to reconstruct the evolution of mitochondrial gene rearrangements in the group. Conclusion Four main lineages were identified within gastropods: Caenogastropoda, Vetigastropoda, Patellogastropoda, and Heterobranchia. Caenogastropoda and Vetigastropoda are sister taxa, as well as, Patellogastropoda and Heterobranchia. This result rejects the validity of the derived clade Apogastropoda (Caenogastropoda + Heterobranchia). The position of Patellogastropoda remains unclear likely due to long-branch attraction biases. Within Heterobranchia, the most heterogeneous group of gastropods, neither Euthyneura (because of the inclusion of P. dolabrata) nor Pulmonata

  16. Exploiting the genome

    SciTech Connect

    Block, S.; Cornwall, J.; Dyson, F.; Koonin, S.; Lewis, N.; Schwitters, R.

    1998-09-11

    In 1997, JASON conducted a DOE-sponsored study of the human genome project with special emphasis on the areas of technology, quality assurance and quality control, and informatics. The present study has two aims: first, to update the 1997 Report in light of recent developments in genome sequencing technology, and second, to consider possible roles for the DOE in the ''post-genomic" era, following acquisition of the complete human genome sequence.

  17. Observing copepods through a genomic lens

    PubMed Central

    2011-01-01

    Background Copepods outnumber every other multicellular animal group. They are critical components of the world's freshwater and marine ecosystems, sensitive indicators of local and global climate change, key ecosystem service providers, parasites and predators of economically important aquatic animals and potential vectors of waterborne disease. Copepods sustain the world fisheries that nourish and support human populations. Although genomic tools have transformed many areas of biological and biomedical research, their power to elucidate aspects of the biology, behavior and ecology of copepods has only recently begun to be exploited. Discussion The extraordinary biological and ecological diversity of the subclass Copepoda provides both unique advantages for addressing key problems in aquatic systems and formidable challenges for developing a focused genomics strategy. This article provides an overview of genomic studies of copepods and discusses strategies for using genomics tools to address key questions at levels extending from individuals to ecosystems. Genomics can, for instance, help to decipher patterns of genome evolution such as those that occur during transitions from free living to symbiotic and parasitic lifestyles and can assist in the identification of genetic mechanisms and accompanying physiological changes associated with adaptation to new or physiologically challenging environments. The adaptive significance of the diversity in genome size and unique mechanisms of genome reorganization during development could similarly be explored. Genome-wide and EST studies of parasitic copepods of salmon and large EST studies of selected free-living copepods have demonstrated the potential utility of modern genomics approaches for the study of copepods and have generated resources such as EST libraries, shotgun genome sequences, BAC libraries, genome maps and inbred lines that will be invaluable in assisting further efforts to provide genomics tools for

  18. Are there ergodic limits to evolution? Ergodic exploration of genome space and convergence.

    PubMed

    McLeish, Tom C B

    2015-12-01

    We examine the analogy between evolutionary dynamics and statistical mechanics to include the fundamental question of ergodicity-the representative exploration of the space of possible states (in the case of evolution this is genome space). Several properties of evolutionary dynamics are identified that allow a generalization of the ergodic dynamics, familiar in dynamical systems theory, to evolution. Two classes of evolved biological structure then arise, differentiated by the qualitative duration of their evolutionary time scales. The first class has an ergodicity time scale (the time required for representative genome exploration) longer than available evolutionary time, and has incompletely explored the genotypic and phenotypic space of its possibilities. This case generates no expectation of convergence to an optimal phenotype or possibility of its prediction. The second, more interesting, class exhibits an evolutionary form of ergodicity-essentially all of the structural space within the constraints of slower evolutionary variables have been sampled; the ergodicity time scale for the system evolution is less than the evolutionary time. In this case, some convergence towards similar optima may be expected for equivalent systems in different species where both possess ergodic evolutionary dynamics. When the fitness maximum is set by physical, rather than co-evolved, constraints, it is additionally possible to make predictions of some properties of the evolved structures and systems. We propose four structures that emerge from evolution within genotypes whose fitness is induced from their phenotypes. Together, these result in an exponential speeding up of evolution, when compared with complete exploration of genomic space. We illustrate a possible case of application and a prediction of convergence together with attaining a physical fitness optimum in the case of invertebrate compound eye resolution.

  19. Group dynamics.

    PubMed

    Scandiffio, A L

    1990-12-01

    Group dynamics play a significant role within any organization, culture, or unit. The important thing to remember with any of these structures is that they are made up of people--people with different ideas, motivations, background, and sometimes different agendas. Most groups, formal or informal, look for a leader in an effort to maintain cohesiveness of the unit. At times, that cultural bond must be developed; once developed, it must be nurtured. There are also times that one of the group no longer finds the culture comfortable and begins to act out behaviorally. It is these times that become trying for the leader as she or he attempts to remain objective when that which was once in the building phase of group cohesiveness starts to fall apart. At all times, the manager must continue to view the employee creating the disturbance as an integral part of the group. It is at this time that it is beneficial to perceive the employee exhibiting problem behaviors as a special employee, as one who needs the benefit of your experience and skills, as one who is still part of the group. It is also during this time that the manager should focus upon her or his own views in the area of power, communication, and the corporate culture of the unit that one has established before attempting to understand another's point of view. Once we understand our own motivation and accept ourselves, it is then that we may move on to offer assistance to another. Once we understand our insecurities recognizing staff dysfunction as a symptom of system dysfunction will not be so threatening to the concept of the manager that we perceive ourselves to be. It takes a secure person to admit that she or he favors staff before deciding to do something to change things. The important thing to know is that it can be done. The favored staff can find a new way of relating to others, the special employee can find new modes of behavior (and even find self-esteem in the process), the group can find new ways

  20. Population genomics of the symbiotic plasmids of sympatric nitrogen-fixing Rhizobium species associated with Phaseolus vulgaris.

    PubMed

    Pérez Carrascal, Olga M; VanInsberghe, David; Juárez, Soledad; Polz, Martin F; Vinuesa, Pablo; González, Víctor

    2016-09-01

    Cultivated common beans are the primary protein source for millions of people around the world who subsist on low-input agriculture, enabled by the symbiotic N2 -fixation these legumes perform in association with rhizobia. Within a single agricultural plot, multiple Rhizobium species can nodulate bean roots, but it is unclear how genetically isolated these species remain in sympatry. To better understand this issue, we sequenced and compared the genomes of 33 strains isolated from the rhizosphere and root nodules of a particular bean variety grown in the same agricultural plot. We found that the Rhizobium species we observed coexist with low genetic recombination across their core genomes. Accessory plasmids thought to be necessary for the saprophytic lifestyle in soil show similar levels of genetic isolation, but with higher rates of recombination than the chromosomes. However, the symbiotic plasmids are extremely similar, with high rates of recombination and do not appear to have co-evolved with the chromosome or accessory plasmids. Therefore, while Rhizobium species are genetically isolated units within the microbial community, a common symbiotic plasmid allows all Rhizobium species to engage in symbiosis with the same host in a single agricultural plot. PMID:27312778

  1. Mitochondrial respiration and genomic analysis provide insight into the influence of the symbiotic bacterium on host trypanosomatid oxygen consumption.

    PubMed

    Azevedo-Martins, A C; Machado, A C L; Klein, C C; Ciapina, L; Gonzaga, L; Vasconcelos, A T R; Sagot, M F; DE Souza, W; Einicker-Lamas, M; Galina, A; Motta, M C M

    2015-02-01

    Certain trypanosomatids co-evolve with an endosymbiotic bacterium in a mutualistic relationship that is characterized by intense metabolic exchanges. Symbionts were able to respire for up to 4 h after isolation from Angomonas deanei. FCCP (carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone) similarly increased respiration in wild-type and aposymbiotic protozoa, though a higher maximal O2 consumption capacity was observed in the symbiont-containing cells. Rotenone, a complex I inhibitor, did not affect A. deanei respiration, whereas TTFA (thenoyltrifluoroacetone), a complex II activity inhibitor, completely blocked respiration in both strains. Antimycin A and cyanide, inhibitors of complexes III and IV, respectively, abolished O2 consumption, but the aposymbiotic protozoa were more sensitive to both compounds. Oligomycin did not affect cell respiration, whereas carboxyatractyloside (CAT), an inhibitor of the ADP-ATP translocator, slightly reduced O2 consumption. In the A. deanei genome, sequences encoding most proteins of the respiratory chain are present. The symbiont genome lost part of the electron transport system (ETS), but complex I, a cytochrome d oxidase, and FoF1-ATP synthase remain. In conclusion, this work suggests that the symbiont influences the mitochondrial respiration of the host protozoan. PMID:25160925

  2. Mitochondrial respiration and genomic analysis provide insight into the influence of the symbiotic bacterium on host trypanosomatid oxygen consumption.

    PubMed

    Azevedo-Martins, A C; Machado, A C L; Klein, C C; Ciapina, L; Gonzaga, L; Vasconcelos, A T R; Sagot, M F; DE Souza, W; Einicker-Lamas, M; Galina, A; Motta, M C M

    2015-02-01

    Certain trypanosomatids co-evolve with an endosymbiotic bacterium in a mutualistic relationship that is characterized by intense metabolic exchanges. Symbionts were able to respire for up to 4 h after isolation from Angomonas deanei. FCCP (carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone) similarly increased respiration in wild-type and aposymbiotic protozoa, though a higher maximal O2 consumption capacity was observed in the symbiont-containing cells. Rotenone, a complex I inhibitor, did not affect A. deanei respiration, whereas TTFA (thenoyltrifluoroacetone), a complex II activity inhibitor, completely blocked respiration in both strains. Antimycin A and cyanide, inhibitors of complexes III and IV, respectively, abolished O2 consumption, but the aposymbiotic protozoa were more sensitive to both compounds. Oligomycin did not affect cell respiration, whereas carboxyatractyloside (CAT), an inhibitor of the ADP-ATP translocator, slightly reduced O2 consumption. In the A. deanei genome, sequences encoding most proteins of the respiratory chain are present. The symbiont genome lost part of the electron transport system (ETS), but complex I, a cytochrome d oxidase, and FoF1-ATP synthase remain. In conclusion, this work suggests that the symbiont influences the mitochondrial respiration of the host protozoan.

  3. Genomic and transcriptomic alterations following hybridisation and genome doubling in trigenomic allohexaploid Brassica carinata × Brassica rapa.

    PubMed

    Xu, Y; Zhao, Q; Mei, S; Wang, J

    2012-09-01

    Allopolyploidisation is a prominent evolutionary force that involves two major events: interspecific hybridisation and genome doubling. Both events have important functional consequences in shaping the genomic architecture of the neo-allopolyploids. The respective effects of hybridisation and genome doubling upon genomic and transcriptomic changes in Brassica allopolyploids are unresolved. In this study, amplified fragment length polymorphism (AFLP), methylation-sensitive amplification polymorphism (MSAP) and cDNA-AFLP approaches were used to track genetic, epigenetic and transcriptional changes in both allohexaploid Brassica (ArArBcBcCcCc genome) and triploid hybrids (ArBcCc genome). Results from these groups were compared with each other and also to their parents Brassica carinata (BBCC genome) and Brassica rapa (AA genome). Rapid and dramatic genetic, DNA methylation and gene expression changes were detected in the triploid hybrids. During the shift from triploidy to allohexaploidy, some of the hybridisation-induced alterations underwent reversion. Additionally, novel genetic, epigenetic and transcriptional alterations were also detected. The proportions of A-genome-specific DNA methylation and gene expression alterations were significantly greater than those of BC-genome-specific alterations in the triploid hybrids. However, the two parental genomes were equally affected during the ploidy shift. Hemi-CCG methylation changes induced by hybridisation were recovered after genome doubling. Full-CG methylation changes were a more general process initiated in the hybrid and continued after genome doubling. These results indicate that genome doubling could ameliorate genomic and transcriptomic alterations induced by hybridisation and instigate additional alterations in trigenomic Brassica allohexaploids. Moreover, genome doubling also modified hybridisation-induced progenitor genome-biased alterations and epigenetic alteration characteristics.

  4. Genome-scale co-evolutionary inference identifies functions and clients of bacterial Hsp90.

    PubMed

    Press, Maximilian O; Li, Hui; Creanza, Nicole; Kramer, Günter; Queitsch, Christine; Sourjik, Victor; Borenstein, Elhanan

    2013-01-01

    The molecular chaperone Hsp90 is essential in eukaryotes, in which it facilitates the folding of developmental regulators and signal transduction proteins known as Hsp90 clients. In contrast, Hsp90 is not essential in bacteria, and a broad characterization of its molecular and organismal function is lacking. To enable such characterization, we used a genome-scale phylogenetic analysis to identify genes that co-evolve with bacterial Hsp90. We find that genes whose gain and loss were coordinated with Hsp90 throughout bacterial evolution tended to function in flagellar assembly, chemotaxis, and bacterial secretion, suggesting that Hsp90 may aid assembly of protein complexes. To add to the limited set of known bacterial Hsp90 clients, we further developed a statistical method to predict putative clients. We validated our predictions by demonstrating that the flagellar protein FliN and the chemotaxis kinase CheA behaved as Hsp90 clients in Escherichia coli, confirming the predicted role of Hsp90 in chemotaxis and flagellar assembly. Furthermore, normal Hsp90 function is important for wild-type motility and/or chemotaxis in E. coli. This novel function of bacterial Hsp90 agreed with our subsequent finding that Hsp90 is associated with a preference for multiple habitats and may therefore face a complex selection regime. Taken together, our results reveal previously unknown functions of bacterial Hsp90 and open avenues for future experimental exploration by implicating Hsp90 in the assembly of membrane protein complexes and adaptation to novel environments. PMID:23874229

  5. The Genomic Medicine Game.

    PubMed

    Tran, Elvis; de Andrés-Galiana, Enrique J; Benitez, Sonia; Martin-Sanchez, Fernando; Lopez-Campos, Guillermo H

    2016-01-01

    With advancements in genomics technology, health care has been improving and new paradigms of medicine such as genomic medicine have evolved. The education of clinicians, researchers and students to face the challenges posed by these new approaches, however, has been often lagging behind. From this the Genomic Medicine Game, an educational tool, was created for the purpose of conceptualizing the key components of Genomic Medicine. A number of phenotype-genotype associations were found through a literature review, which was used to be a base for the concepts the Genomic Medicine Game would focus on. Built in Java, the game was successfully tested with promising results. PMID:27577486

  6. Group Learning.

    ERIC Educational Resources Information Center

    Black, Susan

    1992-01-01

    Research suggests that cooperative learning works best when students are first taught group-processing skills, such as leadership, decision making, communication, trust building, and conflict management. Inadequate teacher training and boring assignments can torpedo cooperative learning efforts. Administrators should reassure teachers with…

  7. Genome structure analysis of molluscs revealed whole genome duplication and lineage specific repeat variation.

    PubMed

    Yoshida, Masa-aki; Ishikura, Yukiko; Moritaki, Takeya; Shoguchi, Eiichi; Shimizu, Kentaro K; Sese, Jun; Ogura, Atsushi

    2011-09-01

    Comparative genome structure analysis allows us to identify novel genes, repetitive sequences and gene duplications. To explore lineage-specific genomic changes of the molluscs that is good model for development of nervous system in invertebrate, we conducted comparative genome structure analyses of three molluscs, pygmy squid, nautilus and scallops using partial genome shotgun sequencing. Most effective elements on the genome structural changes are repetitive elements (REs) causing expansion of genome size and whole genome duplication producing large amount of novel functional genes. Therefore, we investigated variation and proportion of REs and whole genome duplication. We, first, identified variations of REs in the three molluscan genomes by homology-based and de novo RE detection. Proportion of REs were 9.2%, 4.0%, and 3.8% in the pygmy squid, nautilus and scallop, respectively. We, then, estimated genome size of the species as 2.1, 4.2 and 1.8 Gb, respectively, with 2× coverage frequency and DNA sequencing theory. We also performed a gene duplication assay based on coding genes, and found that large-scale duplication events occurred after divergence from the limpet Lottia, an out-group of the three molluscan species. Comparison of all the results suggested that RE expansion did not relate to the increase in genome size of nautilus. Despite close relationships to nautilus, the squid has the largest portion of REs and smaller genome size than nautilus. We also identified lineage-specific RE and gene-family expansions, possibly relate to acquisition of the most complicated eye and brain systems in the three species.

  8. Genomic and comparative genomic analyses of Rickettsia heilongjiangensis provide insight into its evolution and pathogenesis.

    PubMed

    Duan, Changsong; Xiong, Xiaolu; Qi, Yong; Gong, Wenping; Jiao, Jun; Wen, Bohai

    2014-08-01

    Rickettsia heilongjiangensis, the causative agent of far eastern spotted fever, is an obligate intracellular gram-negative bacterium that belongs to the spotted fever group rickettsiae. To understand the evolution and pathogenesis of R. heilongjiangensis, we analyzed its genome and compared it with other rickettsial genomes available in GenBank. The R. heilongjiangensis chromosome contains 1333 genes, including 1297 protein coding genes and 36 RNA coding genes. The genome also contains 121 pseudogenes, 54 insertion sequences, and 39 tandem repeats. Sixteen genes encoding the major components of the type IV secretion systems were identified in the R. heilongjiangensis genome. In total, 37 β-barrel outer membrane proteins were predicted in the genome, eight of which have been previously confirmed to be outer membrane proteins. In addition, 266 potential virulence factor genes, seven partially deleted antibiotic resistance genes, and a genomic island were identified in the genome. The codon usage in the genome is compatible with its low GC content, and the amino acid usage shows apparent bias. A comparative genomic analysis showed that R. heilongjiangensis and R. japonica share one unique fragment that may be a target sequence for a diagnostic assay. The orthologs of 37 genes of R. heilongjiangensis were found in pathogenic R. rickettsii str. Sheila Smith but not in non-pathogenic R. rickettsii str. Iowa, which may explain why R. heilongjiangensis is pathogenic. Pan-genome analysis showed that R. heilongjiangensis and 42 other rickettsiae strains share 693 core genes with a pan-genome size of 4837 genes. The pan-genome-based phylogeny showed that R. heilongjiangensis was closely related to R. japonica.

  9. Bacterial Genome Instability

    PubMed Central

    Darmon, Elise

    2014-01-01

    SUMMARY Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease. PMID:24600039

  10. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes

    PubMed Central

    Ribeiro, Teresa; Barrela, Ricardo M.; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A. P.

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  11. Sequencing three crocodilian genomes to illuminate the evolution of archosaurs and amniotes

    PubMed Central

    2012-01-01

    The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described. PMID:22293439

  12. Characterization of fHbp, nhba (gna2132), nadA, porA, sequence type (ST), and genomic presence of IS1301 in group B meningococcal ST269 clonal complex isolates from England and Wales.

    PubMed

    Lucidarme, Jay; Comanducci, Maurizio; Findlow, Jamie; Gray, Stephen J; Kaczmarski, Edward B; Guiver, Malcolm; Kugelberg, Elisabeth; Vallely, Pamela J; Oster, Philipp; Pizza, Mariagrazia; Bambini, Stefania; Muzzi, Alessandro; Tang, Christoph M; Borrow, Ray

    2009-11-01

    Highly effective glycoconjugate vaccines exist against four of the five major pathogenic groups of meningococci: A, C, W-135, and Y. An equivalent vaccine against group B meningococci (menB) has remained elusive due to the poorly immunogenic capsular polysaccharide. A promising alternative, the investigational recombinant menB (rMenB)- outer membrane vesicle (OMV) vaccine, contains fHBP, NHBA (previously GNA2132), NadA, and outer membrane vesicles (OMVs) from the New Zealand MeNZB vaccine. MenB currently accounts for 90% of meningococcal disease in England and Wales, where the multilocus sequence type (ST) 269 (ST269) clonal complex (cc269) has recently expanded to account for a third of menB cases. To assess the potential cc269 coverage of the rMenB-OMV vaccine, English and Welsh cc269 isolates from the past decade were genetically characterized with respect to fHBP, NHBA, and NadA. All of the isolates harbored fHbp and nhba alleles, while 98% of the cc269 isolates were devoid of nadA. Subvariant profiling of fHbp, nhba, and porA against STs revealed the presence of two broadly distinct and well-defined clusters of isolates, centered around ST269 and ST275, respectively. An additional molecular marker, insertion sequence IS1301, was found to be present in 100% and <2% of isolates of the respective clusters. On the basis of the genetic data, the potential rMenB-OMV coverage of cc269 in England and Wales is high (up to 100%) within both clusters. Expression studies and serum bactericidal antibody assays will serve to enhance predictions of coverage and will augment ongoing studies regarding the significance of IS1301 within the ST269 cluster.

  13. Enabling responsible public genomics.

    PubMed

    Conley, John M; Doerr, Adam K; Vorhaus, Daniel B

    2010-01-01

    As scientific understandings of genetics advance, researchers require increasingly rich datasets that combine genomic data from large numbers of individuals with medical and other personal information. Linking individuals' genetic data and personal information precludes anonymity and produces medically significant information--a result not contemplated by the established legal and ethical conventions governing human genomic research. To pursue the next generation of human genomic research and commerce in a responsible fashion, scientists, lawyers, and regulators must address substantial new issues, including researchers' duties with respect to clinically significant data, the challenges to privacy presented by genomic data, the boundary between genomic research and commerce, and the practice of medicine. This Article presents a new model for understanding and addressing these new challenges--a "public genomics" premised on the idea that ethically, legally, and socially responsible genomics research requires openness, not privacy, as its organizing principle. Responsible public genomics combines the data contributed by informed and fully consenting information altruists and the research potential of rich datasets in a genomic commons that is freely and globally available. This Article examines the risks and benefits of this public genomics model in the context of an ambitious genetic research project currently under way--the Personal Genome Project. This Article also (i) demonstrates that large-scale genomic projects are desirable, (ii) evaluates the risks and challenges presented by public genomics research, and (iii) determines that the current legal and regulatory regimes restrict beneficial and responsible scientific inquiry while failing to adequately protect participants. The Article concludes by proposing a modified normative and legal framework that embraces and enables a future of responsible public genomics.

  14. Neisseria Base: a comparative genomics database for Neisseria meningitidis.

    PubMed

    Katz, Lee S; Humphrey, Jay C; Conley, Andrew B; Nelakuditi, Viswateja; Kislyuk, Andrey O; Agrawal, Sonia; Jayaraman, Pushkala; Harcourt, Brian H; Olsen-Rasmussen, Melissa A; Frace, Michael; Sharma, Nitya V; Mayer, Leonard W; Jordan, I King

    2011-01-01

    Neisseria meningitidis is an important pathogen, causing life-threatening diseases including meningitis, septicemia and in some cases pneumonia. Genomic studies hold great promise for N. meningitidis research, but substantial database resources are needed to deal with the wealth of information that comes with completely sequenced and annotated genomes. To address this need, we developed Neisseria Base (NBase), a comparative genomics database and genome browser that houses and displays publicly available N. meningitidis genomes. In addition to existing N. meningitidis genome sequences, we sequenced and annotated 19 new genomes using 454 pyrosequencing and the CG-Pipeline genome analysis tool. In total, NBase hosts 27 complete N. meningitidis genome sequences along with their associated annotations. The NBase platform is designed to be scalable, via the underlying database schema and modular code architecture, such that it can readily incorporate new genomes and their associated annotations. The front page of NBase provides user access to these genomes through searching, browsing and downloading. NBase search utility includes BLAST-based sequence similarity searches along with a variety of semantic search options. All genomes can be browsed using a modified version of the GBrowse platform, and a plethora of information on each gene can be viewed using a customized details page. NBase also has a whole-genome comparison tool that yields single-nucleotide polymorphism differences between two user-defined groups of genomes. Using the virulent ST-11 lineage as an example, we demonstrate how this comparative genomics utility can be used to identify novel genomic markers for molecular profiling of N. meningitidis. PMID:21930505

  15. Local Group

    NASA Astrophysics Data System (ADS)

    Mateo, M.; Murdin, P.

    2000-11-01

    Not long after EDWIN HUBBLE established that galaxies are `island universes' similar to our home galaxy, the MILKY WAY, he realized that a few of these external galaxies are considerably closer to us than any others. In 1936 he first coined the term `Local Group' in his famous book The Realm of the Nebulae to identify our nearest galactic neighbors. More than 60 yr later, the galaxies of the Loca...

  16. Underrepresented groups

    NASA Technical Reports Server (NTRS)

    Peters, David A.

    1990-01-01

    The problem with the shortage of under represented groups in science and engineering is absolutely crucial, especially considering that U.S. will experience a shortage of 560,000 science and engineering personnel by the year 2010. Most studies by the National Science Foundation also concluded that projected shortages cannot be alleviated without significant increases in the involvement of Blacks, Hispanics, Native Americans, handicapped persons, and women.

  17. Co-evolving non-uniform cellular automata to perform computations

    NASA Astrophysics Data System (ADS)

    Sipper, Moshe

    A major impediment of cellular automata (CA) stems from the difficulty of utilizing their complex behavior to perform useful computations. Recent studies by Packard and Mitchell et al. have shown that CAs can be evolved to perform a computational task. In this paper non-uniform CAs are studied, where each cell may contain a different rule, in contrast to the original, uniform model. We describe experiments in which non-uniform CAs are evolved to perform the computational task using a local, co-evolutionary algorithm. For radius r = 3 we attain peak performance values of 0.92 comparable to those obtained for uniform CAs (0.93-0.95). This is notable considering the huge search spaces involved, much larger than the uniform case. Smaller radius CAs (previously unstudied in this context) attain performance values of 0.93-0.94. For r = 1 this is considerably higher than the maximal possible uniform CA performance of 0.83, suggesting that non-uniformity reduces connectivity requirements. We thus demonstrate that: (1) non-uniform CAs can attain high computational performance, and (2) such systems can be evolved rather than designed.

  18. Linking environmental risk assessment and communication: An experiment in co-evolving scientific and social knowledge

    USGS Publications Warehouse

    Graffy, E.A.; Booth, N.L.

    2008-01-01

    Dissemination of information to decision-makers and enhanced methods of public participation are often put forward as antidotes to a perceived disconnect between risk assessment and risk communication in the public domain. However, mechanisms that support both the provision of routine, timely and relevant technical knowledge to the public and meaningful opportunities for public participation in the evaluation and management of risk are few. We argue for the need to re-conceptualise the institutional context in which risk research and communication occur as one in which scientific knowledge and public understanding are co-evolutionary instead of independent or sequential. Here, we report on an experiment to promote coevolution of environmental risk assessment and risk communication through the instrumental use of a web-based platform that dynamically links expert and public discourses through common information sources, linked scenario evaluations, and opportunities for iterative dialogue. On the basis of technical feasibility, research value and public communication capacity, we conclude that there is potential for further refinement of the methodologies presented here. Copyright ?? 2008 Inderscience Enterprises Ltd.

  19. Beyond Dyadic Interdependence: Actor-Oriented Models for Co-Evolving Social Networks and Individual Behaviors

    ERIC Educational Resources Information Center

    Burk, William J.; Steglich, Christian E. G.; Snijders, Tom A. B.

    2007-01-01

    Actor-oriented models are described as a longitudinal strategy for examining the co-evolution of social networks and individual behaviors. We argue that these models provide advantages over conventional approaches due to their ability to account for inherent dependencies between individuals embedded in a social network (i.e., reciprocity,…

  20. Whole-exome/genome sequencing and genomics.

    PubMed

    Grody, Wayne W; Thompson, Barry H; Hudgins, Louanne

    2013-12-01

    As medical genetics has progressed from a descriptive entity to one focused on the functional relationship between genes and clinical disorders, emphasis has been placed on genomics. Genomics, a subelement of genetics, is the study of the genome, the sum total of all the genes of an organism. The human genome, which is contained in the 23 pairs of nuclear chromosomes and in the mitochondrial DNA of each cell, comprises >6 billion nucleotides of genetic code. There are some 23,000 protein-coding genes, a surprisingly small fraction of the total genetic material, with the remainder composed of noncoding DNA, regulatory sequences, and introns. The Human Genome Project, launched in 1990, produced a draft of the genome in 2001 and then a finished sequence in 2003, on the 50th anniversary of the initial publication of Watson and Crick's paper on the double-helical structure of DNA. Since then, this mass of genetic information has been translated at an ever-increasing pace into useable knowledge applicable to clinical medicine. The recent advent of massively parallel DNA sequencing (also known as shotgun, high-throughput, and next-generation sequencing) has brought whole-genome analysis into the clinic for the first time, and most of the current applications are directed at children with congenital conditions that are undiagnosable by using standard genetic tests for single-gene disorders. Thus, pediatricians must become familiar with this technology, what it can and cannot offer, and its technical and ethical challenges. Here, we address the concepts of human genomic analysis and its clinical applicability for primary care providers.

  1. Legume genomes: more than peas in a pod.

    PubMed

    Young, Nevin Dale; Mudge, Joann; Ellis, T H Noel

    2003-04-01

    A growing array of sequence-based tools is helping to reveal the organization, evolution and syntenic relationships of legume genomes. The results indicate that legumes form a coherent taxonomic group with frequent and widespread macro- and microsynteny. This is good news for two model legume systems, Medicago truncatula and Lotus japonicus. Indeed, both models have recently been used to clone and characterize genes for nodulation-related receptors that were originally described in legumes with more complex genomes. Studies of legume genomes have also provided insight into genome size, gene clustering, genome duplications and repetitive elements. To understand legume genomes better, it will be necessary to develop tools for studying under-represented taxa beyond the relatively small group of economically important species that have been examined so far.

  2. HeteroGenome: database of genome periodicity

    PubMed Central

    Chaley, Maria; Kutyrkin, Vladimir; Tulbasheva, Gayane; Teplukhina, Elena; Nazipova, Nafisa

    2014-01-01

    We present the first release of the HeteroGenome database collecting latent periodicity regions in genomes. Tandem repeats and highly divergent tandem repeats along with the regions of a new type of periodicity, known as profile periodicity, have been collected for the genomes of Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans and Drosophila melanogaster. We obtained data with the aid of a spectral-statistical approach to search for reliable latent periodicity regions (with periods up to 2000 bp) in DNA sequences. The original two-level mode of data presentation (a broad view of the region of latent periodicity and a second level indicating conservative fragments of its structure) was further developed to enable us to obtain the estimate, without redundancy, that latent periodicity regions make up ∼10% of the analyzed genomes. Analysis of the quantitative and qualitative content of located periodicity regions on all chromosomes of the analyzed organisms revealed dominant characteristic types of periodicity in the genomes. The pattern of density distribution of latent periodicity regions on chromosome unambiguously characterizes each chromosome in genome. Database URL: http://www.jcbi.ru/lp_baze/ PMID:24857969

  3. The mitochondrial genome of Grateloupia taiwanensis (Halymeniaceae, Rhodophyta) and comparative mitochondrial genomics of red algae.

    PubMed

    DePriest, Michael S; Bhattacharya, Debashish; López-Bautista, Juan M

    2014-10-01

    Although red algae are economically highly valuable for their gelatinous cell wall compounds as well as being integral parts of marine benthic habitats, very little genome data are currently available. We present mitochondrial genome sequence data from the red alga Grateloupia taiwanensis S.-M. Lin & H.-Y. Liang. Comprising 28,906 nucleotide positions, the mitochondrial genome contig contains 25 protein-coding genes and 24 transfer RNA genes. It is highly similar to other red algal genomes in gene content as well as overall structure. An intron in the cox1 gene was found to be shared by G. taiwanensis and Grateloupia angusta (Okamura) S. Kawaguchi & H. W. Wang. We also used whole-genome alignments to compare G. taiwanensis to different groups of red algae, and these results are consistent with the currently accepted phylogeny of Rhodophyta.

  4. The genomes of root-knot nematodes.

    PubMed

    Bird, David McK; Williamson, Valerie M; Abad, Pierre; McCarter, James; Danchin, Etienne G J; Castagnone-Sereno, Philippe; Opperman, Charles H

    2009-01-01

    Plant-parasitic nematodes are the most destructive group of plant pathogens worldwide and are extremely challenging to control. The recent completion of two root-knot nematode genomes opens the way for a comparative genomics approach to elucidate the success of these parasites. Sequencing revealed that Meloidogyne hapla, a diploid that reproduces by facultative, meiotic parthenogenesis, encodes approximately 14,200 genes in a compact, 54 Mpb genome. Indeed, this is the smallest metazoan genome completed to date. By contrast, the 86 Mbp Meloidogyne incognita genome encodes approximately 19,200 genes. This species reproduces by obligate mitotic parthenogenesis and exhibits a complex pattern of aneuploidy. The genome includes triplicated regions and contains allelic pairs with exceptionally high degrees of sequence divergence, presumably reflecting adaptations to the strictly asexual reproductive mode. Both root-knot nematode genomes have compacted gene families compared with the free-living nematode Caenorhabditis elegans, and both encode large suites of enzymes that uniquely target the host plant. Acquisition of these genes, apparently via horizontal gene transfer, and their subsequent expansion and diversification point to the evolutionary history of these parasites. It also suggests new routes to their control. PMID:19400640

  5. Comparative Genome Analysis of Enterobacter cloacae

    PubMed Central

    Liu, Wing-Yee; Wong, Chi-Fat; Chung, Karl Ming-Kar; Jiang, Jing-Wei; Leung, Frederick Chi-Ching

    2013-01-01

    The Enterobacter cloacae species includes an extremely diverse group of bacteria that are associated with plants, soil and humans. Publication of the complete genome sequence of the plant growth-promoting endophytic E. cloacae subsp. cloacae ENHKU01 provided an opportunity to perform the first comparative genome analysis between strains of this dynamic species. Examination of the pan-genome of E. cloacae showed that the conserved core genome retains the general physiological and survival genes of the species, while genomic factors in plasmids and variable regions determine the virulence of the human pathogenic E. cloacae strain; additionally, the diversity of fimbriae contributes to variation in colonization and host determination of different E. cloacae strains. Comparative genome analysis further illustrated that E. cloacae strains possess multiple mechanisms for antagonistic action against other microorganisms, which involve the production of siderophores and various antimicrobial compounds, such as bacteriocins, chitinases and antibiotic resistance proteins. The presence of Type VI secretion systems is expected to provide further fitness advantages for E. cloacae in microbial competition, thus allowing it to survive in different environments. Competition assays were performed to support our observations in genomic analysis, where E. cloacae subsp. cloacae ENHKU01 demonstrated antagonistic activities against a wide range of plant pathogenic fungal and bacterial species. PMID:24069314

  6. A genomic perspective on hybridization and speciation

    PubMed Central

    Payseur, Bret A.; Rieseberg, Loren H.

    2016-01-01

    Hybridization among diverging lineages is common in nature. Genomic data provide a special opportunity to characterize the history of hybridization and the genetic basis of speciation. We review existing methods and empirical studies to identify recent advances in the genomics of hybridization, as well as issues that need to be addressed. Notable progress has been made in the development of methods for detecting hybridization and inferring individual ancestries. However, few approaches reconstruct the magnitude and timing of gene flow, estimate the fitness of hybrids or incorporate knowledge of recombination rate. Empirical studies indicate that the genomic consequences of hybridization are complex, including a highly heterogeneous landscape of differentiation. Inferred characteristics of hybridization differ substantially among species groups. Loci showing unusual patterns – which may contribute to reproductive barriers – are usually scattered throughout the genome, with potential enrichment in sex chromosomes and regions of reduced recombination. We caution against the growing trend of interpreting genomic variation in summary statistics across genomes as evidence of differential gene flow. We argue that converting genomic patterns into useful inferences about hybridization will ultimately require models and methods that directly incorporate key ingredients of speciation, including the dynamic nature of gene flow, selection acting in hybrid populations and recombination rate variation. PMID:26836441

  7. The UCSC Cancer Genomics Browser: update 2013.

    PubMed

    Goldman, Mary; Craft, Brian; Swatloski, Teresa; Ellrott, Kyle; Cline, Melissa; Diekhans, Mark; Ma, Singer; Wilks, Chris; Stuart, Josh; Haussler, David; Zhu, Jingchun

    2013-01-01

    The UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu/) is a set of web-based tools to display, investigate and analyse cancer genomics data and its associated clinical information. The browser provides whole-genome to base-pair level views of several different types of genomics data, including some next-generation sequencing platforms. The ability to view multiple datasets together allows users to make comparisons across different data and cancer types. Biological pathways, collections of genes, genomic or clinical information can be used to sort, aggregate and zoom into a group of samples. We currently display an expanding set of data from various sources, including 201 datasets from 22 TCGA (The Cancer Genome Atlas) cancers as well as data from Cancer Cell Line Encyclopedia and Stand Up To Cancer. New features include a completely redesigned user interface with an interactive tutorial and updated documentation. We have also added data downloads, additional clinical heatmap features, and an updated Tumor Image Browser based on Google Maps. New security features allow authenticated users access to private datasets hosted by several different consortia through the public website.

  8. The UCSC Cancer Genomics Browser: update 2013.

    PubMed

    Goldman, Mary; Craft, Brian; Swatloski, Teresa; Ellrott, Kyle; Cline, Melissa; Diekhans, Mark; Ma, Singer; Wilks, Chris; Stuart, Josh; Haussler, David; Zhu, Jingchun

    2013-01-01

    The UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu/) is a set of web-based tools to display, investigate and analyse cancer genomics data and its associated clinical information. The browser provides whole-genome to base-pair level views of several different types of genomics data, including some next-generation sequencing platforms. The ability to view multiple datasets together allows users to make comparisons across different data and cancer types. Biological pathways, collections of genes, genomic or clinical information can be used to sort, aggregate and zoom into a group of samples. We currently display an expanding set of data from various sources, including 201 datasets from 22 TCGA (The Cancer Genome Atlas) cancers as well as data from Cancer Cell Line Encyclopedia and Stand Up To Cancer. New features include a completely redesigned user interface with an interactive tutorial and updated documentation. We have also added data downloads, additional clinical heatmap features, and an updated Tumor Image Browser based on Google Maps. New security features allow authenticated users access to private datasets hosted by several different consortia through the public website. PMID:23109555

  9. A Distance Measure for Genome Phylogenetic Analysis

    NASA Astrophysics Data System (ADS)

    Cao, Minh Duc; Allison, Lloyd; Dix, Trevor

    Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.

  10. KEGG: Kyoto Encyclopedia of Genes and Genomes.

    PubMed

    Ogata, H; Goto, S; Sato, K; Fujibuchi, W; Bono, H; Kanehisa, M

    1999-01-01

    Kyoto Encyclopedia of Genes and Genomes (KEGG) is a knowledge base for systematic analysis of gene functions in terms of the networks of genes and molecules. The major component of KEGG is the PATHWAY database that consists of graphical diagrams of biochemical pathways including most of the known metabolic pathways and some of the known regulatory pathways. The pathway information is also represented by the ortholog group tables summarizing orthologous and paralogous gene groups among different organisms. KEGG maintains the GENES database for the gene catalogs of all organisms with complete genomes and selected organisms with partial genomes, which are continuously re-annotated, as well as the LIGAND database for chemical compounds and enzymes. Each gene catalog is associated with the graphical genome map for chromosomal locations that is represented by Java applet. In addition to the data collection efforts, KEGG develops and provides various computational tools, such as for reconstructing biochemical pathways from the complete genome sequence and for predicting gene regulatory networks from the gene expression profiles. The KEGG databases are daily updated and made freely available (http://www.genome.ad.jp/kegg/).

  11. The Footprint of Genome Architecture in the Largest Genome Expansion in RNA Viruses

    PubMed Central

    Lauber, Chris; Goeman, Jelle J.; Parquet, Maria del Carmen; Thi Nga, Phan; Snijder, Eric J.; Morita, Kouichi; Gorbalenya, Alexander E.

    2013-01-01

    The small size of RNA virus genomes (2-to-32 kb) has been attributed to high mutation rates during replication, which is thought to lack proof-reading. This paradigm is being revisited owing to the discovery of a 3′-to-5′ exoribonuclease (ExoN) in nidoviruses, a monophyletic group of positive-stranded RNA viruses with a conserved genome architecture. ExoN, a homolog of canonical DNA proof-reading enzymes, is exclusively encoded by nidoviruses with genomes larger than 20 kb. All other known non-segmented RNA viruses have smaller genomes. Here we use evolutionary analyses to show that the two- to three-fold expansion of the nidovirus genome was accompanied by a large number of replacements in conserved proteins at a scale comparable to that in the Tree of Life. To unravel common evolutionary patterns in such genetically diverse viruses, we established the relation between genomic regions in nidoviruses in a sequence alignment-free manner. We exploited the conservation of the genome architecture to partition each genome into five non-overlapping regions: 5′ untranslated region (UTR), open reading frame (ORF) 1a, ORF1b, 3′ORFs (encompassing the 3′-proximal ORFs), and 3′ UTR. Each region was analyzed for its contribution to genome size change under different models. The non-linear model statistically outperformed the linear one and captured >92% of data variation. Accordingly, nidovirus genomes were concluded to have reached different points on an expansion trajectory dominated by consecutive increases of ORF1b, ORF1a, and 3′ORFs. Our findings indicate a unidirectional hierarchical relation between these genome regions, which are distinguished by their expression mechanism. In contrast, these regions cooperate bi-directionally on a functional level in the virus life cycle, in which they predominantly control genome replication, genome expression, and virus dissemination, respectively. Collectively, our findings suggest that genome architecture and the

  12. Joint assembly and genetic mapping of the Atlantic horseshoe crab genome reveals ancient whole genome duplication

    PubMed Central

    2014-01-01

    Background Horseshoe crabs are marine arthropods with a fossil record extending back approximately 450 million years. They exhibit remarkable morphological stability over their long evolutionary history, retaining a number of ancestral arthropod traits, and are often cited as examples of “living fossils.” As arthropods, they belong to the Ecdysozoa, an ancient super-phylum whose sequenced genomes (including insects and nematodes) have thus far shown more divergence from the ancestral pattern of eumetazoan genome organization than cnidarians, deuterostomes and lophotrochozoans. However, much of ecdysozoan diversity remains unrepresented in comparative genomic analyses. Results Here we apply a new strategy of combined de novo assembly and genetic mapping to examine the chromosome-scale genome organization of the Atlantic horseshoe crab, Limulus polyphemus. We constructed a genetic linkage map of this 2.7 Gbp genome by sequencing the nuclear DNA of 34 wild-collected, full-sibling embryos and their parents at a mean redundancy of 1.1x per sample. The map includes 84,307 sequence markers grouped into 1,876 distinct genetic intervals and 5,775 candidate conserved protein coding genes. Conclusions Comparison with other metazoan genomes shows that the L. polyphemus genome preserves ancestral bilaterian linkage groups, and that a common ancestor of modern horseshoe crabs underwent one or more ancient whole genome duplications 300 million years ago, followed by extensive chromosome fusion. These results provide a counter-example to the often noted correlation between whole genome duplication and evolutionary radiations. The new, low-cost genetic mapping method for obtaining a chromosome-scale view of non-model organism genomes that we demonstrate here does not require laboratory culture, and is potentially applicable to a broad range of other species. PMID:24987520

  13. Molecular Pathology: Prognostic and Diagnostic Genomic Markers for Myeloid Neoplasms.

    PubMed

    Kuo, Frank C

    2016-09-01

    Application of next-generation sequencing (NGS) on myeloid neoplasms has expanded our knowledge of genomic alterations in this group of diseases. Genomic alterations in myeloid neoplasms are complex, heterogeneous, and not specific to a disease entity. NGS-based panel testing of myeloid neoplasms can complement existing diagnostic modalities and is gaining acceptance in the clinics and diagnostic laboratories. Prospective, randomized trials to evaluate the prognostic significance of genomic markers in myeloid neoplasms are under way in academic medical centers. PMID:27523973

  14. Evolutionary genomics of dog domestication.

    PubMed

    Wayne, Robert K; vonHoldt, Bridgett M

    2012-02-01

    We review the underlying principles and tools used in genomic studies of domestic dogs aimed at understanding the genetic changes that have occurred during domestication. We show that there are two principle modes of evolution within dogs. One primary mode that accounts for much of the remarkable diversity of dog breeds is the fixation of discrete mutations of large effect in individual lineages that are then crossed to various breed groupings. This transfer of mutations across the dog evolutionary tree leads to the appearance of high phenotypic diversity that in actuality reflects a small number of major genes. A second mechanism causing diversification involves the selective breeding of dogs within distinct phenotypic or functional groups, which enhances specific group attributes such as heading or tracking. Such progressive selection leads to a distinct genetic structure in evolutionary trees such that functional and phenotypic groups cluster genetically. We trace the origin of the nuclear genome in dogs based on haplotype-sharing analyses between dogs and gray wolves and show that contrary to previous mtDNA analyses, the nuclear genome of dogs derives primarily from Middle Eastern or European wolves, a result more consistent with the archeological record. Sequencing analysis of the IGF1 gene, which has been the target of size selection in small breeds, further supports this conclusion. Finally, we discuss how a black coat color mutation that evolved in dogs has transformed North American gray wolf populations, providing a first example of a mutation that appeared under domestication and selectively swept through a wild relative. PMID:22270221

  15. Genomic Evolution of 11 Type Strains within Family Planctomycetaceae

    PubMed Central

    Zhou, Yizhuang; Yang, Linfeng; Liu, Tianxiang; Yang, Jinlong; Chen, Yanling; Su, Longxiang; Xu, Jin; Chen, Jing; Liu, Feng; Chen, Jiapeng; Dai, Wenkui; Ni, Peixiang; Fang, Chengxiang; Yang, Ruifu

    2014-01-01

    The species in family Planctomycetaceae are ideal groups for investigating the origin of eukaryotes. Their cells are divided by a lipidic intracytoplasmic membrane and they share a number of eukaryote-like molecular characteristics. However, their genomic structures, potential abilities, and evolutionary status are still unknown. In this study, we searched for common protein families and a core genome/pan genome based on 11 sequenced species in family Planctomycetaceae. Then, we constructed phylogenetic tree based on their 832 common protein families. We also annotated the 11 genomes using the Clusters of Orthologous Groups database. Moreover, we predicted and reconstructed their core/pan metabolic pathways using the KEGG (Kyoto Encyclopedia of Genes and Genomes) orthology system. Subsequently, we identified genomic islands (GIs) and structural variations (SVs) among the five complete genomes and we specifically investigated the integration of two Planctomycetaceae plasmids in all 11 genomes. The results indicate that Planctomycetaceae species share diverse genomic variations and unique genomic characteristics, as well as have huge potential for human applications. PMID:24489782

  16. The tiniest tiny genomes.

    PubMed

    Moran, Nancy A; Bennett, Gordon M

    2014-01-01

    Starting in 2006, surprisingly tiny genomes have been discovered from numerous bacterial symbionts of insect hosts. Despite their size, each retains some genes that enable provisioning of limiting nutrients or other capabilities required by hosts. Genome sequence analyses show that genome reduction is an ongoing process, resulting in a continuum of sizes, with the smallest genome currently known at 112 kilobases. Genome reduction is typical in host-restricted symbionts and pathogens, but the tiniest genomes are restricted to symbionts required by hosts and restricted to specialized host cells, resulting from long coevolution with hosts. Genes are lost in all functional categories, but core genes for central informational processes, including genes encoding ribosomal proteins, are mostly retained, whereas genes underlying production of cell envelope components are especially depleted. Thus, these entities retain cell-like properties but are heavily dependent on coadaptation of hosts, which continuously evolve to support the symbionts upon which they depend.

  17. State of cat genomics.

    PubMed

    O'Brien, Stephen J; Johnson, Warren; Driscoll, Carlos; Pontius, Joan; Pecon-Slattery, Jill; Menotti-Raymond, Marilyn

    2008-06-01

    Our knowledge of cat family biology was recently expanded to include a genomics perspective with the completion of a draft whole genome sequence of an Abyssinian cat. The utility of the new genome information has been demonstrated by applications ranging from disease gene discovery and comparative genomics to species conservation. Patterns of genomic organization among cats and inbred domestic cat breeds have illuminated our view of domestication, revealing linkage disequilibrium tracks consequent of breed formation, defining chromosome exchanges that punctuated major lineages of mammals and suggesting ancestral continental migration events that led to 37 modern species of Felidae. We review these recent advances here. As the genome resources develop, the cat is poised to make a major contribution to many areas in genetics and biology.

  18. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex

    PubMed Central

    Garrido-Sanz, Daniel; Meier-Kolthoff, Jan P.; Göker, Markus; Martín, Marta; Rivilla, Rafael; Redondo-Nieto, Miguel

    2016-01-01

    The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR) with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH) identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI) approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as PGPR. PMID:26915094

  19. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    PubMed

    Garrido-Sanz, Daniel; Meier-Kolthoff, Jan P; Göker, Markus; Martín, Marta; Rivilla, Rafael; Redondo-Nieto, Miguel

    2016-01-01

    The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR) with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH) identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI) approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as PGPR. PMID:26915094

  20. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    PubMed

    Garrido-Sanz, Daniel; Meier-Kolthoff, Jan P; Göker, Markus; Martín, Marta; Rivilla, Rafael; Redondo-Nieto, Miguel

    2016-01-01

    The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR) with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH) identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI) approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as PGPR.

  1. Genome Aliquoting Revisited

    NASA Astrophysics Data System (ADS)

    Warren, Robert; Sankoff, David

    We prove that the genome aliquoting problem, the problem of finding a recent polyploid ancestor of a genome, with breakpoint distance can be solved in polynomial time. We propose an aliquoting algorithm that is a 2-approximation for the genome aliquoting problem with double cut and join distance, improving upon the previous best solution to this problem, Feijão and Meidanis' 4-approximation algorithm.

  2. Querying genomic databases

    SciTech Connect

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  3. Quality scores for 32,000 genomes

    DOE PAGES

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; Kora, Guruprasad H.; Hauser, Loren J.; Lukjancenko, Oksana; Ussery, David W.

    2014-12-08

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less

  4. Quality scores for 32,000 genomes

    SciTech Connect

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; Kora, Guruprasad H.; Hauser, Loren J.; Lukjancenko, Oksana; Ussery, David W.

    2014-12-08

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.

  5. Quality scores for 32,000 genomes

    PubMed Central

    2014-01-01

    Background More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). We have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Results Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. Conclusions The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here. PMID:25780509

  6. Figure 2 from Integrative Genomics Viewer: Visualizing Big Data | Office of Cancer Genomics

    Cancer.gov

    Grouping and sorting genomic data in IGV. The IGV user interface displaying 202 glioblastoma samples from TCGA. Samples are grouped by tumor subtype (second annotation column) and data type (first annotation column) and sorted by copy number of the EGFR locus (middle column). Adapted from Figure 1; Robinson et al. 2011

  7. Comparative analysis of trichomonad genome sizes and karyotypes.

    PubMed

    Zubácová, Zuzana; Cimbůrek, Zdenek; Tachezy, Jan

    2008-09-01

    In parasitic protists, the genome sizes range from 2.9Mb in Encephalitozoon cuniculi to about 160Mb in Trichomonas vaginalis. The suprisingly large genome size of the former human parasite resulted from the expansion of various repetitive elements, specific gene families, and possibly from large-scale genome duplication. The reason for this phenomenon, as well as whether other trichomonad species have undergone a similar genome expansion, is not known. In this work we studied the genomes of nine selected species of the Trichomonadea group. We found that each species has a characteristic karyotype with a stable and haploid number of chromosomes. Relatively large genome sizes were found in all the tested species, although over a rather broad range (86-177Mb). The largest genomes were typically observed in the Trichomonas and Tritrichomonas genera (133-177Mb), while Tetratrichomonas gallinarum contains the smallest genome (86Mb). The genome size correlated with the cell volume, however, no relationship between genome size and the site of infection or trichomonad phagocytic ability was observed. The data presented here provide primary information towards selecting a trichomonad species for future large-scale sequencing to elucidate the evolution of unusual parabasalid genomes. PMID:18606195

  8. Comparative analysis of trichomonad genome sizes and karyotypes.

    PubMed

    Zubácová, Zuzana; Cimbůrek, Zdenek; Tachezy, Jan

    2008-09-01

    In parasitic protists, the genome sizes range from 2.9Mb in Encephalitozoon cuniculi to about 160Mb in Trichomonas vaginalis. The suprisingly large genome size of the former human parasite resulted from the expansion of various repetitive elements, specific gene families, and possibly from large-scale genome duplication. The reason for this phenomenon, as well as whether other trichomonad species have undergone a similar genome expansion, is not known. In this work we studied the genomes of nine selected species of the Trichomonadea group. We found that each species has a characteristic karyotype with a stable and haploid number of chromosomes. Relatively large genome sizes were found in all the tested species, although over a rather broad range (86-177Mb). The largest genomes were typically observed in the Trichomonas and Tritrichomonas genera (133-177Mb), while Tetratrichomonas gallinarum contains the smallest genome (86Mb). The genome size correlated with the cell volume, however, no relationship between genome size and the site of infection or trichomonad phagocytic ability was observed. The data presented here provide primary information towards selecting a trichomonad species for future large-scale sequencing to elucidate the evolution of unusual parabasalid genomes.

  9. Genome packaging in viruses.

    PubMed

    Sun, Siyang; Rao, Venigalla B; Rossmann, Michael G

    2010-02-01

    Genome packaging is a fundamental process in a viral life cycle. Many viruses assemble preformed capsids into which the genomic material is subsequently packaged. These viruses use a packaging motor protein that is driven by the hydrolysis of ATP to condense the nucleic acids into a confined space. How these motor proteins package viral genomes had been poorly understood until recently, when a few X-ray crystal structures and cryo-electron microscopy (cryo-EM) structures became available. Here we discuss various aspects of genome packaging and compare the mechanisms proposed for packaging motors on the basis of structural information. PMID:20060706

  10. Filarial and Wolbachia genomics.

    PubMed

    Scott, A L; Ghedin, E; Nutman, T B; McReynolds, L A; Poole, C B; Slatko, B E; Foster, J M

    2012-01-01

    Filarial nematode parasites, the causative agents for a spectrum of acute and chronic diseases including lymphatic filariasis and river blindness, threaten the well-being and livelihood of hundreds of millions of people in the developing regions of the world. The 2007 publication on a draft assembly of the 95-Mb genome of the human filarial parasite Brugia malayi- representing the first helminth parasite genome to be sequenced - has been followed in rapid succession by projects that have resulted in the genome sequencing of six additional filarial species, seven nonfilarial nematode parasites of animals and nearly 30 plant parasitic and free-living species. Parallel to the genomic sequencing, transcriptomic and proteomic projects have facilitated genome annotation, expanded our understanding of stage-associated gene expression and provided a first look at the role of epigenetic regulation of filarial genomes through microRNAs. The expansion in filarial genomics will also provide a significant enrichment in our knowledge of the diversity and variability in the genomes of the endosymbiotic bacterium Wolbachia leading to a better understanding of the genetic principles that govern filarial-Wolbachia mutualism. The goal here is to provide an overview of the trends and advances in filarial and Wolbachia genomics. PMID:22098559

  11. Disentangling associated genomes.

    PubMed

    Sloan, Daniel B; Bennett, Gordon M; Engel, Philipp; Williams, David; Ochman, Howard

    2013-01-01

    The recovery and assembly of genome sequences from samples containing communities of organisms pose several challenges. Because it is rarely possible to disassociate the resident organisms prior to sequencing, a major obstacle is the assignment of sequences to a single genome that can be fully assembled. This chapter delineates many of the decisions, methodologies, and approaches that can lead to the generation of complete or nearly complete microbial genome sequences from heterogeneous samples-that is, the procedures that allow us to turn metagenomes into genomes.

  12. Between two fern genomes.

    PubMed

    Sessa, Emily B; Banks, Jo Ann; Barker, Michael S; Der, Joshua P; Duffy, Aaron M; Graham, Sean W; Hasebe, Mitsuyasu; Langdale, Jane; Li, Fay-Wei; Marchant, D Blaine; Pryer, Kathleen M; Rothfels, Carl J; Roux, Stanley J; Salmi, Mari L; Sigel, Erin M; Soltis, Douglas E; Soltis, Pamela S; Stevenson, Dennis W; Wolf, Paul G

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves.

  13. Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor

    2012-03-12

    The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.

  14. [Landscape and ecological genomics].

    PubMed

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment. PMID:25508669

  15. [Landscape and ecological genomics].

    PubMed

    Tetushkin, E Ia

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment. PMID:25474890

  16. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  17. A Comparison of 14 Erythrobacter Genomes Provides Insights into the Genomic Divergence and Scattered Distribution of Phototrophs

    PubMed Central

    Zheng, Qiang; Lin, Wenxin; Liu, Yanting; Chen, Chang; Jiao, Nianzhi

    2016-01-01

    Aerobic anoxygenic phototrophic bacteria (AAPB) are bacteriochlorophyll a (Bchl a)-containing microbial functional population. Erythrobacter is the first genus that was identified to contain AAPB species. Here, we compared 14 Erythrobacter genomes: seven phototrophic strains and seven non- phototrophic strains. Interestingly, AAPB strains are scattered in this genus based on their phylogenetic relationships. All 14 strains could be clustered into three groups based on phylo-genomic analysis, average genomic nucleotide identity and the phylogeny of signature genes (16S rRNA and virB4 genes). The AAPB strains were distributed in three groups, and gain and loss of phototrophic genes co-occurred in the evolutionary history of the genus Erythrobacter. The organization and structure of photosynthesis gene clusters (PGCs) in seven AAPB genomes displayed high synteny of major regions except for few insertions. The 14 Erythrobacter genomes had a large range of genome sizes, from 2.72 to 3.60 M, and the sizes of the core and pan- genomes were 1231 and 8170 orthologous clusters, respectively. Integrative and conjugative elements (ICEs) were frequently identified in genomes we studied, which might play significant roles in shaping or contributing to the pan-genome of Erythrobacter. Our findings suggest the ongoing evolutionary divergence of Erythrobacter genomes and the scattered distribution characteristic of PGC. PMID:27446024

  18. A Comparison of 14 Erythrobacter Genomes Provides Insights into the Genomic Divergence and Scattered Distribution of Phototrophs.

    PubMed

    Zheng, Qiang; Lin, Wenxin; Liu, Yanting; Chen, Chang; Jiao, Nianzhi

    2016-01-01

    Aerobic anoxygenic phototrophic bacteria (AAPB) are bacteriochlorophyll a (Bchl a)-containing microbial functional population. Erythrobacter is the first genus that was identified to contain AAPB species. Here, we compared 14 Erythrobacter genomes: seven phototrophic strains and seven non- phototrophic strains. Interestingly, AAPB strains are scattered in this genus based on their phylogenetic relationships. All 14 strains could be clustered into three groups based on phylo-genomic analysis, average genomic nucleotide identity and the phylogeny of signature genes (16S rRNA and virB4 genes). The AAPB strains were distributed in three groups, and gain and loss of phototrophic genes co-occurred in the evolutionary history of the genus Erythrobacter. The organization and structure of photosynthesis gene clusters (PGCs) in seven AAPB genomes displayed high synteny of major regions except for few insertions. The 14 Erythrobacter genomes had a large range of genome sizes, from 2.72 to 3.60 M, and the sizes of the core and pan- genomes were 1231 and 8170 orthologous clusters, respectively. Integrative and conjugative elements (ICEs) were frequently identified in genomes we studied, which might play significant roles in shaping or contributing to the pan-genome of Erythrobacter. Our findings suggest the ongoing evolutionary divergence of Erythrobacter genomes and the scattered distribution characteristic of PGC. PMID:27446024

  19. A Comparison of 14 Erythrobacter Genomes Provides Insights into the Genomic Divergence and Scattered Distribution of Phototrophs.

    PubMed

    Zheng, Qiang; Lin, Wenxin; Liu, Yanting; Chen, Chang; Jiao, Nianzhi

    2016-01-01

    Aerobic anoxygenic phototrophic bacteria (AAPB) are bacteriochlorophyll a (Bchl a)-containing microbial functional population. Erythrobacter is the first genus that was identified to contain AAPB species. Here, we compared 14 Erythrobacter genomes: seven phototrophic strains and seven non- phototrophic strains. Interestingly, AAPB strains are scattered in this genus based on their phylogenetic relationships. All 14 strains could be clustered into three groups based on phylo-genomic analysis, average genomic nucleotide identity and the phylogeny of signature genes (16S rRNA and virB4 genes). The AAPB strains were distributed in three groups, and gain and loss of phototrophic genes co-occurred in the evolutionary history of the genus Erythrobacter. The organization and structure of photosynthesis gene clusters (PGCs) in seven AAPB genomes displayed high synteny of major regions except for few insertions. The 14 Erythrobacter genomes had a large range of genome sizes, from 2.72 to 3.60 M, and the sizes of the core and pan- genomes were 1231 and 8170 orthologous clusters, respectively. Integrative and conjugative elements (ICEs) were frequently identified in genomes we studied, which might play significant roles in shaping or contributing to the pan-genome of Erythrobacter. Our findings suggest the ongoing evolutionary divergence of Erythrobacter genomes and the scattered distribution characteristic of PGC.

  20. Cardiovascular group

    NASA Technical Reports Server (NTRS)

    Blomqvist, Gunnar

    1989-01-01

    As a starting point, the group defined a primary goal of maintaining in flight a level of systemic oxygen transport capacity comparable to each individual's preflight upright baseline. The goal of maintaining capacity at preflight levels would seem to be a reasonable objective for several different reasons, including the maintenance of good health in general and the preservation of sufficient cardiovascular reserve capacity to meet operational demands. It is also important not to introduce confounding variables in whatever other physiological studies are being performed. A change in the level of fitness is likely to be a significant confounding variable in the study of many organ systems. The principal component of the in-flight cardiovascular exercise program should be large-muscle activity such as treadmill exercise. It is desirable that at least one session per week be monitored to assure maintenance of proper functional levels and to provide guidance for any adjustments of the exercise prescription. Appropriate measurements include evaluation of the heart-rate/workload or the heart-rate/oxygen-uptake relationship. Respiratory gas analysis is helpful by providing better opportunities to document relative workload levels from analysis of the interrelationships among VO2, VCO2, and ventilation. The committee felt that there is no clear evidence that any particular in-flight exercise regimen is protective against orthostatic hypotension during the early readaptation phase. Some group members suggested that maintenance of the lower body muscle mass and muscle tone may be helpful. There is also evidence that late in-flight interventions to reexpand blood volume to preflight levels are helpful in preventing or minimizing postflight orthostatic hypotension.

  1. Integrated consensus map of cultivated peanut and wild relatives reveals structures of the A and B genomes of Arachis and divergence of the legume genomes.

    PubMed

    Shirasawa, Kenta; Bertioli, David J; Varshney, Rajeev K; Moretzsohn, Marcio C; Leal-Bertioli, Soraya C M; Thudi, Mahendar; Pandey, Manish K; Rami, Jean-Francois; Foncéka, Daniel; Gowda, Makanahally V C; Qin, Hongde; Guo, Baozhu; Hong, Yanbin; Liang, Xuanqiang; Hirakawa, Hideki; Tabata, Satoshi; Isobe, Sachiko

    2013-04-01

    The complex, tetraploid genome structure of peanut (Arachis hypogaea) has obstructed advances in genetics and genomics in the species. The aim of this study is to understand the genome structure of Arachis by developing a high-density integrated consensus map. Three recombinant inbred line populations derived from crosses between the A genome diploid species, Arachis duranensis and Arachis stenosperma; the B genome diploid species, Arachis ipaënsis and Arachis magna; and between the AB genome tetraploids, A. hypogaea and an artificial amphidiploid (A. ipaënsis × A. duranensis)(4×), were used to construct genetic linkage maps: 10 linkage groups (LGs) of 544 cM with 597 loci for the A genome; 10 LGs of 461 cM with 798 loci for the B genome; and 20 LGs of 1442 cM with 1469 loci for the AB genome. The resultant maps plus 13 published maps were integrated into a consensus map covering 2651 cM with 3693 marker loci which was anchored to 20 consensus LGs corresponding to the A and B genomes. The comparative genomics with genome sequences of Cajanus cajan, Glycine max, Lotus japonicus, and Medicago truncatula revealed that the Arachis genome has segmented synteny relationship to the other legumes. The comparative maps in legumes, integrated tetraploid consensus maps, and genome-specific diploid maps will increase the genetic and genomic understanding of Arachis and should facilitate molecular breeding. PMID:23315685

  2. Home - The Cancer Genome Atlas - Cancer Genome - TCGA

    Cancer.gov

    The Cancer Genome Atlas (TCGA) is a comprehensive and coordinated effort to accelerate our understanding of the molecular basis of cancer through the application of genome analysis technologies, including large-scale genome sequencing.

  3. Genome Sequences of Five B1 Subcluster Mycobacteriophages

    PubMed Central

    Barrus, E. Zane; Benedict, Alex B.; Brighton, Alicia K.; Fisher, Joshua N. B.; Gardner, Adam V.; Kartchner, Brittany J.; Ladle, Kara C.; Lunt, Bryce L.; Merrill, Bryan D.; Morrell, John D.; Burnett, Sandra H.

    2013-01-01

    Mycobacteriophages infect members of the Mycobacterium genus in the phylum Actinobacteria and exhibit remarkable diversity. Genome analysis groups the thousands of known mycobacteriophages into clusters, of which the B1 subcluster is currently the third most populous. We report the complete genome sequences of five additional members of the B1 subcluster. PMID:24285667

  4. Genome sequences of five b1 subcluster mycobacteriophages.

    PubMed

    Breakwell, Donald P; Barrus, E Zane; Benedict, Alex B; Brighton, Alicia K; Fisher, Joshua N B; Gardner, Adam V; Kartchner, Brittany J; Ladle, Kara C; Lunt, Bryce L; Merrill, Bryan D; Morrell, John D; Burnett, Sandra H; Grose, Julianne H

    2013-11-27

    Mycobacteriophages infect members of the Mycobacterium genus in the phylum Actinobacteria and exhibit remarkable diversity. Genome analysis groups the thousands of known mycobacteriophages into clusters, of which the B1 subcluster is currently the third most populous. We report the complete genome sequences of five additional members of the B1 subcluster.

  5. EGN: a wizard for construction of gene and genome similarity networks

    PubMed Central

    2013-01-01

    Background Increasingly, similarity networks are being used for evolutionary analyses of molecular datasets. These networks are very useful, in particular for the analysis of gene sharing, lateral gene transfer and for the detection of distant homologs. Currently, such analyses require some computer programming skills due to the limited availability of user-friendly freely distributed software. Consequently, although appealing, the construction and analyses of these networks remain less familiar to biologists than do phylogenetic approaches. Results In order to ease the use of similarity networks in the community of evolutionary biologists, we introduce a software program, EGN, that runs under Linux or MacOSX. EGN automates the reconstruction of gene and genome networks from nucleic and proteic sequences. EGN also implements statistics describing genetic diversity in these samples, for various user-defined thresholds of similarities. In the interest of studying the complexity of evolutionary processes affecting microbial evolution, we applied EGN to a dataset of 571,044 proteic sequences from the three domains of life and from mobile elements. We observed that, in Borrelia, plasmids play a different role than in most other eubacteria. Rather than being genetic couriers involved in lateral gene transfer, Borrelia’s plasmids and their genes act as private genetic goods, that contribute to the creation of genetic diversity within their parasitic hosts. Conclusion EGN can be used for constructing, analyzing, and mining molecular datasets in evolutionary studies. The program can help increase our knowledge of the processes through which genes from distinct sources and/or from multiple genomes co-evolve in lineages of cellular organisms. PMID:23841456

  6. Genomics: implications for toxicology.

    PubMed

    Olden, K; Guthrie, J

    2001-01-25

    The primary goal of the Environmental Genome Project (EGP) is the identification of human polymorphisms indicative of susceptibility to specific environmental agents. Despite evidence for a substantial genetic contribution to disease variation in the population, progress towards identifying specific genes has been slow. To date, most of the advances in our understanding of human diseases has come from genetic analyses of monogenic diseases that affect a relatively small portion of the population. The principal strategy of the EGP involves resequencing DNA samples from populations representative of the US racial and ethnic groups to develop a database of variations. Polymorphisms in specific genes may also be detected by gene-expression profiling. The identification of polymorphisms by resequencing is straightforward, and can be accomplished with minimal difficulty. Gene-expression profiling is still problematic; however, determining the functional significance of the allelic variations will be a monumental challenge involving sophisticated proteomics and population-based and animal model studies. These studies will change radically the practice of public health and clinical medicine, and the approach to the development of pharmaceuticals.

  7. Group evaporation

    NASA Technical Reports Server (NTRS)

    Shen, Hayley H.

    1991-01-01

    Liquid fuel combustion process is greatly affected by the rate of droplet evaporation. The heat and mass exchanges between gas and liquid couple the dynamics of both phases in all aspects: mass, momentum, and energy. Correct prediction of the evaporation rate is therefore a key issue in engineering design of liquid combustion devices. Current analytical tools for characterizing the behavior of these devices are based on results from a single isolated droplet. Numerous experimental studies have challenged the applicability of these results in a dense spray. To account for the droplets' interaction in a dense spray, a number of theories have been developed in the past decade. Herein, two tasks are examined. One was to study how to implement the existing theoretical results, and the other was to explore the possibility of experimental verifications. The current theoretical results of group evaporation are given for a monodispersed cluster subject to adiabatic conditions. The time evolution of the fluid mechanic and thermodynamic behavior in this cluster is derived. The results given are not in the form of a subscale model for CFD codes.

  8. Genomes with distinct function composition.

    PubMed

    Tamames, J; Ouzounis, C; Sander, C; Valencia, A

    1996-06-24

    The functional composition of organisms can be analysed for the first time with the appearance of complete or sizeable parts of various genomes. We have reduced the problem of protein function classification to a simple scheme with three classes of protein function: energy-, information- and communication-associated proteins. Finer classification schemes can be easily mapped to the above three classes. To deal with the vast amount of information, a system for automatic function classification using database annotations has been developed. The system is able to classify correctly about 80% of the query sequences with annotations. Using this system, we can analyse samples from the genomes of the most represented species in sequence databases and compare their genomic composition. The similarities and differences for different taxonomic groups are strikingly intuitive. Viruses have the highest proportion of proteins involved in the control and expression of genetic information. Bacteria have the highest proportion of their genes dedicated to the production of proteins associated with small molecule transformations and transport. Animals have a very large proportion of proteins associated with intra- and intercellular communication and other regulatory processes. In general, the proportion of communication-related proteins increases during evolution, indicating trends that led to the emergence of the eukaryotic cell and later the transition from unicellular to multicellular organisms. PMID:8682215

  9. A Plant-Associated Microbe Genome Initiative

    SciTech Connect

    Jan E. Leach; Scott Gold; Sue Tolin; Kellye Eversole

    2003-03-06

    effort for genome analysis of plant-associated microbes. The goals are to (i) obtain genome sequence information for several representative groups of microbes; (ii) identify and determine function for the genes/proteins and other genomic elements involved in plant-microbe interactions; (iii) develop and implement standardized bioinformatic tools and a database system that is applicable across all microbes; and (iv) educate and train scientists with skills and knowledge of biological and computational sciences who will apply the information to the protection of our food sources and environment.

  10. Timing and Scope of Genomic Expansion within Annelida: Evidence from Homeoboxes in the Genome of the Earthworm Eisenia fetida.

    PubMed

    Zwarycz, Allison S; Nossa, Carlos W; Putnam, Nicholas H; Ryan, Joseph F

    2015-12-10

    Annelida represents a large and morphologically diverse group of bilaterian organisms. The recently published polychaete and leech genome sequences revealed an equally dynamic range of diversity at the genomic level. The availability of more annelid genomes will allow for the identification of evolutionary genomic events that helped shape the annelid lineage and better understand the diversity within the group. We sequenced and assembled the genome of the common earthworm, Eisenia fetida. As a first pass at understanding the diversity within the group, we classified 363 earthworm homeoboxes and compared them with those of the leech Helobdella robusta and the polychaete Capitella teleta. We inferred many gene expansions occurring in the lineage connecting the most recent common ancestor (MRCA) of Capitella and Eisenia to the Eisenia/Helobdella MRCA. Likewise, the lineage leading from the Eisenia/Helobdella MRCA to the leech H. robusta has experienced substantial gains and losses. However, the lineage leading from Eisenia/Helobdella MRCA to E. fetida is characterized by extraordinary levels of homeobox gain. The evolutionary dynamics observed in the homeoboxes of these lineages are very likely to be generalizable to all genes. These genome expansions and losses have likely contributed to the remarkable biology exhibited in this group. These results provide a new perspective from which to understand the diversity within these lineages, show the utility of sub-draft genome assemblies for understanding genomic evolution, and provide a critical resource from which the biology of these animals can be studied.

  11. Timing and Scope of Genomic Expansion within Annelida: Evidence from Homeoboxes in the Genome of the Earthworm Eisenia fetida

    PubMed Central

    Zwarycz, Allison S.; Nossa, Carlos W.; Putnam, Nicholas H.; Ryan, Joseph F.

    2016-01-01

    Annelida represents a large and morphologically diverse group of bilaterian organisms. The recently published polychaete and leech genome sequences revealed an equally dynamic range of diversity at the genomic level. The availability of more annelid genomes will allow for the identification of evolutionary genomic events that helped shape the annelid lineage and better understand the diversity within the group. We sequenced and assembled the genome of the common earthworm, Eisenia fetida. As a first pass at understanding the diversity within the group, we classified 363 earthworm homeoboxes and compared them with those of the leech Helobdella robusta and the polychaete Capitella teleta. We inferred many gene expansions occurring in the lineage connecting the most recent common ancestor (MRCA) of Capitella and Eisenia to the Eisenia/Helobdella MRCA. Likewise, the lineage leading from the Eisenia/Helobdella MRCA to the leech H. robusta has experienced substantial gains and losses. However, the lineage leading from Eisenia/Helobdella MRCA to E. fetida is characterized by extraordinary levels of homeobox gain. The evolutionary dynamics observed in the homeoboxes of these lineages are very likely to be generalizable to all genes. These genome expansions and losses have likely contributed to the remarkable biology exhibited in this group. These results provide a new perspective from which to understand the diversity within these lineages, show the utility of sub-draft genome assemblies for understanding genomic evolution, and provide a critical resource from which the biology of these animals can be studied. PMID:26659921

  12. Recombinant transfer in the basic genome of Escherichia coli.

    PubMed

    Dixit, Purushottam D; Pang, Tin Yau; Studier, F William; Maslov, Sergei

    2015-07-21

    An approximation to the ∼4-Mbp basic genome shared by 32 strains of Escherichia coli representing six evolutionary groups has been derived and analyzed computationally. A multiple alignment of the 32 complete genome sequences was filtered to remove mobile elements and identify the most reliable ∼90% of the aligned length of each of the resulting 496 basic-genome pairs. Patterns of single base-pair mutations (SNPs) in aligned pairs distinguish clonally inherited regions from regions where either genome has acquired DNA fragments from diverged genomes by homologous recombination since their last common ancestor. Such recombinant transfer is pervasive across the basic genome, mostly between genomes in the same evolutionary group, and generates many unique mosaic patterns. The six least-diverged genome pairs have one or two recombinant transfers of length ∼40-115 kbp (and few if any other transfers), each containing one or more gene clusters known to confer strong selective advantage in some environments. Moderately diverged genome pairs (0.4-1% SNPs) show mosaic patterns of interspersed clonal and recombinant regions of varying lengths throughout the basic genome, whereas more highly diverged pairs within an evolutionary group or pairs between evolutionary groups having >1.3% SNPs have few clonal matches longer than a few kilobase pairs. Many recombinant transfers appear to incorporate fragments of the entering DNA produced by restriction systems of the recipient cell. A simple computational model can closely fit the data. Most recombinant transfers seem likely to be due to generalized transduction by coevolving populations of phages, which could efficiently distribute variability throughout bacterial genomes.