Science.gov

Sample records for co-evolving genomic groups

  1. Group normalization for genomic data.

    PubMed

    Ghandi, Mahmoud; Beer, Michael A

    2012-01-01

    Data normalization is a crucial preliminary step in analyzing genomic datasets. The goal of normalization is to remove global variation to make readings across different experiments comparable. In addition, most genomic loci have non-uniform sensitivity to any given assay because of variation in local sequence properties. In microarray experiments, this non-uniform sensitivity is due to different DNA hybridization and cross-hybridization efficiencies, known as the probe effect. In this paper we introduce a new scheme, called Group Normalization (GN), to remove both global and local biases in one integrated step, whereby we determine the normalized probe signal by finding a set of reference probes with similar responses. Compared to conventional normalization methods such as Quantile normalization and physically motivated probe effect models, our proposed method is general in the sense that it does not require the assumption that the underlying signal distribution be identical for the treatment and control, and is flexible enough to correct for nonlinear and higher order probe effects. The Group Normalization algorithm is computationally efficient and easy to implement. We also describe a variant of the Group Normalization algorithm, called Cross Normalization, which efficiently amplifies biologically relevant differences between any two genomic datasets. PMID:22912661

  2. Genomic adaptation of the Lactobacillus casei group.

    PubMed

    Toh, Hidehiro; Oshima, Kenshiro; Nakano, Akiyo; Takahata, Muneaki; Murakami, Masaru; Takaki, Takashi; Nishiyama, Hidetoshi; Igimi, Shizunobu; Hattori, Masahira; Morita, Hidetoshi

    2013-01-01

    Lactobacillus casei, L. paracasei, and L. rhamnosus form a closely related taxonomic group (Lactobacillus casei group) within the facultatively heterofermentative lactobacilli. Here, we report the complete genome sequences of L. paracasei JCM 8130 and L. casei ATCC 393, and the draft genome sequence of L. paracasei COM0101, all of which were isolated from daily products. Furthermore, we re-annotated the genome of L. rhamnosus ATCC 53103 (also known as L. rhamnosus GG), which we have previously reported. We confirmed that ATCC 393 is distinct from other strains previously described as L. paracasei. The core genome of 10 completely sequenced strains of the L. casei group comprised 1,682 protein-coding genes. Although extensive genome-wide synteny was found among the L. casei group, the genomes of ATCC 53103, JCM 8130, and ATCC 393 contained genomic islands compared with L. paracasei ATCC 334. Several genomic islands, including carbohydrate utilization gene clusters, were found at the same loci in the chromosomes of the L. casei group. The spaCBA pilus gene cluster, which was first identified in GG, was also found in other strains of the L. casei group, but several L. paracasei strains including COM0101 contained truncated spaC gene. ATCC 53103 encoded a higher number of proteins involved in carbohydrate utilization compared with intestinal lactobacilli, and extracellular adhesion proteins, several of which are absent in other strains of the L. casei group. In addition to previously fully sequenced L. rhamnosus and L. paracasei strains, the complete genome sequences of L. casei will provide valuable insights into the evolution of the L. casei group. PMID:24116025

  3. Genomic Adaptation of the Lactobacillus casei Group

    PubMed Central

    Nakano, Akiyo; Takahata, Muneaki; Murakami, Masaru; Takaki, Takashi; Nishiyama, Hidetoshi; Igimi, Shizunobu; Hattori, Masahira; Morita, Hidetoshi

    2013-01-01

    Lactobacillus casei, L. paracasei, and L. rhamnosus form a closely related taxonomic group (Lactobacillus casei group) within the facultatively heterofermentative lactobacilli. Here, we report the complete genome sequences of L. paracasei JCM 8130 and L. casei ATCC 393, and the draft genome sequence of L. paracasei COM0101, all of which were isolated from daily products. Furthermore, we re-annotated the genome of L. rhamnosus ATCC 53103 (also known as L. rhamnosus GG), which we have previously reported. We confirmed that ATCC 393 is distinct from other strains previously described as L. paracasei. The core genome of 10 completely sequenced strains of the L. casei group comprised 1,682 protein-coding genes. Although extensive genome-wide synteny was found among the L. casei group, the genomes of ATCC 53103, JCM 8130, and ATCC 393 contained genomic islands compared with L. paracasei ATCC 334. Several genomic islands, including carbohydrate utilization gene clusters, were found at the same loci in the chromosomes of the L. casei group. The spaCBA pilus gene cluster, which was first identified in GG, was also found in other strains of the L. casei group, but several L. paracasei strains including COM0101 contained truncated spaC gene. ATCC 53103 encoded a higher number of proteins involved in carbohydrate utilization compared with intestinal lactobacilli, and extracellular adhesion proteins, several of which are absent in other strains of the L. casei group. In addition to previously fully sequenced L. rhamnosus and L. paracasei strains, the complete genome sequences of L. casei will provide valuable insights into the evolution of the L. casei group. PMID:24116025

  4. Assembler: Efficient Discovery of Spatial Co-evolving Patterns in Massive Geo-sensory Data

    PubMed Central

    Zhang, Chao; Zheng, Yu; Ma, Xiuli; Han, Jiawei

    2015-01-01

    Recent years have witnessed the wide proliferation of geo-sensory applications wherein a bundle of sensors are deployed at different locations to cooperatively monitor the target condition. Given massive geo-sensory data, we study the problem of mining spatial co-evolving patterns (SCPs), i.e., groups of sensors that are spatially correlated and co-evolve frequently in their readings. SCP mining is of great importance to various real-world applications, yet it is challenging because (1) the truly interesting evolutions are often flooded by numerous trivial fluctuations in the geo-sensory time series; and (2) the pattern search space is extremely large due to the spatiotemporal combinatorial nature of SCP. In this paper, we propose a two-stage method called Assembler. In the first stage, Assembler filters trivial fluctuations using wavelet transform and detects frequent evolutions for individual sensors via a segment-and-group approach. In the second stage, Assembler generates SCPs by assembling the frequent evolutions of individual sensors. Leveraging the spatial constraint, it conceptually organizes all the SCPs into a novel structure called the SCP search tree, which facilitates the effective pruning of the search space to generate SCPs efficiently. Our experiments on both real and synthetic data sets show that Assembler is effective, efficient, and scalable. PMID:26705506

  5. Measurement of organization in complex and co-evolving networks

    NASA Astrophysics Data System (ADS)

    Georgiev, Georgi

    2012-02-01

    We apply a new method for measurement of organization of complex and co-evolving networks using the quantity of physical action. We consider simple arrangements of elements in a network and constraints to their motion along paths and calculate the amount of organization in each system using the following measure: organization is the inverse of the average sum of physical actions of all elements in a system per unit motion multiplied by the Planck's constant. The meaning of quantity of organization here is the number of quanta of action per one unit motion along a path of an element. A unit motion along a path for a network, such as internet, is the transmission of one bit of information. The calculation can be expanded to systems consisting of many elements and constraints and also can be followed as a function of time with improvement of the organization of a system or connected systems and networks. Thus, the principle of least action becomes the driving force, and the least action state of the system, the attractor for all of the paths of its elements and states of its constraints. We consider also the rate of constraint minimization, or decrease of action per element and motion, as a function of the number of elements i.e. quality as a function of quantity. Increase of quantity, within specified limits, leads to increase of level of organization and vice versa.

  6. Effect of co-evolving amino acid residues on topology of phylogenetic trees.

    PubMed

    Sherbakov, D Yu; Triboy, T I

    2007-12-01

    The presence in proteins of amino acid residues that change in concert during evolution is associated with keeping constant the protein spatial structure and functions. As in the case with morphological features, correlated substitutions may become the cause of homoplasies--the independent evolution of identical non-homological adaptations. Our data obtained on model phylogenetic trees and corresponding sets of sequences have shown that the presence of correlated substitutions distorts the results of phylogenetic reconstructions. A method for accounting for co-evolving amino acid residues in phylogenetic analysis is proposed. According to this method, only a single site from the group of correlated amino acid positions should remain, whereas other positions should not be used in further phylogenetic analysis. Simulations performed have shown that replacement on the average of 8% of variable positions in a pair of model sequences by coordinately evolving amino acid residues is able to change the tree topology. The removal of such amino acid residues from sequences before phylogenetic analysis restores the correct topology. PMID:18205620

  7. Genomes, neurotoxins and biology of Clostridium botulinum Group I and Group II

    PubMed Central

    Carter, Andrew T.; Peck, Michael W.

    2015-01-01

    Recent developments in whole genome sequencing have made a substantial contribution to understanding the genomes, neurotoxins and biology of Clostridium botulinum Group I (proteolytic C. botulinum) and C. botulinum Group II (non-proteolytic C. botulinum). Two different approaches are used to study genomics in these bacteria; comparative whole genome microarrays and direct comparison of complete genome DNA sequences. The properties of the different types of neurotoxin formed, and different neurotoxin gene clusters found in C. botulinum Groups I and II are explored. Specific examples of botulinum neurotoxin genes are chosen for an in-depth discussion of neurotoxin gene evolution. The most recent cases of foodborne botulism are summarised. PMID:25445012

  8. Genomic characterization of Italian Clostridium botulinum group I strains.

    PubMed

    Giordani, Francesco; Fillo, Silvia; Anselmo, Anna; Palozzi, Anna Maria; Fortunato, Antonella; Gentile, Bernardina; Azarnia Tehran, Domenico; Ciammaruconi, Andrea; Spagnolo, Ferdinando; Pittiglio, Valentina; Anniballi, Fabrizio; Auricchio, Bruna; De Medici, Dario; Lista, Florigio

    2015-12-01

    Clostridium botulinum is a gram-positive bacterium capable of producing the botulinum neurotoxin, a powerful poison that causes botulism, a severe neuroparalytic disease. Its genome has been sequenced entirely and its gene content has been analyzed. To date, 19 full genomes and 64 draft genomes are available. The geographical origin of these genomes is predominantly from the US. In the present study, 10 Italian genomes of C. botulinum group I were analyzed and compared with previously sequenced group I genomes, in order to genetically characterize the Italian population of C. botulinum group I and to investigate the phylogenetic relationships among different lineages. Using the suites of software ClonalFrame and ClonalOrigin to perform genomic analysis, we demonstrated that Italian C. botulinum group I population is phylogenetically heterogeneous encompassing different and distant lineages including overseas strains, too. Moreover, a high recombination rate was demonstrated in the evolution of C. botulinum group I species. Finally, genome sequencing of the strain 357 led us to identify a novel botulinum neurotoxin subtype, F8. PMID:26341861

  9. Comparative genomics of the Campylobacter lari group

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The Campylobacter lari group is a phylogenetic clade within the epsilon subdivision of the Proteobacteria and is part of the thermotolerant campylobacters, a division within the genus that includes the human pathogen Campylobacter jejuni. The lari group is currently composed of five validly-named sp...

  10. Indel Group in Genomes (IGG) Molecular Genetic Markers.

    PubMed

    Toal, Ted W; Burkart-Waco, Diana; Howell, Tyson; Ron, Mily; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger; Brady, Siobhan M

    2016-09-01

    Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831

  11. Consistency of genome-wide associations across major ancestral groups.

    PubMed

    Ntzani, Evangelia E; Liberopoulos, George; Manolio, Teri A; Ioannidis, John P A

    2012-07-01

    It is not well known whether genetic markers identified through genome-wide association studies (GWAS) confer similar or different risks across people of different ancestry. We screened a regularly updated catalog of all published GWAS curated at the NHGRI website for GWAS-identified associations that had reached genome-wide significance (p ≤ 5 × 10(-8)) in at least one major ancestry group (European, Asian, African) and for which replication data were available for comparison in at least two different major ancestry groups. These groups were compared for the correlation between and differences in risk allele frequencies and genetic effects' estimates. Data on 108 eligible GWAS-identified associations with a total of 900 datasets (European, n = 624; Asian, n = 217; African, n = 60) were analyzed. Risk-allele frequencies were modestly correlated between ancestry groups, with >10% absolute differences in 75-89% of the three pairwise comparisons of ancestry groups. Genetic effect (odds ratio) point estimates between ancestry groups correlated modestly (pairwise comparisons' correlation coefficients: 0.20-0.33) and point estimates of risks were opposite in direction or differed more than twofold in 57%, 79%, and 89% of the European versus Asian, European versus African, and Asian versus African comparisons, respectively. The modest correlations, differing risk estimates, and considerable between-association heterogeneity suggest that differential ancestral effects can be anticipated and genomic risk markers may need separate further evaluation in different ancestry groups. PMID:22183176

  12. Cooperative behavior and phase transitions in co-evolving stag hunt game

    NASA Astrophysics Data System (ADS)

    Zhang, W.; Li, Y. S.; Xu, C.; Hui, P. M.

    2016-02-01

    Cooperative behavior and different phases in a co-evolving network dynamics based on the stag hunt game is studied. The dynamical processes are parameterized by a payoff r that tends to promote non-cooperative behavior and a probability q for a rewiring attempt that could isolate the non-cooperators. The interplay between the parameters leads to different phases. Detailed simulations and a mean field theory are employed to reveal the properties of different phases. For small r, the cooperators are the majority and form a connected cluster while the non-cooperators increase with q but remain isolated over the whole range of q, and it is a static phase. For sufficiently large r, cooperators disappear in an intermediate range qL ≤ q ≤qU and a dynamical all-non-cooperators phase results. For q >qU, a static phase results again. A mean field theory based on how the link densities change in time by the co-evolving dynamics is constructed. The theory gives a phase diagram in the q- r parameter space that is qualitatively in agreement with simulation results. The sources of discrepancies between theory and simulations are discussed.

  13. Comparative genome analysis of Bacillus cereus group genomes withBacillus subtilis

    SciTech Connect

    Anderson, Iain; Sorokin, Alexei; Kapatral, Vinayak; Reznik, Gary; Bhattacharya, Anamitra; Mikhailova, Natalia; Burd, Henry; Joukov, Victor; Kaznadzey, Denis; Walunas, Theresa; D'Souza, Mark; Larsen, Niels; Pusch,Gordon; Liolios, Konstantinos; Grechkin, Yuri; Lapidus, Alla; Goltsman,Eugene; Chu, Lien; Fonstein, Michael; Ehrlich, S. Dusko; Overbeek, Ross; Kyrpides, Nikos; Ivanova, Natalia

    2005-09-14

    Genome features of the Bacillus cereus group genomes (representative strains of Bacillus cereus, Bacillus anthracis and Bacillus thuringiensis sub spp israelensis) were analyzed and compared with the Bacillus subtilis genome. A core set of 1,381 protein families among the four Bacillus genomes, with an additional set of 933 families common to the B. cereus group, was identified. Differences in signal transduction pathways, membrane transporters, cell surface structures, cell wall, and S-layer proteins suggesting differences in their phenotype were identified. The B. cereus group has signal transduction systems including a tyrosine kinase related to two-component system histidine kinases from B. subtilis. A model for regulation of the stress responsive sigma factor sigmaB in the B. cereus group different from the well studied regulation in B. subtilis has been proposed. Despite a high degree of chromosomal synteny among these genomes, significant differences in cell wall and spore coat proteins that contribute to the survival and adaptation in specific hosts has been identified.

  14. A SEARCH FOR CO-EVOLVING ION AND NEUTRAL GAS SPECIES IN PRESTELLAR MOLECULAR CLOUD CORES

    SciTech Connect

    Tassis, Konstantinos; Hezareh, Talayeh; Willacy, Karen

    2012-11-20

    A comparison between the widths of ion and neutral molecule spectral lines has been recently used to estimate the strength of the magnetic field in turbulent star-forming regions. However, the ion (HCO{sup +}) and neutral (HCN) species used in such studies may not be necessarily co-evolving at every scale and density, and thus, may not trace the same regions. Here, we use coupled chemical/dynamical models of evolving prestellar molecular cloud cores including non-equilibrium chemistry, with and without magnetic fields, to study the spatial distribution of HCO{sup +} and HCN, which have been used in observations of spectral line width differences to date. In addition, we seek new ion-neutral pairs that are good candidates for such observations, because they have similar evolution and are approximately co-spatial in our models. We identify three such good candidate pairs: HCO{sup +}/NO, HCO{sup +}/CO, and NO{sup +}/NO.

  15. 77 FR 75425 - Interagency Working Group on Plant Genomics (IWGPG): The National Plant Genome Initiative-What's...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-12-20

    ... Interagency Working Group on Plant Genomics (IWGPG): The National Plant Genome Initiative--What's Next? AGENCY: Office of Science, Office of Biological and Environmental Research, Department of Energy (DOE). ACTION... Group on Plant Genomics (IWGPG). DATES: Saturday, January 12, 2013, 1:30 p.m. to 3:40 p.m....

  16. Comparative Genomics of the Staphylococcus intermedius Group of Animal Pathogens

    PubMed Central

    Ben Zakour, Nouri L.; Beatson, Scott A.; van den Broek, Adri H. M.; Thoday, Keith L.; Fitzgerald, J. Ross

    2012-01-01

    The Staphylococcus intermedius group consists of three closely related coagulase-positive bacterial species including S. intermedius, Staphylococcus pseudintermedius, and Staphylococcus delphini. S. pseudintermedius is a major skin pathogen of dogs, which occasionally causes severe zoonotic infections of humans. S. delphini has been isolated from an array of different animals including horses, mink, and pigeons, whereas S. intermedius has been isolated only from pigeons to date. Here we provide a detailed analysis of the S. pseudintermedius whole genome sequence in comparison to high quality draft S. intermedius and S. delphini genomes, and to other sequenced staphylococcal species. The core genome of the SIG was highly conserved with average nucleotide identity (ANI) between the three species of 93.61%, which is very close to the threshold of species delineation (95% ANI), highlighting the close-relatedness of the SIG species. However, considerable variation was identified in the content of mobile genetic elements, cell wall-associated proteins, and iron and sugar transporters, reflecting the distinct ecological niches inhabited. Of note, S. pseudintermedius ED99 contained a clustered regularly interspaced short palindromic repeat locus of the Nmeni subtype and S. intermedius contained both Nmeni and Mtube subtypes. In contrast to S. intermedius and S. delphini and most other staphylococci examined to date, S. pseudintermedius contained at least nine predicted reverse transcriptase Group II introns. Furthermore, S. pseudintermedius ED99 encoded several transposons which were largely responsible for its multi-resistant phenotype. Overall, the study highlights extensive differences in accessory genome content between closely related staphylococcal species inhabiting distinct host niches, providing new avenues for research into pathogenesis and bacterial host-adaptation. PMID:22919635

  17. Analysis of co-evolving soil depths, vegetation patterns, and connectivity on drylands.

    NASA Astrophysics Data System (ADS)

    Saco, Patricia; Willgoose, Garry

    2014-05-01

    Arid and semiarid landscapes cover more than 30% of the Earth's surface. Vegetation in these areas is usually patchy due limited resource availability. This self-organized patchiness results from the nonlinear feedbacks between water redistribution, soils, landforms, and biota. These complex interactions make the understanding and prediction of landscape responses to climate and land use change highly challenging. Though several models have been recently developed and used to understand these feedbacks and the emergence of vegetation patterns in drylands, these models do not explicitly incorporate feedbacks with coevolving soil depths. Here we analyse feedback effects resulting from co-evolving soil depths, which play a key role in the redistribution of surface runoff and therefore on the patterns of vegetation and landscape connectivity. We present modelling results using a coupled landform evolution-dynamic vegetation model, which includes a soil depth evolution module accounts and for soil production and sediment erosion and deposition processes. We analyse the co-evolution of soil depths and vegetation patterns for varying soil erodibilities, slopes and plant functional types. We find that for deeper soils, facilitation effects of vegetation gives rise to the formation of regular patterns, and slope and soil erodibility are the key factors for recovery after disturbance. Disturbances in areas with high slope and/or soil erodibility lead to an increase in connectivity and a degraded state. In contrast, we find that for shallow soils, the facilitation effect of vegetation becomes less important and vegetation patterns are more irregular. In this case, soil depth becomes the key factor prescribing surface connectivity and for the recovery of the system after disturbance. These results have critical implications for effective management and restoration efforts, and for understanding the effects of changes in climate and land use.

  18. Simultaneous prediction of transcription factor binding sites in a group of prokaryotic genomes

    PubMed Central

    2010-01-01

    Background Our current understanding of transcription factor binding sites (TFBSs) in sequenced prokaryotic genomes is very limited due to the lack of an accurate and efficient computational method for the prediction of TFBSs at a genome scale. In an attempt to change this situation, we have recently developed a comparative genomics based algorithm called GLECLUBS for de novo genome-wide prediction of TFBSs in a target genome. Although GLECLUBS has achieved rather high prediction accuracy of TFBSs in a target genome, it is still not efficient enough to be applied to all the sequenced prokaryotic genomes. Results Here, we designed a new algorithm based on GLECLUBS called extended GLECLUBS (eGLECLUBS) for simultaneous prediction of TFBSs in a group of related prokaryotic genomes. When tested on a group of γ-proteobacterial genomes including E. coli K12, a group of firmicutes genomes including B. subtilis and a group of cyanobacterial genomes using the same parameter settings, eGLECLUBS predicts more than 82% of known TFBSs in extracted inter-operonic sequences in both E. coli K12 and B. subtilis. Because each genome in a group is equally treated, it is highly likely that similar prediction accuracy has been achieved for each genome in the group. Conclusions We have developed a new algorithm for genome-wide de novo prediction of TFBSs in a group of related prokaryotic genomes. The algorithm has achieved the same level of accuracy and robustness as its predecessor GLECLUBS, but can work on dozens of genomes at the same time. PMID:20653963

  19. Partially satisfied to fully satisfied transitions in co-evolving inverse voter model and possible scaling behavior

    NASA Astrophysics Data System (ADS)

    Choi, C. W.; Xu, C.; Hui, P. M.

    2015-12-01

    Understanding co-evolving networks characterized by the mutual influence of agents' actions and network structure remains a challenge. We study a co-evolving inverse voter model in which agents adapt to achieve a preferred environment with more opposite-opinion neighbors by rewiring their connections and switching opinion. Numerical studies reveal a transition from a dynamic partially satisfied phase to a frozen fully satisfied phase as the rewiring probability is varied. A simple mean field theory is shown to capture the behavior only qualitatively. An improved mean field theory carrying a longer spatial correlation gives better results. Motivated by numerical results in networks of different degrees and mean field results, we propose a scaling variable that combines the rewiring probability and mean degree in a special form. The scaling variable is shown to work well in analyzing data corresponding to different networks and different rewiring probabilities. An application is to predict the results for networks of different degrees based solely on results obtained from networks of one degree. Studying scaling behavior provides an alternative path for understanding co-evolving agent-based dynamical systems, especially in light of the trade-off between complexity of a theory and its accuracy.

  20. Identification of co-evolving sites in the ligand binding domain of G protein-coupled receptors using mutual information

    NASA Astrophysics Data System (ADS)

    Fatakia, Sarosh N.; Costanzi, Stefano; Chow, Carson C.

    2008-03-01

    G protein-coupled receptors (GPCRs) are the largest superfamily of membrane proteins in humans. They are involved in signal transduction in numerous cellular processes and are the most common target for pharmacological intervention via activation or inhibition. Identification of functionally important sites is relevant for better understanding the ligand-receptor interaction and therefore for drug delivery. In a superfamily of proteins, functionally important but co-evolving sites are not easily identified in a multiple sequence alignment (MSA). Using a MSA of trans-membrane (TM) domains of GPCR superfamily, we identify sites which co-evolve, and may therefore be functionally important. Assigning the TM site as a node and the MI of site pairs as an inverse inter-node distance, a MI graph is established. Co-evolving sites are then identified via this graph. Nodes characterized by high connectivity are located within the commonly accepted ligand binding site of GPCRs, suggesting that concerted co-evolution of a number of neighboring residues gave rise to a multitude of subfamilies each recognizing a specific set of ligands. MI and graph analysis may serve as a tool for the identification of topologically conserved binding pockets in the families of evolutionarily related proteins.

  1. Complete Genomic Sequence for an Avian Group G Rotavirus from South Africa

    PubMed Central

    Stucker, Karla M.; Stockwell, Timothy B.; Nyaga, Martin M.; Halpin, Rebecca A.; Fedorova, Nadia; Akopov, Asmik; Ngoveni, Harry; Peenze, Ina; Seheri, Mapaseka L.; Mphahlele, M. Jeffrey

    2015-01-01

    We report the first complete sequence for an avian group G rotavirus (RVG) genome from Africa, which is the third publically available RVG genome. These RVG genomes are highly diverse, especially in their VP4, VP7, NSP4, and NSP3 segments, indicating that RVG diversity is comparable to that of rotavirus A. PMID:25767240

  2. Complete genomic sequence for an avian group G rotavirus from South Africa.

    PubMed

    Stucker, Karla M; Stockwell, Timothy B; Nyaga, Martin M; Halpin, Rebecca A; Fedorova, Nadia; Akopov, Asmik; Ngoveni, Harry; Peenze, Ina; Seheri, Mapaseka L; Mphahlele, M Jeffrey; Wentworth, David E

    2015-01-01

    We report the first complete sequence for an avian group G rotavirus (RVG) genome from Africa, which is the third publically available RVG genome. These RVG genomes are highly diverse, especially in their VP4, VP7, NSP4, and NSP3 segments, indicating that RVG diversity is comparable to that of rotavirus A. PMID:25767240

  3. Principles of Genome Evolution in the Drosophila melanogaster Species Group

    PubMed Central

    Ranz, José M; Maurin, Damien; Chan, Yuk S; von Grotthuss, Marcin; Hillier, LaDeana W; Roote, John; Ashburner, Michael; Bergman, Casey M

    2007-01-01

    That closely related species often differ by chromosomal inversions was discovered by Sturtevant and Plunkett in 1926. Our knowledge of how these inversions originate is still very limited, although a prevailing view is that they are facilitated by ectopic recombination events between inverted repetitive sequences. The availability of genome sequences of related species now allows us to study in detail the mechanisms that generate interspecific inversions. We have analyzed the breakpoint regions of the 29 inversions that differentiate the chromosomes of Drosophila melanogaster and two closely related species, D. simulans and D. yakuba, and reconstructed the molecular events that underlie their origin. Experimental and computational analysis revealed that the breakpoint regions of 59% of the inversions (17/29) are associated with inverted duplications of genes or other nonrepetitive sequences. In only two cases do we find evidence for inverted repetitive sequences in inversion breakpoints. We propose that the presence of inverted duplications associated with inversion breakpoint regions is the result of staggered breaks, either isochromatid or chromatid, and that this, rather than ectopic exchange between inverted repetitive sequences, is the prevalent mechanism for the generation of inversions in the melanogaster species group. Outgroup analysis also revealed evidence for widespread breakpoint recycling. Lastly, we have found that expression domains in D. melanogaster may be disrupted in D. yakuba, bringing into question their potential adaptive significance. PMID:17550304

  4. Analysis of co-evolving genes in campylobacter jejuni and C. coli

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: The population structure of Campylobacter has been frequently studied by MLST, for which fragments of housekeeping genes are compared. We wished to determine if the used MLST genes are representative of the complete genome. Methods: A set of 1029 core gene families (CGF) was identifie...

  5. Genome sequence of a proteolytic (Group I) Clostridium botulinum strain Hall A and comparative analysis of the clostridial genomes

    PubMed Central

    Sebaihia, Mohammed; Peck, Michael W.; Minton, Nigel P.; Thomson, Nicholas R.; Holden, Matthew T.G.; Mitchell, Wilfrid J.; Carter, Andrew T.; Bentley, Stephen D.; Mason, David R.; Crossman, Lisa; Paul, Catherine J.; Ivens, Alasdair; Wells-Bennik, Marjon H.J.; Davis, Ian J.; Cerdeño-Tárraga, Ana M.; Churcher, Carol; Quail, Michael A.; Chillingworth, Tracey; Feltwell, Theresa; Fraser, Audrey; Goodhead, Ian; Hance, Zahra; Jagels, Kay; Larke, Natasha; Maddison, Mark; Moule, Sharon; Mungall, Karen; Norbertczak, Halina; Rabbinowitsch, Ester; Sanders, Mandy; Simmonds, Mark; White, Brian; Whithead, Sally; Parkhill, Julian

    2007-01-01

    Clostridium botulinum is a heterogeneous Gram-positive species that comprises four genetically and physiologically distinct groups of bacteria that share the ability to produce botulinum neurotoxin, the most poisonous toxin known to man, and the causative agent of botulism, a severe disease of humans and animals. We report here the complete genome sequence of a representative of Group I (proteolytic) C. botulinum (strain Hall A, ATCC 3502). The genome consists of a chromosome (3,886,916 bp) and a plasmid (16,344 bp), which carry 3650 and 19 predicted genes, respectively. Consistent with the proteolytic phenotype of this strain, the genome harbors a large number of genes encoding secreted proteases and enzymes involved in uptake and metabolism of amino acids. The genome also reveals a hitherto unknown ability of C. botulinum to degrade chitin. There is a significant lack of recently acquired DNA, indicating a stable genomic content, in strong contrast to the fluid genome of Clostridium difficile, which can form longer-term relationships with its host. Overall, the genome indicates that C. botulinum is adapted to a saprophytic lifestyle both in soil and aquatic environments. This pathogen relies on its toxin to rapidly kill a wide range of prey species, and to gain access to nutrient sources, it releases a large number of extracellular enzymes to soften and destroy rotting or decayed tissues. PMID:17519437

  6. Modeling growth and dissemination of lymphoma in a co-evolving lymph node: a diffuse-domain approach

    NASA Astrophysics Data System (ADS)

    Chuang, Yao-Li; Cristini, Vittorio; Chen, Ying; Li, Xiangrong; Frieboes, Hermann; Lowengrub, John

    2013-03-01

    While partial differential equation models of tumor growth have successfully described various spatiotemporal phenomena observed for in-vitro tumor spheroid experiments, one challenge towards taking these models to further study in-vivo tumors is that instead of relatively static tissue culture with regular boundary conditions, in-vivo tumors are often confined in organ tissues that co-evolve with the tumor growth. Here we adopt a recently developed diffuse-domain method to account for the co-evolving domain boundaries, adapting our previous in-vitro tumor model for the development of lymphoma encapsulated in a lymph node, which may swell or shrink due to proliferation and dissemination of lymphoma cells and treatment by chemotherapy. We use the model to study the induced spatial heterogeneity, which may arise as an emerging phenomenon in experimental observations and model analysis. Spatial heterogeneity is believed to lead to tumor infiltration patterns and reduce the efficacy of chemotherapy, leaving residuals that cause cancer relapse after the treatment. Understanding the spatiotemporal evolution of in-vivo tumors can be an essential step towards more effective strategies of curing cancer. Supported by NIH-PSOC grant 1U54CA143907-01.

  7. Comparative genomics of the bacterial genus Streptococcus illuminates evolutionary implications of species groups.

    PubMed

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei; Klenk, Hans-Peter; Li, Wen-Jun

    2014-01-01

    Members of the genus Streptococcus within the phylum Firmicutes are among the most diverse and significant zoonotic pathogens. This genus has gone through considerable taxonomic revision due to increasing improvements of chemotaxonomic approaches, DNA hybridization and 16S rRNA gene sequencing. It is proposed to place the majority of streptococci into "species groups". However, the evolutionary implications of species groups are not clear presently. We use comparative genomic approaches to yield a better understanding of the evolution of Streptococcus through genome dynamics, population structure, phylogenies and virulence factor distribution of species groups. Genome dynamics analyses indicate that the pan-genome size increases with the addition of newly sequenced strains, while the core genome size decreases with sequential addition at the genus level and species group level. Population structure analysis reveals two distinct lineages, one including Pyogenic, Bovis, Mutans and Salivarius groups, and the other including Mitis, Anginosus and Unknown groups. Phylogenetic dendrograms show that species within the same species group cluster together, and infer two main clades in accordance with population structure analysis. Distribution of streptococcal virulence factors has no obvious patterns among the species groups; however, the evolution of some common virulence factors is congruous with the evolution of species groups, according to phylogenetic inference. We suggest that the proposed streptococcal species groups are reasonable from the viewpoints of comparative genomics; evolution of the genus is congruent with the individual evolutionary trajectories of different species groups. PMID:24977706

  8. Comparative Genomics of the Bacterial Genus Streptococcus Illuminates Evolutionary Implications of Species Groups

    PubMed Central

    Gao, Xiao-Yang; Zhi, Xiao-Yang; Li, Hong-Wei; Klenk, Hans-Peter; Li, Wen-Jun

    2014-01-01

    Members of the genus Streptococcus within the phylum Firmicutes are among the most diverse and significant zoonotic pathogens. This genus has gone through considerable taxonomic revision due to increasing improvements of chemotaxonomic approaches, DNA hybridization and 16S rRNA gene sequencing. It is proposed to place the majority of streptococci into “species groups”. However, the evolutionary implications of species groups are not clear presently. We use comparative genomic approaches to yield a better understanding of the evolution of Streptococcus through genome dynamics, population structure, phylogenies and virulence factor distribution of species groups. Genome dynamics analyses indicate that the pan-genome size increases with the addition of newly sequenced strains, while the core genome size decreases with sequential addition at the genus level and species group level. Population structure analysis reveals two distinct lineages, one including Pyogenic, Bovis, Mutans and Salivarius groups, and the other including Mitis, Anginosus and Unknown groups. Phylogenetic dendrograms show that species within the same species group cluster together, and infer two main clades in accordance with population structure analysis. Distribution of streptococcal virulence factors has no obvious patterns among the species groups; however, the evolution of some common virulence factors is congruous with the evolution of species groups, according to phylogenetic inference. We suggest that the proposed streptococcal species groups are reasonable from the viewpoints of comparative genomics; evolution of the genus is congruent with the individual evolutionary trajectories of different species groups. PMID:24977706

  9. PhyloGene server for identification and visualization of co-evolving proteins using normalized phylogenetic profiles

    PubMed Central

    Sadreyev, Ilyas R.; Ji, Fei; Cohen, Emiliano; Ruvkun, Gary; Tabach, Yuval

    2015-01-01

    Proteins that function in the same pathways, protein complexes or the same environmental conditions can show similar patterns of sequence conservation across phylogenetic clades. In species that no longer require a specific protein complex or pathway, these proteins, as a group, tend to be lost or diverge. Analysis of the similarity in patterns of sequence conservation across a large set of eukaryotes can predict functional associations between different proteins, identify new pathway members and reveal the function of previously uncharacterized proteins. We used normalized phylogenetic profiling to predict protein function and identify new pathway members and disease genes. The phylogenetic profiles of tens of thousands conserved proteins in the human, mouse, Caenorhabditis elegans and Drosophila genomes can be queried on our new web server, PhyloGene. PhyloGene provides intuitive and user-friendly platform to query the patterns of conservation across 86 animal, fungal, plant and protist genomes. A protein query can be submitted either by selecting the name from whole-genome protein sets of the intensively studied species or by entering a protein sequence. The graphic output shows the profile of sequence conservation for the query and the most similar phylogenetic profiles for the proteins in the genome of choice. The user can also download this output in numerical form. PMID:25958392

  10. Genomic Sequence or Signature Tags (GSTs) from the Genome Group at Brookhaven National Laboratory (BNL)

    DOE Data Explorer

    Dunn, John J.; McCorkle, Sean R.; Praissman, Laura A.; Hind, Geoffrey; Van der Lelie, Daniel; Bahou, Wadie F.; Gnatenko, Dmitri V.; Krause, Maureen K.

    Genomic Signature Tags (GSTs) are the products of a method we have developed for identifying and quantitatively analyzing genomic DNAs. The DNA is initially fragmented with a type II restriction enzyme. An oligonucleotide adaptor containing a recognition site for MmeI, a type IIS restriction enzyme, is then used to release 21-bp tags from fixed positions in the DNA relative to the sites recognized by the fragmenting enzyme. These tags are PCR-amplified, purified, concatenated and then cloned and sequenced. The tag sequences and abundances are used to create a high resolution GST sequence profile of the genomic DNA. [Quoted from Genomic Signature Tags (GSTs): A System for Profiling Genomic DNA, Dunn, John J.; McCorkle, Sean R.; Praissman, Laura A.; Hind, Geoffrey; Van der Lelie, Daniel; Bahou, Wadie F.; Gnatenko, Dmitri V.; Krause, Maureen K., Revised 9/13/2002

  11. Comparative genomics and functional analysis of the 936 group of lactococcal Siphoviridae phages

    PubMed Central

    Murphy, James; Bottacini, Francesca; Mahony, Jennifer; Kelleher, Philip; Neve, Horst; Zomer, Aldert; Nauta, Arjen; van Sinderen, Douwe

    2016-01-01

    Genome sequencing and comparative analysis of bacteriophage collections has greatly enhanced our understanding regarding their prevalence, phage-host interactions as well as the overall biodiversity of their genomes. This knowledge is very relevant to phages infecting Lactococcus lactis, since they constitute a significant risk factor for dairy fermentations. Of the eighty four lactococcal phage genomes currently available, fifty five belong to the so-called 936 group, the most prevalent of the ten currently recognized lactococcal phage groups. Here, we report the genetic characteristics of a new collection of 936 group phages. By combining these genomes to those sequenced previously we determined the core and variable elements of the 936 genome. Genomic variation occurs across the 936 phage genome, such as genetic elements that (i) lead to a +1 translational frameshift resulting in the formation of additional structures on the phage tail, (ii) specify a double neck passage structure, and (iii) encode packaging module-associated methylases. Hierarchical clustering of the gene complement of the 936 group phages and nucleotide alignments allowed grouping of the ninety 936 group phages into distinct clusters, which in general appear to correspond with their geographical origin. PMID:26892066

  12. Application of Whole-Genome Sequencing to an Unusual Outbreak of Invasive Group A Streptococcal Disease

    PubMed Central

    Galloway-Peña, Jessica; Clement, Meredith E.; Sharma Kuinkel, Batu K.; Ruffin, Felicia; Flores, Anthony R.; Levinson, Howard; Shelburne, Samuel A.; Moore, Zack; Fowler, Vance G.

    2016-01-01

    Whole-genome analysis was applied to investigate atypical point-source transmission of 2 invasive group A streptococcal (GAS) infections. Isolates were serotype M4, ST39, and genetically indistinguishable. Comparison with MGAS10750 revealed nonsynonymous polymorphisms in ropB and increased speB transcription. This study demonstrates the usefulness of whole-genome analyses for GAS outbreaks. PMID:27006966

  13. Sulfur-oxidizing symbionts have not co-evolved with their hydrothermal vent tube worm hosts: an RFLP analysis.

    PubMed

    Laue, B E; Nelson, D C

    1997-09-01

    A fine-scale phylogenetic comparison was made among the symbionts of different genera of hydrothermal vent tube worms. These included Riftia pachyptila and Tevnia jerichonona, which inhabit sites along the east Pacific Rise, and Ridgeia piscesae from the Juan de Fuca Ridge. An analysis of restriction fragment length polymorphism (RFLP) was employed using three symbiont-specific gene probes: eubacterial 16S rRNA, RuBPC/O Form II, and ATP sulfurylase (recently cloned from the Riftia symbiont). Results indicated that all of the symbionts from the three different hosts were conspecific and the Riftia and Tevnia symbionts were indistinguishable over and 1800-km range. Significantly, this indicates that the symbionts have not co-evolved with their respective hosts, which are known to belong to separate families. This study strongly supports the conclusion that the symbionts are acquired de novo by each generation of juvenile tube worms from a common source in the surrounding sea water. PMID:9284558

  14. Genome Sequence of Acidovorax citrulli Group 1 Strain pslb65 Causing Bacterial Fruit Blotch of Melons

    PubMed Central

    Wang, Tielin; Sun, Baixin; Yang, Yuwen

    2015-01-01

    Acidovorax citrulli is typed into two groups, mainly based on the host. We determined the draft genome of A. citrulli group 1 strain pslb65. The strain was isolated from melon collected from Xinjiang province, China. The A. citrulli pslb65 genome contains 4,903,443 bp and has a G+C content of 68.8 mol%. PMID:25908136

  15. Phage Morphology Recapitulates Phylogeny: The Comparative Genomics of a New Group of Myoviruses

    PubMed Central

    Comeau, André M.; Tremblay, Denise; Moineau, Sylvain; Rattei, Thomas; Kushkina, Alla I.; Tovkach, Fedor I.; Krisch, Henry M.; Ackermann, Hans-Wolfgang

    2012-01-01

    Among dsDNA tailed bacteriophages (Caudovirales), members of the Myoviridae family have the most sophisticated virion design that includes a complex contractile tail structure. The Myoviridae generally have larger genomes than the other phage families. Relatively few “dwarf” myoviruses, those with a genome size of less than 50 kb such as those of the Mu group, have been analyzed in extenso. Here we report on the genome sequencing and morphological characterization of a new group of such phages that infect a diverse range of Proteobacteria, namely Aeromonas salmonicida phage 56, Vibrio cholerae phages 138 and CP-T1, Bdellovibrio phage φ1422, and Pectobacterium carotovorum phage ZF40. This group of dwarf myoviruses shares an identical virion morphology, characterized by usually short contractile tails, and have genome sizes of approximately 45 kb. Although their genome sequences are variable in their lysogeny, replication, and host adaption modules, presumably reflecting differing lifestyles and hosts, their structural and morphogenesis modules have been evolutionarily constrained by their virion morphology. Comparative genomic analysis reveals that these phages, along with related prophage genomes, form a new coherent group within the Myoviridae. The results presented in this communication support the hypothesis that the diversity of phages may be more structured than generally believed and that the innumerable phages in the biosphere all belong to discrete lineages or families. PMID:22792219

  16. A quantitative measure for organization of complex and co-evolving networks

    NASA Astrophysics Data System (ADS)

    Georgiev, Georgi

    2012-02-01

    To define evolution and self-organization in complex networks a quantitative measure for organization is necessary. Two systems should be numerically distinguishable by their degree of organization and their rate of self-organization. Here we apply as a measure for quantity of organization the inverse of the average sum of physical actions of all elements in a system per unit motion multiplied by the Planck's constant. The meaning of quantity of organization here is the number of quanta of action per one unit motion of an element. For example, a unit motion for electrons on a computer chip is the one necessary for one computation. This definition can be applied to the organization in any complex system. Systems self-organize to decrease the average action per element per unit motion in them. This is the attractor for a dynamical, nonlinear system evolving in time. Constraints increase this average action, so constraint minimization is a basic mechanism for action minimization. Increase of quantity of elements in the network, leads to faster constraint minimization through grouping, decrease of average action per element and motion and therefore faster self-organization and evolution.

  17. Nitrogen limitation as a driver of genome size evolution in a group of karst plants.

    PubMed

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-01-01

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to environmental limitation in natural ecosystems. Using phylogenetic comparative methods, we test this hypothesis by determining whether leaf N and P availability affects genome sizes in 99 species of Primulina (Gesneriaceae), a group of soil specialists adapted to limestone karst environment in south China. We find that genome sizes in Primulina are strongly positively correlated with plant N content, but the correlation with plant P content is not significant when phylogeny history was taken into account. This study shows for the first time that N limitation might have been a plausible driver of genome size variation in a group of plants. We propose that competition for nitrogen nutrient between DNA synthesis and cellular functions is a possible mechanism for genome size evolution in Primulina under N-limitation. PMID:26109237

  18. Nitrogen limitation as a driver of genome size evolution in a group of karst plants

    PubMed Central

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-01-01

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to environmental limitation in natural ecosystems. Using phylogenetic comparative methods, we test this hypothesis by determining whether leaf N and P availability affects genome sizes in 99 species of Primulina (Gesneriaceae), a group of soil specialists adapted to limestone karst environment in south China. We find that genome sizes in Primulina are strongly positively correlated with plant N content, but the correlation with plant P content is not significant when phylogeny history was taken into account. This study shows for the first time that N limitation might have been a plausible driver of genome size variation in a group of plants. We propose that competition for nitrogen nutrient between DNA synthesis and cellular functions is a possible mechanism for genome size evolution in Primulina under N-limitation. PMID:26109237

  19. Nitrogen limitation as a driver of genome size evolution in a group of karst plants

    NASA Astrophysics Data System (ADS)

    Kang, Ming; Wang, Jing; Huang, Hongwen

    2015-06-01

    Genome size is of fundamental biological importance with significance in predicting structural and functional attributes of organisms. Although abundant evidence has shown that the genome size can be largely explained by differential proliferation and removal of non-coding DNA of the genome, the evolutionary and ecological basis of genome size variation remains poorly understood. Nitrogen (N) and phosphorus (P) are essential elements of DNA and protein building blocks, yet often subject to environmental limitation in natural ecosystems. Using phylogenetic comparative methods, we test this hypothesis by determining whether leaf N and P availability affects genome sizes in 99 species of Primulina (Gesneriaceae), a group of soil specialists adapted to limestone karst environment in south China. We find that genome sizes in Primulina are strongly positively correlated with plant N content, but the correlation with plant P content is not significant when phylogeny history was taken into account. This study shows for the first time that N limitation might have been a plausible driver of genome size variation in a group of plants. We propose that competition for nitrogen nutrient between DNA synthesis and cellular functions is a possible mechanism for genome size evolution in Primulina under N-limitation.

  20. Multiple Groups of Novel Retroviral Genomes in Pigs and Related Species

    PubMed Central

    Patience, Clive; Switzer, William M.; Takeuchi, Yasuhiro; Griffiths, David J.; Goward, Melanie E.; Heneine, Walid; Stoye, Jonathan P.; Weiss, Robin A.

    2001-01-01

    In view of the concern over potential infection hazards in the use of porcine tissues and organs for xenotransplantation to humans, we investigated the diversity of porcine endogenous retrovirus (PERV) genomes in the DNA of domestic pigs and related species. In addition to the three known envelope subgroups of infectious gamma retroviruses (PERV-A, -B, and -C), classed together here as PERV group γ1, four novel groups of gamma retrovirus (γ2 to γ5) and four novel groups of beta retrovirus (β1 to β4) genomes were detected in pig DNA using generic and specific PCR primers. PCR quantification indicated that the retroviral genome copy number in the Landrace × Duroc F1 hybrid pig ranged from 2 (β2 and γ5) to approximately 50 (γ1). The γ1, γ2, and β4 genomes were transcribed into RNA in adult kidney tissue. Apart from γ1, the retroviral genomes are not known to be infectious, and sequencing of a small number of amplified genome fragments revealed stop codons in putative open reading frames in several cases. Analysis of DNA from wild boar and other species of Old World pigs (Suidae) and New World peccaries (Tayassuidae) showed that one retrovirus group, β2, was common to all species tested, while the others were present among all Old World species but absent from New World species. The PERV-C subgroup of γ1 genomes segregated among domestic pigs and were absent from two African species (red river hog and warthog). Thus domestic swine and their phylogenetic relatives harbor multiple groups of hitherto undescribed PERV genomes. PMID:11222700

  1. Complete Genome Sequences of Nine Bacillus cereus Group Phages

    PubMed Central

    Foltz, Samantha

    2016-01-01

    We report the sequences of nine novel Bacillus cereus group bacteriophages: DIGNKC, Juglone, Nemo, Nigalana, NotTheCreek, Phrodo, SageFayge, Vinny, and Zuko. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples using B. thuringiensis subsp. kurstaki as the host bacterium. PMID:27417827

  2. Complete Genome Sequences of Nine Bacillus cereus Group Phages.

    PubMed

    Foltz, Samantha; Johnson, Allison A

    2016-01-01

    We report the sequences of nine novel Bacillus cereus group bacteriophages: DIGNKC, Juglone, Nemo, Nigalana, NotTheCreek, Phrodo, SageFayge, Vinny, and Zuko. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples using B. thuringiensis subsp. kurstaki as the host bacterium. PMID:27417827

  3. Complete Genome Assembly of Streptococcus pyogenes ATCC 19615, a Group A β-Hemolytic Reference Strain.

    PubMed

    Minogue, T D; Daligault, H A; Davenport, K W; Bishop-Lilly, K A; Broomall, S M; Bruce, D C; Chain, P S; Chertkov, O; Coyne, S R; Freitas, T; Frey, K G; Gibbons, H S; Jaissle, J; Redden, C L; Rosenzweig, C N; Xu, Y; Johnson, S L

    2014-01-01

    We present the complete genome assembly of Streptococcus pyogenes ATCC 19615 (Rosenbach) as submitted to GenBank under accession number CP008926. This group A nonmotile β-hemolytic clinical isolate is used for quality control in a variety of commercially available tests. The assembled genome is 1.84 Mb (38.5% G+C content) and contains 1,788 coding regions. PMID:25258274

  4. Complete Genome Assembly of Streptococcus pyogenes ATCC 19615, a Group A β-Hemolytic Reference Strain

    PubMed Central

    Minogue, T. D.; Daligault, H. A.; Davenport, K. W.; Bishop-Lilly, K. A.; Broomall, S. M.; Bruce, D. C.; Chain, P. S.; Chertkov, O.; Coyne, S. R.; Freitas, T.; Frey, K. G.; Gibbons, H. S.; Jaissle, J.; Redden, C. L.; Rosenzweig, C. N.; Xu, Y.

    2014-01-01

    We present the complete genome assembly of Streptococcus pyogenes ATCC 19615 (Rosenbach) as submitted to GenBank under accession number CP008926. This group A nonmotile β-hemolytic clinical isolate is used for quality control in a variety of commercially available tests. The assembled genome is 1.84 Mb (38.5% G+C content) and contains 1,788 coding regions. PMID:25258274

  5. Co-evolving CENP-A and CAL1 Domains Mediate Centromeric CENP-A Deposition across Drosophila Species.

    PubMed

    Rosin, Leah; Mellone, Barbara G

    2016-04-18

    Centromeres mediate the conserved process of chromosome segregation, yet centromeric DNA and the centromeric histone, CENP-A, are rapidly evolving. The rapid evolution of Drosophila CENP-A loop 1 (L1) is thought to modulate the DNA-binding preferences of CENP-A to counteract centromere drive, the preferential transmission of chromosomes with expanded centromeric satellites. Consistent with this model, CENP-A from Drosophila bipectinata (bip) cannot localize to Drosophila melanogaster (mel) centromeres. We show that this result is due to the inability of the mel CENP-A chaperone, CAL1, to deposit bip CENP-A into chromatin. Co-expression of bip CENP-A and bip CAL1 in mel cells restores centromeric localization, and similar findings apply to other Drosophila species. We identify two co-evolving regions, CENP-A L1 and the CAL1 N terminus, as critical for lineage-specific CENP-A incorporation. Collectively, our data show that the rapid evolution of L1 modulates CAL1-mediated CENP-A assembly, suggesting an alternative mechanism for the suppression of centromere drive. PMID:27093083

  6. Characterization and comparative genomic analysis of bacteriophages infecting members of the Bacillus cereus group.

    PubMed

    Lee, Ju-Hoon; Shin, Hakdong; Ryu, Sangryeol

    2014-05-01

    The Bacillus cereus group phages infecting B. cereus, B. anthracis, and B. thuringiensis (Bt) have been studied at the molecular level and, recently, at the genomic level to control the pathogens B. cereus and B. anthracis and to prevent phage contamination of the natural insect pesticide Bt. A comparative phylogenetic analysis has revealed three different major phage groups with different morphologies (Myoviridae for group I, Siphoviridae for group II, and Tectiviridae for group III), genome size (group I > group II > group III), and lifestyle (virulent for group I and temperate for group II and III). A subsequent phage genome comparison using a dot plot analysis showed that phages in each group are highly homologous, substantiating the grouping of B. cereus phages. Endolysin is a host lysis protein that contains two conserved domains: a cell-wall-binding domain (CBD) and an enzymatic activity domain (EAD). In B. cereus sensu lato phage group I, four different endolysin groups have been detected, according to combinations of two types of CBD and four types of EAD. Group I phages have two copies of tail lysins and one copy of endolysin, but the functions of the tail lysins are still unknown. In the B. cereus sensu lato phage group II, the B. anthracis phages have been studied and applied for typing and rapid detection of pathogenic host strains. In the B. cereus sensu lato phage group III, the B. thuringiensis phages Bam35 and GIL01 have been studied to understand phage entry and lytic switch regulation mechanisms. In this review, we suggest that further study of the B. cereus group phages would be useful for various phage applications, such as biocontrol, typing, and rapid detection of the pathogens B. cereus and B. anthracis and for the prevention of phage contamination of the natural insect pesticide Bt. PMID:24264384

  7. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform.

    PubMed

    Zheng, Wenning; Tan, Tze King; Paterson, Ian C; Mutha, Naresh V R; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A; Jakubovics, Nicholas S; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my. PMID:27138013

  8. StreptoBase: An Oral Streptococcus mitis Group Genomic Resource and Analysis Platform

    PubMed Central

    Zheng, Wenning; Paterson, Ian C.; Mutha, Naresh V. R.; Siow, Cheuk Chuen; Tan, Shi Yang; Old, Lesley A.; Jakubovics, Nicholas S.; Choo, Siew Woh

    2016-01-01

    The oral streptococci are spherical Gram-positive bacteria categorized under the phylum Firmicutes which are among the most common causative agents of bacterial infective endocarditis (IE) and are also important agents in septicaemia in neutropenic patients. The Streptococcus mitis group is comprised of 13 species including some of the most common human oral colonizers such as S. mitis, S. oralis, S. sanguinis and S. gordonii as well as species such as S. tigurinus, S. oligofermentans and S. australis that have only recently been classified and are poorly understood at present. We present StreptoBase, which provides a specialized free resource focusing on the genomic analyses of oral species from the mitis group. It currently hosts 104 S. mitis group genomes including 27 novel mitis group strains that we sequenced using the high throughput Illumina HiSeq technology platform, and provides a comprehensive set of genome sequences for analyses, particularly comparative analyses and visualization of both cross-species and cross-strain characteristics of S. mitis group bacteria. StreptoBase incorporates sophisticated in-house designed bioinformatics web tools such as Pairwise Genome Comparison (PGC) tool and Pathogenomic Profiling Tool (PathoProT), which facilitate comparative pathogenomics analysis of Streptococcus strains. Examples are provided to demonstrate how StreptoBase can be employed to compare genome structure of different S. mitis group bacteria and putative virulence genes profile across multiple streptococcal strains. In conclusion, StreptoBase offers access to a range of streptococci genomic resources as well as analysis tools and will be an invaluable platform to accelerate research in streptococci. Database URL: http://streptococcus.um.edu.my. PMID:27138013

  9. Complete genome of the uncultured Termite Group 1 bacteria in a single host protist cell.

    PubMed

    Hongoh, Yuichi; Sharma, Vineet K; Prakash, Tulika; Noda, Satoko; Taylor, Todd D; Kudo, Toshiaki; Sakaki, Yoshiyuki; Toyoda, Atsushi; Hattori, Masahira; Ohkuma, Moriya

    2008-04-01

    Termites harbor a symbiotic gut microbial community that is responsible for their ability to thrive on recalcitrant plant matter. The community comprises diverse microorganisms, most of which are as yet uncultivable; the detailed symbiotic mechanism remains unclear. Here, we present the first complete genome sequence of a termite gut symbiont-an uncultured bacterium named Rs-D17 belonging to the candidate phylum Termite Group 1 (TG1). TG1 is a dominant group in termite guts, found as intracellular symbionts of various cellulolytic protists, without any physiological information. To acquire the complete genome sequence, we collected Rs-D17 cells from only a single host protist cell to minimize their genomic variation and performed isothermal whole-genome amplification. This strategy enabled us to reconstruct a circular chromosome (1,125,857 bp) encoding 761 putative protein-coding genes. The genome additionally contains 121 pseudogenes assigned to categories, such as cell wall biosynthesis, regulators, transporters, and defense mechanisms. Despite its apparent reductive evolution, the ability to synthesize 15 amino acids and various cofactors is retained, some of these genes having been duplicated. Considering that diverse termite-gut protists harbor TG1 bacteria, we suggest that this bacterial group plays a key role in the gut symbiotic system by stably supplying essential nitrogenous compounds deficient in lignocelluloses to their host protists and the termites. Our results provide a breakthrough to clarify the functions of and the interactions among the individual members of this multilayered symbiotic complex. PMID:18391199

  10. Complete genome sequence of the bioleaching bacterium Leptospirillum sp. group II strain CF-1.

    PubMed

    Ferrer, Alonso; Bunk, Boyke; Spröer, Cathrin; Biedendieck, Rebekka; Valdés, Natalia; Jahn, Martina; Jahn, Dieter; Orellana, Omar; Levicán, Gloria

    2016-03-20

    We describe the complete genome sequence of Leptospirillum sp. group II strain CF-1, an acidophilic bioleaching bacterium isolated from an acid mine drainage (AMD). This work provides data to gain insights about adaptive response of Leptospirillum spp. to the extreme conditions of bioleaching environments. PMID:26853478

  11. Genome Editing via Mobile Group-II Introns and Cre/lox

    NASA Astrophysics Data System (ADS)

    Enyeart, P. E.; Perutka, J.; Dao, M.; Ellington, A. E.

    2010-04-01

    Mobile group-II introns and the Cre/lox systems are combined to allow large segments of DNA to be removed or transferred within/between bacterial genomes. Planned applications include metabolic optimization and development of novel dissimilatory metal-reducing bacteria.

  12. Genome Sequence of Borrelia chilensis VA1, a South American Member of the Lyme Borreliosis Group

    PubMed Central

    Huang, Weihua; Ojaimi, Caroline; Fallon, John T.; Travisany, Dante; Maass, Alejandro; Ivanova, Larisa; Tomova, Alexandra; González-Acuña, Daniel; Cabello, Felipe C.

    2015-01-01

    Borrelia chilensis strain VA1 is a recently described South American member of the Borrelia burgdorferi sensu lato complex from Chile. Whole-genome sequencing analysis determined its linear chromosome and plasmids lp54 and cp26, confirmed its membership in the Lyme borreliosis group, and will open new research avenues regarding its pathogenic potential. PMID:25676758

  13. "Is It Worth Knowing?" Focus Group Participants' Perceived Utility of Genomic Preconception Carrier Screening.

    PubMed

    Schneider, Jennifer L; Goddard, Katrina A B; Davis, James; Wilfond, Benjamin; Kauffman, Tia L; Reiss, Jacob A; Gilmore, Marian; Himes, Patricia; Lynch, Frances L; Leo, Michael C; McMullen, Carmit

    2016-02-01

    As genome sequencing technology advances, research is needed to guide decision-making about what results can or should be offered to patients in different clinical settings. We conducted three focus groups with individuals who had prior preconception genetic testing experience to explore perceived advantages and disadvantages of genome sequencing for preconception carrier screening, compared to usual care. Using a discussion guide, a trained qualitative moderator facilitated the audio-recorded focus groups. Sixteen individuals participated. Thematic analysis of transcripts started with a grounded approach and subsequently focused on participants' perceptions of the value of genetic information. Analysis uncovered two orientations toward genomic preconception carrier screening: "certain" individuals desiring all possible screening information; and "hesitant" individuals who were more cautious about its value. Participants revealed valuable information about barriers to screening: fear/anxiety about results; concerns about the method of returning results; concerns about screening necessity; and concerns about partner participation. All participants recommended offering choice to patients to enhance the value of screening and reduce barriers. Overall, two groups of likely users of genome sequencing for preconception carrier screening demonstrated different perceptions of the advantages or disadvantages of screening, suggesting tailored approaches to education, consent, and counseling may be warranted with each group. PMID:26093606

  14. Genome Sequencing and Analysis of Catopsilia pomona nucleopolyhedrovirus: A Distinct Species in Group I Alphabaculovirus

    PubMed Central

    Wang, Jun; Zhu, Zheng; Zhang, Lei; Hou, Dianhai; Wang, Manli; Arif, Basil; Kou, Zheng; Wang, Hualin; Deng, Fei; Hu, Zhihong

    2016-01-01

    The genome sequence of Catopsilia pomona nucleopolyhedrovirus (CapoNPV) was determined by the Roche 454 sequencing system. The genome consisted of 128,058 bp and had an overall G+C content of 40%. There were 130 hypothetical open reading frames (ORFs) potentially encoding proteins of more than 50 amino acids and covering 92% of the genome. Among all the hypothetical ORFs, 37 baculovirus core genes, 23 lepidopteran baculovirus conserved genes and 10 genes conserved in Group I alphabaculoviruses were identified. In addition, the genome included regions of 8 typical baculoviral homologous repeat sequences (hrs). Phylogenic analysis showed that CapoNPV was in a distinct branch of clade “a” in Group I alphabaculoviruses. Gene parity plot analysis and overall similarity of ORFs indicated that CapoNPV is more closely related to the Group I alphabaculoviruses than to other baculoviruses. Interesting, CapoNPV lacks the genes encoding the fibroblast growth factor (fgf) and ac30, which are conserved in most lepidopteran and Group I baculoviruses, respectively. Sequence analysis of the F-like protein of CapoNPV showed that some amino acids were inserted into the fusion peptide region and the pre-transmembrane region of the protein. All these unique features imply that CapoNPV represents a member of a new baculovirus species. PMID:27166956

  15. Genomic definition of hypervirulent and multidrug-resistant Klebsiella pneumoniae clonal groups.

    PubMed

    Bialek-Davenet, Suzanne; Criscuolo, Alexis; Ailloud, Florent; Passet, Virginie; Jones, Louis; Delannoy-Vieillard, Anne-Sophie; Garin, Benoit; Le Hello, Simon; Arlet, Guillaume; Nicolas-Chanoine, Marie-Hélène; Decré, Dominique; Brisse, Sylvain

    2014-11-01

    Multidrug-resistant and highly virulent Klebsiella pneumoniae isolates are emerging, but the clonal groups (CGs) corresponding to these high-risk strains have remained imprecisely defined. We aimed to identify K. pneumoniae CGs on the basis of genome-wide sequence variation and to provide a simple bioinformatics tool to extract virulence and resistance gene data from genomic data. We sequenced 48 K. pneumoniae isolates, mostly of serotypes K1 and K2, and compared the genomes with 119 publicly available genomes. A total of 694 highly conserved genes were included in a core-genome multilocus sequence typing scheme, and cluster analysis of the data enabled precise definition of globally distributed hypervirulent and multidrug-resistant CGs. In addition, we created a freely accessible database, BIGSdb-Kp, to enable rapid extraction of medically and epidemiologically relevant information from genomic sequences of K. pneumoniae. Although drug-resistant and virulent K. pneumoniae populations were largely nonoverlapping, isolates with combined virulence and resistance features were detected. PMID:25341126

  16. Generalized bacterial genome editing using mobile group II introns and Cre-lox

    PubMed Central

    Enyeart, Peter J; Chirieleison, Steven M; Dao, Mai N; Perutka, Jiri; Quandt, Erik M; Yao, Jun; Whitt, Jacob T; Keatinge-Clay, Adrian T; Lambowitz, Alan M; Ellington, Andrew D

    2013-01-01

    Efficient bacterial genetic engineering approaches with broad-host applicability are rare. We combine two systems, mobile group II introns (‘targetrons') and Cre/lox, which function efficiently in many different organisms, into a versatile platform we call GETR (Genome Editing via Targetrons and Recombinases). The introns deliver lox sites to specific genomic loci, enabling genomic manipulations. Efficiency is enhanced by adding flexibility to the RNA hairpins formed by the lox sites. We use the system for insertions, deletions, inversions, and one-step cut-and-paste operations. We demonstrate insertion of a 12-kb polyketide synthase operon into the lacZ gene of Escherichia coli, multiple simultaneous and sequential deletions of up to 120 kb in E. coli and Staphylococcus aureus, inversions of up to 1.2 Mb in E. coli and Bacillus subtilis, and one-step cut-and-pastes for translocating 120 kb of genomic sequence to a site 1.5 Mb away. We also demonstrate the simultaneous delivery of lox sites into multiple loci in the Shewanella oneidensis genome. No selectable markers need to be placed in the genome, and the efficiency of Cre-mediated manipulations typically approaches 100%. PMID:24002656

  17. Genomic and Metabolic Diversity of Marine Group I Thaumarchaeota in the Mesopelagic of Two Subtropical Gyres

    PubMed Central

    Swan, Brandon K.; Chaffin, Mark D.; Martinez-Garcia, Manuel; Morrison, Hilary G.; Field, Erin K.; Poulton, Nicole J.; Masland, E. Dashiell P.; Harris, Christopher C.; Sczyrba, Alexander; Chain, Patrick S. G.; Koren, Sergey; Woyke, Tanja; Stepanauskas, Ramunas

    2014-01-01

    Marine Group I (MGI) Thaumarchaeota are one of the most abundant and cosmopolitan chemoautotrophs within the global dark ocean. To date, no representatives of this archaeal group retrieved from the dark ocean have been successfully cultured. We used single cell genomics to investigate the genomic and metabolic diversity of thaumarchaea within the mesopelagic of the subtropical North Pacific and South Atlantic Ocean. Phylogenetic and metagenomic recruitment analysis revealed that MGI single amplified genomes (SAGs) are genetically and biogeographically distinct from existing thaumarchaea cultures obtained from surface waters. Confirming prior studies, we found genes encoding proteins for aerobic ammonia oxidation and the hydrolysis of urea, which may be used for energy production, as well as genes involved in 3-hydroxypropionate/4-hydroxybutyrate and oxidative tricarboxylic acid pathways. A large proportion of protein sequences identified in MGI SAGs were absent in the marine cultures Cenarchaeum symbiosum and Nitrosopumilus maritimus, thus expanding the predicted protein space for this archaeal group. Identifiable genes located on genomic islands with low metagenome recruitment capacity were enriched in cellular defense functions, likely in response to viral infections or grazing. We show that MGI Thaumarchaeota in the dark ocean may have more flexibility in potential energy sources and adaptations to biotic interactions than the existing, surface-ocean cultures. PMID:24743558

  18. Evolutionary Genomics of Genes Involved in Olfactory Behavior in the Drosophila melanogaster Species Group

    PubMed Central

    Lavagnino, Nicolás; Serra, François; Arbiza, Leonardo; Dopazo, Hernán; Hasson, Esteban

    2012-01-01

    Previous comparative genomic studies of genes involved in olfactory behavior in Drosophila focused only on particular gene families such as odorant receptor and/or odorant binding proteins. However, olfactory behavior has a complex genetic architecture that is orchestrated by many interacting genes. In this paper, we present a comparative genomic study of olfactory behavior in Drosophila including an extended set of genes known to affect olfactory behavior. We took advantage of the recent burst of whole genome sequences and the development of powerful statistical tools to analyze genomic data and test evolutionary and functional hypotheses of olfactory genes in the six species of the Drosophila melanogaster species group for which whole genome sequences are available. Our study reveals widespread purifying selection and limited incidence of positive selection on olfactory genes. We show that the pace of evolution of olfactory genes is mostly independent of the life cycle stage, and of the number of life cycle stages, in which they participate in olfaction. However, we detected a relationship between evolutionary rates and the position that the gene products occupy in the olfactory system, genes occupying central positions tend to be more constrained than peripheral genes. Finally, we demonstrate that specialization to one host does not seem to be associated with bursts of adaptive evolution in olfactory genes in D. sechellia and D. erecta, the two specialists species analyzed, but rather different lineages have idiosyncratic evolutionary histories in which both historical and ecological factors have been involved. PMID:22346339

  19. Genome Sequence of Candidatus Nitrososphaera evergladensis from Group I.1b Enriched from Everglades Soil Reveals Novel Genomic Features of the Ammonia-Oxidizing Archaea

    PubMed Central

    Zhalnina, Kateryna V.; Dias, Raquel; Leonard, Michael T.; Dorr de Quadros, Patricia; Camargo, Flavio A. O.; Drew, Jennifer C.; Farmerie, William G.; Daroub, Samira H.; Triplett, Eric W.

    2014-01-01

    The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group. PMID:24999826

  20. Genome sequence of Candidatus Nitrososphaera evergladensis from group I.1b enriched from Everglades soil reveals novel genomic features of the ammonia-oxidizing archaea.

    PubMed

    Zhalnina, Kateryna V; Dias, Raquel; Leonard, Michael T; Dorr de Quadros, Patricia; Camargo, Flavio A O; Drew, Jennifer C; Farmerie, William G; Daroub, Samira H; Triplett, Eric W

    2014-01-01

    The activity of ammonia-oxidizing archaea (AOA) leads to the loss of nitrogen from soil, pollution of water sources and elevated emissions of greenhouse gas. To date, eight AOA genomes are available in the public databases, seven are from the group I.1a of the Thaumarchaeota and only one is from the group I.1b, isolated from hot springs. Many soils are dominated by AOA from the group I.1b, but the genomes of soil representatives of this group have not been sequenced and functionally characterized. The lack of knowledge of metabolic pathways of soil AOA presents a critical gap in understanding their role in biogeochemical cycles. Here, we describe the first complete genome of soil archaeon Candidatus Nitrososphaera evergladensis, which has been reconstructed from metagenomic sequencing of a highly enriched culture obtained from an agricultural soil. The AOA enrichment was sequenced with the high throughput next generation sequencing platforms from Pacific Biosciences and Ion Torrent. The de novo assembly of sequences resulted in one 2.95 Mb contig. Annotation of the reconstructed genome revealed many similarities of the basic metabolism with the rest of sequenced AOA. Ca. N. evergladensis belongs to the group I.1b and shares only 40% of whole-genome homology with the closest sequenced relative Ca. N. gargensis. Detailed analysis of the genome revealed coding sequences that were completely absent from the group I.1a. These unique sequences code for proteins involved in control of DNA integrity, transporters, two-component systems and versatile CRISPR defense system. Notably, genomes from the group I.1b have more gene duplications compared to the genomes from the group I.1a. We suggest that the presence of these unique genes and gene duplications may be associated with the environmental versatility of this group. PMID:24999826

  1. Genome size in Hieracium subgenus Hieracium (Asteraceae) is strongly correlated with major phylogenetic groups

    PubMed Central

    Chrtek, Jindřich; Zahradníček, Jaroslav; Krak, Karol; Fehrer, Judith

    2009-01-01

    Background and Aims Hieracium subgenus Hieracium is one of the taxonomically most intricate groups of vascular plants, due to polyploidy and a diversity of breeeding systems (sexuality vs. apomixis). The aim of the present study was to analyse nuclear genome size in a phylogenetic framework and to assess relationships between genome size and ploidy, breeding system and selected ecogeographic features. Methods Holoploid and monoploid genome sizes (C- and Cx-values) of 215 cultivated plants from 89 field populations of 42 so-called ‘basic’ Hieracium species were determined using propidium iodide flow cytometry. Chromosome counts were available for all analysed plants, and all plants were tested experimentally for their mode of reproduction (sexuality vs. apomixis). For constructing molecular phylogenetic trees, the external transcribed spacer region of nuclear ribosomal DNA was used. Key Results The mean 2C values differed up to 2·37-fold among different species (from 7·03 pg in diploid to 16·67 in tetraploid accessions). The 1Cx values varied 1·22-fold (between 3·51 and 4·34 pg). Variation in 1Cx values between conspecific (species in a broad sense) accessions ranged from 0·24% to 7·2%. Little variation (not exceeding the approximate measurement inaccurracy threshold of 3·5%) was found in 33 species, whereas variation higher than 3·5% was detected in seven species. Most of the latter may have a polytopic origin. Mean 1Cx values of the three cytotypes (2n, 3n and 4n) differed significantly (average of 3·93 pg in diploids, 3·82 pg in triploids and 3·78 pg in tetraploids) indicating downsizing of genomes in polyploids. The pattern of genome size variation correlated well with two major phylogenetic clades which were composed of species with western or eastern European origin. The monoploid genome size in the ‘western’ species was significantly lower than in the ‘eastern’ ones. Correlation of genome size with latitude, altitude and selected

  2. Draft Genome Sequences of 17 French Clostridium botulinum Group III Strains

    PubMed Central

    Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2015-01-01

    Animal botulism is mainly associated with Clostridium botulinum group III strains producing neurotoxin types C, C/D, D, and D/C. In this report, we present the draft genome sequences of fourteen strains of Clostridium botulinum producing type C/D and two strains producing type D/C isolated in France, and one strain producing type D/C that originated from New Caledonia. PMID:26430029

  3. Draft Genome Sequences of 17 French Clostridium botulinum Group III Strains.

    PubMed

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2015-01-01

    Animal botulism is mainly associated with Clostridium botulinum group III strains producing neurotoxin types C, C/D, D, and D/C. In this report, we present the draft genome sequences of fourteen strains of Clostridium botulinum producing type C/D and two strains producing type D/C isolated in France, and one strain producing type D/C that originated from New Caledonia. PMID:26430029

  4. Analysis of virus genomes from glacial environments reveals novel virus groups with unusual host interactions

    PubMed Central

    Bellas, Christopher M.; Anesio, Alexandre M.; Barker, Gary

    2015-01-01

    Microbial communities in glacial ecosystems are diverse, active, and subjected to strong viral pressures and infection rates. In this study we analyse putative virus genomes assembled from three dsDNA viromes from cryoconite hole ecosystems of Svalbard and the Greenland Ice Sheet to assess the potential hosts and functional role viruses play in these habitats. We assembled 208 million reads from the virus-size fraction and developed a procedure to select genuine virus scaffolds from cellular contamination. Our curated virus library contained 546 scaffolds up to 230 Kb in length, 54 of which were circular virus consensus genomes. Analysis of virus marker genes revealed a wide range of viruses had been assembled, including bacteriophages, cyanophages, nucleocytoplasmic large DNA viruses and a virophage, with putative hosts identified as Cyanobacteria, Alphaproteobacteria, Gammaproteobacteria, Actinobacteria, Firmicutes, eukaryotic algae and amoebae. Whole genome comparisons revealed the majority of circular genome scaffolds (CGS) formed 12 novel groups, two of which contained multiple phage members with plasmid-like properties, including a group of phage-plasmids possessing plasmid-like partition genes and toxin-antitoxin addiction modules to ensure their replication and a satellite phage-plasmid group. Surprisingly we also assembled a phage that not only encoded plasmid partition genes, but a clustered regularly interspaced short palindromic repeat (CRISPR)/Cas adaptive bacterial immune system. One of the spacers was an exact match for another phage in our virome, indicating that in a novel use of the system, the lysogen was potentially capable of conferring immunity on its bacterial host against other phage. Together these results suggest that highly novel and diverse groups of viruses are present in glacial environments, some of which utilize very unusual life strategies and genes to control their replication and maintain a long-term relationship with their hosts

  5. Genome drafts of four phytoplasma strains of the ribosomal group 16SrIII.

    PubMed

    Saccardo, Federica; Martini, Marta; Palmano, Sabrina; Ermacora, Paolo; Scortichini, Marco; Loi, Nazia; Firrao, Giuseppe

    2012-11-01

    By applying a coverage-based read selection and filtration through a healthy plant dataset, and a post-assembly contig selection based on homology and linkage, genome sequence drafts were obtained for four phytoplasma strains belonging to the 16SrIII group (X disease clade), namely Vaccinium Witches' Broom phytoplasma (647 754 nt in 272 contigs), Italian Clover Phyllody phytoplasma strain MA (597 245 nt in 197 contigs), Poinsettia branch-inducing phytoplasma strain JR1 (631 440 nt in 185 contigs) and Milkweed Yellows phytoplasma (583 806 nt in 158 contigs). Despite assignment to different 16SrIII subgroups, the genomes of the four strains were similar, comprising a highly conserved core (92-98 % similar in their nucleotide sequence among each other over alignments about 500 kb in length) and a minor strain-specific component. As far as their protein complement was concerned, they did not differ significantly in their basic metabolism potential from the genomes of other wide-host-range phytoplasmas sequenced previously, but were distinct from strains of other species, as well as among each other, in genes encoding functions conceivably related to interactions with the host, such as membrane trafficking components, proteases, DNA methylases, effectors and several hypothetical proteins of unknown function, some of which are likely secreted through the Sec-dependent secretion system. The four genomes displayed a group of genes encoding hypothetical proteins with high similarity to a central domain of IcmE/DotG, a core component of the type IVB secretion system of Gram-negative Legionella spp. Conversely, genes encoding functional GroES/GroEL chaperones were not detected in any of the four drafts. The results also indicated the significant role of horizontal gene transfer among different 'Candidatus Phytoplasma' species in shaping phytoplasma genomes and promoting their diversity. PMID:22936033

  6. Genomic insights into the taxonomic status of the Bacillus cereus group

    PubMed Central

    Liu, Yang; Lai, Qiliang; Göker, Markus; Meier-Kolthoff, Jan P.; Wang, Meng; Sun, Yamin; Wang, Lei; Shao, Zongze

    2015-01-01

    The identification and phylogenetic relationships of bacteria within the Bacillus cereus group are controversial. This study aimed at determining the taxonomic affiliations of these strains using the whole-genome sequence-based Genome BLAST Distance Phylogeny (GBDP) approach. The GBDP analysis clearly separated 224 strains into 30 clusters, representing eleven known, partially merged species and accordingly 19–20 putative novel species. Additionally, 16S rRNA gene analysis, a novel variant of multi-locus sequence analysis (nMLSA) and screening of virulence genes were performed. The 16S rRNA gene sequence was not sufficient to differentiate the bacteria within this group due to its high conservation. The nMLSA results were consistent with GBDP. Moreover, a fast typing method was proposed using the pycA gene, and where necessary, the ccpA gene. The pXO plasmids and cry genes were widely distributed, suggesting little correlation with the phylogenetic positions of the host bacteria. This might explain why classifications based on virulence characteristics proved unsatisfactory in the past. In summary, this is the first large-scale and systematic study of the taxonomic status of the bacteria within the B. cereus group using whole-genome sequences, and is likely to contribute to further insights into their pathogenicity, phylogeny and adaptation to diverse environments. PMID:26373441

  7. Extending the cereus group genomics to putative food-bornepathogens of different toxicity

    SciTech Connect

    Lapidus, Alla; Goltsman, Eugene; Auger, Sandrine; Galleron,Nathalie; Segurens, Beatrice; Dossat, Carole; Land, Miriam L.; Broussole,Veronique; Brillard, Julien; Guinebretiere, Marie-Helene; Sanchis,Vincent; Nguen-the, Christophe; Lereclus, Didier; Richardson, Paul; Winker, Patrick; Weissenbach, Jean; Ehrlich, S.Dusko; Sorokin, Alexei

    2006-08-24

    The cereus group represents sporulating soil bacteriacontaining pathogenic strains which may cause diarrheic or emetic foodpoisoning outbreaks. Multiple locus sequence typing revealed a presencein natural samples of these bacteria of about thirty clonal complexes.Application of genomic methods to this group was however biased due tothe major interest for representatives closely related to B. anthracis.Albeit the most important food-borne pathogens were not yet defined,existing dataindicate that they are scattered all over the phylogenetictree. The preliminary analysis of the sequences of three genomesdiscussed in this paper narrows down the gaps in our knowledge of thecereus group. The strain NVH391-98 is a rare but particularly severefood-borne pathogen. Sequencing revealed that the strain must be arepresentative of a novel bacterial species, for which the name Bacilluscytotoxis is proposed. This strain has a reduced genome size compared toother cereus group strains. Genome analysis revealed absence of sigma Bfactor and the presence of genes encoding diarrheic Nhe toxin, notdetected earlier. The strain B. cereus F837/76 represents a clonalcomplex close to that of B. anthracis. Including F837/76, three such B.cereus strains had been sequenced. Alignment of genomes suggests that B.anthracis is their common ancestor. Since such strains often emerge fromclinical cases, they merit a special attention. The third strain, KBAB4,is a typical psychrotrophe characteristic to unbiased soil communities.Phylogenic studies show that in nature it is the most active group interms of gene exchange. Genomic sequence revealed high presence ofextra-chromosomal genetic material (about 530 kb) that may account forthis phenomenon. Genes coding Nhe-like toxin were found on a big plasmidin this strain. This may indicate a potential mechanism of toxicityspread from the psychrotrophic strain community. The results of thisgenomic work and ecological compartments of different strains incite

  8. Whole-Genome Comparison Uncovers Genomic Mutations between Group B Streptococci Sampled from Infected Newborns and Their Mothers

    PubMed Central

    Almeida, Alexandre; Villain, Adrien; Joubrel, Caroline; Touak, Gérald; Sauvage, Elisabeth; Rosinski-Chupin, Isabelle

    2015-01-01

    ABSTRACT Streptococcus agalactiae (group B Streptococcus or GBS), a commensal of the human gut and genitourinary tract, is a leading cause of neonatal infections, in which vertical transmission from mother to child remains the most frequent route of contamination. Here, we investigated whether the progression of GBS from carriage to disease is associated with genomic adaptation. Whole-genome comparison of 47 GBS samples from 19 mother-child pairs uncovered 21 single nucleotide polymorphisms (SNPs) and seven insertions/deletions. Of the SNPs detected, 16 appear to have been fixed in the population sampled whereas five mutations were found to be polymorphic. In the infant strains, 14 mutations were detected, including two independently fixed variants affecting the covRS locus, which is known to encode a major regulatory system of virulence. A one-nucleotide insertion was also identified in the promoter region of the highly immunogenic surface protein Rib gene. Gene expression analysis after incubation in human blood showed that these mutations influenced the expression of virulence-associated genes. Additional identification of three mutated strains in the mothers' milk raised the possibility of the newborns also being a source of contamination for their mothers. Overall, our work showed that GBS strains in carriage and disease scenarios might undergo adaptive changes following colonization. The types and locations of the mutations found, together with the experimental results showing their phenotypic impact, suggest that those in a context of infection were positively selected during the transition of GBS from commensal to pathogen, contributing to an increased capacity to cause disease. IMPORTANCE Group B Streptococcus (GBS) is a major pathogen responsible for neonatal infections. Considering that its colonization of healthy adults is mostly asymptomatic, the mechanisms behind its switch from a commensal to an invasive state are largely unknown. In this work, we

  9. Group-theoretic models of the inversion process in bacterial genomes.

    PubMed

    Egri-Nagy, Attila; Gebhardt, Volker; Tanaka, Mark M; Francis, Andrew R

    2014-07-01

    The variation in genome arrangements among bacterial taxa is largely due to the process of inversion. Recent studies indicate that not all inversions are equally probable, suggesting, for instance, that shorter inversions are more frequent than longer, and those that move the terminus of replication are less probable than those that do not. Current methods for establishing the inversion distance between two bacterial genomes are unable to incorporate such information. In this paper we suggest a group-theoretic framework that in principle can take these constraints into account. In particular, we show that by lifting the problem from circular permutations to the affine symmetric group, the inversion distance can be found in polynomial time for a model in which inversions are restricted to acting on two regions. This requires the proof of new results in group theory, and suggests a vein of new combinatorial problems concerning permutation groups on which group theorists will be needed to collaborate with biologists. We apply the new method to inferring distances and phylogenies for published Yersinia pestis data. PMID:23793228

  10. Genomic data do not support comb jellies as the sister group to all other animals

    PubMed Central

    Pisani, Davide; Pett, Walker; Dohrmann, Martin; Feuda, Roberto; Rota-Stabelli, Omar; Philippe, Hervé; Lartillot, Nicolas; Wörheide, Gert

    2015-01-01

    Understanding how complex traits, such as epithelia, nervous systems, muscles, or guts, originated depends on a well-supported hypothesis about the phylogenetic relationships among major animal lineages. Traditionally, sponges (Porifera) have been interpreted as the sister group to the remaining animals, a hypothesis consistent with the conventional view that the last common animal ancestor was relatively simple and more complex body plans arose later in evolution. However, this premise has recently been challenged by analyses of the genomes of comb jellies (Ctenophora), which, instead, found ctenophores as the sister group to the remaining animals (the “Ctenophora-sister” hypothesis). Because ctenophores are morphologically complex predators with true epithelia, nervous systems, muscles, and guts, this scenario implies these traits were either present in the last common ancestor of all animals and were lost secondarily in sponges and placozoans (Trichoplax) or, alternatively, evolved convergently in comb jellies. Here, we analyze representative datasets from recent studies supporting Ctenophora-sister, including genome-scale alignments of concatenated protein sequences, as well as a genomic gene content dataset. We found no support for Ctenophora-sister and conclude it is an artifact resulting from inadequate methodology, especially the use of simplistic evolutionary models and inappropriate choice of species to root the metazoan tree. Our results reinforce a traditional scenario for the evolution of complexity in animals, and indicate that inferences about the evolution of Metazoa based on the Ctenophora-sister hypothesis are not supported by the currently available data. PMID:26621703

  11. Genomic data do not support comb jellies as the sister group to all other animals.

    PubMed

    Pisani, Davide; Pett, Walker; Dohrmann, Martin; Feuda, Roberto; Rota-Stabelli, Omar; Philippe, Hervé; Lartillot, Nicolas; Wörheide, Gert

    2015-12-15

    Understanding how complex traits, such as epithelia, nervous systems, muscles, or guts, originated depends on a well-supported hypothesis about the phylogenetic relationships among major animal lineages. Traditionally, sponges (Porifera) have been interpreted as the sister group to the remaining animals, a hypothesis consistent with the conventional view that the last common animal ancestor was relatively simple and more complex body plans arose later in evolution. However, this premise has recently been challenged by analyses of the genomes of comb jellies (Ctenophora), which, instead, found ctenophores as the sister group to the remaining animals (the "Ctenophora-sister" hypothesis). Because ctenophores are morphologically complex predators with true epithelia, nervous systems, muscles, and guts, this scenario implies these traits were either present in the last common ancestor of all animals and were lost secondarily in sponges and placozoans (Trichoplax) or, alternatively, evolved convergently in comb jellies. Here, we analyze representative datasets from recent studies supporting Ctenophora-sister, including genome-scale alignments of concatenated protein sequences, as well as a genomic gene content dataset. We found no support for Ctenophora-sister and conclude it is an artifact resulting from inadequate methodology, especially the use of simplistic evolutionary models and inappropriate choice of species to root the metazoan tree. Our results reinforce a traditional scenario for the evolution of complexity in animals, and indicate that inferences about the evolution of Metazoa based on the Ctenophora-sister hypothesis are not supported by the currently available data. PMID:26621703

  12. Genomic analysis reveals hidden biodiversity within colugos, the sister group to primates

    PubMed Central

    Mason, Victor C.; Li, Gang; Minx, Patrick; Schmitz, Jürgen; Churakov, Gennady; Doronina, Liliya; Melin, Amanda D.; Dominy, Nathaniel J.; Lim, Norman T-L.; Springer, Mark S.; Wilson, Richard K.; Warren, Wesley C.; Helgen, Kristofer M.; Murphy, William J.

    2016-01-01

    Colugos are among the most poorly studied mammals despite their centrality to resolving supraordinal primate relationships. Two described species of these gliding mammals are the sole living members of the order Dermoptera, distributed throughout Southeast Asia. We generated a draft genome sequence for a Sunda colugo and a Philippine colugo reference alignment, and used these to identify colugo-specific genetic changes that were enriched in sensory and musculoskeletal-related genes that likely underlie their nocturnal and gliding adaptations. Phylogenomic analysis and catalogs of rare genomic changes overwhelmingly support the contested hypothesis that colugos are the sister group to primates (Primatomorpha), to the exclusion of treeshrews. We captured ~140 kb of orthologous sequence data from colugo museum specimens sampled across their range and identified large genetic differences between many geographically isolated populations that may result in a >300% increase in the number of recognized colugo species. Our results identify conservation units to mitigate future losses of this enigmatic mammalian order. PMID:27532052

  13. Genomic analysis reveals hidden biodiversity within colugos, the sister group to primates.

    PubMed

    Mason, Victor C; Li, Gang; Minx, Patrick; Schmitz, Jürgen; Churakov, Gennady; Doronina, Liliya; Melin, Amanda D; Dominy, Nathaniel J; Lim, Norman T-L; Springer, Mark S; Wilson, Richard K; Warren, Wesley C; Helgen, Kristofer M; Murphy, William J

    2016-08-01

    Colugos are among the most poorly studied mammals despite their centrality to resolving supraordinal primate relationships. Two described species of these gliding mammals are the sole living members of the order Dermoptera, distributed throughout Southeast Asia. We generated a draft genome sequence for a Sunda colugo and a Philippine colugo reference alignment, and used these to identify colugo-specific genetic changes that were enriched in sensory and musculoskeletal-related genes that likely underlie their nocturnal and gliding adaptations. Phylogenomic analysis and catalogs of rare genomic changes overwhelmingly support the contested hypothesis that colugos are the sister group to primates (Primatomorpha), to the exclusion of treeshrews. We captured ~140 kb of orthologous sequence data from colugo museum specimens sampled across their range and identified large genetic differences between many geographically isolated populations that may result in a >300% increase in the number of recognized colugo species. Our results identify conservation units to mitigate future losses of this enigmatic mammalian order. PMID:27532052

  14. New Insights into the Genetic Diversity of Clostridium botulinum Group III through Extensive Genome Exploration.

    PubMed

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2016-01-01

    Animal botulism is caused by group III Clostridium botulinum strains producing type C and D toxins, or their chimeric forms C/D and D/C. Animal botulism is considered an emerging disease in Europe, notably in poultry production. Before our study, 14 genomes from different countries were available in the public database, but none were from France. In order to investigate the genetic relationship of French strains with different geographical areas and find new potential typing targets, 17 strains of C. botulinum group III were sequenced (16 from France and one from New Caledonia). Fourteen were type C/D strains isolated from chickens, ducks, guinea fowl and turkeys and three were type D/C strains isolated from cattle. The New Caledonian strain was a type D/C strain. Whole genome sequence analysis showed the French strains to be closely related to European strains from C. botulinum group III lineages Ia and Ib. The investigation of CRISPR sequences as genetic targets for differentiating strains in group III proved to be irrelevant for type C/D due to a deficient CRISPR/Cas mechanism, but not for type D/C. Conversely, the extrachromosomal elements of type C/D strains could be used to generate a genetic ID card. The highest level of discrimination was achieved with SNP core phylogeny, which allowed differentiation up to strain level and provide the most relevant information for genetic epidemiology studies and discrimination. PMID:27242769

  15. New Insights into the Genetic Diversity of Clostridium botulinum Group III through Extensive Genome Exploration

    PubMed Central

    Woudstra, Cédric; Le Maréchal, Caroline; Souillard, Rozenn; Bayon-Auboyer, Marie-Hélène; Mermoud, Isabelle; Desoutter, Denise; Fach, Patrick

    2016-01-01

    Animal botulism is caused by group III Clostridium botulinum strains producing type C and D toxins, or their chimeric forms C/D and D/C. Animal botulism is considered an emerging disease in Europe, notably in poultry production. Before our study, 14 genomes from different countries were available in the public database, but none were from France. In order to investigate the genetic relationship of French strains with different geographical areas and find new potential typing targets, 17 strains of C. botulinum group III were sequenced (16 from France and one from New Caledonia). Fourteen were type C/D strains isolated from chickens, ducks, guinea fowl and turkeys and three were type D/C strains isolated from cattle. The New Caledonian strain was a type D/C strain. Whole genome sequence analysis showed the French strains to be closely related to European strains from C. botulinum group III lineages Ia and Ib. The investigation of CRISPR sequences as genetic targets for differentiating strains in group III proved to be irrelevant for type C/D due to a deficient CRISPR/Cas mechanism, but not for type D/C. Conversely, the extrachromosomal elements of type C/D strains could be used to generate a genetic ID card. The highest level of discrimination was achieved with SNP core phylogeny, which allowed differentiation up to strain level and provide the most relevant information for genetic epidemiology studies and discrimination. PMID:27242769

  16. Complete Genome Sequence of the RmInt1 Group II Intronless Sinorhizobium meliloti Strain RMO17

    PubMed Central

    Martínez-Abarca, Francisco; Nisa-Martínez, Rafael

    2014-01-01

    We report the complete genome sequence of the RmInt1 group II intronless Sinorhizobium meliloti strain RMO17 isolated from Medicago orbicularis nodules from Spanish soil. The genome consists of 6.73 Mb distributed between a single chromosome and two megaplasmids (the chromid pSymB and pSymA). PMID:25301650

  17. Complete Genome Sequence of the RmInt1 Group II Intronless Sinorhizobium meliloti Strain RMO17.

    PubMed

    Toro, Nicolás; Martínez-Abarca, Francisco; Nisa-Martínez, Rafael

    2014-01-01

    We report the complete genome sequence of the RmInt1 group II intronless Sinorhizobium meliloti strain RMO17 isolated from Medicago orbicularis nodules from Spanish soil. The genome consists of 6.73 Mb distributed between a single chromosome and two megaplasmids (the chromid pSymB and pSymA). PMID:25301650

  18. Genomic Characterization of Group C Orthobunyavirus Reference Strains and Recent South American Clinical Isolates

    PubMed Central

    Yang, Yu; Solórzano, Víctor Fiestas; Kuschner, Robert A.; Halsey, Eric S.; Jarman, Richard G.; Kochel, Tadeusz J.

    2014-01-01

    Group C orthobunyaviruses (family Bunyaviridae, genus Orthobunyavirus), discovered in the 1950s, are vector-borne human pathogens in the Americas. Currently there is a gap in genomic information for group C viruses. In this study, we obtained complete coding region sequences of reference strains of Caraparu (CARV), Oriboca (ORIV), Marituba (MTBV) and Madrid (MADV) viruses, and five clinical isolates from Peru and Bolivia, using an unbiased de novo approach consisting of random reverse transcription, random anchored PCR amplification, and high throughput pyrosequencing. The small, medium, and large segments encode for a 235 amino acid nucleocapsid protein, an approximately 1430 amino acid surface glycoprotein polyprotein precursor, and a 2248 amino acid RNA-dependent RNA polymerase, respectively. Additionally, the S segment encodes for an 83 amino acid non-structural protein, although this protein is truncated or silenced in some isolates. Phylogenetically, three clinical isolates clustered with CARV, one clustered with MTBV, and one isolate appeared to be a reassortant or a genetic drift resulted from the high variability of the medium segment which was also seen in a few other orthobunyaviruses. These data represent the first complete coding region sequences for this serocomplex of pathogenic orthobunyaviruses. The genome-wide phylogeny of reference strains is consistent with the antigenic properties of the viruses reported in the original serological studies conducted in the 1960s. Comparative analysis of conserved protein regions across group C virus strains and the other orthobunyavirus groups revealed that these group C viruses contain characteristic domains of potential structural and functional significance. Our results provide the basis for the developments of diagnostics, further genetic analyses, and future epidemiologic studies of group C viruses. PMID:24633174

  19. Genome Evolution of Wolbachia Strain wPip from the Culex pipiens Group

    PubMed Central

    Klasson, Lisa; Walker, Thomas; Sebaihia, Mohammed; Sanders, Mandy J.; Quail, Michael A.; Lord, Angela; Sanders, Susanne; Earl, Julie; O'Neill, Scott L.; Thomson, Nicholas; Sinkins, Steven P.; Parkhill, Julian

    2008-01-01

    The obligate intracellular bacterium Wolbachia pipientis strain wPip induces cytoplasmic incompatibility (CI), patterns of crossing sterility, in the Culex pipiens group of mosquitoes. The complete sequence is presented of the 1.48-Mbp genome of wPip which encodes 1386 coding sequences (CDSs), representing the first genome sequence of a B-supergroup Wolbachia. Comparisons were made with the smaller genomes of Wolbachia strains wMel of Drosophila melanogaster, an A-supergroup Wolbachia that is also a CI inducer, and wBm, a mutualist of Brugia malayi nematodes that belongs to the D-supergroup of Wolbachia. Despite extensive gene order rearrangement, a core set of Wolbachia genes shared between the 3 genomes can be identified and contrasts with a flexible gene pool where rapid evolution has taken place. There are much more extensive prophage and ankyrin repeat encoding (ANK) gene components of the wPip genome compared with wMel and wBm, and both are likely to be of considerable importance in wPip biology. Five WO-B–like prophage regions are present and contain some genes that are identical or highly similar in multiple prophage copies, whereas other genes are unique, and it is likely that extensive recombination, duplication, and insertion have occurred between copies. A much larger number of genes encode ankyrin repeat (ANK) proteins in wPip, with 60 present compared with 23 in wMel, many of which are within or close to the prophage regions. It is likely that this pattern is partly a result of expansions in the wPip lineage, due for example to gene duplication, but their presence is in some cases more ancient. The wPip genome underlines the considerable evolutionary flexibility of Wolbachia, providing clear evidence for the rapid evolution of ANK-encoding genes and of prophage regions. This host–Wolbachia system, with its complex patterns of sterility induced between populations, now provides an excellent model for unraveling the molecular systems underlying host

  20. Developing improved durum wheat germplasm by altering the cytoplasmic genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In eukaryotic organisms, nuclear and cytoplasmic genomes interact to drive cellular functions. These genomes have co-evolved to form specific nuclear-cytoplasmic interactions that are essential to the origin, success, and evolution of diploid and polyploid species. Hundreds of genetic diseases in h...

  1. Unique Mitochondrial Genome Structure in Diplonemids, the Sister Group of Kinetoplastids

    PubMed Central

    Marande, William; Lukeš, Julius; Burger, Gertraud

    2005-01-01

    Kinetoplastid flagellates are characterized by uniquely massed mitochondrial DNAs (mtDNAs), the kinetoplasts. Kinetoplastids of the trypanosomatid group possess two types of mtDNA molecules: maxicircles bearing protein and mitoribosomal genes and minicircles specifying guide RNAs, which mediate uridine insertion/deletion RNA editing. These circles are interlocked with one another to form dense networks. Whether these peculiar mtDNA features are restricted to kinetoplastids or prevail throughout Euglenozoa (euglenids, diplonemids, and kinetoplastids) is unknown. Here, we describe the mitochondrial genome and the mitochondrial ultrastructure of Diplonema papillatum, a member of the diplonemid flagellates, the sister group of kinetoplastids. Fluorescence and electron microscopy show a single mitochondrion per cell with an ultrastructure atypical for Euglenozoa. In addition, DNA is evenly distributed throughout the organelle rather than compacted. Molecular and electron microscopy studies distinguish numerous 6- and 7-kbp-sized mitochondrial chromosomes of monomeric circular topology and relaxed conformation in vivo. Remarkably, the cox1 gene (and probably other mitochondrial genes) is fragmented, with separate gene pieces encoded on different chromosomes. Generation of the contiguous cox1 mRNA requires trans-splicing, the precise mechanism of which remains to be determined. Taken together, the mitochondrial gene/genome structure of Diplonema is not only different from that of kinetoplastids but unique among eukaryotes as a whole. PMID:15947205

  2. Genome-wide analysis of the SET DOMAIN GROUP family in grapevine.

    PubMed

    Aquea, Felipe; Vega, Andrea; Timmermann, Tania; Poupin, María Josefina; Arce-Johnson, Patricio

    2011-06-01

    The SET DOMAIN GROUP (SDG) proteins represent an evolutionarily-conserved family of epigenetic regulators present in eukaryotes and are putative candidates for the catalysis of lysine methylation in histones. Plant genomes analyses of this family have been performed in arabidopsis, maize, and rice and functional studies have shown that SDG genes are involved in the control of plant development. In this work, we describe the identification and structural characterization of SDG genes in the Vitis vinifera genome. This analysis revealed the presence of 33 putative SDG genes that can be grouped into different classes, as it has been previously described for plants. In addition to the SET domain, the proteins identified possessed other domains in the different classes. As part of our study regarding the growth and development of grapevine, we selected eight genes and their expression levels were analyzed in representative vegetative and reproductive organs of this species. The selected genes showed different patterns of expression during inflorescence and fruit development, suggesting that they participate in these processes. Furthermore, we showed that the expression of selected SDGs changes during viral infection, using as a model Grapevine Leafroll Associated Virus 3-infected symptomatic grapevine leaves and fruits. Our results suggest that developmental changes caused by this virus could be the result of alterations in SDG expression. PMID:21293861

  3. Differentiation of strains from the Bacillus cereus group by RFLP-PFGE genomic fingerprinting.

    PubMed

    Otlewska, Anna; Oltuszak-Walczak, Elzbieta; Walczak, Piotr

    2013-11-01

    Bacillus mycoides, Bacillus pseudomycoides, Bacillus weihenstephanensis, Bacillus anthracis, Bacillus thuringiensis, and Bacillus cereus belong to the B. cereus group. The last three species are characterized by different phenotype features and pathogenicity spectrum, but it has been shown that these species are genetically closely related. The macrorestriction analysis of the genomic DNA with the NotI enzyme was used to generate polymorphism of restriction profiles for 39 food-borne isolates (B. cereus, B. mycoides) and seven reference strains (B. mycoides, B. thuringiensis, B. weihenstephanensis, and B. cereus). The PFGE method was applied to differentiate the examined strains of the B. cereus group. On the basis of the unweighted pair group method with the arithmetic mean method and Dice coefficient, the strains were divided into five clusters (types A-E), and the most numerous group was group A (25 strains). A total of 21 distinct pulsotypes were observed. The RFLP-PFGE analysis was successfully used for the differentiation and characterization of B. cereus and B. mycoides strains isolated from different food products. PMID:23893780

  4. Human promoter genomic composition demonstrates non-random groupings that reflect general cellular function

    PubMed Central

    McNutt, Markey C; Tongbai, Ron; Cui, Wenwu; Collins, Irene; Freebern, Wendy J; Montano, Idalia; Haggerty, Cynthia M; Chandramouli, GVR; Gardner, Kevin

    2005-01-01

    Background The purpose of this study is to determine whether or not there exists nonrandom grouping of cis-regulatory elements within gene promoters that can be perceived independent of gene expression data and whether or not there is any correlation between this grouping and the biological function of the gene. Results Using ProSpector, a web-based promoter search and annotation tool, we have applied an unbiased approach to analyze the transcription factor binding site frequencies of 1400 base pair genomic segments positioned at 1200 base pairs upstream and 200 base pairs downstream of the transcriptional start site of 7298 commonly studied human genes. Partitional clustering of the transcription factor binding site composition within these promoter segments reveals a small number of gene groups that are selectively enriched for gene ontology terms consistent with distinct aspects of cellular function. Significance ranking of the class-determining transcription factor binding sites within these clusters show substantial overlap between the gene ontology terms of the transcriptions factors associated with the binding sites and the gene ontology terms of the regulated genes within each group. Conclusion Thus, gene sorting by promoter composition alone produces partitions in which the "regulated" and the "regulators" cosegregate into similar functional classes. These findings demonstrate that the transcription factor binding site composition is non-randomly distributed between gene promoters in a manner that reflects and partially defines general gene class function. PMID:16232321

  5. Assembly-Driven Community Genomics of a Hypersaline Microbial Ecosystem

    PubMed Central

    Podell, Sheila; Ugalde, Juan A.; Narasingarao, Priya; Banfield, Jillian F.; Heidelberg, Karla B.; Allen, Eric E.

    2013-01-01

    Microbial populations inhabiting a natural hypersaline lake ecosystem in Lake Tyrrell, Victoria, Australia, have been characterized using deep metagenomic sampling, iterative de novo assembly, and multidimensional phylogenetic binning. Composite genomes representing habitat-specific microbial populations were reconstructed for eleven different archaea and one bacterium, comprising between 0.6 and 14.1% of the planktonic community. Eight of the eleven archaeal genomes were from microbial species without previously cultured representatives. These new genomes provide habitat-specific reference sequences enabling detailed, lineage-specific compartmentalization of predicted functional capabilities and cellular properties associated with both dominant and less abundant community members, including organisms previously known only by their 16S rRNA sequences. Together, these data provide a comprehensive, culture-independent genomic blueprint for ecosystem-wide analysis of protein functions, population structure, and lifestyles of co-existing, co-evolving microbial groups within the same natural habitat. The “assembly-driven” community genomic approach demonstrated in this study advances our ability to push beyond single gene investigations, and promotes genome-scale reconstructions as a tangible goal in the quest to define the metabolic, ecological, and evolutionary dynamics that underpin environmental microbial diversity. PMID:23637883

  6. Molecular and genomic characterization of pathogenic traits of group A Streptococcus pyogenes

    PubMed Central

    HAMADA, Shigeyuki; KAWABATA, Shigetada; NAKAGAWA, Ichiro

    2015-01-01

    Group A streptococcus (GAS) or Streptococcus pyogenes causes various diseases ranging from self-limiting sore throat to deadly invasive diseases. The genome size of GAS is 1.85–1.9 Mb, and genomic rearrangement has been demonstrated. GAS possesses various surface-associated substances such as hyaluronic capsule, M proteins, and fibronectin/laminin/immunoglobulin-binding proteins. These are related to the virulence and play multifaceted and mutually reflected roles in the pathogenesis of GAS infections. Invasion of GAS into epithelial cells and deeper tissues provokes immune and non-immune defense or inflammatory responses including the recruitment of neutrophils, macrophages, and dendritic cells in hosts. GAS frequently evades host defense mechanisms by using its virulence factors. Extracellular products of GAS may perturb cellular and subcellular functions and degrade tissues enzymatically, which leads to the aggravation of local and/or systemic disorders in the host. In this review, we summarize some important cellular and extracellular substances that may affect pathogenic processes during GAS infections, and the host responses to these. PMID:26666305

  7. Genomic Analysis Reveals the Molecular Basis for Capsule Loss in the Group B Streptococcus Population

    PubMed Central

    Rosini, Roberto; Campisi, Edmondo; De Chiara, Matteo; Tettelin, Hervé; Rinaudo, Daniela; Toniolo, Chiara; Metruccio, Matteo; Guidotti, Silvia; Sørensen, Uffe B. Skov; Kilian, Mogens; Ramirez, Mario; Janulczyk, Robert; Donati, Claudio; Grandi, Guido; Margarit, Immaculada

    2015-01-01

    The human and bovine bacterial pathogen Streptococcus agalactiae (Group B Streptococcus, GBS) expresses a thick polysaccharide capsule that constitutes a major virulence factor and vaccine target. GBS can be classified into ten distinct serotypes differing in the chemical composition of their capsular polysaccharide. However, non-typeable strains that do not react with anti-capsular sera are frequently isolated from colonized and infected humans and cattle. To gain a comprehensive insight into the molecular basis for the loss of capsule expression in GBS, a collection of well-characterized non-typeable strains was investigated by genome sequencing. Genome based phylogenetic analysis extended to a wide population of sequenced strains confirmed the recently observed high clonality among GBS lineages mainly containing human strains, and revealed a much higher degree of diversity in the bovine population. Remarkably, non-typeable strains were equally distributed in all lineages. A number of distinct mutations in the cps operon were identified that were apparently responsible for inactivation of capsule synthesis. The most frequent genetic alterations were point mutations leading to stop codons in the cps genes, and the main target was found to be cpsE encoding the portal glycosyl trasferase of capsule biosynthesis. Complementation of strains carrying missense mutations in cpsE with a wild-type gene restored capsule expression allowing the identification of amino acid residues essential for enzyme activity. PMID:25946017

  8. Genomic analysis reveals the molecular basis for capsule loss in the group B Streptococcus population.

    PubMed

    Rosini, Roberto; Campisi, Edmondo; De Chiara, Matteo; Tettelin, Hervé; Rinaudo, Daniela; Toniolo, Chiara; Metruccio, Matteo; Guidotti, Silvia; Sørensen, Uffe B Skov; Kilian, Mogens; Ramirez, Mario; Janulczyk, Robert; Donati, Claudio; Grandi, Guido; Margarit, Immaculada

    2015-01-01

    The human and bovine bacterial pathogen Streptococcus agalactiae (Group B Streptococcus, GBS) expresses a thick polysaccharide capsule that constitutes a major virulence factor and vaccine target. GBS can be classified into ten distinct serotypes differing in the chemical composition of their capsular polysaccharide. However, non-typeable strains that do not react with anti-capsular sera are frequently isolated from colonized and infected humans and cattle. To gain a comprehensive insight into the molecular basis for the loss of capsule expression in GBS, a collection of well-characterized non-typeable strains was investigated by genome sequencing. Genome based phylogenetic analysis extended to a wide population of sequenced strains confirmed the recently observed high clonality among GBS lineages mainly containing human strains, and revealed a much higher degree of diversity in the bovine population. Remarkably, non-typeable strains were equally distributed in all lineages. A number of distinct mutations in the cps operon were identified that were apparently responsible for inactivation of capsule synthesis. The most frequent genetic alterations were point mutations leading to stop codons in the cps genes, and the main target was found to be cpsE encoding the portal glycosyl transferase of capsule biosynthesis. Complementation of strains carrying missense mutations in cpsE with a wild-type gene restored capsule expression allowing the identification of amino acid residues essential for enzyme activity. PMID:25946017

  9. Comparative genomic hybridisation divides retinoblastomas into a high and a low level chromosomal instability group

    PubMed Central

    van der Wal, J E; Hermsen, M A J A; Gille, H J P; Schouten-Van Meeteren, N Y N; Moll, A C; Imhof, S M; Meijer, G A; Baak, J P A; van der Valk, P

    2003-01-01

    Background: Retinoblastoma is the most common intraocular malignancy in childhood and is responsible for approximately 1% of all deaths caused by childhood cancer. Aims/methods: Comparative genomic hybridisation was performed on 13 consecutive, histologically confirmed retinoblastomas to analyse patterns of chromosomal changes and correlate these to clinicopathological variables. Six cases were hereditary and seven cases were sporadic. Results: In 11 of the 13 tumours chromosomal abnormalities were detected, most frequently gains. Frequent chromosomal gains concerned 6p (46%), 1q (38%), 2p, 9q (30%), 5p, 7q, 10q, 17q, and 20q (23%). Frequent losses occurred at Xq (46%), 13q14, 16q, and 4q (23%). High level copy number gains were found at 5p15 and 6p11–12. A loss at 13q14 occurred in three cases only. Relatively few events occurred in the hereditary cases (27) compared with the non-hereditary cases (70 events). The number of chromosomal aberrations in these 13 retinoblastomas showed a bimodal distribution. Seven tumours showed less than four chromosomal aberrations, falling into a low level chromosomal instability (CIN) group, and six tumours showed at least eight aberrations, falling into a high level CIN group. In the low level CIN group the mean age was half that seen in the high level CIN group, there were less male patients, and there were more hereditary and bilateral cases. Microsatellite instability was not detected in either of the two groups. Conclusion: Despite the complex pattern of genetic changes in retinoblastomas, certain chromosomal regions appear to be affected preferentially. On the basis of the number of genetic events, retinoblastomas can be divided in low and a high level chromosomal instability groups, which have striking differences in clinical presentation. PMID:12499428

  10. Assignment of simian rotavirus SA11 temperature-sensitive mutant groups B and E to genome segments

    SciTech Connect

    Gombold, J.L.; Estes, M.K.; Ramig, R.F.

    1985-05-01

    Recombinant (reassortant) viruses were selected from crosses between temperature-sensitive (ts) mutants of simian rotavirus SA11 and wild-type human rotavirus Wa. The double-stranded genome RNAs of the reassortants were examined by electrophoresis in Tris-glycine-buffered polyacrylamide gels and by dot hybridization with a cloned DNA probe for genome segment 2. Analysis of replacements of genome segments in the reassortants allowed construction of a map correlating genome segments providing functions interchangeable between SA11 and Wa. The reassortants revealed a functional correspondence in order of increasing electrophoretic mobility of genome segments. Analysis of the parental origin of genome segments in ts+ SA11/Wa reassortants derived from the crosses SA11 tsB(339) X Wa and SA11 tsE(1400) X Wa revealed that the group B lesion of tsB(339) was located on genome segment 3 and the group E lesion of tsE(1400) was on segment 8.

  11. Clostridium botulinum Group II Isolate Phylogenomic Profiling Using Whole-Genome Sequence Data.

    PubMed

    Weedmark, K A; Mabon, P; Hayden, K L; Lambert, D; Van Domselaar, G; Austin, J W; Corbett, C R

    2015-09-01

    Clostridium botulinum group II isolates (n = 163) from different geographic regions, outbreaks, and neurotoxin types and subtypes were characterized in silico using whole-genome sequence data. Two clusters representing a variety of botulinum neurotoxin (BoNT) types and subtypes were identified by multilocus sequence typing (MLST) and core single nucleotide polymorphism (SNP) analysis. While one cluster included BoNT/B4/F6/E9 and nontoxigenic members, the other comprised a wide variety of different BoNT/E subtype isolates and a nontoxigenic strain. In silico MLST and core SNP methods were consistent in terms of clade-level isolate classification; however, core SNP analysis showed higher resolution capability. Furthermore, core SNP analysis correctly distinguished isolates by outbreak and location. This study illustrated the utility of next-generation sequence-based typing approaches for isolate characterization and source attribution and identified discrete SNP loci and MLST alleles for isolate comparison. PMID:26116673

  12. Whole genome sequencing reveals extensive community-level transmission of group A Streptococcus in remote communities.

    PubMed

    Bowen, A C; Harris, T; Holt, D C; Giffard, P M; Carapetis, J R; Campbell, P T; McVERNON, J; Tong, S Y C

    2016-07-01

    Impetigo is common in remote Indigenous children of northern Australia, with the primary driver in this context being Streptococcus pyogenes [or group A Streptococcus (GAS)]. To reduce the high burden of impetigo, the transmission dynamics of GAS must be more clearly elucidated. We performed whole genome sequencing on 31 GAS isolates collected in a single community from children in 11 households with ⩾2 GAS-infected children. We aimed to determine whether transmission was occurring principally within households or across the community. The 31 isolates were represented by nine multilocus sequence types and isolates within each sequence type differed from one another by only 0-3 single nucleotide polymorphisms. There was evidence of extensive transmission both within households and across the community. Our findings suggest that strategies to reduce the burden of impetigo in this setting will need to extend beyond individual households, and incorporate multi-faceted, community-wide approaches. PMID:26833141

  13. Clostridium botulinum Group II Isolate Phylogenomic Profiling Using Whole-Genome Sequence Data

    PubMed Central

    Weedmark, K. A.; Mabon, P.; Hayden, K. L.; Lambert, D.; Van Domselaar, G.; Austin, J. W.

    2015-01-01

    Clostridium botulinum group II isolates (n = 163) from different geographic regions, outbreaks, and neurotoxin types and subtypes were characterized in silico using whole-genome sequence data. Two clusters representing a variety of botulinum neurotoxin (BoNT) types and subtypes were identified by multilocus sequence typing (MLST) and core single nucleotide polymorphism (SNP) analysis. While one cluster included BoNT/B4/F6/E9 and nontoxigenic members, the other comprised a wide variety of different BoNT/E subtype isolates and a nontoxigenic strain. In silico MLST and core SNP methods were consistent in terms of clade-level isolate classification; however, core SNP analysis showed higher resolution capability. Furthermore, core SNP analysis correctly distinguished isolates by outbreak and location. This study illustrated the utility of next-generation sequence-based typing approaches for isolate characterization and source attribution and identified discrete SNP loci and MLST alleles for isolate comparison. PMID:26116673

  14. Tetrahymena macronuclear genome mapping: colinearity Of macronuclear coassortment groups and the micronuclear map on chromosome 1l.

    PubMed Central

    Wickert, S; Nangle, L; Shevel, S; Orias, E

    2000-01-01

    The genetics of the ciliate Tetrahymena thermophila are richer than for most other eukaryotic cells, because Tetrahymena possesses two genomes: a germline (micronuclear) genome that follows a Mendelian model of genetic transmission and a somatic (macronuclear) genome, derived from the micronuclear genome by fragmentation, which follows a different genetic transmission model called phenotypic assortment. While genetic markers in the micronucleus fall into classical linkage groups under meiotic recombination and segregation, the same markers in the macronucleus fall into coassortment groups (CAGs) under phenotypic assortment by the random distribution of MAC chromosome pieces. We set out to determine whether genomic mapping in the macronucleus by genetic means is feasible. To investigate the relationship between the micronuclear map and coassortment groups, we systematically placed into CAGs all of the markers lying on chromosome 1L that are also found in the macronucleus. Sixteen CAGs were identified, 7 of which contain at least two loci. We have concluded that CAGs represent a fundamental genetic feature of the MAC. The MIC and MAC maps on 1L are colinear; that is, CAGs consist exclusively of markers that map to a continuous segment in a given region of the micronuclear map, with no intervening markers from other CAGs. These findings provide a solid foundation for exploiting the MAC chromosome pieces to build a physical map of the Tetrahymena genome. PMID:10757760

  15. In Silico Genomic Fingerprints of the Bacillus anthracis Group Obtained by Virtual Hybridization

    PubMed Central

    Jaimes-Díaz, Hueman; Larios-Serrato, Violeta; Lloret-Sánchez, Teresa; Olguín-Ruiz, Gabriela; Sánchez-Vallejo, Carlos; Carreño-Durán, Luis; Maldonado-Rodríguez, Rogelio; Méndez-Tenorio, Alfonso

    2015-01-01

    In this study we evaluate the capacity of Virtual Hybridization to identify between highly related bacterial strains. Eight genomic fingerprints were obtained by virtual hybridization for the Bacillus anthracis genome set, and a set of 15,264 13-nucleotide short probes designed to produce genomic fingerprints unique for each organism. The data obtained from each genomic fingerprint were used to obtain hybridization patterns simulating a DNA microarray. Two virtual hybridization methods were used: the Direct and the Extended method to identify the number of potential hybridization sites and thus determine the minimum sensitivity value to discriminate between genomes with 99.9% similarity. Genomic fingerprints were compared using both methods and phylogenomic trees were constructed to verify that the minimum detection value is 0.000017. Results obtained from the genomic fingerprints suggest that the distribution in the trees is correct, as compared to other taxonomic methods. Specific virtual hybridization sites for each of the genomes studied were also identified.

  16. Complete Genome Sequence of Nitrosomonas ureae Strain Nm10, an Oligotrophic Group 6a Nitrosomonad

    PubMed Central

    Kozlowski, Jessica A.; Kits, K. Dimitri

    2016-01-01

    The complete genome of Nitrosomonas ureae strain Nm10, a mesophilic betaproteobacterial ammonia oxidizer isolated from Mediterranean soils in Sardinia, Italy, is reported here. This genome represents a cluster 6a nitrosomonad. PMID:26966201

  17. Genome Sequences of Two Bacillus cereus Group Bacteriophages, Eyuki and AvesoBmore

    PubMed Central

    Erill, Ivan

    2015-01-01

    The genomes of two double-stranded DNA (dsDNA) bacteriophages isolated on Bacillus thuringiensis show similarity to previously sequenced phages and provide evidence of the mosaicism of phage genomes. PMID:26472840

  18. Complete Genome Sequence of Streptococcus dysgalactiae subsp. equisimilis 167 Carrying Lancefield Group C Antigen and Comparative Genomics of S. dysgalactiae subsp. equisimilis Strains

    PubMed Central

    Watanabe, Shinya; Kirikae, Teruo; Miyoshi-Akiyama, Tohru

    2013-01-01

    Streptococcus dysgalactiae subsp. equisimilis (SDSE) is an emerging human pathogen that causes life-threatening invasive infections such as streptococcal toxic shock syndrome. Recent epidemiological studies reveal that invasive SDSE infections have been increasing in Asia, Europe, and the United States. Almost all SDSE carry Lancefield group G or C antigen. We have determined the complete genome sequence of a human group C SDSE 167 strain. A comparison of its sequence with that of four SDSE strains, three in Lancefield group G and one in Lancefield group A, showed approximately 90% coverage. Most regions showing little or no homology were located in the prophages. There was no evidence of massive rearrangement in the genome of SDSE 167. Bayesian phylogeny using entire genome sequences showed that the most recent common ancestor of the five SDSE strains appeared 446 years ago. Interestingly, we found that SDSE 167 harbors sugar metabolizing enzymes in a unique region and streptodornase in the phage region, which presumably contribute to the degradation of host tissues and the prompted covRS mutation, respectively. A comparison of these five SDSE strains, which differ in Lancefield group antigens, revealed a gene cluster presumably responsible for the synthesis of the antigenic determinant. These results may provide the basis for molecular epidemiological research of SDSE. PMID:23918808

  19. Big Data: the challenge for small research groups in the era of cancer genomics

    PubMed Central

    Noor, Aisyah Mohd; Holmberg, Lars; Gillett, Cheryl; Grigoriadis, Anita

    2015-01-01

    In the past decade, cancer research has seen an increasing trend towards high-throughput techniques and translational approaches. The increasing availability of assays that utilise smaller quantities of source material and produce higher volumes of data output have resulted in the necessity for data storage solutions beyond those previously used. Multifactorial data, both large in sample size and heterogeneous in context, needs to be integrated in a standardised, cost-effective and secure manner. This requires technical solutions and administrative support not normally financially accounted for in small- to moderate-sized research groups. In this review, we highlight the Big Data challenges faced by translational research groups in the precision medicine era; an era in which the genomes of over 75 000 patients will be sequenced by the National Health Service over the next 3 years to advance healthcare. In particular, we have looked at three main themes of data management in relation to cancer research, namely (1) cancer ontology management, (2) IT infrastructures that have been developed to support data management and (3) the unique ethical challenges introduced by utilising Big Data in research. PMID:26492224

  20. Big Data: the challenge for small research groups in the era of cancer genomics.

    PubMed

    Noor, Aisyah Mohd; Holmberg, Lars; Gillett, Cheryl; Grigoriadis, Anita

    2015-11-17

    In the past decade, cancer research has seen an increasing trend towards high-throughput techniques and translational approaches. The increasing availability of assays that utilise smaller quantities of source material and produce higher volumes of data output have resulted in the necessity for data storage solutions beyond those previously used. Multifactorial data, both large in sample size and heterogeneous in context, needs to be integrated in a standardised, cost-effective and secure manner. This requires technical solutions and administrative support not normally financially accounted for in small- to moderate-sized research groups. In this review, we highlight the Big Data challenges faced by translational research groups in the precision medicine era; an era in which the genomes of over 75,000 patients will be sequenced by the National Health Service over the next 3 years to advance healthcare. In particular, we have looked at three main themes of data management in relation to cancer research, namely (1) cancer ontology management, (2) IT infrastructures that have been developed to support data management and (3) the unique ethical challenges introduced by utilising Big Data in research. PMID:26492224

  1. Genome-wide identification of genes required for fitness of group A Streptococcus in human blood.

    PubMed

    Le Breton, Yoann; Mistry, Pragnesh; Valdes, Kayla M; Quigley, Jeffrey; Kumar, Nikhil; Tettelin, Hervé; McIver, Kevin S

    2013-03-01

    The group A streptococcus (GAS) is a strict human pathogen responsible for a wide spectrum of diseases. Although GAS genome sequences are available, functional genomic analyses have been limited. We developed a mariner-based transposon, osKaR, designed to perform Transposon-Site Hybridization (TraSH) in GAS and successfully tested its use in several invasive serotypes. A complex osKaR mutant library in M1T1 GAS strain 5448 was subjected to negative selection in human blood to identify genes important for GAS fitness in this clinically relevant environment. Mutants underrepresented after growth in blood (output pool) compared to growth in rich media (input pool) were identified using DNA microarray hybridization of transposon-specific tags en masse. Using blood from three different donors, we identified 81 genes that met our criteria for reduced fitness in blood from at least two individuals. Genes known to play a role in survival of GAS in blood were found, including those encoding the virulence regulator Mga (mga), the peroxide response regulator PerR (perR), and the RofA-like regulator Ralp-3 (ralp3). We also identified genes previously reported for their contribution to sepsis in other pathogens, such as de novo nucleotide synthesis (purD, purA, pyrB, carA, carB, guaB), sugar metabolism (scrB, fruA), zinc uptake (adcC), and transcriptional regulation (cpsY). To validate our findings, independent mutants with mutations in 10 different genes identified in our screen were confirmed to be defective for survival in blood bactericidal assays. Overall, this work represents the first use of TraSH in GAS to identify potential virulence genes. PMID:23297387

  2. Genome-Wide Identification of Genes Required for Fitness of Group A Streptococcus in Human Blood

    PubMed Central

    Le Breton, Yoann; Mistry, Pragnesh; Valdes, Kayla M.; Quigley, Jeffrey; Kumar, Nikhil; Tettelin, Hervé

    2013-01-01

    The group A streptococcus (GAS) is a strict human pathogen responsible for a wide spectrum of diseases. Although GAS genome sequences are available, functional genomic analyses have been limited. We developed a mariner-based transposon, osKaR, designed to perform Transposon-Site Hybridization (TraSH) in GAS and successfully tested its use in several invasive serotypes. A complex osKaR mutant library in M1T1 GAS strain 5448 was subjected to negative selection in human blood to identify genes important for GAS fitness in this clinically relevant environment. Mutants underrepresented after growth in blood (output pool) compared to growth in rich media (input pool) were identified using DNA microarray hybridization of transposon-specific tags en masse. Using blood from three different donors, we identified 81 genes that met our criteria for reduced fitness in blood from at least two individuals. Genes known to play a role in survival of GAS in blood were found, including those encoding the virulence regulator Mga (mga), the peroxide response regulator PerR (perR), and the RofA-like regulator Ralp-3 (ralp3). We also identified genes previously reported for their contribution to sepsis in other pathogens, such as de novo nucleotide synthesis (purD, purA, pyrB, carA, carB, guaB), sugar metabolism (scrB, fruA), zinc uptake (adcC), and transcriptional regulation (cpsY). To validate our findings, independent mutants with mutations in 10 different genes identified in our screen were confirmed to be defective for survival in blood bactericidal assays. Overall, this work represents the first use of TraSH in GAS to identify potential virulence genes. PMID:23297387

  3. Xeroderma Pigmentosum Group A Suppresses Mutagenesis Caused by Clustered Oxidative DNA Adducts in the Human Genome

    PubMed Central

    Sassa, Akira; Kamoshita, Nagisa; Kanemaru, Yuki; Honma, Masamitsu; Yasui, Manabu

    2015-01-01

    Clustered DNA damage is defined as multiple sites of DNA damage within one or two helical turns of the duplex DNA. This complex damage is often formed by exposure of the genome to ionizing radiation and is difficult to repair. The mutagenic potential and repair mechanisms of clustered DNA damage in human cells remain to be elucidated. In this study, we investigated the involvement of nucleotide excision repair (NER) in clustered oxidative DNA adducts. To identify the in vivo protective roles of NER, we established a human cell line lacking the NER gene xeroderma pigmentosum group A (XPA). XPA knockout (KO) cells were generated from TSCER122 cells derived from the human lymphoblastoid TK6 cell line. To analyze the mutagenic events in DNA adducts in vivo, we previously employed a system of tracing DNA adducts in the targeted mutagenesis (TATAM), in which DNA adducts were site-specifically introduced into intron 4 of thymidine kinase genes. Using the TATAM system, one or two tandem 7,8-dihydro-8-oxoguanine (8-oxoG) adducts were introduced into the genomes of TSCER122 or XPA KO cells. In XPA KO cells, the proportion of mutants induced by a single 8-oxoG (7.6%) was comparable with that in TSCER122 cells (8.1%). In contrast, the lack of XPA significantly enhanced the mutant proportion of tandem 8-oxoG in the transcribed strand (12%) compared with that in TSCER122 cells (7.4%) but not in the non-transcribed strand (12% and 11% in XPA KO and TSCER122 cells, respectively). By sequencing the tandem 8-oxoG-integrated loci in the transcribed strand, we found that the proportion of tandem mutations was markedly increased in XPA KO cells. These results indicate that NER is involved in repairing clustered DNA adducts in the transcribed strand in vivo. PMID:26559182

  4. Reconciliation of rotavirus temperature-sensitive mutant collections and assignment of reassortment groups D, J, and K to genome segments.

    PubMed

    Criglar, Jeanette; Greenberg, Harry B; Estes, Mary K; Ramig, Robert F

    2011-05-01

    Four rotavirus SA11 temperature-sensitive (ts) mutants and seven rotavirus RRV ts mutants, isolated at the National Institutes of Health (NIH) and not genetically characterized, were assigned to reassortment groups by pairwise crosses with the SA11 mutant group prototypes isolated and characterized at Baylor College of Medicine (BCM). Among the NIH mutants, three of the RRV mutants and all four SA11 mutants contained mutations in single reassortment groups, and four RRV mutants contained mutations in multiple groups. One NIH mutant [RRVtsK(2)] identified the previously undefined 11th reassortment group (K) expected for rotavirus. Three NIH single mutant RRV viruses, RRVtsD(7), RRVtsJ(5), and RRVtsK(2), were in reassortment groups not previously mapped to genome segments. These mutants were mapped using classical genetic methods, including backcrosses to demonstrate reversion or suppression in reassortants with incongruent genotype and temperature phenotype. Once located to specific genome segments by genetic means, the mutations responsible for the ts phenotype were identified by sequencing. The reassortment group K mutant RRVtsK(2) maps to genome segment 9 and has a Thr280Ileu mutation in the capsid surface glycoprotein VP7. The group D mutant RRVtsD(7) maps to segment 5 and has a Leu140Val mutation in the nonstructural interferon (IFN) antagonist protein NSP1. The group J mutant RRVtsJ(5) maps to segment 11 and has an Ala182Gly mutation affecting only the NSP5 open reading frame. Rotavirus ts mutation groups are now mapped to 9 of the 11 rotavirus genome segments. Possible segment locations of the two remaining unmapped ts mutant groups are discussed. PMID:21367894

  5. Complete Genome Sequence of Streptococcus mitis Strain SVGS_061 Isolated from a Neutropenic Patient with Viridans Group Streptococcal Shock Syndrome.

    PubMed

    Petrosyan, Varduhi; Holder, Michael; Ajami, Nadim J; Petrosino, Joseph F; Sahasrabhojane, Pranoti; Thompson, Erika J; Kalia, Awdhesh; Shelburne, Samuel A

    2016-01-01

    Streptococcus mitisfrequently causes invasive infections in neutropenic cancer patients, with a subset of patients developing viridans group streptococcal (VGS) shock syndrome. We report here the first complete genome sequence ofS. mitisstrain SVGS_061, which caused VGS shock syndrome, to help elucidate the pathogenesis of severe VGS infection. PMID:27056234

  6. Draft Genome Sequences of Two Clostridium botulinum Group II (Nonproteolytic) Type B Strains (DB-2 and KAPB-3).

    PubMed

    Petronella, Nicholas; Kenwell, Robyn; Pagotto, Franco; Pightling, Arthur W

    2014-01-01

    Clostridium botulinum is important for food safety and studies of neurotoxins associated with human botulism. We present the draft genome sequences of two strains belonging to group II type B: one collected from Pacific Ocean sediments (DB-2) and another obtained during a botulism outbreak (KAPB-3). PMID:25377702

  7. Draft genome sequence of Streptomyces vitaminophilus ATCC 31673, a producer of pyrrolomycin antibiotics, some of which contain a nitro group

    DOE PAGESBeta

    Mahan, Kristina M.; Klingeman, Dawn Marie; Robert L. Hettich; Parry, Ronald J.; Graham, David E.

    2016-01-21

    Streptomyces vitaminophilus produces pyrrolomycins, which are halogenated polyketide antibiotics. Some of the pyrrolomycins contain a rare nitro group located on the pyrrole ring. In addition, the 6.5-Mbp genome encodes 5,941 predicted protein-coding sequences in 39 contigs with a 71.9% G+C content.

  8. Draft Genome Sequence of Streptomyces vitaminophilus ATCC 31673, a Producer of Pyrrolomycin Antibiotics, Some of Which Contain a Nitro Group

    PubMed Central

    Klingeman, Dawn M.; Hettich, Robert L.; Parry, Ronald J.

    2016-01-01

    Streptomyces vitaminophilus produces pyrrolomycins, which are halogenated polyketide antibiotics. Some of the pyrrolomycins contain a rare nitro group located on the pyrrole ring. The 6.5-Mbp genome encodes 5,941 predicted protein-coding sequences in 39 contigs with a 71.9% G+C content. PMID:26798098

  9. Complete Genome Sequence of Streptococcus mitis Strain SVGS_061 Isolated from a Neutropenic Patient with Viridans Group Streptococcal Shock Syndrome

    PubMed Central

    Petrosyan, Varduhi; Holder, Michael; Ajami, Nadim J.; Petrosino, Joseph F.; Sahasrabhojane, Pranoti; Thompson, Erika J.; Kalia, Awdhesh

    2016-01-01

    Streptococcus mitis frequently causes invasive infections in neutropenic cancer patients, with a subset of patients developing viridans group streptococcal (VGS) shock syndrome. We report here the first complete genome sequence of S. mitis strain SVGS_061, which caused VGS shock syndrome, to help elucidate the pathogenesis of severe VGS infection. PMID:27056234

  10. Effects of racial and ethnic group and health literacy on responses to genomic risk information in a medically underserved population

    PubMed Central

    Kaphingst, Kimberly A.; Stafford, Jewel D.; McGowan, Lucy D’Agostino; Seo, Joann; Lachance, Christina R.; Goodman, Melody S.

    2015-01-01

    Objective Few studies have examined how individuals respond to genomic risk information for common, chronic diseases. This randomized study examined differences in responses by type of genomic information [genetic test/family history] and disease condition [diabetes/heart disease] and by race/ethnicity in a medically underserved population. Methods 1057 English-speaking adults completed a survey containing one of four vignettes (two-by-two randomized design). Differences in dependent variables (i.e., interest in receiving genomic assessment, discussing with doctor or family, changing health habits) by experimental condition and race/ethnicity were examined using chi-squared tests and multivariable regression analysis. Results No significant differences were found in dependent variables by type of genomic information or disease condition. In multivariable models, Hispanics were more interested in receiving a genomic assessment than Whites (OR=1.93; p<0.0001); respondents with marginal (OR=1.54; p=0.005) or limited (OR=1.85; p=0.009) health literacy had greater interest than those with adequate health literacy. Blacks (OR=1.78; p=0.001) and Hispanics (OR=1.85; p=0.001) had greater interest in discussing information with family than Whites. Non-Hispanic Blacks (OR=1.45; p=0.04) had greater interest in discussing genomic information with a doctor than Whites. Blacks (β= −0.41; p<0.001) and Hispanics (β= −0.25; p=0.033) intended to change fewer health habits than Whites; health literacy was negatively associated with number of health habits participants intended to change. Conclusions Findings suggest that race/ethnicity may affect responses to genomic risk information. Additional research could examine how cognitive representations of this information differ across racial/ethnic groups. Health literacy is also critical to consider in developing approaches to communicating genomic information. PMID:25622080

  11. How the indirect reciprocity with co-evolving norm and strategy for 2 × 2 prisoner's dilemma game works for emerging cooperation

    NASA Astrophysics Data System (ADS)

    Tanimoto, Jun; Sagara, Hirokji

    2015-11-01

    We built a new indirect reciprocity model based on binary image scores, where an agent's strategy and norm co-evolve. The norm, meaning what behavior is evaluated as "good" or "bad," stipulates how image scores of two agents playing a game is altered, which has been presumed to be a fixed value in most previous studies. Also, unlike former studies, our model allows an agent to play with an agent who has a different norm. This point of relaxing the freedom of the model pulls down cooperation level vis-à-vis the case where an agent always plays with another one having same norm. However, it is observed that a rather larger dilemma shows robust cooperation establishing compared with a smaller dilemma, since a norm that punishes a so-called second-order free-rider is prompted. To encourage the evolution of norms to be able to punish second-order free-riders, a society needs a small number of defectors. This is elucidated by the fact that cases with action error are more cooperative than those without action error.

  12. Complete genome sequence of Lactobacillus plantarum LZ95, a potential probiotic strain producing bacteriocins and B-group vitamin riboflavin.

    PubMed

    Li, Ping; Gu, Qing

    2016-07-10

    Lactobacillus plantarum LZ95 is a potential probiotic isolated from newborn infant fecal and it is identified to produce riboflavin with great antimicrobial activity. The complete genome sequence of this strain was reported in the present study. The genome contains a 3,261,418-bp chromosome and two plasmids. Genes, related to the biosynthesis of bacteriocins and riboflavin, were identified. This work will facilitate to reveal the biosynthetic mechanism of bacteriocins and B-group vitamins in lactic acid bacteria and provide evidence for its potential application in food industry. PMID:27140869

  13. Draft Genome Sequences of Xanthomonas sacchari and Two Banana-Associated Xanthomonads Reveal Insights into the Xanthomonas Group 1 Clade

    PubMed Central

    Studholme, David J.; Wasukira, Arthur; Paszkiewicz, Konrad; Aritua, Valente; Thwaites, Richard; Smith, Julian; Grant, Murray

    2011-01-01

    We present draft genome sequences for three strains of Xanthomonas species, each of which was associated with banana plants (Musa species) but is not closely related to the previously sequenced banana-pathogen Xanthomonas campestris pathovar musacearum. Strain NCPPB4393 had been deposited as Xanthomonas campestris pathovar musacearum but in fact falls within the species Xanthomonas sacchari. Strain NCPPB1132 is more distantly related to Xanthomonas sacchari whilst strain NCPPB 1131 grouped in a distinct species-level clade related to X. sacchari, along with strains from ginger, rice, cotton and sugarcane. These three newly sequenced strains share many genomic features with the previously sequenced Xanthomonas albilineans, for example possessing an unsual metE allele and lacking the Hrp type III secretion system. However, they are distinct from Xanthomonas albilineans in many respects, for example showing little evidence of genome reduction. They also lack the SPI-1 type III secretion system found in Xanthomonas albilineans. Unlike X. albilineans, all three strains possess a gum gene cluster. The data reported here provide the first genome-wide survey of non-Hrp Xanthomonas species other than Xanthomonas albilineans, which is an atypical member of this group. We hope that the availability of complete sequence data for this group of organisms is the first step towards understanding their interactions with plants and identifying potential virulence factors. PMID:24710305

  14. Genomic properties of Marine Group A bacteria indicate a role in the marine sulfur cycle.

    PubMed

    Wright, Jody J; Mewis, Keith; Hanson, Niels W; Konwar, Kishori M; Maas, Kendra R; Hallam, Steven J

    2014-02-01

    Marine Group A (MGA) is a deeply branching and uncultivated phylum of bacteria. Although their functional roles remain elusive, MGA subgroups are particularly abundant and diverse in oxygen minimum zones and permanent or seasonally stratified anoxic basins, suggesting metabolic adaptation to oxygen-deficiency. Here, we expand a previous survey of MGA diversity in O2-deficient waters of the Northeast subarctic Pacific Ocean (NESAP) to include Saanich Inlet (SI), an anoxic fjord with seasonal O2 gradients and periodic sulfide accumulation. Phylogenetic analysis of small subunit ribosomal RNA (16S rRNA) gene clone libraries recovered five previously described MGA subgroups and defined three novel subgroups (SHBH1141, SHBH391, and SHAN400) in SI. To discern the functional properties of MGA residing along gradients of O2 in the NESAP and SI, we identified and sequenced to completion 14 fosmids harboring MGA-associated 16S RNA genes from a collection of 46 fosmid libraries sourced from NESAP and SI waters. Comparative analysis of these fosmids, in addition to four publicly available MGA-associated large-insert DNA fragments from Hawaii Ocean Time-series and Monterey Bay, revealed widespread genomic differentiation proximal to the ribosomal RNA operon that did not consistently reflect subgroup partitioning patterns observed in 16S rRNA gene clone libraries. Predicted protein-coding genes associated with adaptation to O2-deficiency and sulfur-based energy metabolism were detected on multiple fosmids, including polysulfide reductase (psrABC), implicated in dissimilatory polysulfide reduction to hydrogen sulfide and dissimilatory sulfur oxidation. These results posit a potential role for specific MGA subgroups in the marine sulfur cycle. PMID:24030600

  15. Community Genomic and Proteomic Analyses of Chemoautotrophic Iron-Oxidizing "Leptospirillum rubarum" (Group II) and "Leptospirillum ferrodiazotrophum" (Group III) Bacteria in Acid Mine Drainage Biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum group II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, Iron Mountain, CA, acid mine drainage biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum groups II and III, respectively, and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and >60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid carries conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacterial groups have genes for community-essential functions, including carbon fixation and biosynthesis of vitamins, fatty acids, and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum group II uses a methyl-dependent and Leptospirillum group III a methyl-independent response pathway. Although only Leptospirillum group III can fix nitrogen, these proteins were not identified by proteomics. The abundances of core proteins are similar in all communities, but the abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum groups II and III.

  16. Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer

    PubMed Central

    Nisa-Martínez, Rafael; Jiménez-Zurdo, José I.; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

    2007-01-01

    RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase–maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (ΔORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic ΔORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161

  17. Dispersion of the RmInt1 group II intron in the Sinorhizobium meliloti genome upon acquisition by conjugative transfer.

    PubMed

    Nisa-Martínez, Rafael; Jiménez-Zurdo, José I; Martínez-Abarca, Francisco; Muñoz-Adelantado, Estefanía; Toro, Nicolás

    2007-01-01

    RmInt1 is a self-splicing and mobile group II intron initially identified in the bacterium Sinorhizobium meliloti, which encodes a reverse transcriptase-maturase (Intron Encoded Protein, IEP) lacking the C-terminal DNA binding (D) and DNA endonuclease domains (En). RmInt1 invades cognate intronless homing sites (ISRm2011-2) by a mechanism known as retrohoming. This work describes how the RmInt1 intron spreads in the S.meliloti genome upon acquisition by conjugation. This process was revealed by using the wild-type intron RmInt1 and engineered intron-donor constructs based on ribozyme coding sequence (DeltaORF)-derivatives with higher homing efficiency than the wild-type intron. The data demonstrate that RmInt1 propagates into the S.meliloti genome primarily by retrohoming with a strand bias related to replication of the chromosome and symbiotic megaplasmids. Moreover, we show that when expressed in trans from a separate plasmid, the IEP is able to mobilize genomic DeltaORF ribozymes that afterward displayed wild-type levels of retrohoming. Our results contribute to get further understanding of how group II introns spread into bacterial genomes in nature. PMID:17158161

  18. Genomic Analysis of Melioribacter roseus, Facultatively Anaerobic Organotrophic Bacterium Representing a Novel Deep Lineage within Bacteriodetes/Chlorobi Group

    PubMed Central

    Kadnikov, Vitaly V.; Mardanov, Andrey V.; Podosokorskaya, Olga A.; Gavrilov, Sergey N.; Kublanov, Ilya V.; Beletsky, Alexey V.; Bonch-Osmolovskaya, Elizaveta A.; Ravin, Nikolai V.

    2013-01-01

    Melioribacter roseus is a moderately thermophilic facultatively anaerobic organotrophic bacterium representing a novel deep branch within Bacteriodetes/Chlorobi group. To better understand the metabolic capabilities and possible ecological functions of M. roseus and get insights into the evolutionary history of this bacterial lineage, we sequenced the genome of the type strain P3M-2T. A total of 2838 open reading frames was predicted from its 3.30 Mb genome. The whole proteome analysis supported phylum-level classification of M. roseus since most of the predicted proteins had closest matches in Bacteriodetes, Proteobacteria, Chlorobi, Firmicutes and deeply-branching bacterium Caldithrix abyssi, rather than in one particular phylum. Consistent with the ability of the bacterium to grow on complex carbohydrates, the genome analysis revealed more than one hundred glycoside hydrolases, glycoside transferases, polysaccharide lyases and carbohydrate esterases. The reconstructed central metabolism revealed pathways enabling the fermentation of complex organic substrates, as well as their complete oxidation through aerobic and anaerobic respiration. Genes encoding the photosynthetic and nitrogen-fixation machinery of green sulfur bacteria, as well as key enzymes of autotrophic carbon fixation pathways, were not identified. The M. roseus genome supports its affiliation to a novel phylum Ignavibateriae, representing the first step on the evolutionary pathway from heterotrophic ancestors of Bacteriodetes/Chlorobi group towards anaerobic photoautotrophic Chlorobi. PMID:23301019

  19. Insights into the evolution of Archaea and eukaryotic protein modifier systems revealed by the genome of a novel archaeal group.

    PubMed

    Nunoura, Takuro; Takaki, Yoshihiro; Kakuta, Jungo; Nishi, Shinro; Sugahara, Junichi; Kazama, Hiromi; Chee, Gab-Joo; Hattori, Masahira; Kanai, Akio; Atomi, Haruyuki; Takai, Ken; Takami, Hideto

    2011-04-01

    The domain Archaea has historically been divided into two phyla, the Crenarchaeota and Euryarchaeota. Although regarded as members of the Crenarchaeota based on small subunit rRNA phylogeny, environmental genomics and efforts for cultivation have recently revealed two novel phyla/divisions in the Archaea; the 'Thaumarchaeota' and 'Korarchaeota'. Here, we show the genome sequence of Candidatus 'Caldiarchaeum subterraneum' that represents an uncultivated crenarchaeotic group. A composite genome was reconstructed from a metagenomic library previously prepared from a microbial mat at a geothermal water stream of a sub-surface gold mine. The genome was found to be clearly distinct from those of the known phyla/divisions, Crenarchaeota (hyperthermophiles), Euryarchaeota, Thaumarchaeota and Korarchaeota. The unique traits suggest that this crenarchaeotic group can be considered as a novel archaeal phylum/division. Moreover, C. subterraneum harbors an ubiquitin-like protein modifier system consisting of Ub, E1, E2 and small Zn RING finger family protein with structural motifs specific to eukaryotic system proteins, a system clearly distinct from the prokaryote-type system recently identified in Haloferax and Mycobacterium. The presence of such a eukaryote-type system is unprecedented in prokaryotes, and indicates that a prototype of the eukaryotic protein modifier system is present in the Archaea. PMID:21169198

  20. Complete genome sequence of Lactobacillus plantarum LZ227, a potential probiotic strain producing B-group vitamins.

    PubMed

    Li, Ping; Zhou, Qingqing; Gu, Qing

    2016-09-20

    B-group vitamins play an important role in human metabolism, whose deficiencies are associated with a variety of disorders and diseases. Certain microorganisms such as Lactic acid bacteria (LAB) have been shown to have capacities for B-group vitamin production and thus could potentially replace chemically synthesized vitamins for food fortification. A potential probiotic strain named Lactobacillus plantarum LZ227, which was isolated from raw cow milk in this study, exhibits the ability to produce B-group vitamins. Complete genome sequencing of LZ227 was performed to gain insights into the genetic elements involved in B-group vitamin production. The genome of LZ227 contains a circular 3,131,750-bp chromosome, three circular plasmids and two predicted linear plasmids. LZ227 also contains gene clusters for biosynthesis of both riboflavin and folate. This genome sequence provides a basis for further elucidation of its molecular genetics and probiotic functions, and will facilitate its applications as starter cultures in food industry. PMID:27480344

  1. Allopolyploidy-induced rapid genome evolution in the wheat (Aegilops-Triticum) group.

    PubMed

    Ozkan, H; Levy, A A; Feldman, M

    2001-08-01

    To better understand genetic events that accompany allopolyploid formation, we studied the rate and time of elimination of eight DNA sequences in F1 hybrids and newly formed allopolyploids of Aegilops and TRITICUM: In total, 35 interspecific and intergeneric F1 hybrids and 22 derived allopolyploids were analyzed and compared with their direct parental plants. The studied sequences exist in all the diploid species of the Triticeae but occur in only one genome, either in one homologous pair (chromosome-specific sequences [CSSs]) or in several pairs of the same genome (genome-specific sequences [GSSs]), in the polyploid wheats. It was found that rapid elimination of CSSs and GSSs is a general phenomenon in newly synthesized allopolyploids. Elimination of GSSs was already initiated in F1 plants and was completed in the second or third allopolyploid generation, whereas elimination of CSSs started in the first allopolyploid generation and was completed in the second or third generation. Sequence elimination started earlier in allopolyploids whose genome constitution was analogous to natural polyploids compared with allopolyploids that do not occur in nature. Elimination is a nonrandom and reproducible event whose direction was determined by the genomic combination of the hybrid or the allopolyploid. It was not affected by the genotype of the parental plants, by their cytoplasm, or by the ploidy level, and it did not result from intergenomic recombination. Allopolyploidy-induced sequence elimination occurred in a sizable fraction of the genome and in sequences that were apparently noncoding. This finding suggests a role in augmenting the differentiation of homoeologous chromosomes at the polyploid level, thereby providing the physical basis for the diploid-like meiotic behavior of newly formed allopolyploids. In our view, this rapid genome adjustment may have contributed to the successful establishment of newly formed allopolyploids as new species. PMID:11487689

  2. Community genomic and proteomic analysis of chemoautotrophic, iron-oxidizing "Leptospirillum rubarum" (Group II) and Leptospirillum ferrodiazotrophum (Group III) in acid mine drainage biofilms

    SciTech Connect

    Goltsman, Daniela; Denef, Vincent; Singer, Steven; Verberkmoes, Nathan C; Lefsrud, Mark G; Mueller, Ryan; Dick, Gregory J.; Sun, Christine; Wheeler, Korin; Zelma, Adam; Baker, Brett J.; Hauser, Loren John; Land, Miriam L; Shah, Manesh B; Thelen, Michael P.; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2009-01-01

    We analyzed near-complete population (composite) genomic sequences for coexisting acidophilic iron-oxidizing Leptospirillum Groups II and III bacteria (phylum Nitrospirae) and an extrachromosomal plasmid from a Richmond Mine, CA acid mine drainage (AMD) biofilm. Community proteomic analysis of the genomically characterized sample and two other biofilms identified 64.6% and 44.9% of the predicted proteins of Leptospirillum Groups II and III, respectively and 20% of the predicted plasmid proteins. The bacteria share 92% 16S rRNA gene sequence identity and > 60% of their genes, including integrated plasmid-like regions. The extrachromosomal plasmid encodes conjugation genes with detectable sequence similarity to genes in the integrated conjugative plasmid, but only those on the extrachromosomal element were identified by proteomics. Both bacteria have genes for community-essential functions, including carbon fixation, biosynthesis of vitamins, fatty acids and biopolymers (including cellulose); proteomic analyses reveal these activities. Both Leptospirillum types have multiple pathways for osmotic protection. Although both are motile, signal transduction and methyl-accepting chemotaxis proteins are more abundant in Leptospirillum Group III, consistent with its distribution in gradients within biofilms. Interestingly, Leptospirillum Group II uses a methyl-dependent and Leptospirillum Group III a methyl-independent response pathway. Although only Leptospirillum Group III can fix nitrogen, these proteins were not identified by proteomics. Abundances of core proteins are similar in all communities, but abundance levels of unique and shared proteins of unknown function vary. Some proteins unique to one organism were highly expressed and may be key to the functional and ecological differentiation of Leptospirillum Groups II and III.

  3. Fundamental molecules of life are pigments which arose and co-evolved as a response to the thermodynamic imperative of dissipating the prevailing solar spectrum

    NASA Astrophysics Data System (ADS)

    Michaelian, K.; Simeonov, A.

    2015-08-01

    The driving force behind the origin and evolution of life has been the thermodynamic imperative of increasing the entropy production of the biosphere through increasing the global solar photon dissipation rate. In the upper atmosphere of today, oxygen and ozone derived from life processes are performing the short-wavelength UV-C and UV-B dissipation. On Earth's surface, water and organic pigments in water facilitate the near-UV and visible photon dissipation. The first organic pigments probably formed, absorbed, and dissipated at those photochemically active wavelengths in the UV-C and UV-B that could have reached Earth's surface during the Archean. Proliferation of these pigments can be understood as an autocatalytic photochemical process obeying non-equilibrium thermodynamic directives related to increasing solar photon dissipation rate. Under these directives, organic pigments would have evolved over time to increase the global photon dissipation rate by (1) increasing the ratio of their effective photon cross sections to their physical size, (2) decreasing their electronic excited state lifetimes, (3) quenching radiative de-excitation channels (e.g., fluorescence), (4) covering ever more completely the prevailing solar spectrum, and (5) proliferating and dispersing to cover an ever greater surface area of Earth. From knowledge of the evolution of the spectrum of G-type stars, and considering the most probable history of the transparency of Earth's atmosphere, we construct the most probable Earth surface solar spectrum as a function of time and compare this with the history of molecular absorption maxima obtained from the available data in the literature. This comparison supports the conjecture that many fundamental molecules of life are pigments which arose, proliferated, and co-evolved as a response to dissipating the solar spectrum, supports the thermodynamic dissipation theory for the origin of life, constrains models for Earth's early atmosphere, and sheds

  4. Novel Resistance-Nodulation-Cell Division Efflux System AdeDE in Acinetobacter Genomic DNA Group 3

    PubMed Central

    Chau, Sze-Lok; Chu, Yiu-Wai; Houang, Elizabeth T. S.

    2004-01-01

    Resistance-nodulation-cell division type efflux pump AdeDE was identified in acinetobacters belonging to genomic DNA group 3. Inactivation of adeE showed that it may be responsible for reduced susceptibility to amikacin, ceftazidime, chloramphenicol, ciprofloxacin, erythromycin, ethidium bromide, meropenem, rifampin, and tetracycline. However, unlike what was found for other similar efflux systems, the open reading frame for the outer membrane component was not found downstream of the adeDE gene cluster. PMID:15388479

  5. Survey of chimeric IStron elements in bacterial genomes: multiple molecular symbioses between group I intron ribozymes and DNA transposons

    PubMed Central

    Tourasse, Nicolas J.; Stabell, Fredrik B.; Kolstø, Anne-Brit

    2014-01-01

    IStrons are chimeric genetic elements composed of a group I intron associated with an insertion sequence (IS). The group I intron is a catalytic RNA providing the IStron with self-splicing ability, which renders IStron insertions harmless to the host genome. The IS element is a DNA transposon conferring mobility, and thus allowing the IStron to spread in genomes. IStrons are therefore a striking example of a molecular symbiosis between unrelated genetic elements endowed with different functions. In this study, we have conducted the first comprehensive survey of IStrons in sequenced genomes that provides insights into the distribution, diversity, origin and evolution of IStrons. We show that IStrons have a restricted phylogenetic distribution limited to two bacterial phyla, the Firmicutes and the Fusobacteria. Nevertheless, diverse IStrons representing two major groups targeting different insertion site motifs were identified. This taken with the finding that while the intron components of all IStrons belong to the same structural class, they are fused to different IS families, indicates that multiple intron–IS symbioses have occurred during evolution. In addition, introns and IS elements related to those that were at the origin of IStrons were also identified. PMID:25324310

  6. The first detection and whole genome characterization of the G6P[15] group A rotavirus strain from roe deer.

    PubMed

    Jamnikar-Ciglenecki, Urska; Kuhar, Urska; Sturm, Sabina; Kirbis, Andrej; Racki, Nejc; Steyer, Andrej

    2016-08-15

    Although rotaviruses have been detected in a variety of host species, there are only limited records of their occurrence in deer, where their role is unknown. In this study, group A rotavirus was identified in roe deer during a study of enteric viruses in game animals. 102 samples of intestinal content were collected from roe deer (56), wild boars (29), chamois (10), red deer (6) and mouflon (1), but only one sample from roe deer was positive. Following whole genome sequence analysis, the rotavirus strain D38/14 was characterized by next generation sequencing. The genotype constellation, comprising 11 genome segments, was G6-P[15]-I2-R2-C2-M2-A3-N2-T6-E2-H3. Phylogenetic analysis of the VP7 genome segment showed that the D38/14 rotavirus strain is closely related to the various G6 zoonotic rotavirus strains of bovine-like origin frequently detected in humans. In the VP4 segment, this strain showed high variation compared to that in the P[15] strain found in sheep and in a goat. This finding suggests that rotaviruses from deer are similar to those in other DS-1 rotavirus groups and could constitute a source of zoonotically transmitted rotaviruses. The epidemiological status of group A rotaviruses in deer should be further investigated. PMID:27374907

  7. A novel virus genome discovered in an extreme environment suggests recombination between unrelated groups of RNA and DNA viruses

    PubMed Central

    2012-01-01

    Background Viruses are known to be the most abundant organisms on earth, yet little is known about their collective origin and evolutionary history. With exceptionally high rates of genetic mutation and mosaicism, it is not currently possible to resolve deep evolutionary histories of the known major virus groups. Metagenomics offers a potential means of establishing a more comprehensive view of viral evolution as vast amounts of new sequence data becomes available for comparative analysis. Results Bioinformatic analysis of viral metagenomic sequences derived from a hot, acidic lake revealed a circular, putatively single-stranded DNA virus encoding a major capsid protein similar to those found only in single-stranded RNA viruses. The presence and circular configuration of the complete virus genome was confirmed by inverse PCR amplification from native DNA extracted from lake sediment. The virus genome appears to be the result of a RNA-DNA recombination event between two ostensibly unrelated virus groups. Environmental sequence databases were examined for homologous genes arranged in similar configurations and three similar putative virus genomes from marine environments were identified. This result indicates the existence of a widespread but previously undetected group of viruses. Conclusions This unique viral genome carries implications for theories of virus emergence and evolution, as no mechanism for interviral RNA-DNA recombination has yet been identified, and only scant evidence exists that genetic exchange occurs between such distinct virus lineages. Reviewers This article was reviewed by EK, MK (nominated by PF) and AM. For the full reviews, please go to the Reviewers' comments section. PMID:22515485

  8. Evolution and dynamics of megaplasmids with genome sizes larger than 100 kb in the Bacillus cereus group

    PubMed Central

    2013-01-01

    Background Plasmids play a crucial role in the evolution of bacterial genomes by mediating horizontal gene transfer. However, the origin and evolution of most plasmids remains unclear, especially for megaplasmids. Strains of the Bacillus cereus group contain up to 13 plasmids with genome sizes ranging from 2 kb to 600 kb, and thus can be used to study plasmid dynamics and evolution. Results This work studied the origin and evolution of 31 B. cereus group megaplasmids (>100 kb) focusing on the most conserved regions on plasmids, minireplicons. Sixty-five putative minireplicons were identified and classified to six types on the basis of proteins that are essential for replication. Twenty-nine of the 31 megaplasmids contained two or more minireplicons. Phylogenetic analysis of the protein sequences showed that different minireplicons on the same megaplasmid have different evolutionary histories. Therefore, we speculated that these megaplasmids are the results of fusion of smaller plasmids. All plasmids of a bacterial strain must be compatible. In megaplasmids of the B. cereus group, individual minireplicons of different megaplasmids in the same strain belong to different types or subtypes. Thus, the subtypes of each minireplicon they contain may determine the incompatibilities of megaplasmids. A broader analysis of all 1285 bacterial plasmids with putative known minireplicons whose complete genome sequences were available from GenBank revealed that 34% (443 plasmids) of the plasmids have two or more minireplicons. This indicates that plasmid fusion events are general among bacterial plasmids. Conclusions Megaplasmids of B. cereus group are fusion of smaller plasmids, and the fusion of plasmids likely occurs frequently in the B. cereus group and in other bacterial taxa. Plasmid fusion may be one of the major mechanisms for formation of novel megaplasmids in the evolution of bacteria. PMID:24295128

  9. Including different groups of genotyped females for genomic prediction in a Nordic Jersey population.

    PubMed

    Gao, H; Madsen, P; Nielsen, U S; Aamand, G P; Su, G; Byskov, K; Jensen, J

    2015-12-01

    Including genotyped females in a reference population (RP) is an obvious way to increase the RP in genomic selection, especially for dairy breeds of limited population size. However, the incorporation of these females must be conducted cautiously because of the potential preferential treatment of the genotyped cows and lower reliabilities of phenotypes compared with the proven pseudo-phenotypes of bulls. Breeding organizations in Denmark, Finland, and Sweden have implemented a female-genotyping project with the possibility of genotyping entire herds using the low-density (LD) chip. In the present study, 5 scenarios for building an RP were investigated in the Nordic Jersey population: (1) bulls only, (2) bulls with females from the LD project, (3) bulls with females from the LD project plus non-LD project females genotyped before their first calving, (4) bulls with females from the LD project plus non-LD project females genotyped after their first calving, and (5) bulls with all genotyped females. The genomically enhanced breeding value (GEBV) was predicted for 8 traits in the Nordic total merit index through a genomic BLUP model using deregressed proof (DRP) as the response variable in all scenarios. In addition, (daughter) yield deviation and raw phenotypic data were studied as response variables for comparison with the DRP, using stature as a model trait. The validation population was formed using a cut-off birth year of 2005 based on the genotyped Nordic Jersey bulls with DRP. The average increment in reliability of the GEBV across the 8 traits investigated was 1.9 to 4.5 percentage points compared with using only bulls in the RP (scenario 1). The addition of all the genotyped females to the RP resulted in the highest gain in reliability (scenario 5), followed by scenario 3, scenario 2, and scenario 4. All scenarios led to inflated GEBV because the regression coefficients are less than 1. However, scenario 2 and scenario 3 led to less bias of genomic predictions

  10. Genome sequence and comparative microarray analysis of serotype M18 group A Streptococcus strains associated with acute rheumatic fever outbreaks

    PubMed Central

    Smoot, James C.; Barbian, Kent D.; Van Gompel, Jamie J.; Smoot, Laura M.; Chaussee, Michael S.; Sylva, Gail L.; Sturdevant, Daniel E.; Ricklefs, Stacy M.; Porcella, Stephen F.; Parkins, Larye D.; Beres, Stephen B.; Campbell, David S.; Smith, Todd M.; Zhang, Qing; Kapur, Vivek; Daly, Judy A.; Veasy, L. George; Musser, James M.

    2002-01-01

    Acute rheumatic fever (ARF), a sequelae of group A Streptococcus (GAS) infection, is the most common cause of preventable childhood heart disease worldwide. The molecular basis of ARF and the subsequent rheumatic heart disease are poorly understood. Serotype M18 GAS strains have been associated for decades with ARF outbreaks in the U.S. As a first step toward gaining new insight into ARF pathogenesis, we sequenced the genome of strain MGAS8232, a serotype M18 organism isolated from a patient with ARF. The genome is a circular chromosome of 1,895,017 bp, and it shares 1.7 Mb of closely related genetic material with strain SF370 (a sequenced serotype M1 strain). Strain MGAS8232 has 178 ORFs absent in SF370. Phages, phage-like elements, and insertion sequences are the major sources of variation between the genomes. The genomes of strain MGAS8232 and SF370 encode many of the same proven or putative virulence factors. Importantly, strain MGAS8232 has genes encoding many additional secreted proteins involved in human–GAS interactions, including streptococcal pyrogenic exotoxin A (scarlet fever toxin) and two uncharacterized pyrogenic exotoxin homologues, all phage-associated. DNA microarray analysis of 36 serotype M18 strains from diverse localities showed that most regions of variation were phages or phage-like elements. Two epidemics of ARF occurring 12 years apart in Salt Lake City, UT, were caused by serotype M18 strains that were genetically identical, or nearly so. Our analysis provides a critical foundation for accelerated research into ARF pathogenesis and a molecular framework to study the plasticity of GAS genomes. PMID:11917108

  11. Effect of cow reference group on validation reliability of genomic evaluation.

    PubMed

    Koivula, M; Strandén, I; Aamand, G P; Mäntysaari, E A

    2016-06-01

    We studied the effect of including genomic data for cows in the reference population of single-step evaluations. Deregressed individual cow genetic evaluations (DRP) from milk production evaluations of Nordic Red Dairy cattle were used to estimate the single-step breeding values. Validation reliability and bias of the evaluations were calculated with four data sets including different amount of DRP record information from genotyped cows in the reference population. The gain in reliability was from 2% to 4% units for the production traits, depending on the used DRP data and the amount of genomic data. Moreover, inclusion of genotyped bull dams and their genotyped daughters seemed to create some bias in the single-step evaluation. Still, genotyping cows and their inclusion in the reference population is advantageous and should be encouraged. PMID:27075712

  12. Social evolution. Genomic signatures of evolutionary transitions from solitary to group living.

    PubMed

    Kapheim, Karen M; Pan, Hailin; Li, Cai; Salzberg, Steven L; Puiu, Daniela; Magoc, Tanja; Robertson, Hugh M; Hudson, Matthew E; Venkat, Aarti; Fischman, Brielle J; Hernandez, Alvaro; Yandell, Mark; Ence, Daniel; Holt, Carson; Yocum, George D; Kemp, William P; Bosch, Jordi; Waterhouse, Robert M; Zdobnov, Evgeny M; Stolle, Eckart; Kraus, F Bernhard; Helbing, Sophie; Moritz, Robin F A; Glastad, Karl M; Hunt, Brendan G; Goodisman, Michael A D; Hauser, Frank; Grimmelikhuijzen, Cornelis J P; Pinheiro, Daniel Guariz; Nunes, Francis Morais Franco; Soares, Michelle Prioli Miranda; Tanaka, Érica Donato; Simões, Zilá Luz Paulino; Hartfelder, Klaus; Evans, Jay D; Barribeau, Seth M; Johnson, Reed M; Massey, Jonathan H; Southey, Bruce R; Hasselmann, Martin; Hamacher, Daniel; Biewer, Matthias; Kent, Clement F; Zayed, Amro; Blatti, Charles; Sinha, Saurabh; Johnston, J Spencer; Hanrahan, Shawn J; Kocher, Sarah D; Wang, Jun; Robinson, Gene E; Zhang, Guojie

    2015-06-01

    The evolution of eusociality is one of the major transitions in evolution, but the underlying genomic changes are unknown. We compared the genomes of 10 bee species that vary in social complexity, representing multiple independent transitions in social evolution, and report three major findings. First, many important genes show evidence of neutral evolution as a consequence of relaxed selection with increasing social complexity. Second, there is no single road map to eusociality; independent evolutionary transitions in sociality have independent genetic underpinnings. Third, though clearly independent in detail, these transitions do have similar general features, including an increase in constrained protein evolution accompanied by increases in the potential for gene regulation and decreases in diversity and abundance of transposable elements. Eusociality may arise through different mechanisms each time, but would likely always involve an increase in the complexity of gene networks. PMID:25977371

  13. Complete Genome Sequence and Comparative Genomic Analysis of Mycobacterium massiliense JCM 15300 in the Mycobacterium abscessus Group Reveal a Conserved Genomic Island MmGI-1 Related to Putative Lipid Metabolism

    PubMed Central

    Nakanaga, Kazue; Nakata, Noboru; Kazumi, Yuko; Maeda, Shinji; Makino, Masahiko; Hoshino, Yoshihiko; Kuroda, Makoto

    2014-01-01

    Mycobacterium abscessus group subsp., such as M. massiliense, M. abscessus sensu stricto and M. bolletii, are an environmental organism found in soil, water and other ecological niches, and have been isolated from respiratory tract infection, skin and soft tissue infection, postoperative infection of cosmetic surgery. To determine the unique genetic feature of M. massiliense, we sequenced the complete genome of M. massiliense type strain JCM 15300 (corresponding to CCUG 48898). Comparative genomic analysis was performed among Mycobacterium spp. and among M. abscessus group subspp., showing that additional ß-oxidation-related genes and, notably, the mammalian cell entry (mce) operon were located on a genomic island, M. massiliense Genomic Island 1 (MmGI-1), in M. massiliense. In addition, putative anaerobic respiration system-related genes and additional mycolic acid cyclopropane synthetase-related genes were found uniquely in M. massiliense. Japanese isolates of M. massiliense also frequently possess the MmGI-1 (14/44, approximately 32%) and three unique conserved regions (26/44; approximately 60%, 34/44; approximately 77% and 40/44; approximately 91%), as well as isolates of other countries (Malaysia, France, United Kingdom and United States). The well-conserved genomic island MmGI-1 may play an important role in high growth potential with additional lipid metabolism, extra factors for survival in the environment or synthesis of complex membrane-associated lipids. ORFs on MmGI-1 showed similarities to ORFs of phylogenetically distant M. avium complex (MAC), suggesting that horizontal gene transfer or genetic recombination events might have occurred within MmGI-1 among M. massiliense and MAC. PMID:25503461

  14. Genome Sequence of Enterobacter cloacae Strain SENG-6, a Bacterium Producing Histo-Blood Group Antigen-Like Substances That Can Bind with Human Noroviruses

    PubMed Central

    Amarasiri, Mohan; Hashiba, Satoshi; Yang, Peiyi; Okabe, Satoshi

    2016-01-01

    Enterobacter sp. strain SENG-6, isolated from healthy human feces, produces histo-blood group antigen (HBGA)-like substances that can bind with human noroviruses. Based on the genome sequence analysis, strain SENG-6 belongs to the species Enterobacter cloacae. The genome sequence of this strain should help identify genes associated with the production of HBGA-like substances. PMID:27563051

  15. The Genomes, Proteomes, and Structures of Three Novel Phages That Infect the Bacillus cereus Group and Carry Putative Virulence Factors

    PubMed Central

    Belnap, David M.; Jensen, Jordan D.; Mathis, Andrew D.; Prince, John T.; Merrill, Bryan D.; Burnett, Sandra H.; Breakwell, Donald P.

    2014-01-01

    ABSTRACT This article reports the results of studying three novel bacteriophages, JL, Shanette, and Basilisk, which infect the pathogen Bacillus cereus and carry genes that may contribute to its pathogenesis. We analyzed host range and superinfection ability, mapped their genomes, and characterized phage structure by mass spectrometry and transmission electron microscopy (TEM). The JL and Shanette genomes were 96% similar and contained 217 open reading frames (ORFs) and 220 ORFs, respectively, while Basilisk has an unrelated genome containing 138 ORFs. Mass spectrometry revealed 23 phage particle proteins for JL and 15 for Basilisk, while only 11 and 4, respectively, were predicted to be present by sequence analysis. Structural protein homology to well-characterized phages suggested that JL and Shanette were members of the family Myoviridae, which was confirmed by TEM. The third phage, Basilisk, was similar only to uncharacterized phages and is an unrelated siphovirus. Cryogenic electron microscopy of this novel phage revealed a T=9 icosahedral capsid structure with the major capsid protein (MCP) likely having the same fold as bacteriophage HK97 MCP despite the lack of sequence similarity. Several putative virulence factors were encoded by these phage genomes, including TerC and TerD involved in tellurium resistance. Host range analysis of all three phages supports genetic transfer of such factors within the B. cereus group, including B. cereus, B. anthracis, and B. thuringiensis. This study provides a basis for understanding these three phages and other related phages as well as their contributions to the pathogenicity of B. cereus group bacteria. IMPORTANCE The Bacillus cereus group of bacteria contains several human and plant pathogens, including B. cereus, B. anthracis, and B. thuringiensis. Phages are intimately linked to the evolution of their bacterial hosts and often provide virulence factors, making the study of B. cereus phages important to understanding

  16. Random forest estimation of genomic breeding values for disease susceptibility over different disease incidences and genomic architectures in simulated cow calibration groups.

    PubMed

    Naderi, S; Yin, T; König, S

    2016-09-01

    A simulation study was conducted to investigate the performance of random forest (RF) and genomic BLUP (GBLUP) for genomic predictions of binary disease traits based on cow calibration groups. Training and testing sets were modified in different scenarios according to disease incidence, the quantitative-genetic background of the trait (h(2)=0.30 and h(2)=0.10), and the genomic architecture [725 quantitative trait loci (QTL) and 290 QTL, populations with high and low levels of linkage disequilibrium (LD)]. For all scenarios, 10,005 SNP (depicting a low-density 10K SNP chip) and 50,025 SNP (depicting a 50K SNP chip) were evenly spaced along 29 chromosomes. Training and testing sets included 20,000 cows (4,000 sick, 16,000 healthy, disease incidence 20%) from the last 2 generations. Initially, 4,000 sick cows were assigned to the testing set, and the remaining 16,000 healthy cows represented the training set. In the ongoing allocation schemes, the number of sick cows in the training set increased stepwise by moving 10% of the sick animals from the testing set to the training set, and vice versa. The size of the training and testing sets was kept constant. Evaluation criteria for both GBLUP and RF were the correlations between genomic breeding values and true breeding values (prediction accuracy), and the area under the receiving operating characteristic curve (AUROC). Prediction accuracy and AUROC increased for both methods and all scenarios as increasing percentages of sick cows were allocated to the training set. Highest prediction accuracies were observed for disease incidences in training sets that reflected the population disease incidence of 0.20. For this allocation scheme, the largest prediction accuracies of 0.53 for RF and of 0.51 for GBLUP, and the largest AUROC of 0.66 for RF and of 0.64 for GBLUP, were achieved using 50,025 SNP, a heritability of 0.30, and 725 QTL. Heritability decreases from 0.30 to 0.10 and QTL reduction from 725 to 290 were associated

  17. Variable Tick Protein in Two Genomic Groups of the Relapsing Fever Spirochete Borrelia hermsii in Western North America

    PubMed Central

    Porcella, Stephen F.; Raffel, Sandra J.; Anderson, Donald E.; Gilk, Stacey D.; Bono, James L.; Schrumpf, Merry E.; Schwan, Tom G.

    2005-01-01

    Borrelia hermsii is the primary cause of tick-borne relapsing fever in North America. When its tick vector, Ornithodoros hermsi, acquires these spirochetes from the blood of an infected mammal, the bacteria switch their outer surface from one of many bloodstream variable major proteins (Vmps) to a unique protein, Vtp (Vsp33). Vtp may be critical for successful tick transmission of B. hermsii; however, the gene encoding this protein has been described previously in only one isolate. Here we identified and sequenced the vtp gene in 31 isolates of B. hermsii collected over 40 years from localities throughout much of its known geographic distribution. Seven major Vtp types were found. Little or no sequence variation existed within types, but between them significant variation was observed, similar to the pattern of diversity described for the outer surface protein C (OspC) gene in Lyme disease spirochetes. The pattern of sequence relatedness among the Vtp types was incongruent in two branches compared to two genomic groups identified among the isolates by multilocus sequence typing of the 16S rRNA, flaB, gyrB, and glpQ genes. Therefore, both horizontal transfer and recombination within and between the two genomic groups were responsible for some of the variation observed in the vtp gene. O. hermsi ticks were capable of transmitting spirochetes in the newly identified genomic group. Therefore, given the longevity of the tick vector and persistent infection of spirochetes in ticks, these arthropods rather than mammals may be the likely host where the exchange of spirochetal DNA occurs. PMID:16177341

  18. Admixture patterns and genetic differentiation in negrito groups from West Malaysia estimated from genome-wide SNP data.

    PubMed

    Jinam, Timothy A; Phipps, Maude E; Saitou, Naruya

    2013-01-01

    Southeast Asia houses various culturally and linguistically diverse ethnic groups. In Malaysia, where the Malay, Chinese, and Indian ethnic groups form the majority, there exist minority groups such as the "negritos" who are believed to be descendants of the earliest settlers of Southeast Asia. Here we report patterns of genetic substructure and admixture in two Malaysian negrito populations (Jehai and Kensiu), using ~50,000 genome-wide single-nucleotide polymorphism (SNP) data. We found traces of recent admixture in both the negrito populations, particularly in the Jehai, with the Malay through principal component analysis and STRUCTURE analysis software, which suggested that the admixture was as recent as one generation ago. We also identified significantly differentiated nonsynonymous SNPs and haplotype blocks related to intracellular transport, metabolic processes, and detection of stimulus. These results highlight the different levels of admixture experienced by the two Malaysian negritos. Delineating admixture and differentiated genomic regions should be of importance in designing and interpretation of molecular anthropology and disease association studies. PMID:24297225

  19. Genomic Analysis of the Emergence and Rapid Global Dissemination of the Clonal Group 258 Klebsiella pneumoniae Pandemic

    PubMed Central

    Driebe, Elizabeth M.; MacCannell, Duncan R.; Roe, Chandler; Lemmer, Darrin; de Man, Tom; Rasheed, J. Kamile; Engelthaler, David M.; Keim, Paul; Limbago, Brandi M.

    2015-01-01

    Multidrug-resistant Klebsiella pneumoniae producing the KPC carbapenemase have rapidly spread throughout the world, causing severe healthcare-associated infections with limited antimicrobial treatment options. Dissemination of KPC-producing K. pneumoniae is largely attributed to expansion of a single dominant strain, ST258. In this study, we explore phylogenetic relationships and evolution within ST258 and its clonal group, CG258, using whole genome sequence analysis of 167 isolates from 20 countries collected over 17 years. Our results show a common ST258 ancestor emerged from its diverse parental clonal group around 1995 and likely acquired blaKPC prior to dissemination. Over the past two decades, ST258 has remained highly clonal despite diversity in accessory elements and divergence in the capsule polysaccharide synthesis locus. Apart from the large recombination event that gave rise to ST258, few mutations set it apart from its clonal group. However, one mutation occurs in a global transcription regulator. Characterization of outer membrane protein sequences revealed a profile in ST258 that includes a truncated OmpK35 and modified OmpK37. Our work illuminates potential genomic contributors to the pathogenic success of ST258, helps us better understand the global dissemination of this strain, and identifies genetic markers unique to ST258. PMID:26196384

  20. Genomic study of the Ket: a Paleo-Eskimo-related ethnic group with significant ancient North Eurasian ancestry

    PubMed Central

    Flegontov, Pavel; Changmai, Piya; Zidkova, Anastassiya; Logacheva, Maria D.; Altınışık, N. Ezgi; Flegontova, Olga; Gelfand, Mikhail S.; Gerasimov, Evgeny S.; Khrameeva, Ekaterina E.; Konovalova, Olga P.; Neretina, Tatiana; Nikolsky, Yuri V.; Starostin, George; Stepanova, Vita V.; Travinsky, Igor V.; Tříska, Martin; Tříska, Petr; Tatarinova, Tatiana V.

    2016-01-01

    The Kets, an ethnic group in the Yenisei River basin, Russia, are considered the last nomadic hunter-gatherers of Siberia, and Ket language has no transparent affiliation with any language family. We investigated connections between the Kets and Siberian and North American populations, with emphasis on the Mal’ta and Paleo-Eskimo ancient genomes, using original data from 46 unrelated samples of Kets and 42 samples of their neighboring ethnic groups (Uralic-speaking Nganasans, Enets, and Selkups). We genotyped over 130,000 autosomal SNPs, identified mitochondrial and Y-chromosomal haplogroups, and performed high-coverage genome sequencing of two Ket individuals. We established that Nganasans, Kets, Selkups, and Yukaghirs form a cluster of populations most closely related to Paleo-Eskimos in Siberia (not considering indigenous populations of Chukotka and Kamchatka). Kets are closely related to modern Selkups and to some Bronze and Iron Age populations of the Altai region, with all these groups sharing a high degree of Mal’ta ancestry. Implications of these findings for the linguistic hypothesis uniting Ket and Na-Dene languages into a language macrofamily are discussed. PMID:26865217

  1. Genomic study of the Ket: a Paleo-Eskimo-related ethnic group with significant ancient North Eurasian ancestry.

    PubMed

    Flegontov, Pavel; Changmai, Piya; Zidkova, Anastassiya; Logacheva, Maria D; Altınışık, N Ezgi; Flegontova, Olga; Gelfand, Mikhail S; Gerasimov, Evgeny S; Khrameeva, Ekaterina E; Konovalova, Olga P; Neretina, Tatiana; Nikolsky, Yuri V; Starostin, George; Stepanova, Vita V; Travinsky, Igor V; Tříska, Martin; Tříska, Petr; Tatarinova, Tatiana V

    2016-01-01

    The Kets, an ethnic group in the Yenisei River basin, Russia, are considered the last nomadic hunter-gatherers of Siberia, and Ket language has no transparent affiliation with any language family. We investigated connections between the Kets and Siberian and North American populations, with emphasis on the Mal'ta and Paleo-Eskimo ancient genomes, using original data from 46 unrelated samples of Kets and 42 samples of their neighboring ethnic groups (Uralic-speaking Nganasans, Enets, and Selkups). We genotyped over 130,000 autosomal SNPs, identified mitochondrial and Y-chromosomal haplogroups, and performed high-coverage genome sequencing of two Ket individuals. We established that Nganasans, Kets, Selkups, and Yukaghirs form a cluster of populations most closely related to Paleo-Eskimos in Siberia (not considering indigenous populations of Chukotka and Kamchatka). Kets are closely related to modern Selkups and to some Bronze and Iron Age populations of the Altai region, with all these groups sharing a high degree of Mal'ta ancestry. Implications of these findings for the linguistic hypothesis uniting Ket and Na-Dene languages into a language macrofamily are discussed. PMID:26865217

  2. The mitochondrial genome of the arbuscular mycorrhizal fungus Gigaspora margarita reveals two unsuspected trans-splicing events of group I introns.

    PubMed

    Pelin, Adrian; Pombert, Jean-François; Salvioli, Alessandra; Bonen, Linda; Bonfante, Paola; Corradi, Nicolas

    2012-05-01

    • Arbuscular mycorrhizal fungi (AMF) are ubiquitous organisms that benefit ecosystems through the establishment of an association with the roots of most plants: the mycorrhizal symbiosis. Despite their ecological importance, however, these fungi have been poorly studied at the genome level. • In this study, total DNA from the AMF Gigaspora margarita was subjected to a combination of 454 and Illumina sequencing, and the resulting reads were used to assemble its mitochondrial genome de novo. This genome was annotated and compared with those of other relatives to better comprehend the evolution of the AMF lineage. • The mitochondrial genome of G. margarita is unique in many ways, exhibiting a large size (97 kbp) and elevated GC content (45%). This genome also harbors molecular events that were previously unknown to occur in fungal mitochondrial genomes, including trans-splicing of group I introns from two different genes coding for the first subunit of the cytochrome oxidase and for the small subunit of the rRNA. • This study reports the second published genome from an AMF organelle, resulting in relevant DNA sequence information from this poorly studied fungal group, and providing new insights into the frequency, origin and evolution of trans-spliced group I introns found across the mitochondrial genomes of distantly related organisms. PMID:22320438

  3. Genomic profiling of lower-grade gliomas uncovers cohesive disease groups: implications for diagnosis and treatment.

    PubMed

    Zhang, Chang-Ming; Brat, Daniel J

    2016-01-01

    Lower-grade gliomas (including low- and intermediate-grade gliomas, World Health Organization grades II and III) are diffusely infiltrative neoplasms that arise most often in the cerebral hemispheres of adults and have traditionally been classified based on their presumed histogenesis as astrocytomas, oligodendrogliomas, or oligoastrocytomas. Although the histopathologic classification of lower-grade glioma has been the accepted standard for nearly a century, it suffers from high intra- and inter-observer variability and does not adequately predict clinical outcomes. Based on integrated analysis of multiplatform genomic data from The Cancer Genome Atlas, lower-grade gliomas have been found to segregate into three cohesive, clinically relevant molecular classes. Molecular classes were closely aligned with the status of isocitrate dehydrogenase (IDH) mutations, tumor protein 53 mutations and the co-deletion of chromosome arms 1p and 19q, but were not closely aligned with histologic classes. These findings emphasize the potential for improved definition of clinically relevant disease subsets using integrated molecular approaches and highlight the importance of biomarkers for brain tumor classification. PMID:26758195

  4. Sequence variants from whole genome sequencing a large group of Icelanders.

    PubMed

    Gudbjartsson, Daniel F; Sulem, Patrick; Helgason, Hannes; Gylfason, Arnaldur; Gudjonsson, Sigurjon A; Zink, Florian; Oddson, Asmundur; Magnusson, Gisli; Halldorsson, Bjarni V; Hjartarson, Eirikur; Sigurdsson, Gunnar Th; Kong, Augustine; Helgason, Agnar; Masson, Gisli; Magnusson, Olafur Th; Thorsteinsdottir, Unnur; Stefansson, Kari

    2015-01-01

    We have accumulated considerable data on the genetic makeup of the Icelandic population by sequencing the whole genomes of 2,636 Icelanders to depth of at least 10X and by chip genotyping 101,584 more. The sequencing was done with Illumina technology. The median sequencing depth was 20X and 909 individuals were sequenced to a depth of at least 30X. We found 20 million single nucleotide polymorphisms (SNPs) and 1.5 million insertions/deletions (indels) that passed stringent quality control. Almost all the common SNPs (derived allele frequency (DAF) over 2%) that we identified in Iceland have been observed by either dbSNP (build 137) or the Exome Sequencing Project (ESP) while only 60 and 20% of rare (DAF<0.5%) SNPs and indels in coding regions, the most heavily studied parts of the genome, have been observed in the public databases. Features of our variant data, such as the transition/transversion ratio and the length distribution of indels, are similar to published reports. PMID:25977816

  5. Implications of Genome-Based Discrimination between Clostridium botulinum Group I and Clostridium sporogenes Strains for Bacterial Taxonomy

    PubMed Central

    Weigand, Michael R.; Pena-Gonzalez, Angela; Shirey, Timothy B.; Broeker, Robin G.; Ishaq, Maliha K.; Konstantinidis, Konstantinos T.

    2015-01-01

    Taxonomic classification of Clostridium botulinum is based on the production of botulinum neurotoxin (BoNT), while closely related, nontoxic organisms are classified as Clostridium sporogenes. However, this taxonomic organization does not accurately mirror phylogenetic relationships between these species. A phylogenetic reconstruction using 2,016 orthologous genes shared among strains of C. botulinum group I and C. sporogenes clearly separated these two species into discrete clades which showed ∼93% average nucleotide identity (ANI) between them. Clustering of strains based on the presence of variable orthologs revealed 143 C. sporogenes clade-specific genetic signatures, a subset of which were further evaluated for their ability to correctly classify a panel of presumptive C. sporogenes strains by PCR. Genome sequencing of several C. sporogenes strains lacking these signatures confirmed that they clustered with C. botulinum strains in a core genome phylogenetic tree. Our analysis also identified C. botulinum strains that contained C. sporogenes clade-specific signatures and phylogenetically clustered with C. sporogenes strains. The genome sequences of two bont/B2-containing strains belonging to the C. sporogenes clade contained regions with similarity to a bont-bearing plasmid (pCLD), while two different strains belonging to the C. botulinum clade carried bont/B2 on the chromosome. These results indicate that bont/B2 was likely acquired by C. sporogenes strains through horizontal gene transfer. The genome-based classification of these species used to identify candidate genes for the development of rapid assays for molecular identification may be applicable to additional bacterial species that are challenging with respect to their classification. PMID:26048939

  6. Genome sequence of Shimia str. SK013, a representative of the Roseobacter group isolated from marine sediment

    DOE PAGESBeta

    Kanukollu, Saranya; Voget, Sonja; Pohlner, Marion; Vandieken, Verona; Petersen, Jörn; Kyrpides, Nikos C.; Woyke, Tanja; Shapiro, Nicole; Göker, Markus; Klenk, Hans -Peter; et al

    2016-03-12

    Shimia strain SK013 is an aerobic, Gram-negative, rod shaped alphaproteobacterium affiliated with the Roseobacter group within the family Rhodobacteraceae. The strain was isolated from surface sediment (0-1 cm) of the Skagerrak at 114 m below sea level. The 4,049,808 bp genome of Shimia str. SK013 comprises 3,981 protein-coding genes and 47 RNA genes. It contains one chromosome and no extrachromosomal elements. The genome analysis revealed the presence of genes for a dimethylsulfoniopropionate lyase, demethylase and the trimethylamine methyltransferase (mttB) as well as genes for nitrate, nitrite and dimethyl sulfoxide reduction. This indicates that Shimia str. SK013 is able to switchmore » from aerobic to anaerobic metabolism and thus is capable of aerobic and anaerobic sulfur cycling at the seafloor. Among the ability to convert other sulfur compounds it has the genetic capacity to produce climatically active dimethyl sulfide. Growth on glutamate as a sole carbon source results in formation of cell-connecting filaments, a putative phenotypic adaptation of the surface-associated strain to the environmental conditions at the seafloor. Genome analysis revealed the presence of a flagellum (fla1) and a type IV pilus biogenesis, which is speculated to be a prerequisite for biofilm formation. This is also related to genes responsible for signalling such as N-acyl homoserine lactones, as well as quip-genes responsible for quorum quenching and antibiotic biosynthesis. Pairwise similarities of 16S rRNA genes (98.56 % sequence similarity to the next relative S. haliotis) and the in silico DNA-DNA hybridization (21.20 % sequence similarity to S. haliotis) indicated Shimia str. SK013 to be considered as a new species. In conclusion, the genome analysis of Shimia str. SK013 offered first insights into specific physiological and phenotypic adaptation mechanisms of Roseobacter-affiliated bacteria to the benthic environment.« less

  7. Genomic Characterisation of Three Mapputta Group Viruses, a Serogroup of Australian and Papua New Guinean Bunyaviruses Associated with Human Disease

    PubMed Central

    Gauci, Penelope J.; McAllister, Jane; Mitchell, Ian R.; Boyle, David B.; Bulach, Dieter M.; Weir, Richard P.; Melville, Lorna F.; Gubala, Aneta J.

    2015-01-01

    The Mapputta serogroup tentatively contains the mosquito-associated viruses Mapputta, Maprik, Trubanaman and Gan Gan. Interestingly, this serogroup has previously been associated with an acute epidemic polyarthritis-like illness in humans; however, there has been no ensuing genetic characterisation. Here we report the complete genome sequences of Mapputta and Maprik viruses, and a new Mapputta group candidate, Buffalo Creek virus, previously isolated from mosquitoes and detected by serology in a hospitalised patient. Phylogenetic analyses indicate that the group is one of the earliest diverged groups within the genus Orthobunyavirus of the family Bunyaviridae. Analyses show that these three viruses are related to the recently sequenced Australian bunyaviruses from mosquitoes, Salt Ash and Murrumbidgee. A notable feature of the Mapputta group viruses is the absence of the NSs (non-structural) ORF commonly found on the S segment of other orthobunyaviruses. Viruses of the Mapputta group have been isolated from geographically diverse regions ranging from tropical Papua New Guinea to the semi-arid climate of south-eastern Australia. The relevance of this group to human health in the region merits further investigation. PMID:25588016

  8. Complete Genome Sequence of Bacillus cereus Group Phage TsarBomba

    PubMed Central

    Erill, Ivan

    2015-01-01

    The Bacillus cereus group bacteriophage TsarBomba, a double-stranded DNA Myoviridae, was isolated from soil collected in Saratov, Russia. TsarBomba was found to be similar to Bacillus phages BCP78 and BCU4, and to have a wide host range among Bacillus cereus group species. PMID:26472830

  9. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants.

    PubMed

    Li, Meng-Yao; Xu, Zhi-Sheng; Tian, Chang; Huang, Ying; Wang, Feng; Xiong, Ai-Sheng

    2016-01-01

    WRKY transcription factors belong to one of the largest transcription factor families. These factors possess functions in plant growth and development, signal transduction, and stress response. Here, we identified 95 DcWRKY genes in carrot based on the carrot genomic and transcriptomic data, and divided them into three groups. Phylogenetic analysis of WRKY proteins from carrot and Arabidopsis divided these proteins into seven subgroups. To elucidate the evolution and distribution of WRKY transcription factors in different species, we constructed a schematic of the phylogenetic tree and compared the WRKY family factors among 22 species, which including plants, slime mold and protozoan. An in-depth study was performed to clarify the homologous factor groups of nine divergent taxa in lower and higher plants. Based on the orthologous factors between carrot and Arabidopsis, 38 DcWRKY proteins were calculated to interact with other proteins in the carrot genome. Yeast two-hybrid assay showed that DcWRKY20 can interact with DcMAPK1 and DcMAPK4. The expression patterns of the selected DcWRKY genes based on transcriptome data and qRT-PCR suggested that those selected DcWRKY genes are involved in root development, biotic and abiotic stress response. This comprehensive analysis provides a basis for investigating the evolution and function of WRKY genes. PMID:26975939

  10. The complete mitochondrial genome of the Keeled box turtle Pyxidea mouhotii and phylogenetic analysis of major turtle groups.

    PubMed

    Zhang, Li; Nie, Liuwang; Cao, Chenghe; Zhan, Ying

    2008-01-01

    The complete mitochondrial genome (16,837 bp) from the Keeled box turtle (Pyxidea mouhotii) was determined. The genome content, gene order, and base composition conformed to the consensus vertebrate type mtDNA. However, a remarkable feature was found in this molecule: a large number of (ATTATATC) (n) direct tandem repeats followed by (TA) (n) microsatellite at the 3' end of the control region (D-loop), which might be useful as molecular markers for studying population genetics and helpful for species identification and conservation. Besides, to review phylogenetic relationships among major turtle lineages, maximum-likelihood (ML) and Bayesian (BI) analyses were conducted based on concatenated sequences of 13 protein-coding genes from 16 taxa. The resultant ML and BI analyses showed homological topologies, which only differed on the exact placement of Platysternon. Nevertheless, the results strongly supported that 1) Pyxidea mouhotii and Cuora aurocapitata formed a monophyletic clade, whereas Cyclemys atripons was not closer to the Pyxidea-Cuora than to Chinemys reevesii, suggesting that Cyclemys and the Cuora group (containing Pyxidea) may have originated from two ancestors; 2) the Geoemydidae with Testudinidae was a sister group rather than with the Emydidae. PMID:18222407

  11. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants

    PubMed Central

    Li, Meng-Yao; Xu, Zhi-Sheng; Tian, Chang; Huang, Ying; Wang, Feng; Xiong, Ai-Sheng

    2016-01-01

    WRKY transcription factors belong to one of the largest transcription factor families. These factors possess functions in plant growth and development, signal transduction, and stress response. Here, we identified 95 DcWRKY genes in carrot based on the carrot genomic and transcriptomic data, and divided them into three groups. Phylogenetic analysis of WRKY proteins from carrot and Arabidopsis divided these proteins into seven subgroups. To elucidate the evolution and distribution of WRKY transcription factors in different species, we constructed a schematic of the phylogenetic tree and compared the WRKY family factors among 22 species, which including plants, slime mold and protozoan. An in-depth study was performed to clarify the homologous factor groups of nine divergent taxa in lower and higher plants. Based on the orthologous factors between carrot and Arabidopsis, 38 DcWRKY proteins were calculated to interact with other proteins in the carrot genome. Yeast two-hybrid assay showed that DcWRKY20 can interact with DcMAPK1 and DcMAPK4. The expression patterns of the selected DcWRKY genes based on transcriptome data and qRT-PCR suggested that those selected DcWRKY genes are involved in root development, biotic and abiotic stress response. This comprehensive analysis provides a basis for investigating the evolution and function of WRKY genes. PMID:26975939

  12. Comparative genomics of a plant-parasitic nematode endosymbiont suggest a role in nutritional symbiosis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bacterial mutualists can increase the biochemical capacity of animals. Highly co-evolved nutritional mutualists do this by synthesizing nutrients missing from the host's diet. Genomics tools have recently advanced the study of these partnerships. Here we examined the endosymbiont Xiphinematobacter (...

  13. Genomic identification of group A bZIP transcription factors and their responses to abiotic stress in carrot.

    PubMed

    Que, F; Wang, G L; Huang, Y; Xu, Z S; Wang, F; Xiong, A S

    2015-01-01

    The basic-region/leucine-zipper (bZIP) family is one of the major transcription factor (TF) families associated with responses to abiotic stresses. Many members of group A in this family have been extensively examined and are reported to perform significant functions in ABA signaling as well as in responses to abiotic stresses. In this study, 10 bZIP factors in carrot were classified into group A based on their DNA-binding domains. The cis-acting regulatory elements and folding states of these 10 factors were analyzed. Evolutionary analysis of the group A members suggested their importance during the course of evolution in plants. In addition, cis-acting elements and the folding state of proteins were important for DNA binding and could affect gene expression. Quantitative RT-PCR was conducted to investigate the stress response of 10 genes encoding the group A factors. Six genes showed responses to abiotic stresses, while four genes showed other special phenomenon. The current analysis on group A bZIP family TFs in carrot is the first to investigate the TFs of Apiaceae via genome analysis. These results provide new information for future studies on carrot. PMID:26535641

  14. Insights from the Genome Sequence of Acidovorax citrulli M6, a Group I Strain of the Causal Agent of Bacterial Fruit Blotch of Cucurbits

    PubMed Central

    Eckshtain-Levi, Noam; Shkedy, Dafna; Gershovits, Michael; Da Silva, Gustavo M.; Tamir-Ariel, Dafna; Walcott, Ron; Pupko, Tal; Burdman, Saul

    2016-01-01

    Acidovorax citrulli is a seedborne bacterium that causes bacterial fruit blotch of cucurbit plants including watermelon and melon. A. citrulli strains can be divided into two major groups based on DNA fingerprint analyses and biochemical properties. Group I strains have been generally isolated from non-watermelon cucurbits, while group II strains are closely associated with watermelon. In the present study, we report the genome sequence of M6, a group I model A. citrulli strain, isolated from melon. We used comparative genome analysis to investigate differences between the genome of strain M6 and the genome of the group II model strain AAC00-1. The draft genome sequence of A. citrulli M6 harbors 139 contigs, with an overall approximate size of 4.85 Mb. The genome of M6 is ∼500 Kb shorter than that of strain AAC00-1. Comparative analysis revealed that this size difference is mainly explained by eight fragments, ranging from ∼35–120 Kb and distributed throughout the AAC00-1 genome, which are absent in the M6 genome. In agreement with this finding, while AAC00-1 was found to possess 532 open reading frames (ORFs) that are absent in strain M6, only 123 ORFs in M6 were absent in AAC00-1. Most of these M6 ORFs are hypothetical proteins and most of them were also detected in two group I strains that were recently sequenced, tw6 and pslb65. Further analyses by PCR assays and coverage analyses with other A. citrulli strains support the notion that some of these fragments or significant portions of them are discriminative between groups I and II strains of A. citrulli. Moreover, GC content, effective number of codon values and cluster of orthologs’ analyses indicate that these fragments were introduced into group II strains by horizontal gene transfer events. Our study reports the genome sequence of a model group I strain of A. citrulli, one of the most important pathogens of cucurbits. It also provides the first comprehensive comparison at the genomic level between

  15. Insights from the Genome Sequence of Acidovorax citrulli M6, a Group I Strain of the Causal Agent of Bacterial Fruit Blotch of Cucurbits.

    PubMed

    Eckshtain-Levi, Noam; Shkedy, Dafna; Gershovits, Michael; Da Silva, Gustavo M; Tamir-Ariel, Dafna; Walcott, Ron; Pupko, Tal; Burdman, Saul

    2016-01-01

    Acidovorax citrulli is a seedborne bacterium that causes bacterial fruit blotch of cucurbit plants including watermelon and melon. A. citrulli strains can be divided into two major groups based on DNA fingerprint analyses and biochemical properties. Group I strains have been generally isolated from non-watermelon cucurbits, while group II strains are closely associated with watermelon. In the present study, we report the genome sequence of M6, a group I model A. citrulli strain, isolated from melon. We used comparative genome analysis to investigate differences between the genome of strain M6 and the genome of the group II model strain AAC00-1. The draft genome sequence of A. citrulli M6 harbors 139 contigs, with an overall approximate size of 4.85 Mb. The genome of M6 is ∼500 Kb shorter than that of strain AAC00-1. Comparative analysis revealed that this size difference is mainly explained by eight fragments, ranging from ∼35-120 Kb and distributed throughout the AAC00-1 genome, which are absent in the M6 genome. In agreement with this finding, while AAC00-1 was found to possess 532 open reading frames (ORFs) that are absent in strain M6, only 123 ORFs in M6 were absent in AAC00-1. Most of these M6 ORFs are hypothetical proteins and most of them were also detected in two group I strains that were recently sequenced, tw6 and pslb65. Further analyses by PCR assays and coverage analyses with other A. citrulli strains support the notion that some of these fragments or significant portions of them are discriminative between groups I and II strains of A. citrulli. Moreover, GC content, effective number of codon values and cluster of orthologs' analyses indicate that these fragments were introduced into group II strains by horizontal gene transfer events. Our study reports the genome sequence of a model group I strain of A. citrulli, one of the most important pathogens of cucurbits. It also provides the first comprehensive comparison at the genomic level between the

  16. The genomic organization of the Fanconi anemia group A (FAA) gene

    SciTech Connect

    Ianzano, L.; Centra, M.; Savino, M.

    1997-05-01

    Fanconi anemia (FA) is a genetically heterogeneous disease involving at least five genes on the basis of complementation analysis (FAA to FAE). The FAA gene has been recently isolated by two independent approaches, positional and functional cloning. In the present study we describe the genomic structure of the FAA gene. The gene contains 43 exons spanning approximately 80 kb as determined by the alignment of four cosmids and the fine localization of the first and the last exons in restriction fragments of these clones. Exons range from 34 to 188 bp. All but three of the splice sites were consistent with the ag-gt rule. We also describe three alternative splicing events in cDNA clones that result in the loss of exon 37, a 23-bp deletion at the 5{prime} end of exon 41. Sequence analysis of the 5{prime} region upstream of the putative transcription start site showed no obvious TATA and CAAT boxes, but did show a GC-rich region, typical of housekeeping genes. Knowledge of the structure of the FAA gene will provide an invaluable resource for the discovery of mutations in the gene that accounts for about 60-66% of FA patients. 24 refs., 3 figs., 1 tab.

  17. Perspective: maternal kin groups and the origins of asymmetric genetic systems-genomic imprinting, haplodiploidy, and parthenogenesis.

    PubMed

    Normark, Benjamin B

    2006-04-01

    The genetic systems of animals and plants are typically eumendelian. That is, an equal complement of autosomes is inherited from each of two parents, and at each locus, each parent's allele is equally likely to be expressed and equally likely to be transmitted. Genetic systems that violate any of these eumendelian symmetries are termed asymmetric and include parent-specific gene expression (PSGE), haplodiploidy, thelytoky, and related systems. Asymmetric genetic systems typically arise in lineages with close associations between kin (gregarious siblings, brooding, or viviparity). To date, different explanatory frameworks have been proposed to account for each of the different asymmetric genetic systems. Haig's kinship theory of genomic imprinting argues that PSGE arises when kinship asymmetries between interacting kin create conflicts between maternally and paternally derived alleles. Greater maternal than paternal relatedness within groups selects for more "abstemious" expression of maternally derived alleles and more "greedy" expression of paternally derived alleles. Here, I argue that this process may also underlie origins of haplodiploidy and many origins of thelytoky. The tendency for paternal alleles to be more "greedy" in maternal kin groups means that maternal-paternal conflict is not a zero-sum game: the maternal optimum will more closely correspond to the optimum for family groups and demes and for associated entities such as symbionts. Often in these circumstances, partial or complete suppression of paternal gene expression will evolve (haplodiploidy, thelytoky), or other features of the life cycle will evolve to minimize the conflict (monogamy, inbreeding). Maternally transmitted cytoplasmic elements and maternally imprinted nuclear alleles have a shared interest in minimizing agonistic interactions between female siblings and may cooperate to exclude the paternal genome. Eusociality is the most dramatic expression of the conflict-reducing effects of

  18. Use of Longitudinal Data in Genetic Studies in the Genome-wide Association Studies Era: Summary of Group 14

    PubMed Central

    Kerner, Berit; North, Kari E; Fallin, M Daniele

    2010-01-01

    Participants analyzed actual and simulated longitudinal data from the Framingham Heart Study for various metabolic and cardiovascular traits. The genetic information incorporated into these investigations ranged from selected single-nucleotide polymorphisms to genome-wide association arrays. Genotypes were incorporated using a broad range of methodological approaches including conditional logistic regression, linear mixed models, generalized estimating equations, linear growth curve estimation, growth modeling, growth mixture modeling, population attributable risk fraction based on survival functions under the proportional hazards models, and multivariate adaptive splines for the analysis of longitudinal data. The specific scientific questions addressed by these different approaches also varied, ranging from a more precise definition of the phenotype, bias reduction in control selection, estimation of effect sizes and genotype associated risk, to direct incorporation of genetic data into longitudinal modeling approaches and the exploration of population heterogeneity with regard to longitudinal trajectories. The group reached several overall conclusions: 1) The additional information provided by longitudinal data may be useful in genetic analyses. 2) The precision of the phenotype definition as well as control selection in nested designs may be improved, especially if traits demonstrate a trend over time or have strong age-of-onset effects. 3) Analyzing genetic data stratified for high-risk subgroups defined by a unique development over time could be useful for the detection of rare mutations in common multi-factorial diseases. 4) Estimation of the population impact of genomic risk variants could be more precise. The challenges and computational complexity demanded by genome-wide single-nucleotide polymorphism data were also discussed. PMID:19924713

  19. Use of longitudinal data in genetic studies in the genome-wide association studies era: summary of Group 14.

    PubMed

    Kerner, Berit; North, Kari E; Fallin, M Daniele

    2009-01-01

    Participants analyzed actual and simulated longitudinal data from the Framingham Heart Study for various metabolic and cardiovascular traits. The genetic information incorporated into these investigations ranged from selected single-nucleotide polymorphisms to genome-wide association arrays. Genotypes were incorporated using a broad range of methodological approaches including conditional logistic regression, linear mixed models, generalized estimating equations, linear growth curve estimation, growth modeling, growth mixture modeling, population attributable risk fraction based on survival functions under the proportional hazards models, and multivariate adaptive splines for the analysis of longitudinal data. The specific scientific questions addressed by these different approaches also varied, ranging from a more precise definition of the phenotype, bias reduction in control selection, estimation of effect sizes and genotype associated risk, to direct incorporation of genetic data into longitudinal modeling approaches and the exploration of population heterogeneity with regard to longitudinal trajectories. The group reached several overall conclusions: (1) The additional information provided by longitudinal data may be useful in genetic analyses. (2) The precision of the phenotype definition as well as control selection in nested designs may be improved, especially if traits demonstrate a trend over time or have strong age-of-onset effects. (3) Analyzing genetic data stratified for high-risk subgroups defined by a unique development over time could be useful for the detection of rare mutations in common multifactorial diseases. (4) Estimation of the population impact of genomic risk variants could be more precise. The challenges and computational complexity demanded by genome-wide single-nucleotide polymorphism data were also discussed. PMID:19924713

  20. Identification and Genomic Analysis of a Novel Group C Orthobunyavirus Isolated from a Mosquito Captured near Iquitos, Peru

    PubMed Central

    Treangen, Todd J.; Schoeler, George; Phillippy, Adam M.; Bergman, Nicholas H.; Turell, Michael J.

    2016-01-01

    Group C orthobunyaviruses are single-stranded RNA viruses found in both South and North America. Until very recently, and despite their status as important vector-borne human pathogens, no Group C whole genome sequences containing all three segments were available in public databases. Here we report a Group C orthobunyavirus, named El Huayo virus, isolated from a pool of Culex portesi mosquitoes captured near Iquitos, Peru. Although initial metagenomic analysis yielded only a handful of reads belonging to the genus Orthobunyavirus, single contig assemblies were generated for L, M, and S segments totaling over 200,000 reads (~0.5% of sample). Given the moderately high viremia in hamsters (>107 plaque-forming units/ml) and the propensity for Cx. portesi to feed on rodents, it is possible that El Huayo virus is maintained in nature in a Culex portesi/rodent cycle. El Huayo virus was found to be most similar to Peruvian Caraparu virus isolates and constitutes a novel subclade within Group C. PMID:27074162

  1. Identification and Genomic Analysis of a Novel Group C Orthobunyavirus Isolated from a Mosquito Captured near Iquitos, Peru.

    PubMed

    Treangen, Todd J; Schoeler, George; Phillippy, Adam M; Bergman, Nicholas H; Turell, Michael J

    2016-04-01

    Group C orthobunyaviruses are single-stranded RNA viruses found in both South and North America. Until very recently, and despite their status as important vector-borne human pathogens, no Group C whole genome sequences containing all three segments were available in public databases. Here we report a Group C orthobunyavirus, named El Huayo virus, isolated from a pool of Culex portesi mosquitoes captured near Iquitos, Peru. Although initial metagenomic analysis yielded only a handful of reads belonging to the genus Orthobunyavirus, single contig assemblies were generated for L, M, and S segments totaling over 200,000 reads (~0.5% of sample). Given the moderately high viremia in hamsters (>107 plaque-forming units/ml) and the propensity for Cx. portesi to feed on rodents, it is possible that El Huayo virus is maintained in nature in a Culex portesi/rodent cycle. El Huayo virus was found to be most similar to Peruvian Caraparu virus isolates and constitutes a novel subclade within Group C. PMID:27074162

  2. Expanded MLST genotyping and comparative genomic hybridization evidence for host preferred groups in Campylobacter coli

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The majority of previous work on campylobacteriosis has centered on the species Campylobacter jejuni, however, Campylobacter coli, the sister group to C. jejuni, is also a significant problem, but remains a much less studied organism. The purpose of this work was to develop and apply an expanded 16 ...

  3. Group-specific structural features of the 5'-proximal sequences of coronavirus genomic RNAs.

    PubMed

    Chen, Shih-Cheng; Olsthoorn, René C L

    2010-05-25

    Global predictions of the secondary structure of coronavirus (CoV) 5' untranslated regions and adjacent coding sequences revealed the presence of conserved structural elements. Stem loops (SL) 1, 2, 4, and 5 were predicted in all CoVs, while the core leader transcription-regulating sequence (L-TRS) forms SL3 in only some CoVs. SL5 in group I and II CoVs, with the exception of group IIa CoVs, is characterized by the presence of a large sequence insertion capable of forming hairpins with the conserved 5'-UUYCGU-3' loop sequence. Structure probing confirmed the existence of these hairpins in the group I Human coronavirus-229E and the group II Severe acute respiratory syndrome coronavirus (SARS-CoV). In general, the pattern of the 5' cis-acting elements is highly related to the lineage of CoVs, including features of the conserved hairpins in SL5. The function of these conserved hairpins as a putative packaging signal is discussed. PMID:20202661

  4. Framework for development of physician competencies in genomic medicine: report of the Competencies Working Group of the Inter-Society Coordinating Committee for Physician Education in Genomics.

    PubMed

    Korf, Bruce R; Berry, Anna B; Limson, Melvin; Marian, Ali J; Murray, Michael F; O'Rourke, P Pearl; Passamani, Eugene R; Relling, Mary V; Tooker, John; Tsongalis, Gregory J; Rodriguez, Laura L

    2014-11-01

    Completion of the Human Genome Project, in conjunction with dramatic reductions in the cost of DNA sequencing and advances in translational research, is gradually ushering genomic discoveries and technologies into the practice of medicine. The rapid pace of these advances is opening up a gap between the knowledge available about the clinical relevance of genomic information and the ability of clinicians to include such information in their medical practices. This educational gap threatens to be rate limiting to the clinical adoption of genomics in medicine. Solutions will require not only a better understanding of the clinical implications of genetic discoveries but also training in genomics at all levels of professional development, including for individuals in formal training and others who long ago completed such training. The National Human Genome Research Institute has convened the Inter-Society Coordinating Committee for Physician Education in Genomics (ISCC) to develop and share best practices in the use of genomics in medicine. The ISCC has developed a framework for development of genomics practice competencies that may serve as a starting point for formulation of competencies for physicians in various medical disciplines. PMID:24763287

  5. Chemical rescue, multiple ionizable groups, and general acid-base catalysis in the HDV genomic ribozyme.

    PubMed

    Perrotta, Anne T; Wadkins, Timothy S; Been, Michael D

    2006-07-01

    In the ribozyme from the hepatitis delta virus (HDV) genomic strand RNA, a cytosine side chain is proposed to facilitate proton transfer in the transition state of the reaction and, thus, act as a general acid-base catalyst. Mutation of this active-site cytosine (C75) reduced RNA cleavage rates by as much as one million-fold, but addition of exogenous cytosine and certain nucleobase or imidazole analogs can partially rescue activity in these mutants. However, pH-rate profiles for the rescued reactions were bell shaped, and only one leg of the pH-rate curve could be attributed to ionization of the exogenous nucleobase or buffer. When a second potential ionizable nucleobase (C41) was removed, one leg of the bell-shaped curve was eliminated in the chemical-rescue reaction. With this construct, the apparent pK(a) determined from the pH-rate profile correlated with the solution pK(a) of the buffer, and the contribution of the buffer to the rate enhancement could be directly evaluated in a free-energy or Brønsted plot. The free-energy relationship between the acid dissociation constant of the buffer and the rate constant for cleavage (Brønsted value, beta, = approximately 0.5) was consistent with a mechanism in which the buffer acted as a general acid-base catalyst. These data support the hypothesis that cytosine 75, in the intact ribozyme, acts as a general acid-base catalyst. PMID:16690998

  6. The Transcriptome of the Reference Potato Genome Solanum tuberosum Group Phureja Clone DM1-3 516R44

    PubMed Central

    Massa, Alicia N.; Childs, Kevin L.; Lin, Haining; Bryan, Glenn J.; Giuliano, Giovanni; Buell, C. Robin

    2011-01-01

    Advances in molecular breeding in potato have been limited by its complex biological system, which includes vegetative propagation, autotetraploidy, and extreme heterozygosity. The availability of the potato genome and accompanying gene complement with corresponding gene structure, location, and functional annotation are powerful resources for understanding this complex plant and advancing molecular breeding efforts. Here, we report a reference for the potato transcriptome using 32 tissues and growth conditions from the doubled monoploid Solanum tuberosum Group Phureja clone DM1-3 516R44 for which a genome sequence is available. Analysis of greater than 550 million RNA-Seq reads permitted the detection and quantification of expression levels of over 22,000 genes. Hierarchical clustering and principal component analyses captured the biological variability that accounts for gene expression differences among tissues suggesting tissue-specific gene expression, and genes with tissue or condition restricted expression. Using gene co-expression network analysis, we identified 18 gene modules that represent tissue-specific transcriptional networks of major potato organs and developmental stages. This information provides a powerful resource for potato research as well as studies on other members of the Solanaceae family. PMID:22046362

  7. Genomic Recombination Leading to Decreased Virulence of Group B Streptococcus in a Mouse Model of Adult Invasive Disease.

    PubMed

    Teatero, Sarah; Lemire, Paul; Dewar, Ken; Wasserscheid, Jessica; Calzas, Cynthia; Mallo, Gustavo V; Li, Aimin; Athey, Taryn B T; Segura, Mariela; Fittipaldi, Nahuel

    2016-01-01

    Adult invasive disease caused by Group B Streptococcus (GBS) is increasing worldwide. Whole-genome sequencing (WGS) now permits rapid identification of recombination events, a phenomenon that occurs frequently in GBS. Using WGS, we described that strain NGBS375, a capsular serotype V GBS isolate of sequence type (ST)297, has an ST1 genomic background but has acquired approximately 300 kbp of genetic material likely from an ST17 strain. Here, we examined the virulence of this strain in an in vivo model of GBS adult invasive infection. The mosaic ST297 strain showed intermediate virulence, causing significantly less systemic infection and reduced mortality than a more virulent, serotype V ST1 isolate. Bacteremia induced by the ST297 strain was similar to that induced by a serotype III ST17 strain, which was the least virulent under the conditions tested. Yet, under normalized bacteremia levels, the in vivo intrinsic capacity to induce the production of pro-inflammatory cytokines was similar between the ST297 strain and the virulent ST1 strain. Thus, the diminished virulence of the mosaic strain may be due to reduced capacity to disseminate or multiply in blood during a systemic infection which could be mediated by regulatory factors contained in the recombined region. PMID:27527222

  8. Genome analysis of Elusimicrobium minutum, the first cultivated representative of the Elusimicrobia phylum (formerly Termite Group 1)

    SciTech Connect

    Herlemann, D. P. R.; Geissinger, O.; Ikeda-Ohtsubo, W.; Kunin, V.; Sun, H.; Lapidus, A.; Hugenholtz, P.; Brune, A.

    2009-02-01

    The candidate phylum Termite group 1 (TG1), is regularly 1 encountered in termite hindguts but is present also in many other habitats. Here we report the complete genome sequence (1.64 Mbp) of Elusimicrobium minutum strain Pei191{sup T}, the first cultured representative of the TG1 phylum. We reconstructed the metabolism of this strictly anaerobic bacterium isolated from a beetle larva gut and discuss the findings in light of physiological data. E. minutum has all genes required for uptake and fermentation of sugars via the Embden-Meyerhof pathway, including several hydrogenases, and an unusual peptide degradation pathway comprising transamination reactions and leading to the formation of alanine, which is excreted in substantial amounts. The presence of genes encoding lipopolysaccharide biosynthesis and the presence of a pathway for peptidoglycan formation are consistent with ultrastructural evidence of a Gram-negative cell envelope. Even though electron micrographs showed no cell appendages, the genome encodes many genes putatively involved in pilus assembly. We assigned some to a type II secretion system, but the function of 60 pilE-like genes remains unknown. Numerous genes with hypothetical functions, e.g., polyketide synthesis, non-ribosomal peptide synthesis, antibiotic transport, and oxygen stress protection, indicate the presence of hitherto undiscovered physiological traits. Comparative analysis of 22 concatenated single-copy marker genes corroborated the status of Elusimicrobia (formerly TG1) as a separate phylum in the bacterial domain, which was so far based only on 16S rRNA sequence analysis.

  9. Increased UV resistance in xeroderma pigmentosum group A cells after transformation with a human genomic DNA clone.

    PubMed Central

    Rinaldy, A; Bellew, T; Egli, E; Lloyd, R S

    1990-01-01

    Xeroderma pigmentosum (XP) is an autosomal recessive disease in which the major clinical manifestation is a 2,000-fold enhanced probability of developing sunlight-induced skin tumors, and the molecular basis for the disease is a defective DNA excision repair system. To clone the gene defective in XP complementation group A (XP-A), cDNA clones were isolated by a competition hybridization strategy in which the corresponding mRNAs were more abundant in cells of the obligately heterozygous parents relative to cells of the homozygous proband affected with the disease. In this report, a human genomic DNA clone that contains this cDNA was transformed into two independent homozygous XP-A cell lines, and these transformants displayed partial restoration of resistance to the killing effects of UV irradiation. The abundance of mRNA corresponding to this cDNA appears to correlate well with the observed UV cell survival. The results of unscheduled DNA synthesis after UV exposure indicate that the transformed cells are repair proficient relative to that of the control XP-A cells. However, using this same genomic DNA, transformation of an XP-F cell line did not confer any enhancement of UV survival or promote unscheduled DNA synthesis after UV exposure. Images PMID:2168562

  10. Genome-Wide Analysis of Group A Streptococci Reveals a Mutation That Modulates Global Phenotype and Disease Specificity

    PubMed Central

    2006-01-01

    Many human pathogens produce phenotypic variants as a means to circumvent the host immune system and enhance survival and, as a potential consequence, exhibit increased virulence. For example, it has been known for almost 90 y that clinical isolates of the human bacterial pathogen group A streptococci (GAS) have extensive phenotypic heterogeneity linked to variation in virulence. However, the complete underlying molecular mechanism(s) have not been defined. Expression microarray analysis of nine clinical isolates identified two fundamentally different transcriptomes, designated pharyngeal transcriptome profile (PTP) and invasive transcriptome profile (ITP). PTP and ITP GAS differed in approximately 10% of the transcriptome, including at least 23 proven or putative virulence factor genes. ITP organisms were recovered from skin lesions of mice infected subcutaneously with PTP GAS and were significantly more able to survive phagocytosis and killing by human polymorphonuclear leukocytes. Complete genome resequencing of a mouse-derived ITP GAS revealed that the organism differed from its precursor by only a 7-bp frameshift mutation in the gene (covS) encoding the sensor kinase component of a two-component signal transduction system implicated in virulence. Genetic complementation, and sequence analysis of covR/S in 42 GAS isolates confirmed the central role of covR/S in transcriptome, exoproteome, and virulence modulation. Genome-wide analysis provides a heretofore unattained understanding of phenotypic variation and disease specificity in microbial pathogens, resulting in new avenues for vaccine and therapeutics research. PMID:16446783

  11. Phylogenetic relationships among sandfly fever group viruses (Phlebovirus: Bunyaviridae) based on the small genome segment.

    PubMed

    Xu, Fangling; Chen, Hongli; Travassos da Rosa, Amelia P A; Tesh, Robert B; Xiao, Shu-Yuan

    2007-08-01

    The phleboviruses are more diverse in terms of arthropod vectors and antigenic relationships than most other genera of arthropod-borne viruses. In this study, 30 sandfly fever group viruses from the Naples, Sicilian, Punta Toro, Icoaraci and Frijoles serocomplexes were sequenced. Phylogenetic analyses were performed based on the sequence of the open reading frame for the nucleoprotein (N) and non-structural (NSs) protein genes of the small (S) segment. The five resultant genotypic lineages correlated with the serological grouping and were similar to analysis of M segment sequences. The sequence identity for N and NSs genes within the Sicilian, Naples, Punta Toro, Icoaraci and Frijoles serocomplexes was determined. The results indicated that genetic divergence for the S segment is lower than that for the M segment, suggesting that the S segment is more stable during evolution. PMID:17622637

  12. Bacterial genome remodeling through bacteriophage recombination.

    PubMed

    Menouni, Rachid; Hutinet, Geoffrey; Petit, Marie-Agnès; Ansaldi, Mireille

    2015-01-01

    Bacteriophages co-exist and co-evolve with their hosts in natural environments. Virulent phages lyse infected cells through lytic cycles, whereas temperate phages often remain dormant and can undergo lysogenic or lytic cycles. In their lysogenic state, prophages are actually part of the host genome and replicate passively in rhythm with host division. However, prophages are far from being passive residents: they can modify or bring new properties to their host. In this review, we focus on two important phage-encoded recombination mechanisms, i.e. site-specific recombination and homologous recombination, and how they remodel bacterial genomes. PMID:25790500

  13. Genomic Analysis Identifies a Transcription Factor Binding Motif Regulating Expression of the Alpha C Protein in Group B Streptococcus

    PubMed Central

    Klinzing, David C.; Madoff, Lawrence C.; Puopolo, Karen M.

    2009-01-01

    The virulence-associated alpha C protein (ACP) of Group B Streptococcus (GBS) facilitates the bacterial interaction with host epithelial cells. We previously demonstrated that phase-variable expression of ACP is controlled by variation in short-sequence repeat sequences present upstream of the promoter of bca, the gene encoding ACP. To determine if trans-acting transcriptional control also influences ACP expression, we developed an in silico prediction algorithm that identified a potential transcription-factor binding motif (TTT-N6-ATAT) in the bca upstream region. In vitro reporter gene expression studies confirmed that this motif is required for full ACP expression, and DNA-binding assays with a GBS protein extract demonstrated that the predicted site is bound by a protein. This approach demonstrates the utility of in silico genomic predictive methods in the study of GBS regulatory mechanisms. PMID:19328843

  14. Phylogeny of the Sphaerotilus-Leptothrix group inferred from morphological comparisons, genomic fingerprinting, and 16S ribosomal DNA sequence analyses.

    PubMed

    Siering, P L; Ghiorse, W C

    1996-01-01

    Phase-contrast light microscopy revealed that only one of eight cultivated strains belonging to the Sphaerotilus-Leptothrix group of sheathed bacteria actually produced a sheath in standard growth media. Two Sphaerotilus natans strains produced branched cells, but other morphological characteristics that were used to identify these bacteria were consistent with previously published descriptions. Genomic fingerprints, which were obtained by performing PCR amplification with primers corresponding to enterobacterial repetitive intergenic consensus sequences, were useful for distinguishing between the genera Sphaerotilus and Leptothrix, as well as among individual strains. The complete 16S ribosomal DNA (rDNA) sequences of two strains of "Leptothrix discophora" (strains SP-6 and SS-1) were determined. In addition, partial sequences (approximately 300 nucleotides) of one strain of Leptothrix cholodnii (strain LMG 7171), an unidentified Leptothrix strain (strain NC-1), and four strains of Sphaerotilus natans (strains ATCC 13338T [T = type strain], ATCC 15291, ATCC 29329, and ATCC 29330) were determined. We found that two of the S. natans strains (ATCC 15291 and ATCC 13338T), which differed in morphology and in their genomic fingerprints, had identical sequences in the 300-nucleotide region sequenced. Both parsimony and distance matrix methods were used to infer the evolutionary relationships of the eight strains in a comparison of the 16S rDNA sequences of these organisms with 16S rDNA sequences obtained from ribosomal sequence databases. All of the strains clustered in the Rubrivivax subdivision of the beta subclass of the Proteobacteria, which confirmed previously published conclusions concerning selected individual strains. Additional analyses revealed that all of the S. natans strains clustered in one closely related group, while the Leptothrix strains clustered in two separate lineages that were approximately equidistant from the S. natans cluster. This finding

  15. The Genome and Linkage Map of the Northern Pike (Esox lucius): Conserved Synteny Revealed between the Salmonid Sister Group and the Neoteleostei

    PubMed Central

    Rondeau, Eric B.; Minkley, David R.; Leong, Jong S.; Messmer, Amber M.; Jantzen, Johanna R.; von Schalburg, Kristian R.; Lemon, Craig; Bird, Nathan H.; Koop, Ben F.

    2014-01-01

    The northern pike is the most frequently studied member of the Esociformes, the closest order to the diverse and economically important Salmoniformes. The ancestor of all salmonids purportedly experienced a whole-genome duplication (WGD) event, making salmonid species ideal for studying the early impacts of genome duplication while complicating their use in wider analyses of teleost evolution. Studies suggest that the Esociformes diverged from the salmonid lineage prior to the WGD, supporting the use of northern pike as a pre-duplication outgroup. Here we present the first genome assembly, reference transcriptome and linkage map for northern pike, and evaluate the suitability of this species to provide a representative pre-duplication genome for future studies of salmonid and teleost evolution. The northern pike genome sequence is composed of 94,267 contigs (N50 = 16,909 bp) contained in 5,688 scaffolds (N50 = 700,535 bp); the total scaffolded genome size is 878 million bases. Multiple lines of evidence suggest that over 96% of the protein-coding genome is present in the genome assembly. The reference transcriptome was constructed from 13 tissues and contains 38,696 transcripts, which are accompanied by normalized expression data in all tissues. Gene-prediction analysis produced a total of 19,601 northern pike-specific gene models. The first-generation linkage map identifies 25 linkage groups, in agreement with northern pike's diploid karyotype of 2N = 50, and facilitates the placement of 46% of assembled bases onto linkage groups. Analyses reveal a high degree of conserved synteny between northern pike and other model teleost genomes. While conservation of gene order is limited to smaller syntenic blocks, the wider conservation of genome organization implies the northern pike exhibits a suitable approximation of a non-duplicated Protacanthopterygiian genome. This dataset will facilitate future studies of esocid biology and empower ongoing examinations

  16. Population-ethnic group specific genome variation allele frequency data: a querying and visualization journey.

    PubMed

    Viennas, Emmanouil; Gkantouna, Vassiliki; Ioannou, Marina; Georgitsi, Marianthi; Rigou, Maria; Poulas, Konstantinos; Patrinos, George P; Tzimas, Giannis

    2012-08-01

    National/ethnic mutation databases aim to document the genetic heterogeneity in various populations and ethnic groups worldwide. We have previously reported the development and upgrade of FINDbase (www.findbase.org), a database recording causative mutations and pharmacogenomic marker allele frequencies in various populations around the globe. Although this database has recently been upgraded, we continuously try to enhance its functionality by providing more advanced visualization tools that would further assist effective data querying and comparisons. We are currently experimenting in various visualization techniques on the existing FINDbase causative mutation data collection aiming to provide a dynamic research tool for the worldwide scientific community. We have developed an interactive web-based application for population-based mutation data retrieval. It supports sophisticated data exploration allowing users to apply advanced filtering criteria upon a set of multiple views of the underlying data collection and enables browsing the relationships between individual datasets in a novel and meaningful way. PMID:22659238

  17. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    ERIC Educational Resources Information Center

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L. M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David; Elgin, Sarah C. R.

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington…

  18. Toward a marker-dense meiotic map of the potato genome: lessons from linkage group I.

    PubMed Central

    Isidore, Edwige; van Os, Hans; Andrzejewski, Sandra; Bakker, Jaap; Barrena, Imanol; Bryan, Glenn J; Caromel, Bernard; van Eck, Herman; Ghareeb, Bilal; de Jong, Walter; van Koert, Paul; Lefebvre, Véronique; Milbourne, Dan; Ritter, Enrique; van der Voort, Jeroen Rouppe; Rousselle-Bourgeois, Françoise; van Vliet, Joke; Waugh, Robbie

    2003-01-01

    Segregation data were obtained for 1260 potato linkage group I-specific AFLP loci from a heterozygous diploid potato population. Analytical tools that identified potential typing errors and/or inconsistencies in the data and that assembled cosegregating markers into bins were applied. Bins contain multiple-marker data sets with an identical segregation pattern, which is defined as the bin signature. The bin signatures were used to construct a skeleton bin map that was based solely on observed recombination events. Markers that did not match any of the bin signatures exactly (and that were excluded from the calculation of the skeleton bin map) were placed on the map by maximum likelihood. The resulting maternal and paternal maps consisted of 95 and 101 bins, respectively. Markers derived from EcoRI/MseI, PstI/MseI, and SacI/MseI primer combinations showed different genetic distributions. Approximately three-fourths of the markers placed into a bin were considered to fit well on the basis of an estimated residual "error rate" of 0-3%. However, twice as many PstI-based markers fit badly, suggesting that parental PstI-site methylation patterns had changed in the population. Recombination frequencies were highly variable across the map. Inert, presumably centromeric, regions caused extensive marker clustering while recombination hotspots (or regions identical by descent) resulted in empty bins, despite the level of marker saturation. PMID:14704190

  19. Group II Intron-Mediated Trans-Splicing in the Gene-Rich Mitochondrial Genome of an Enigmatic Eukaryote, Diphylleia rotans

    PubMed Central

    Kamikawa, Ryoma; Shiratori, Takashi; Ishida, Ken-Ichiro; Miyashita, Hideaki; Roger, Andrew J.

    2016-01-01

    Although mitochondria have evolved from a single endosymbiotic event, present day mitochondria of diverse eukaryotes display a great range of genome structures, content and features. Group I and group II introns are two features that are distributed broadly but patchily in mitochondrial genomes across branches of the tree of eukaryotes. While group I intron-mediated trans-splicing has been reported from some lineages distantly related to each other, findings of group II intron-mediated trans-splicing has been restricted to members of the Chloroplastida. In this study, we found the mitochondrial genome of the unicellular eukaryote Diphylleia rotans possesses currently the second largest gene repertoire. On the basis of a probable phylogenetic position of Diphylleia, which is located within Amorphea, current mosaic gene distribution in Amorphea must invoke parallel gene losses from mitochondrial genomes during evolution. Most notably, although the cytochrome c oxidase subunit (cox) 1 gene was split into four pieces which located at a distance to each other, we confirmed that a single mature mRNA that covered the entire coding region could be generated by group II intron-mediated trans-splicing. This is the first example of group II intron-mediated trans-splicing outside Chloroplastida. Similar trans-splicing mechanisms likely work for bipartitely split cox2 and nad3 genes to generate single mature mRNAs. We finally discuss origin and evolution of this type of trans-splicing in D. rotans as well as in eukaryotes. PMID:26833505

  20. Group II Intron-Mediated Trans-Splicing in the Gene-Rich Mitochondrial Genome of an Enigmatic Eukaryote, Diphylleia rotans.

    PubMed

    Kamikawa, Ryoma; Shiratori, Takashi; Ishida, Ken-Ichiro; Miyashita, Hideaki; Roger, Andrew J

    2016-02-01

    Although mitochondria have evolved from a single endosymbiotic event, present day mitochondria of diverse eukaryotes display a great range of genome structures, content and features. Group I and group II introns are two features that are distributed broadly but patchily in mitochondrial genomes across branches of the tree of eukaryotes. While group I intron-mediated trans-splicing has been reported from some lineages distantly related to each other, findings of group II intron-mediated trans-splicing has been restricted to members of the Chloroplastida. In this study, we found the mitochondrial genome of the unicellular eukaryote Diphylleia rotans possesses currently the second largest gene repertoire. On the basis of a probable phylogenetic position of Diphylleia, which is located within Amorphea, current mosaic gene distribution in Amorphea must invoke parallel gene losses from mitochondrial genomes during evolution. Most notably, although the cytochrome c oxidase subunit (cox) 1 gene was split into four pieces which located at a distance to each other, we confirmed that a single mature mRNA that covered the entire coding region could be generated by group II intron-mediated trans-splicing. This is the first example of group II intron-mediated trans-splicing outside Chloroplastida. Similar trans-splicing mechanisms likely work for bipartitely split cox2 and nad3 genes to generate single mature mRNAs. We finally discuss origin and evolution of this type of trans-splicing in D. rotans as well as in eukaryotes. PMID:26833505

  1. Evolutionary pathway to increased virulence and epidemic group A Streptococcus disease derived from 3,615 genome sequences.

    PubMed

    Nasser, Waleed; Beres, Stephen B; Olsen, Randall J; Dean, Melissa A; Rice, Kelsey A; Long, S Wesley; Kristinsson, Karl G; Gottfredsson, Magnus; Vuopio, Jaana; Raisanen, Kati; Caugant, Dominique A; Steinbakk, Martin; Low, Donald E; McGeer, Allison; Darenberg, Jessica; Henriques-Normark, Birgitta; Van Beneden, Chris A; Hoffmann, Steen; Musser, James M

    2014-04-29

    We sequenced the genomes of 3,615 strains of serotype Emm protein 1 (M1) group A Streptococcus to unravel the nature and timing of molecular events contributing to the emergence, dissemination, and genetic diversification of an unusually virulent clone that now causes epidemic human infections worldwide. We discovered that the contemporary epidemic clone emerged in stepwise fashion from a precursor cell that first contained the phage encoding an extracellular DNase virulence factor (streptococcal DNase D2, SdaD2) and subsequently acquired the phage encoding the SpeA1 variant of the streptococcal pyrogenic exotoxin A superantigen. The SpeA2 toxin variant evolved from SpeA1 by a single-nucleotide change in the M1 progenitor strain before acquisition by horizontal gene transfer of a large chromosomal region encoding secreted toxins NAD(+)-glycohydrolase and streptolysin O. Acquisition of this 36-kb region in the early 1980s into just one cell containing the phage-encoded sdaD2 and speA2 genes was the final major molecular event preceding the emergence and rapid intercontinental spread of the contemporary epidemic clone. Thus, we resolve a decades-old controversy about the type and sequence of genomic alterations that produced this explosive epidemic. Analysis of comprehensive, population-based contemporary invasive strains from seven countries identified strong patterns of temporal population structure. Compared with a preepidemic reference strain, the contemporary clone is significantly more virulent in nonhuman primate models of pharyngitis and necrotizing fasciitis. A key finding is that the molecular evolutionary events transpiring in just one bacterial cell ultimately have produced millions of human infections worldwide. PMID:24733896

  2. Evolutionary pathway to increased virulence and epidemic group A Streptococcus disease derived from 3,615 genome sequences

    PubMed Central

    Nasser, Waleed; Beres, Stephen B.; Olsen, Randall J.; Dean, Melissa A.; Rice, Kelsey A.; Long, S. Wesley; Kristinsson, Karl G.; Gottfredsson, Magnus; Vuopio, Jaana; Raisanen, Kati; Caugant, Dominique A.; Steinbakk, Martin; Low, Donald E.; McGeer, Allison; Darenberg, Jessica; Henriques-Normark, Birgitta; Van Beneden, Chris A.; Hoffmann, Steen; Musser, James M.

    2014-01-01

    We sequenced the genomes of 3,615 strains of serotype Emm protein 1 (M1) group A Streptococcus to unravel the nature and timing of molecular events contributing to the emergence, dissemination, and genetic diversification of an unusually virulent clone that now causes epidemic human infections worldwide. We discovered that the contemporary epidemic clone emerged in stepwise fashion from a precursor cell that first contained the phage encoding an extracellular DNase virulence factor (streptococcal DNase D2, SdaD2) and subsequently acquired the phage encoding the SpeA1 variant of the streptococcal pyrogenic exotoxin A superantigen. The SpeA2 toxin variant evolved from SpeA1 by a single-nucleotide change in the M1 progenitor strain before acquisition by horizontal gene transfer of a large chromosomal region encoding secreted toxins NAD+-glycohydrolase and streptolysin O. Acquisition of this 36-kb region in the early 1980s into just one cell containing the phage-encoded sdaD2 and speA2 genes was the final major molecular event preceding the emergence and rapid intercontinental spread of the contemporary epidemic clone. Thus, we resolve a decades-old controversy about the type and sequence of genomic alterations that produced this explosive epidemic. Analysis of comprehensive, population-based contemporary invasive strains from seven countries identified strong patterns of temporal population structure. Compared with a preepidemic reference strain, the contemporary clone is significantly more virulent in nonhuman primate models of pharyngitis and necrotizing fasciitis. A key finding is that the molecular evolutionary events transpiring in just one bacterial cell ultimately have produced millions of human infections worldwide. PMID:24733896

  3. Challenges of flow-cytometric estimation of nuclear genome size in orchids, a plant group with both whole-genome and progressively partial endoreplication.

    PubMed

    Trávníček, Pavel; Ponert, Jan; Urfus, Tomáš; Jersáková, Jana; Vrána, Jan; Hřibová, Eva; Doležel, Jaroslav; Suda, Jan

    2015-10-01

    Nuclear genome size is an inherited quantitative trait of eukaryotic organisms with both practical and biological consequences. A detailed analysis of major families is a promising approach to fully understand the biological meaning of the extensive variation in genome size in plants. Although Orchidaceae accounts for ∼10% of the angiosperm diversity, the knowledge of patterns and dynamics of their genome size is limited, in part due to difficulties in flow cytometric analyses. Cells in various somatic tissues of orchids undergo extensive endoreplication, either whole-genome or partial, and the G1-phase nuclei with 2C DNA amounts may be lacking, resulting in overestimated genome size values. Interpretation of DNA content histograms is particularly challenging in species with progressively partial endoreplication, in which the ratios between the positions of two neighboring DNA peaks are lower than two. In order to assess distributions of nuclear DNA amounts and identify tissue suitable for reliable estimation of nuclear DNA content, we analyzed six different tissue types in 48 orchid species belonging to all recognized subfamilies. Although traditionally used leaves may provide incorrect C-values, particularly in species with progressively partial endoreplication, young ovaries and pollinaria consistently yield 2C and 1C peaks of their G1-phase nuclei, respectively, and are, therefore, the most suitable parts for genome size studies in orchids. We also provide new DNA C-values for 22 orchid genera and 42 species. Adhering to the proposed methodology would allow for reliable genome size estimates in this largest plant family. Although our research was limited to orchids, the need to find a suitable tissue with dominant 2C peak of G1-phase nuclei applies to all endopolyploid species. PMID:25929591

  4. Phylogenetic relationship and virulence inference of Streptococcus Anginosus Group: curated annotation and whole-genome comparative analysis support distinct species designation

    PubMed Central

    2013-01-01

    Background The Streptococcus Anginosus Group (SAG) represents three closely related species of the viridans group streptococci recognized as commensal bacteria of the oral, gastrointestinal and urogenital tracts. The SAG also cause severe invasive infections, and are pathogens during cystic fibrosis (CF) pulmonary exacerbation. Little genomic information or description of virulence mechanisms is currently available for SAG. We conducted intra and inter species whole-genome comparative analyses with 59 publically available Streptococcus genomes and seven in-house closed high quality finished SAG genomes; S. constellatus (3), S. intermedius (2), and S. anginosus (2). For each SAG species, we sequenced at least one numerically dominant strain from CF airways recovered during acute exacerbation and an invasive, non-lung isolate. We also evaluated microevolution that occurred within two isolates that were cultured from one individual one year apart. Results The SAG genomes were most closely related to S. gordonii and S. sanguinis, based on shared orthologs and harbor a similar number of proteins within each COG category as other Streptococcus species. Numerous characterized streptococcus virulence factor homologs were identified within the SAG genomes including; adherence, invasion, spreading factors, LPxTG cell wall proteins, and two component histidine kinases known to be involved in virulence gene regulation. Mobile elements, primarily integrative conjugative elements and bacteriophage, account for greater than 10% of the SAG genomes. S. anginosus was the most variable species sequenced in this study, yielding both the smallest and the largest SAG genomes containing multiple genomic rearrangements, insertions and deletions. In contrast, within the S. constellatus and S. intermedius species, there was extensive continuous synteny, with only slight differences in genome size between strains. Within S. constellatus we were able to determine important SNPs and changes in

  5. Complete mtDNA genomes of Filipino ethnolinguistic groups: a melting pot of recent and ancient lineages in the Asia-Pacific region.

    PubMed

    Delfin, Frederick; Min-Shan Ko, Albert; Li, Mingkun; Gunnarsdóttir, Ellen D; Tabbada, Kristina A; Salvador, Jazelyn M; Calacal, Gayvelline C; Sagum, Minerva S; Datar, Francisco A; Padilla, Sabino G; De Ungria, Maria Corazon A; Stoneking, Mark

    2014-02-01

    The Philippines is a strategic point in the Asia-Pacific region for the study of human diversity, history and origins, as it is a cross-road for human migrations and consequently exhibits enormous ethnolinguistic diversity. Following on a previous in-depth study of Y-chromosome variation, here we provide new insights into the maternal genetic history of Filipino ethnolinguistic groups by surveying complete mitochondrial DNA (mtDNA) genomes from a total of 14 groups (11 groups in this study and 3 groups previously published) including previously published mtDNA hypervariable segment (HVS) data from Filipino regional center groups. Comparison of HVS data indicate genetic differences between ethnolinguistic and regional center groups. The complete mtDNA genomes of 14 ethnolinguistic groups reveal genetic aspects consistent with the Y-chromosome, namely: diversity and heterogeneity of groups, no support for a simple dichotomy between Negrito and non-Negrito groups, and different genetic affinities with Asia-Pacific groups that are both ancient and recent. Although some mtDNA haplogroups can be associated with the Austronesian expansion, there are others that associate with South Asia, Near Oceania and Australia that are consistent with a southern migration route for ethnolinguistic group ancestors into the Asia-Pacific, with a timeline that overlaps with the initial colonization of the Asia-Pacific region, the initial colonization of the Philippines and a possible separate post-colonization migration into the Philippine archipelago. PMID:23756438

  6. Complete mtDNA genomes of Filipino ethnolinguistic groups: a melting pot of recent and ancient lineages in the Asia-Pacific region

    PubMed Central

    Delfin, Frederick; Min-Shan Ko, Albert; Li, Mingkun; Gunnarsdóttir, Ellen D; Tabbada, Kristina A; Salvador, Jazelyn M; Calacal, Gayvelline C; Sagum, Minerva S; Datar, Francisco A; Padilla, Sabino G; De Ungria, Maria Corazon A; Stoneking, Mark

    2014-01-01

    The Philippines is a strategic point in the Asia-Pacific region for the study of human diversity, history and origins, as it is a cross-road for human migrations and consequently exhibits enormous ethnolinguistic diversity. Following on a previous in-depth study of Y-chromosome variation, here we provide new insights into the maternal genetic history of Filipino ethnolinguistic groups by surveying complete mitochondrial DNA (mtDNA) genomes from a total of 14 groups (11 groups in this study and 3 groups previously published) including previously published mtDNA hypervariable segment (HVS) data from Filipino regional center groups. Comparison of HVS data indicate genetic differences between ethnolinguistic and regional center groups. The complete mtDNA genomes of 14 ethnolinguistic groups reveal genetic aspects consistent with the Y-chromosome, namely: diversity and heterogeneity of groups, no support for a simple dichotomy between Negrito and non-Negrito groups, and different genetic affinities with Asia-Pacific groups that are both ancient and recent. Although some mtDNA haplogroups can be associated with the Austronesian expansion, there are others that associate with South Asia, Near Oceania and Australia that are consistent with a southern migration route for ethnolinguistic group ancestors into the Asia-Pacific, with a timeline that overlaps with the initial colonization of the Asia-Pacific region, the initial colonization of the Philippines and a possible separate post-colonization migration into the Philippine archipelago. PMID:23756438

  7. Complete Genome Sequence of the Wild-Type Commensal Escherichia coli Strain SE15, Belonging to Phylogenetic Group B2▿

    PubMed Central

    Toh, Hidehiro; Oshima, Kenshiro; Toyoda, Atsushi; Ogura, Yoshitoshi; Ooka, Tadasuke; Sasamoto, Hiroyuki; Park, Sang-Hee; Iyoda, Sunao; Kurokawa, Ken; Morita, Hidetoshi; Itoh, Kikuji; Taylor, Todd D.; Hayashi, Tetsuya; Hattori, Masahira

    2010-01-01

    Escherichia coli SE15 (O150:H5) is a human commensal bacterium recently isolated from feces of a healthy adult and classified into E. coli phylogenetic group B2, which includes the majority of extraintestinal pathogenic E. coli. Here, we report the finished and annotated genome sequence of this organism. PMID:20008064

  8. Whole-Genome Sequences of Four Strains Closely Related to Members of the Mycobacterium chelonae Group, Isolated from Biofilms in a Drinking Water Distribution System Simulator

    EPA Science Inventory

    We report the draft genome sequences of four Mycobacterium chelonae group strains from biofilms obtained after a ‘chlorine burn’ in a chloraminated drinking water distribution system simulator. These opportunistic pathogens have been detected in drinking and hospital water distr...

  9. Genome-Wide Association Study of Ureide Concentration in Diverse Maturity Group IV Soybean [Glycine max (L.) Merr.] Accessions

    PubMed Central

    Ray, Jeffery D.; Dhanapal, Arun Prabhu; Singh, Shardendu K.; Hoyos-Villegas, Valerio; Smith, James R.; Purcell, Larry C.; King, C. Andy; Boykin, Debbie; Cregan, Perry B.; Song, Qijian; Fritschi, Felix B.

    2015-01-01

    Ureides are the N-rich products of N-fixation that are transported from soybean nodules to the shoot. Ureides are known to accumulate in leaves in response to water-deficit stress, and this has been used to identify genotypes with reduced N-fixation sensitivity to drought. Our objectives in this research were to determine shoot ureide concentrations in 374 Maturity Group IV soybean accessions and to identify genomic regions associated with shoot ureide concentration. The accessions were grown at two locations (Columbia, MO, and Stuttgart, AR) in 2 yr (2009 and 2010) and characterized for ureide concentration at beginning flowering to full bloom. Average shoot ureide concentrations across all four environments (two locations and two years) and 374 accessions ranged from 12.4 to 33.1 µmol g−1 and were comparable to previously reported values. SNP–ureide associations within and across the four environments were assessed using 33,957 SNPs with a MAF ≥0.03. In total, 53 putative loci on 18 chromosomes were identified as associated with ureide concentration. Two of the putative loci were located near previously reported QTL associated with ureide concentration and 30 loci were located near genes associated with ureide metabolism. The remaining putative loci were not near chromosomal regions previously associated with shoot ureide concentration and may mark new genes involved in ureide metabolism. Ultimately, confirmation of these putative loci will provide new sources of variation for use in soybean breeding programs. PMID:26374596

  10. Molecular classification of diffuse cerebral WHO grade II/III gliomas using genome- and transcriptome-wide profiling improves stratification of prognostically distinct patient groups.

    PubMed

    Weller, Michael; Weber, Ruthild G; Willscher, Edith; Riehmer, Vera; Hentschel, Bettina; Kreuz, Markus; Felsberg, Jörg; Beyer, Ulrike; Löffler-Wirth, Henry; Kaulich, Kerstin; Steinbach, Joachim P; Hartmann, Christian; Gramatzki, Dorothee; Schramm, Johannes; Westphal, Manfred; Schackert, Gabriele; Simon, Matthias; Martens, Tobias; Boström, Jan; Hagel, Christian; Sabel, Michael; Krex, Dietmar; Tonn, Jörg C; Wick, Wolfgang; Noell, Susan; Schlegel, Uwe; Radlwimmer, Bernhard; Pietsch, Torsten; Loeffler, Markus; von Deimling, Andreas; Binder, Hans; Reifenberger, Guido

    2015-05-01

    Cerebral gliomas of World Health Organization (WHO) grade II and III represent a major challenge in terms of histological classification and clinical management. Here, we asked whether large-scale genomic and transcriptomic profiling improves the definition of prognostically distinct entities. We performed microarray-based genome- and transcriptome-wide analyses of primary tumor samples from a prospective German Glioma Network cohort of 137 patients with cerebral gliomas, including 61 WHO grade II and 76 WHO grade III tumors. Integrative bioinformatic analyses were employed to define molecular subgroups, which were then related to histology, molecular biomarkers, including isocitrate dehydrogenase 1 or 2 (IDH1/2) mutation, 1p/19q co-deletion and telomerase reverse transcriptase (TERT) promoter mutations, and patient outcome. Genomic profiling identified five distinct glioma groups, including three IDH1/2 mutant and two IDH1/2 wild-type groups. Expression profiling revealed evidence for eight transcriptionally different groups (five IDH1/2 mutant, three IDH1/2 wild type), which were only partially linked to the genomic groups. Correlation of DNA-based molecular stratification with clinical outcome allowed to define three major prognostic groups with characteristic genomic aberrations. The best prognosis was found in patients with IDH1/2 mutant and 1p/19q co-deleted tumors. Patients with IDH1/2 wild-type gliomas and glioblastoma-like genomic alterations, including gain on chromosome arm 7q (+7q), loss on chromosome arm 10q (-10q), TERT promoter mutation and oncogene amplification, displayed the worst outcome. Intermediate survival was seen in patients with IDH1/2 mutant, but 1p/19q intact, mostly astrocytic gliomas, and in patients with IDH1/2 wild-type gliomas lacking the +7q/-10q genotype and TERT promoter mutation. This molecular subgrouping stratified patients into prognostically distinct groups better than histological classification. Addition of gene expression

  11. Complete genome sequence of the bacteriochlorophyll a-containing Roseibacterium elongatum type strain (DSM 19469(T)), a representative of the Roseobacter group isolated from Australian coast sand.

    PubMed

    Riedel, Thomas; Fiebig, Anne; Göker, Markus; Klenk, Hans-Peter

    2014-06-15

    Roseibacterium elongatum Suzuki et al. 2006 is a pink-pigmented and bacteriochlorophyll a-producing representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the marine 'Roseobacter group' were found to be abundant in the ocean and play an important role in global and biogeochemical processes. In the present study we describe the features of R. elongatum strain OCh 323(T) together with its genome sequence and annotation. The 3,555,102 bp long genome consists of one circular chromosome with no extrachromosomal elements and is one of the smallest known Roseobacter genomes. It contains 3,540 protein-coding genes and 59 RNA genes. Genome analysis revealed the presence of a photosynthetic gene cluster, which putatively enables a photoheterotrophic lifestyle. Gene sequences associated with quorum sensing, motility, surface attachment, and thiosulfate and carbon monoxide oxidation could be detected. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197467

  12. Phylogenomic network and comparative genomics reveal a diverged member of the ΦKZ-related group, marine vibrio phage ΦJM-2012.

    PubMed

    Jang, Ho Bin; Fagutao, Fernand F; Nho, Seong Won; Park, Seong Bin; Cha, In Seok; Yu, Jong Earn; Lee, Jung Seok; Im, Se Pyeong; Aoki, Takashi; Jung, Tae Sung

    2013-12-01

    Bacteriophages are the largest reservoir of genetic diversity. Here we describe the novel phage ΦJM-2012. This natural isolate from marine Vibrio cyclitrophicus possesses very few gene contents relevant to other well-studied marine Vibrio phages. To better understand its evolutionary history, we built a mathematical model of pairwise relationships among 1,221 phage genomes, in which the genomes (nodes) are linked by edges representing the normalized number of shared orthologous protein families. This weighted network revealed that ΦJM-2012 was connected to only five members of the Pseudomonas ΦKZ-like phage family in an isolated network, strongly indicating that it belongs to this phage group. However, comparative genomic analyses highlighted an almost complete loss of colinearity with the ΦKZ-related genomes and little conservation of gene order, probably reflecting the action of distinct evolutionary forces on the genome of ΦJM-2012. In this phage, typical conserved core genes, including six RNA polymerase genes, were frequently displaced and the hyperplastic regions were rich in both unique genes and predicted unidirectional promoters with highly correlated orientations. Further, analysis of the ΦJM-2012 genome showed that segments of the conserved N-terminal parts of ΦKZ tail fiber paralogs exhibited evidence of combinatorial assortment, having switched transcriptional orientation, and there was recruitment and/or structural changes among phage endolysins and tail spike protein. Thus, this naturally occurring phage appears to have branched from a common ancestor of the ΦKZ-related groups, showing a distinct genomic architecture and unique genes that most likely reflect adaptation to its chosen host and environment. PMID:24067958

  13. The Genomics Education Partnership: Successful Integration of Research into Laboratory Classes at a Diverse Group of Undergraduate Institutions

    PubMed Central

    Shaffer, Christopher D.; Alvarez, Consuelo; Bailey, Cheryl; Barnard, Daron; Bhalla, Satish; Chandrasekaran, Chitra; Chandrasekaran, Vidya; Chung, Hui-Min; Dorer, Douglas R.; Du, Chunguang; Eckdahl, Todd T.; Poet, Jeff L.; Frohlich, Donald; Goodman, Anya L.; Gosser, Yuying; Hauser, Charles; Hoopes, Laura L.M.; Johnson, Diana; Jones, Christopher J.; Kaehler, Marian; Kokan, Nighat; Kopp, Olga R.; Kuleck, Gary A.; McNeil, Gerard; Moss, Robert; Myka, Jennifer L.; Nagengast, Alexis; Morris, Robert; Overvoorde, Paul J.; Shoop, Elizabeth; Parrish, Susan; Reed, Kelynne; Regisford, E. Gloria; Revie, Dennis; Rosenwald, Anne G.; Saville, Ken; Schroeder, Stephanie; Shaw, Mary; Skuse, Gary; Smith, Christopher; Smith, Mary; Spana, Eric P.; Spratt, Mary; Stamm, Joyce; Thompson, Jeff S.; Wawersik, Matthew; Wilson, Barbara A.; Youngblom, Jim; Leung, Wilson; Buhler, Jeremy; Mardis, Elaine R.; Lopatto, David

    2010-01-01

    Genomics is not only essential for students to understand biology but also provides unprecedented opportunities for undergraduate research. The goal of the Genomics Education Partnership (GEP), a collaboration between a growing number of colleges and universities around the country and the Department of Biology and Genome Center of Washington University in St. Louis, is to provide such research opportunities. Using a versatile curriculum that has been adapted to many different class settings, GEP undergraduates undertake projects to bring draft-quality genomic sequence up to high quality and/or participate in the annotation of these sequences. GEP undergraduates have improved more than 2 million bases of draft genomic sequence from several species of Drosophila and have produced hundreds of gene models using evidence-based manual annotation. Students appreciate their ability to make a contribution to ongoing research, and report increased independence and a more active learning approach after participation in GEP projects. They show knowledge gains on pre- and postcourse quizzes about genes and genomes and in bioinformatic analysis. Participating faculty also report professional gains, increased access to genomics-related technology, and an overall positive experience. We have found that using a genomics research project as the core of a laboratory course is rewarding for both faculty and students. PMID:20194808

  14. Draft Genome Sequence of Dietzia maris DSM 43672, a Gram-Positive Bacterium of the Mycolata Group

    PubMed Central

    Ganguly, Sonalli; Jimenez-Galisteo, Guadalupe; Pletzer, Daniel; Winterhalter, Mathias; Benz, Roland

    2016-01-01

    Here, we report the draft genome sequence of Dietzia maris, known previously as Rhodococcus maris. It is 3,505,372 bp in size with a G+C content of 73%. The draft genome sequence will improve our understanding of Dietzia maris related to other mycolata species and constitutes a basic tool for exploring the cell wall proteins. PMID:27284155

  15. Draft Genome Sequence of Dietzia maris DSM 43672, a Gram-Positive Bacterium of the Mycolata Group.

    PubMed

    Ganguly, Sonalli; Jimenez-Galisteo, Guadalupe; Pletzer, Daniel; Winterhalter, Mathias; Benz, Roland; Viñas, Miguel

    2016-01-01

    Here, we report the draft genome sequence of Dietzia maris, known previously as Rhodococcus maris It is 3,505,372 bp in size with a G+C content of 73%. The draft genome sequence will improve our understanding of Dietzia maris related to other mycolata species and constitutes a basic tool for exploring the cell wall proteins. PMID:27284155

  16. Three groups of transposable elements with contrasting copy number dynamics and host responses in the maize (Zea mays ssp. mays) genome.

    PubMed

    Diez, Concepcion M; Meca, Esteban; Tenaillon, Maud I; Gaut, Brandon S

    2014-04-01

    Most angiosperm nuclear DNA is repetitive and derived from silenced transposable elements (TEs). TE silencing requires substantial resources from the plant host, including the production of small interfering RNAs (siRNAs). Thus, the interaction between TEs and siRNAs is a critical aspect of both the function and the evolution of plant genomes. Yet the co-evolutionary dynamics between these two entities remain poorly characterized. Here we studied the organization of TEs within the maize (Zea mays ssp mays) genome, documenting that TEs fall within three groups based on the class and copy numbers. These groups included DNA elements, low copy RNA elements and higher copy RNA elements. The three groups varied statistically in characteristics that included length, location, age, siRNA expression and 24:22 nucleotide (nt) siRNA targeting ratios. In addition, the low copy retroelements encompassed a set of TEs that had previously been shown to decrease expression within a 24 nt siRNA biogenesis mutant (mop1). To investigate the evolutionary dynamics of the three groups, we estimated their abundance in two landraces, one with a genome similar in size to that of the maize reference and the other with a 30% larger genome. For all three accessions, we assessed TE abundance as well as 22 nt and 24 nt siRNA content within leaves. The high copy number retroelements are under targeted similarly by siRNAs among accessions, appear to be born of a rapid bust of activity, and may be currently transpositionally dead or limited. In contrast, the lower copy number group of retrolements are targeted more dynamically and have had a long and ongoing history of transposition in the maize genome. PMID:24743518

  17. Genomic Testing

    MedlinePlus

    ... Working Group Independent Web site Informing the effective integration of genomics into health practice—Lynch syndrome ACCE Model for Evaluating Genetic Tests Recommendations by the EGAPP Working Group Top of ... ...

  18. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae

    PubMed Central

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-01-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3′ terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785

  19. Insights into the history of a bacterial group II intron remnant from the genomes of the nitrogen-fixing symbionts Sinorhizobium meliloti and Sinorhizobium medicae.

    PubMed

    Toro, N; Martínez-Rodríguez, L; Martínez-Abarca, F

    2014-10-01

    Group II introns are self-splicing catalytic RNAs that act as mobile retroelements. In bacteria, they are thought to be tolerated to some extent because they self-splice and home preferentially to sites outside of functional genes, generally within intergenic regions or in other mobile genetic elements, by mechanisms including the divergence of DNA target specificity to prevent target site saturation. RmInt1 is a mobile group II intron that is widespread in natural populations of Sinorhizobium meliloti and was first described in the GR4 strain. Like other bacterial group II introns, RmInt1 tends to evolve toward an inactive form by fragmentation, with loss of the 3' terminus. We identified genomic evidence of a fragmented intron closely related to RmInt1 buried in the genome of the extant S. meliloti/S. medicae species. By studying this intron, we obtained evidence for the occurrence of intron insertion before the divergence of ancient rhizobial species. This fragmented group II intron has thus existed for a long time and has provided sequence variation, on which selection can act, contributing to diverse genetic rearrangements, and to generate pan-genome divergence after strain differentiation. The data presented here suggest that fragmented group II introns within intergenic regions closed to functionally important neighboring genes may have been microevolutionary forces driving adaptive evolution of these rhizobial species. PMID:24736785

  20. Mitotic index, microvascular proliferation, and necrosis define 3 groups of 1p/19q codeleted anaplastic oligodendrogliomas associated with different genomic alterations

    PubMed Central

    Figarella-Branger, Dominique; Mokhtari, Karima; Dehais, Caroline; Jouvet, Anne; Uro-Coste, Emmanuelle; Colin, Carole; Carpentier, Catherine; Forest, Fabien; Maurage, Claude-Alain; Vignaud, Jean-Michel; Polivka, Marc; Lechapt-Zalcman, Emmanuelle; Eimer, Sandrine; Viennet, Gabriel; Quintin-Roué, Isabelle; Aubriot-Lorton, Marie-Hélène; Diebold, Marie-Danièle; Loussouarn, Delphine; Lacroix, Catherine; Rigau, Valérie; Laquerrière, Annie; Vandenbos, Fanny; Michalak, Sophie; Sevestre, Henri; Peoch, Michel; Labrousse, François; Christov, Christo; Kemeny, Jean-Louis; Chenard, Marie-Pierre; Chiforeanu, Danchristian; Ducray, François; Idbaih, Ahmed; Desenclos, Christine; Menei, Philippe; Al Nader, Edmond; Godard, Joel; Servagi-Vernat, Stéphanie; Carpentier, Antoine; Loiseau, Hugues; Dam-Hieu, Phong; Guillamo, Jean Sebastien; Emery, Evelyne; Verelle, Pierre; Durando, Xavier; Faillot, Thierry; Le Guerinel, Caroline; Ghiringhelli, François; Parker, Fabrice; Adam, Clovis; Dubois, François; Ramirez, Carole; Gueye, Edouard Marcel; Honnorat, Jerome; Chinot, Olivier; Bauchet, Luc; Beauchesne, Patrick; Campone, Mario; Frenel, Jean Sébastien; Fontaine, Denys; Campello, Chantal; Roger, Pascal; Heitzmann, Anne; Fesneau, Mélanie; Delattre, Jean Yves; Elouadhani-Hamdi, Selma; Ricard, Damien; Colin, Philippe; Vauléon, Elodie; Langlois, Olivier; Fotso, Marie Janette Motsuo; Andraud, Marie; Mouton, Servane; Noel, Georges; Desse, Nicolas; Soulard, Raoulin; Cohen-Moyal, Elisabeth; Lubrano, Vincent; Dhermain, Frederic

    2014-01-01

    Background The aim of this study was to correlate histological features and molecular characteristics in anaplastic oligodendrogliomas (AOs). Methods The histological characteristics of 203 AO patients, enrolled in the French national network POLA, were analyzed. The genomic profiles of 191 cases were studied using genomic arrays. IDH mutational status was assessed by immunohistochemistry and direct sequencing. Results 1p/19q codeletion was present in 79% of cases and was associated with alpha-internexin expression (P < 10−4), IDH1/2 mutation (P < 10−4), chromosome 4 loss (P < 10−3), and better overall survival (P < 10−4). Based on mitotic index, microvascular proliferation (MVP), and necrosis, 3 groups of 1p/19q codeleted AOs were identified: (group 1) AO with more than 5 mitoses per 10-HPF, no MVP, and no necrosis; (group 2) AO with MVP and no necrosis; and (group 3) AO with MVP and necrosis. Compared with group 1, groups 2 and 3 AOs had a higher mean Ki-67 proliferation index and a higher rate of 9p and 9q losses. Compared with group 2, group 3 AOs had a higher number of chromosomal alterations including chromosome 4 loss. In the subgroup of 157 1p/19q codeleted AOs, chromosomal instability was associated with shorter progression-free survival (P = .024) and shorter overall survival (P = .023). Conclusions The present study shows that oligodendrogliomas with classic histological features remain a molecularly heterogeneous entity and should be stratified according to 1p/19q status because of its major prognostic relevance. Moreover, 1p/19q codeleted AOs are also heterogeneous. Interestingly, mitotic index, MVP, and necrosis help to classify them into 3 groups associated with distinct genomic alterations. PMID:24723566

  1. Complete genome sequence of the bacteriochlorophyll a-containing Roseibacterium elongatum type strain (DSM 19469T), a representative of the Roseobacter group isolated from Australian coast sand

    PubMed Central

    Riedel, Thomas; Fiebig, Anne; Göker, Markus; Klenk, Hans-Peter

    2014-01-01

    Roseibacterium elongatum Suzuki et al. 2006 is a pink-pigmented and bacteriochlorophyll a-producing representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the marine ‘Roseobacter group’ were found to be abundant in the ocean and play an important role in global and biogeochemical processes. In the present study we describe the features of R. elongatum strain OCh 323T together with its genome sequence and annotation. The 3,555,102 bp long genome consists of one circular chromosome with no extrachromosomal elements and is one of the smallest known Roseobacter genomes. It contains 3,540 protein-coding genes and 59 RNA genes. Genome analysis revealed the presence of a photosynthetic gene cluster, which putatively enables a photoheterotrophic lifestyle. Gene sequences associated with quorum sensing, motility, surface attachment, and thiosulfate and carbon monoxide oxidation could be detected. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197467

  2. Genome sequence of the Wenxinia marina type strain (DSM 24838T), a representative of the Roseobacter group isolated from oilfield sediments

    PubMed Central

    Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Spring, Stefan; Petersen, Jörn; Ivanova, Natalia N.; Markowitz, Victor; Woyke, Tanja; Göker, Markus; Kyrpides, Nikos C.; Klenk, Hans-Peter

    2014-01-01

    Wenxinia marina Ying et al. 2007 is the type species of the genus Wenxinia, a representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae, isolated from oilfield sediments of the South China Sea. This family was shown to harbor the most abundant bacteria especially from coastal and polar waters, but was also found in microbial mats, sediments and attached to different kind of surfaces. Here we describe the features of W. marina strain HY34T together with the genome sequence and annotation of strain DSM 24838T and novel aspects of its phenotype. The 4,181,754 bp containing genome sequence encodes 4,047 protein-coding genes and 59 RNA genes. The genome of W. marina DSM 24838T was sequenced as part of the activities of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project funded by the DoE and the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197468

  3. Draft Genome Sequence of the Plant-Pathogenic Soil Fungus Rhizoctonia solani Anastomosis Group 3 Strain Rhs1AP

    PubMed Central

    Cubeta, Marc A.; Dean, Ralph A.; Jabaji, Suha; Neate, Stephen M.; Tavantzis, Stellos; Toda, Takeshi; Vilgalys, Rytas; Bharathan, Narayanaswamy; Fedorova-Abrams, Natalie; Pakala, Suman B.; Pakala, Suchitra M.; Zafar, Nikhat; Joardar, Vinita; Losada, Liliana; Nierman, William C.

    2014-01-01

    The soil fungus Rhizoctonia solani is a pathogen of agricultural crops. Here, we report on the 51,705,945 bp draft consensus genome sequence of R. solani strain Rhs1AP. A comprehensive understanding of the heterokaryotic genome complexity and organization of R. solani may provide insight into the plant disease ecology and adaptive behavior of the fungus. PMID:25359908

  4. Draft Genome Sequence of the Plant-Pathogenic Soil Fungus Rhizoctonia solani Anastomosis Group 3 Strain Rhs1AP.

    PubMed

    Cubeta, Marc A; Thomas, Elizabeth; Dean, Ralph A; Jabaji, Suha; Neate, Stephen M; Tavantzis, Stellos; Toda, Takeshi; Vilgalys, Rytas; Bharathan, Narayanaswamy; Fedorova-Abrams, Natalie; Pakala, Suman B; Pakala, Suchitra M; Zafar, Nikhat; Joardar, Vinita; Losada, Liliana; Nierman, William C

    2014-01-01

    The soil fungus Rhizoctonia solani is a pathogen of agricultural crops. Here, we report on the 51,705,945 bp draft consensus genome sequence of R. solani strain Rhs1AP. A comprehensive understanding of the heterokaryotic genome complexity and organization of R. solani may provide insight into the plant disease ecology and adaptive behavior of the fungus. PMID:25359908

  5. Genome sequence of the Roseovarius mucosus type strain (DSM 17069T), a bacteriochlorophyll a-containing representative of the marine Roseobacter group isolated from the dinoflagellate Alexandrium ostenfeldii

    PubMed Central

    2015-01-01

    Roseovarius mucosus Biebl et al. 2005 is a bacteriochlorophyll a-producing representative of the marine Roseobacter group within the alphaproteobacterial family Rhodobacteraceae, which was isolated from the dinoflagellate Alexandrium ostenfeldii. The marine Roseobacter group was found to be abundant in the ocean and plays an important role for global and biogeochemical processes. Here we describe the features of the R. mucosus strain DFL-24T together with its genome sequence and annotation generated from a culture of DSM 17069T. The 4,247,724 bp containing genome sequence encodes 4,194 protein-coding genes and 57 RNA genes. In addition to the presence of four plasmids, genome analysis revealed the presence of genes associated with host colonization, DMSP utilization, cytotoxins, and quorum sensing that could play a role in the interrelationship of R. mucosus with the dinoflagellate A. ostenfeldii and other marine organisms. Furthermore, the genome encodes genes associated with mixotrophic growth, where both reduced inorganic compounds for lithotrophic growth and a photoheterotrophic lifestyle using light as additional energy source could be used. PMID:26203330

  6. The Complete Genome Phylogeny of Geographically Distinct Dengue Virus Serotype 2 Isolates (1944-2013) Supports Further Groupings within the Cosmopolitan Genotype

    PubMed Central

    Ali, Akhtar; Ali, Ijaz

    2015-01-01

    Dengue virus serotype 2 (DENV-2) isolates have been implicated in deadly outbreaks of dengue fever (DF) and dengue hemorrhagic fever (DHF) in several regions of the world. Phylogenetic analysis of DENV-2 isolates collected from particular countries has been performed using partial or individual genes but only a few studies have examined complete whole-genome sequences collected worldwide. Herein, 50 complete genome sequences of DENV-2 isolates, reported over the past 70 years from 19 different countries, were downloaded from GenBank. Phylogenetic analysis was conducted and evolutionary distances of the 50 DENV-2 isolates were determined using maximum likelihood (ML) trees or Bayesian phylogenetic analysis created from complete genome nucleotide (nt) and amino acid (aa) sequences or individual gene sequences. The results showed that all DENV-2 isolates fell into seven main groups containing five previously defined genotypes. A Cosmopolitan genotype showed further division into three groups (C-I, C-II, and C-III) with the C-I group containing two subgroups (C-IA and C-IB). Comparison of the aa sequences showed specific mutations among the various groups of DENV-2 isolates. A maximum number of aa mutations was observed in the NS5 gene, followed by the NS2A, NS3 and NS1 genes, while the smallest number of aa substitutions was recorded in the capsid gene, followed by the PrM/M, NS4A, and NS4B genes. Maximum evolutionary distances were found in the NS2A gene, followed by the NS4A and NS4B genes. Based on these results, we propose that genotyping of DENV-2 isolates in future studies should be performed on entire genome sequences in order to gain a complete understanding of the evolution of various isolates reported from different geographical locations around the world. PMID:26414178

  7. Evolutionary Dynamics of the Mitochondrial Genome in the Evaniomorpha (Hymenoptera)—A Group with an Intermediate Rate of Gene Rearrangement

    PubMed Central

    Mao, Meng; Gibson, Tracey; Dowton, Mark

    2014-01-01

    We determined the complete mitochondrial (mt) genomes of three evaniomorph species, Ceraphron sp. (Ceraphronoidea), Gasteruption sp. (Evanioidea), and Orthogonalys pulchella (Trigonalyoidea) as well as the nearly complete mt genome from another evaniomorph species, Megalyra sp. (Megalyroidea). Each of them possesses dramatic gene rearrangements, including protein-coding or rRNA genes. Gene inversions were identified in all of these mt genomes; for example, the two rRNA genes have inverted and moved into the nad2-cox1 junction in the Megalyra sp. mt genome. In addition, we found two copies of a 10-bp complementary repeat at the beginning of rrnS and at the end of trnL2 in the Gasteruption sp. mt genome, consistent with recombination as the possible mechanism for gene inversion and long-range movement. Although each of the genomes contains a number of repeats of varying size, there was no consistent association of the size or number of repeats with the extent or type of gene rearrangement. The breakpoint distance analysis showed the Evaniomorpha has an intermediate rate of gene rearrangement. Sequence-based phylogenetic analyses of 13 protein-coding and 2 rRNA genes in 22 hymenopteran taxa recovered a paraphyletic Evaniomorpha with the Aculeata nested within it. Within the Evaniomorpha, our analyses confirmed the Trigonalyoidea + Megalyroidea as the sister group to the Aculeata and recovered a novel clade, Ceraphronoidea + Evanioidea. In contrast to previous hymenopteran phylogenetic studies, the internal relationships of the Evaniomorpha were highly supported and robust to the variation of alignment approach and phylogenetic inference approach. PMID:25115010

  8. Comparative analysis of three Magnaporthaceae mitochondrial genomes reveals group I introns in the soil-inhabiting pathogens

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A comparison of the mitochrondrial genomes of two soilborne grass pathogens, M. poae of turfgrass and Gaemannomyces graminis var. triciti (Ggt) of wheat, was made to the foliar rice blast pathogen, Magnaporthe oryzae. We sought to extend observations from nuclear-coded genes that the soilborne patho...

  9. First Genome Sequence of Wild Onion Symptomless Virus, a Novel Member of Potyvirus in the Turnip Mosaic Virus Phylogenetic Group.

    PubMed

    Ohshima, Kazusato; Korkmaz, Savas; Mitoma, Shinichiro; Nomiyama, Rei; Honda, Yuki

    2016-01-01

    The nearly complete genome sequence of a new species of potyvirus was obtained from the symptomless wild onion (Allium sp.) in Turkey. This virus has less than 67% nucleotide sequence identities over the polyprotein to other known potyviruses. We propose the name wild onion symptomless virus for this novel potyvirus. PMID:27540073

  10. First Genome Sequence of Wild Onion Symptomless Virus, a Novel Member of Potyvirus in the Turnip Mosaic Virus Phylogenetic Group

    PubMed Central

    Korkmaz, Savas; Mitoma, Shinichiro; Nomiyama, Rei; Honda, Yuki

    2016-01-01

    The nearly complete genome sequence of a new species of potyvirus was obtained from the symptomless wild onion (Allium sp.) in Turkey. This virus has less than 67% nucleotide sequence identities over the polyprotein to other known potyviruses. We propose the name wild onion symptomless virus for this novel potyvirus. PMID:27540073

  11. Population Structure and Comparative Genome Hybridization of European Flor Yeast Reveal a Unique Group of Saccharomyces cerevisiae Strains with Few Gene Duplications in Their Genome

    PubMed Central

    Legras, Jean-Luc; Erny, Claude; Charpentier, Claudine

    2014-01-01

    Wine biological aging is a wine making process used to produce specific beverages in several countries in Europe, including Spain, Italy, France, and Hungary. This process involves the formation of a velum at the surface of the wine. Here, we present the first large scale comparison of all European flor strains involved in this process. We inferred the population structure of these European flor strains from their microsatellite genotype diversity and analyzed their ploidy. We show that almost all of these flor strains belong to the same cluster and are diploid, except for a few Spanish strains. Comparison of the array hybridization profile of six flor strains originating from these four countries, with that of three wine strains did not reveal any large segmental amplification. Nonetheless, some genes, including YKL221W/MCH2 and YKL222C, were amplified in the genome of four out of six flor strains. Finally, we correlated ICR1 ncRNA and FLO11 polymorphisms with flor yeast population structure, and associate the presence of wild type ICR1 and a long Flo11p with thin velum formation in a cluster of Jura strains. These results provide new insight into the diversity of flor yeast and show that combinations of different adaptive changes can lead to an increase of hydrophobicity and affect velum formation. PMID:25272156

  12. The finding of a group IIE phospholipase A2 gene in a specified segment of Protobothrops flavoviridis genome and its possible evolutionary relationship to group IIA phospholipase A2 genes.

    PubMed

    Yamaguchi, Kazuaki; Chijiwa, Takahito; Ikeda, Naoki; Shibata, Hiroki; Fukumaki, Yasuyuki; Oda-Ueda, Naoko; Hattori, Shosaku; Ohno, Motonori

    2014-01-01

    The genes encoding group IIE phospholipase A2, abbreviated as IIE PLA2, and its 5' and 3' flanking regions of Crotalinae snakes such as Protobothrops flavoviridis, P. tokarensis, P. elegans, and Ovophis okinavensis, were found and sequenced. The genes consisted of four exons and three introns and coded for 22 or 24 amino acid residues of the signal peptides and 134 amino acid residues of the mature proteins. These IIE PLA2s show high similarity to those from mammals and Colubridae snakes. The high expression level of IIE PLA2s in Crotalinae venom glands suggests that they should work as venomous proteins. The blast analysis indicated that the gene encoding OTUD3, which is ovarian tumor domain-containing protein 3, is located in the 3' downstream of IIE PLA2 gene. Moreover, a group IIA PLA2 gene was found in the 5' upstream of IIE PLA2 gene linked to the OTUD3 gene (OTUD3) in the P. flavoviridis genome. It became evident that the specified arrangement of IIA PLA2 gene, IIE PLA2 gene, and OTUD3 in this order is common in the genomes of humans to snakes. The present finding that the genes encoding various secretory PLA2s form a cluster in the genomes of humans to birds is closely related to the previous finding that six venom PLA2 isozyme genes are densely clustered in the so-called NIS-1 fragment of the P. flavoviridis genome. It is also suggested that venom IIA PLA2 genes may be evolutionarily derived from the IIE PLA2 gene. PMID:25529307

  13. The Finding of a Group IIE Phospholipase A2 Gene in a Specified Segment of Protobothrops flavoviridis Genome and Its Possible Evolutionary Relationship to Group IIA Phospholipase A2 Genes

    PubMed Central

    Yamaguchi, Kazuaki; Chijiwa, Takahito; Ikeda, Naoki; Shibata, Hiroki; Fukumaki, Yasuyuki; Oda-Ueda, Naoko; Hattori, Shosaku; Ohno, Motonori

    2014-01-01

    The genes encoding group IIE phospholipase A2, abbreviated as IIE PLA2, and its 5' and 3' flanking regions of Crotalinae snakes such as Protobothrops flavoviridis, P. tokarensis, P. elegans, and Ovophis okinavensis, were found and sequenced. The genes consisted of four exons and three introns and coded for 22 or 24 amino acid residues of the signal peptides and 134 amino acid residues of the mature proteins. These IIE PLA2s show high similarity to those from mammals and Colubridae snakes. The high expression level of IIE PLA2s in Crotalinae venom glands suggests that they should work as venomous proteins. The blast analysis indicated that the gene encoding OTUD3, which is ovarian tumor domain-containing protein 3, is located in the 3' downstream of IIE PLA2 gene. Moreover, a group IIA PLA2 gene was found in the 5' upstream of IIE PLA2 gene linked to the OTUD3 gene (OTUD3) in the P. flavoviridis genome. It became evident that the specified arrangement of IIA PLA2 gene, IIE PLA2 gene, and OTUD3 in this order is common in the genomes of humans to snakes. The present finding that the genes encoding various secretory PLA2s form a cluster in the genomes of humans to birds is closely related to the previous finding that six venom PLA2 isozyme genes are densely clustered in the so-called NIS-1 fragment of the P. flavoviridis genome. It is also suggested that venom IIA PLA2 genes may be evolutionarily derived from the IIE PLA2 gene. PMID:25529307

  14. Metagenomic identification of novel enteric viruses in urban wild rats and genome characterization of a group A rotavirus

    PubMed Central

    Sachsenröder, Jana; Braun, Anne; Machnowska, Patrycja; Ng, Terry Fei Fan; Deng, Xutao; Guenther, Sebastian; Bernstein, Samuel; Ulrich, Rainer G.; Delwart, Eric

    2014-01-01

    Rats are known as reservoirs and vectors for several zoonotic pathogens. However, information on the viruses shed by urban wild rats that could pose a zoonotic risk to human health is scare. Here, intestinal contents from 20 wild Norway rats (Rattus norvegicus) collected in the city of Berlin, Germany, were subjected to metagenomic analysis of viral nucleic acids. The determined faecal viromes of rats consisted of a variety of known and unknown viruses, and were highly variable among the individuals. Members of the families Parvoviridae and Picobirnaviridae represented the most abundant species. Novel picornaviruses, bocaviruses, sapoviruses and stool-associated circular ssDNA viruses were identified, which showed only low sequence identity to known representatives of the corresponding taxa. In addition, noroviruses and rotaviruses were detected as potential zoonotic gastroenteritis viruses. However, partial-genome sequence analyses indicated that the norovirus was closely related to the recently identified rat norovirus and the rotavirus B was closely related to the rat rotavirus strain IDIR; both viruses clustered separately from respective human virus strains in phylogenetic trees. In contrast, the rotavirus A sequences showed high identity to human and animal strains. Analysis of the nearly complete genome of this virus revealed the known genotypes G3, P[3] and N2 for three of the genome segments, whereas the remaining eight genome segments represented the novel genotypes I20–R11–C11–M10–A22–T14–E18–H13. Our results indicated a high heterogeneity of enteric viruses present in urban wild rats; their ability to be transmitted to humans remains to be assessed in the future. PMID:25121550

  15. New Group in the Leptospirillum Clade: Cultivation-Independent Community Genomics, Proteomics, and Transcriptomics of the New Species “Leptospirillum Group IV UBA BS”

    PubMed Central

    Dasari, Mauna; Thomas, Brian C.; Shah, Manesh B.; VerBerkmoes, Nathan C.; Hettich, Robert L.; Banfield, Jillian F.

    2013-01-01

    Leptospirillum spp. are widespread members of acidophilic microbial communities that catalyze ferrous iron oxidation, thereby increasing sulfide mineral dissolution rates. These bacteria play important roles in environmental acidification and are harnessed for bioleaching-based metal recovery. Known members of the Leptospirillum clade of the Nitrospira phylum are Leptospirillum ferrooxidans (group I), Leptospirillum ferriphilum and “Leptospirillum rubarum” (group II), and Leptospirillum ferrodiazotrophum (group III). In the Richmond Mine acid mine drainage (AMD) system, biofilm formation is initiated by L. rubarum; L. ferrodiazotrophum appears in later developmental stages. Here we used community metagenomic data from unusual, thick floating biofilms to identify distinguishing metabolic traits in a rare and uncultivated community member, the new species “Leptospirillum group IV UBA BS.” These biofilms typically also contain a variety of Archaea, Actinobacteria, and a few other Leptospirillum spp. The Leptospirillum group IV UBA BS species shares 98% 16S rRNA sequence identity and 70% average amino acid identity between orthologs with its closest relative, L. ferrodiazotrophum. The presence of nitrogen fixation and reverse tricarboxylic acid (TCA) cycle proteins suggest an autotrophic metabolism similar to that of L. ferrodiazotrophum, while hydrogenase proteins suggest anaerobic metabolism. Community transcriptomic and proteomic analyses demonstrate expression of a multicopper oxidase unique to this species, as well as hydrogenases and core metabolic genes. Results suggest that the Leptospirillum group IV UBA BS species might play important roles in carbon fixation, nitrogen fixation, hydrogen metabolism, and iron oxidation in some acidic environments. PMID:23645189

  16. A new group in the Leptospirillum clade: cultivation-independent community genomics, proteomics and transcriptomics of the new species Leptospirillum group IV UBA BS.

    SciTech Connect

    Goltsman, Daniela; Dasari, Mauna; Thomas, BC; Shah, Manesh B; Verberkmoes, Nathan C; Hettich, Robert {Bob} L; Banfield, Jillian F.

    2013-01-01

    Leptospirillum spp. are widespread members of acidophilic microbial communities that catalyze ferrous iron oxidation, thereby increasing sulfide mineral dissolution rates. These bacteria play important roles in environmental acidification and are harnessed for bioleaching-based metal recovery. Known members of the Leptospirillum clade of the Nitrospira phylum are Leptospirillum ferrooxidans (group I), Leptospirillum ferriphilum and Leptospirillum rubarum (group II), and Leptospirillum ferrodiazotrophum (group III). In the Richmond Mine acid mine drainage (AMD) system, biofilm formation is initiated by L. rubarum; L. ferrodiazotrophum appears in later developmental stages. Here we used community metagenomic data from unusual, thick floating biofilms to identify distinguishing metabolic traits in a rare and uncultivated community member, the new species Leptospirillum group IV UBA BS. These biofilms typically also contain a variety of Archaea, Actinobacteria, and a few other Leptospirillum spp. The Leptospirillum group IV UBA BS species shares 98% 16S rRNA sequence identity and 70% average amino acid identity between orthologs with its closest relative, L. ferrodiazotrophum. The presence of nitrogen fixation and reverse tricarboxylic acid (TCA) cycle proteins suggest an autotrophic metabolism similar to that of L. ferrodiazotrophum, while hydrogenase proteins suggest anaerobic metabolism. Community transcriptomic and proteomic analyses demonstrate expression of a multicopper oxidase unique to this species, as well as hydrogenases and core metabolic genes. Results suggest that the Leptospirillum group IV UBA BS species might play important roles in carbon fixation, nitrogen fixation, hydrogen metabolism, and iron oxidation in some acidic environments.

  17. NIH working group report—using genomic information to guide weight management: From universal to precision treatment

    PubMed Central

    Bray, Molly S; Loos, Ruth JF; McCaffery, Jeanne M; Ling, Charlotte; Franks, Paul W; Weinstock, George M; Snyder, Michael P; Vassy, Jason L; Agurs-Collins, Tanya

    2016-01-01

    Objective Precision medicine utilizes genomic and other data to optimize and personalize treatment. Although more than 2,500 genetic tests are currently available, largely for extreme and/or rare phenotypes, the question remains whether this approach can be used for the treatment of common, complex conditions like obesity, inflammation, and insulin resistance, which underlie a host of metabolic diseases. Methods This review, developed from a Trans-NIH Conference titled “Genes, Behaviors, and Response to Weight Loss Interventions,” provides an overview of the state of genetic and genomic research in the area of weight change and identifies key areas for future research. Results Although many loci have been identified that are associated with cross-sectional measures of obesity/body size, relatively little is known regarding the genes/loci that influence dynamic measures of weight change over time. Although successful short-term weight loss has been achieved using many different strategies, sustainable weight loss has proven elusive for many, and there are important gaps in our understanding of energy balance regulation. Conclusions Elucidating the molecular basis of variability in weight change has the potential to improve treatment outcomes and inform innovative approaches that can simultaneously take into account information from genomic and other sources in devising individualized treatment plans. PMID:26692578

  18. Two distinct groups of porcine enteropathogenic Escherichia coli strains of serogroup O45 are revealed by comparative genomic hybridization and virulence gene microarray

    PubMed Central

    Bruant, Guillaume; Zhang, Yongxiang; Garneau, Philippe; Wong, Justin; Laing, Chad; Fairbrother, John M; Gannon, Victor PJ; Harel, Josée

    2009-01-01

    Background Porcine enteropathogenic Escherichia coli (PEPEC) strains of serogroup O45 cause post-weaning diarrhea and produce characteristic attaching and effacing (A/E) lesions. Most O45 PEPEC strains possess the locus of enterocyte effacement (LEE), encoding the virulence factors required for production of A/E lesions, and often possess the paa gene, which is thought to contribute to the early stages of PEPEC pathogenicity. In this study, nine O45 PEPEC strains and a rabbit enteropathogenic (REPEC) strain, known to produce A/E lesions in vivo, were characterized using an E. coli O157-E. coli K12 whole genome microarray and a virulence gene-specific microarray, and by PCR experiments. Results Based on their virulence gene profiles, the 10 strains were considered to be atypical EPEC. The differences in their genomes pointed to the identification of two distinct evolutionary groups of O45 PEPEC, Groups I and II, and provided evidence for a contribution of these genetic differences to their virulence in pigs. Group I included the REPEC strain and four O45 PEPEC strains known to induce severe A/E lesions in challenged pigs whereas Group II was composed of the five other O45 PEPEC strains, which induced less severe or no A/E lesions in challenged pigs. Significant differences between Groups I and II were found with respect to the presence or absence of 50 O-Islands (OIs) or S-loops and 13 K-islands (KIs) or K-loops, including the virulence-associated islands OI#1 (S-loop#1), OI#47 (S-loop#71), OI#57 (S-loop#85), OI#71 (S-loop#108), OI#115, OI#122, and OI#154 (S-loop#253). Conclusion We have genetically characterized a collection of O45 PEPEC strains and classified them into two distinct groups. The differences in their virulence gene and genomic island content may influence the pathogenicity of O45 PEPEC strains, and explain why Group I O45 PEPEC strains induced more severe A/E lesions in explants and challenged pigs than Group II strains. PMID:19709428

  19. The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

    PubMed Central

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications

  20. Description of Bacillus toyonensis sp. nov., a novel species of the Bacillus cereus group, and pairwise genome comparisons of the species of the group by means of ANI calculations.

    PubMed

    Jiménez, Guillermo; Urdiain, Mercedes; Cifuentes, Ana; López-López, Aránzazu; Blanch, Anicet R; Tamames, Javier; Kämpfer, Peter; Kolstø, Anne-Brit; Ramón, Daniel; Martínez, Juan F; Codoñer, Francisco M; Rosselló-Móra, Ramon

    2013-09-01

    Strain BCT-7112(T) was isolated in 1966 in Japan from a survey designed to obtain naturally occurring microorganisms as pure cultures in the laboratory for use as probiotics in animal nutrition. This strain, which was primarily identified as Bacillus cereus var toyoi, has been in use for more than 30 years as the active ingredient of the preparation TOYOCERIN(®), an additive for use in animal nutrition (e.g. swine, poultry, cattle, rabbits and aquaculture). Despite the fact that the strain was initially classified as B. cereus, it showed significant genomic differences from the type strains of the B. cereus group that were large enough (ANI values below 92%) to allow it to be considered as a different species within the group. The polyphasic taxonomic study presented here provides sufficient discriminative parameters to classify BCT-7112(T) as a new species for which the name Bacillus toyonensis sp. nov. is proposed, with BCT-7112(T) (=CECT 876(T); =NCIMB 14858(T)) being designated as the type strain. In addition, a pairwise comparison between the available genomes of the whole B. cereus group by means of average nucleotide identity (ANI) calculations indicated that besides the eight classified species (including B. toyonensis), additional genomospecies could be detected, and most of them also had ANI values below 94%. ANI values were on the borderline of a species definition only in the cases of representatives of B. cereus versus B. thuringiensis, and B. mycoides and B. weihenstephanensis. PMID:23791203

  1. Genome sequence and analysis of a broad-host range lytic bacteriophage that infects the Bacillus cereus group

    PubMed Central

    2013-01-01

    Background Comparatively little information is available on members of the Myoviridae infecting low G+C content, Gram-positive host bacteria of the family Firmicutes. While numerous Bacillus phages have been isolated up till now only very few Bacillus cereus phages have been characterized in detail. Results Here we present data on the large, virulent, broad-host-range B. cereus phage vB_BceM_Bc431v3 (Bc431v3). Bc431v3 features a 158,618 bp dsDNA genome, encompassing 239 putative open reading frames (ORFs) and, 20 tRNA genes encoding 17 different amino acids. Since pulsed-field gel electrophoresis indicated that the genome of this phage has a mass of 155-158 kb Bc431v3 DNA appears not to contain long terminal repeats that are found in the genome of Bacillus phage SPO1. Conclusions Bc431v3 displays significant sequence similarity, at the protein level, to B. cereus phage BCP78, Listeria phage A511 and Enterococcus phage ØEF24C and other morphologically related phages infecting Firmicutes such as Staphylococcus phage K and Lactobacillus phage LP65. Based on these data we suggest that Bc431v3 should be included as a member of the Spounavirinae; however, because of all the diverse taxonomical information has been addressed recently, it is difficult to determine the genus. The Bc431v3 phage contains some highly unusual genes such as gp143 encoding putative tRNAHis guanylyltransferase. In addition, it carries some genes that appear to be related to the host sporulation regulators. These are: gp098, which encodes a putative segregation protein related to FstK/SpoIIIE DNA transporters; gp105, a putative segregation protein; gp108, RNA polymerase sigma factor F/B; and, gp109 encoding RNA polymerase sigma factor G. PMID:23388049

  2. Whole genomic analysis of human G1P[8] rotavirus strains from different age groups in China.

    PubMed

    Shintani, Tsuzumi; Ghosh, Souvik; Wang, Yuan-Hong; Zhou, Xuan; Zhou, Dun-Jin; Kobayashi, Nobumichi

    2012-08-01

    G1P[8] rotaviruses are an important cause of diarrhea in humans in China. To date, there are no reports on the whole genomic analysis of the Chinese G1P[8] rotaviruses. To determine the origin and overall genetic makeup of the recent Chinese G1P[8] strains, the whole genomes of three strains, RVA/Human-wt/CHN/E1911/2009/G1P[8], RVA/Human-tc/CHN/R588/2005/G1P[8] and RVA/Human-tc/CHN/Y128/2004/G1P[8], detected in an infant, a child and an adult, respectively, were analyzed. Strains E1911, R588 and Y128 exhibited a typical Wa-like genotype constellation. Except for the NSP3 gene of E1911, the whole genomes of strains E1911, R588 and Y128 were found to be more closely related to those of the recent Wa-like common human strains from different countries than those of the prototype G1P[8] strain, or other old strains. On the other hand, the NSP3 gene of E1911 was genetically distinct from those of Y128, R588, or other Wa-like common human strains, and appeared to share a common origin with those of the porcine-like human G9 strains, providing evidence for intergenotype reassortment events. Comparisons of the amino acid residues defining the VP7 and VP4 antigenic domains revealed several mismatches between these Chinese G1P[8] strains and the G1 and P[8] strains contained in the currently licensed rotavirus vaccines Rotarix(TM )and RotaTeq(TM). PMID:23012626

  3. Genome sequence and virulence factors of a group G Streptococcus dysgalactiae subsp. equisimilis strain with a new element carrying erm(B)

    PubMed Central

    Wang, Xiaohui; Zhang, Xiaoxia; Zong, Zhiyong

    2016-01-01

    A Streptococcus dysgalactiae subsp. equisimilis (SDSE) strain WCHSDSE-1, which caused an outbreak of tonsillopharyngitis among healthcare workers in China, was subjected to genome sequencing and analysis. WCHSDSE-1 belongs to the Lancefield group G, emm type stG211.1 and sequence type 44. WCHSDSE-1 has virulence factors for adherence, impairing the recruitment of neutrophils to infection sites and toxins including streptolysins O and S and exotoxin G. WCHSDSE-1 has a 45.4-kb element resembling a conjugative transposon. This element is absent from other known SDSE genomes and contains the macrolide-resistant gene erm(B). Conjugative transfer of erm(B) was not successful in mating experiments, suggesting that the element might have lost its ability of conjugation. An almost identical element, which contains the tetracycline-resistant gene tet(M) instead of erm(B), is present on the genome of Filifactor alocis ATCC 35896. The boundaries and insertion sites of the two elements were identified and both were flanked by a 3-bp direct repeat, which is characteristic of transposition. In conclusion, the spectrum of virulence factors of WCHSDSE-1 is similar to other SDSE strains causing invasive diseases. WCHSDSE-1 possesses a new transposable element encoding macrolide resistance, which could pick up different resistance genes and could be transferred across species in oral microflora. PMID:26843282

  4. Novel Association of ABO Histo-Blood Group Antigen with Soluble ICAM-1: Results of a Genome-Wide Association Study of 6,578 Women

    PubMed Central

    Paré, Guillaume; Chasman, Daniel I.; Kellogg, Mark; Zee, Robert Y. L.; Rifai, Nader; Badola, Sunita; Miletich, Joseph P.; Ridker, Paul M.

    2008-01-01

    While circulating levels of soluble Intercellular Adhesion Molecule 1 (sICAM-1) have been associated with diverse conditions including myocardial infarction, stroke, malaria, and diabetes, comprehensive analysis of the common genetic determinants of sICAM-1 is not available. In a genome-wide association study conducted among 6,578 participants in the Women's Genome Health Study, we find that three SNPs at the ICAM1 (19p13.2) locus (rs1799969, rs5498 and rs281437) are non-redundantly associated with plasma sICAM-1 concentrations at a genome-wide significance level (P<5×10−8), thus extending prior results from linkage and candidate gene studies. We also find that a single SNP (rs507666, P = 5.1×10−29) at the ABO (9q34.2) locus is highly correlated with sICAM-1 concentrations. The novel association at the ABO locus provides evidence for a previously unknown regulatory role of histo-blood group antigens in inflammatory adhesion processes. PMID:18604267

  5. Prophage Genomics

    PubMed Central

    Canchaya, Carlos; Proux, Caroline; Fournous, Ghislain; Bruttin, Anne; Brüssow, Harald

    2003-01-01

    The majority of the bacterial genome sequences deposited in the National Center for Biotechnology Information database contain prophage sequences. Analysis of the prophages suggested that after being integrated into bacterial genomes, they undergo a complex decay process consisting of inactivating point mutations, genome rearrangements, modular exchanges, invasion by further mobile DNA elements, and massive DNA deletion. We review the technical difficulties in defining such altered prophage sequences in bacterial genomes and discuss theoretical frameworks for the phage-bacterium interaction at the genomic level. The published genome sequences from three groups of eubacteria (low- and high-G+C gram-positive bacteria and γ-proteobacteria) were screened for prophage sequences. The prophages from Streptococcus pyogenes served as test case for theoretical predictions of the role of prophages in the evolution of pathogenic bacteria. The genomes from further human, animal, and plant pathogens, as well as commensal and free-living bacteria, were included in the analysis to see whether the same principles of prophage genomics apply for bacteria living in different ecological niches and coming from distinct phylogenetical affinities. The effect of selection pressure on the host bacterium is apparently an important force shaping the prophage genomes in low-G+C gram-positive bacteria and γ-proteobacteria. PMID:12794192

  6. Kelp Fly Virus: a Novel Group of Insect Picorna-Like Viruses as Defined by Genome Sequence Analysis and a Distinctive Virion Structure

    PubMed Central

    Hartley, C. J.; Greenwood, D. R.; Gilbert, R. J. C.; Masoumi, A.; Gordon, K. H. J.; Hanzlik, T. N.; Fry, E. E.; Stuart, D. I.; Scotti, P. D.

    2005-01-01

    The complete genomic sequence of kelp fly virus (KFV), originally isolated from the kelp fly, Chaetocoelopa sydneyensis, has been determined. Analyses of its genomic and structural organization and phylogeny show that it belongs to a hitherto undescribed group within the picorna-like virus superfamily. The single-stranded genomic RNA of KFV is 11,035 nucleotides in length and contains a single large open reading frame encoding a polypeptide of 3,436 amino acids with 5′ and 3′ untranslated regions of 384 and 343 nucleotides, respectively. The predicted amino acid sequence of the polypeptide shows that it has three regions. The N-terminal region contains sequences homologous to the baculoviral inhibitor of apoptosis repeat domain, an inhibitor of apoptosis commonly found in animals and in viruses with double-stranded DNA genomes. The second region contains at least two capsid proteins. The third region has three sequence motifs characteristic of replicase proteins of many plant and animal viruses, including a helicase, a 3C chymotrypsin-like protease, and an RNA-dependent RNA polymerase. Phylogenetic analysis of the replicase motifs shows that KFV forms a distinct and distant taxon within the picorna-like virus superfamily. Cryoelectron microscopy and image reconstruction of KFV to a resolution of 15 Å reveals an icosahedral structure, with each of its 12 fivefold vertices forming a turret from the otherwise smooth surface of the 20-Å-thick capsid. The architecture of the KFV capsid is unique among the members of the picornavirus superfamily for which structures have previously been determined. PMID:16227260

  7. Complete genome analyses of the first porcine rotavirus group H identified from a South African pig does not provide evidence for recent interspecies transmission events.

    PubMed

    Nyaga, Martin M; Peenze, Ina; Potgieter, Christiaan A; Seheri, L Mapaseka; Page, Nicola A; Yinda, Claude K; Steele, A Duncan; Matthijnssens, Jelle; Mphahlele, M Jeffrey

    2016-03-01

    Rotaviruses (RVs) are classified into eight species/groups (RVA-RVH) according to the migration patterns of their 11 genome segments, as well as by serological and molecular properties of Viral Protein 6 (VP6). In 1997 a new unclassified RV was reported infecting adults in Bangladesh and China. This virus was initially named novel adult diarrhoea rotavirus (ADRV-N), but later renamed as RVH. Since then, RVH has been detected in humans only very sporadically. However, RVH is increasingly being detected in pig populations in the USA, Brazil and Japan, but not yet in Africa. Unfortunately, whole genome sequence data of porcine RVH strains in GenBank is currently restricted to a single strain (SKA-1) from Japan. Porcine diarrhoeic samples were collected in South Africa and analysed for rotavirus using an RVA ELISA and electropherotyping by PAGE. One sample displayed a 4:2:1:1:1:1:1 migration pattern, typical for RVH. In order to further investigate this strain, sequence-independent amplification followed by random sequencing using the 454/Roche GS FLX Sequencer was performed, resulting in the second complete porcine RVH strain (MRC-DPRU1575) available in databases. Phylogenetically, all segments of MRC-DPRU1575 clustered closely with the SKA-1 strain and in some segments with known porcine RVH strains from Brazil and the USA. In contrast, the porcine RVH strains were only distantly related to human RVH strains from Asia and a partial RVH-like strain recently detected in bats from Cameroon. Overall, strain MRC-DPRU1575 is the first complete genome of a porcine RVH from Africa and allows for the development of improved RVH screening methods. Our analyses indicate that RVH strains cluster according to their host species, not suggesting any evidence of recent interspecies transmission events. However, more RVH genomes from a wider host range are needed to better understand their evolutionary pathways and zoonotic potential. PMID:26658066

  8. Whole-genome phylogenies of the family Bacillaceae and expansion of the sigma factor gene family in the Bacillus cereus species-group

    PubMed Central

    2011-01-01

    Background The Bacillus cereus sensu lato group consists of six species (B. anthracis, B. cereus, B. mycoides, B. pseudomycoides, B. thuringiensis, and B. weihenstephanensis). While classical microbial taxonomy proposed these organisms as distinct species, newer molecular phylogenies and comparative genome sequencing suggests that these organisms should be classified as a single species (thus, we will refer to these organisms collectively as the Bc species-group). How do we account for the underlying similarity of these phenotypically diverse microbes? It has been established for some time that the most rapidly evolving and evolutionarily flexible portions of the bacterial genome are regulatory sequences and transcriptional networks. Other studies have suggested that the sigma factor gene family of these organisms has diverged and expanded significantly relative to their ancestors; sigma factors are those portions of the bacterial transcriptional apparatus that control RNA polymerase recognition for promoter selection. Thus, examining sigma factor divergence in these organisms would concurrently examine both regulatory sequences and transcriptional networks important for divergence. We began this examination by comparison to the sigma factor gene set of B. subtilis. Results Phylogenetic analysis of the Bc species-group utilizing 157 single-copy genes of the family Bacillaceae suggests that several taxonomic revisions of the genus Bacillus should be considered. Within the Bc species-group there is little indication that the currently recognized species form related sub-groupings, suggesting that they are members of the same species. The sigma factor gene family encoded by the Bc species-group appears to be the result of a dynamic gene-duplication and gene-loss process that in previous analyses underestimated the true heterogeneity of the sigma factor content in the Bc species-group. Conclusions Expansion of the sigma factor gene family appears to have preferentially

  9. Comparative genomic analysis of catfish linkage group 8 reveals two homologous chromosomes in zebrafish and other teleosts with extensive inter-chromosomal rearrangements

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Comparative genomics is a powerful tool to transfer genomic information from model species to related non-model species. Channel catfish (Ictalurus punctatus) is the primary aquaculture species in the United States. Its existing genome resources such as genomic sequences generated from n...

  10. Genome-Wide SNP Analysis of Southern African Populations Provides New Insights into the Dispersal of Bantu-Speaking Groups

    PubMed Central

    González-Santos, Miguel; Montinaro, Francesco; Oosthuizen, Ockie; Oosthuizen, Erica; Busby, George B.J.; Anagnostou, Paolo; Destro-Bisol, Giovanni; Pascali, Vincenzo; Capelli, Cristian

    2015-01-01

    The expansion of Bantu-speaking agropastoralist populations had a great impact on the genetic, linguistic, and cultural variation of sub-Saharan Africa. It is generally accepted that Bantu languages originated in an area around the present border between Cameroon and Nigeria approximately 5,000 years ago, from where they spread South and East becoming the largest African linguistic branch. The demic consequences of this event are reflected in the relatively high genetic homogeneity observed across most of sub-Saharan Africa populations. In this work, we explored genome-wide single nucleotide polymorphism data from 28 populations to characterize the genetic components present in sub-Saharan African populations. Combining novel data from four Southern African populations with previously published results, we reject the hypothesis that the “non-Bantu” genetic component reported in South-Eastern Africa (Mozambique) reflects extensive gene flow between incoming agriculturalist and resident hunter-gatherer communities. We alternatively suggest that this novel component is the result of demographic dynamics associated with the Bantu dispersal. PMID:26363465

  11. Seed colour loci, homoeology and linkage groups of the C genome chromosomes revealed in Brassica rapa–B. oleracea monosomic alien addition lines

    PubMed Central

    Heneen, Waheeb K.; Geleta, Mulatu; Brismar, Kerstin; Xiong, Zhiyong; Pires, J. Chris; Hasterok, Robert; Stoute, Andrew I.; Scott, Roderick J.; King, Graham J.; Kurup, Smita

    2012-01-01

    Background and Aims Brassica rapa and B. oleracea are the progenitors of oilseed rape B. napus. The addition of each chromosome of B. oleracea to the chromosome complement of B. rapa results in a series of monosomic alien addition lines (MAALs). Analysis of MAALs determines which B. oleracea chromosomes carry genes controlling specific phenotypic traits, such as seed colour. Yellow-seeded oilseed rape is a desirable breeding goal both for food and livestock feed end-uses that relate to oil, protein and fibre contents. The aims of this study included developing a missing MAAL to complement an available series, for studies on seed colour control, chromosome homoeology and assignment of linkage groups to B. oleracea chromosomes. Methods A new batch of B. rapa–B. oleracea aneuploids was produced to generate the missing MAAL. Seed colour and other plant morphological features relevant to differentiation of MAALs were recorded. For chromosome characterization, Snow's carmine, fluorescence in situ hybridization (FISH) and genomic in situ hybridization (GISH) were used. Key Results The final MAAL was developed. Morphological traits that differentiated the MAALs comprised cotyledon number, leaf morphology, flower colour and seed colour. Seed colour was controlled by major genes on two B. oleracea chromosomes and minor genes on five other chromosomes of this species. Homoeologous pairing was largely between chromosomes with similar centromeric positions. FISH, GISH and a parallel microsatellite marker analysis defined the chromosomes in terms of their linkage groups. Conclusions A complete set of MAALs is now available for genetic, genomic, evolutionary and breeding perspectives. Defining chromosomes that carry specific genes, physical localization of DNA markers and access to established genetic linkage maps contribute to the integration of these approaches, manifested in the confirmed correspondence of linkage groups with specific chromosomes. Applications include marker

  12. Genomic Analysis Reveals Multi-Drug Resistance Clusters in Group B Streptococcus CC17 Hypervirulent Isolates Causing Neonatal Invasive Disease in Southern Mainland China.

    PubMed

    Campisi, Edmondo; Rosini, Roberto; Ji, Wenjing; Guidotti, Silvia; Rojas-López, Maricarmen; Geng, Guozhu; Deng, Qiulian; Zhong, Huamin; Wang, Weidong; Liu, Haiying; Nan, Cassandra; Margarit, Immaculada; Rinaudo, C D

    2016-01-01

    Neonatal invasive disease caused by group B Streptococcus (GBS) represents a significant public health care concern globally. However, data related to disease burden, serotype distribution, and molecular epidemiology in China and other Asian countries are very few and specifically relative to confined regions. The aim of this study was to investigate the genetic characteristics of GBS isolates recovered from neonates with invasive disease during 2013-2014 at Guangzhou and Changsha hospitals in southern mainland China. We assessed the capsular polysaccharide type, pilus islands (PIs) distribution and hvgA gene presence in a panel of 26 neonatal clinical isolates, of which 8 were recovered from Early Onset Disease and 18 from Late Onset Disease (LOD). Among 26 isolates examined, five serotypes were identified. Type III was the most represented (15 cases), particularly among LOD strains (n = 11), followed by types Ib (n = 5), V (n = 3), Ia (n = 2) and II (n = 1). We performed whole-genome sequencing analysis and antimicrobial susceptibility testing on the 14 serotype III isolates belonging to the hypervirulent Clonal Complex 17 (serotype III-CC17). The presence of PI-2b alone was associated with 13 out of 14 serotype III-CC17 strains. Genome analysis led us to identify two multi-drug resistance gene clusters harbored in two new versions of integrative and conjugative elements (ICEs), carrying five or eight antibiotic resistance genes, respectively. These ICEs replaced the 16 kb-locus that normally contains the PI-1 operon. All isolates harboring the identified ICEs showed multiple resistances to aminoglycoside, macrolide, and tetracycline antibiotic classes. In conclusion, we report the first whole-genome sequence analysis of 14 GBS serotype III-CC17 strains isolated in China, representing the most prevalent lineage causing neonatal invasive disease. The acquisition of newly identified ICEs conferring multiple antibiotic resistance could in part explain the spread of

  13. Genomic Analysis Reveals Multi-Drug Resistance Clusters in Group B Streptococcus CC17 Hypervirulent Isolates Causing Neonatal Invasive Disease in Southern Mainland China

    PubMed Central

    Campisi, Edmondo; Rosini, Roberto; Ji, Wenjing; Guidotti, Silvia; Rojas-López, Maricarmen; Geng, Guozhu; Deng, Qiulian; Zhong, Huamin; Wang, Weidong; Liu, Haiying; Nan, Cassandra; Margarit, Immaculada; Rinaudo, C. D.

    2016-01-01

    Neonatal invasive disease caused by group B Streptococcus (GBS) represents a significant public health care concern globally. However, data related to disease burden, serotype distribution, and molecular epidemiology in China and other Asian countries are very few and specifically relative to confined regions. The aim of this study was to investigate the genetic characteristics of GBS isolates recovered from neonates with invasive disease during 2013–2014 at Guangzhou and Changsha hospitals in southern mainland China. We assessed the capsular polysaccharide type, pilus islands (PIs) distribution and hvgA gene presence in a panel of 26 neonatal clinical isolates, of which 8 were recovered from Early Onset Disease and 18 from Late Onset Disease (LOD). Among 26 isolates examined, five serotypes were identified. Type III was the most represented (15 cases), particularly among LOD strains (n = 11), followed by types Ib (n = 5), V (n = 3), Ia (n = 2) and II (n = 1). We performed whole-genome sequencing analysis and antimicrobial susceptibility testing on the 14 serotype III isolates belonging to the hypervirulent Clonal Complex 17 (serotype III-CC17). The presence of PI-2b alone was associated with 13 out of 14 serotype III-CC17 strains. Genome analysis led us to identify two multi-drug resistance gene clusters harbored in two new versions of integrative and conjugative elements (ICEs), carrying five or eight antibiotic resistance genes, respectively. These ICEs replaced the 16 kb-locus that normally contains the PI-1 operon. All isolates harboring the identified ICEs showed multiple resistances to aminoglycoside, macrolide, and tetracycline antibiotic classes. In conclusion, we report the first whole-genome sequence analysis of 14 GBS serotype III-CC17 strains isolated in China, representing the most prevalent lineage causing neonatal invasive disease. The acquisition of newly identified ICEs conferring multiple antibiotic resistance could in part explain the spread

  14. Sinorhizobium meliloti Phage ΦM9 Defines a New Group of T4 Superfamily Phages with Unusual Genomic Features but a Common T=16 Capsid

    PubMed Central

    Johnson, Matthew C.; Tatum, Kelsey B.; Lynn, Jason S.; Brewer, Tess E.; Lu, Stephen; Washburn, Brian K.

    2015-01-01

    ABSTRACT Relatively little is known about the phages that infect agriculturally important nitrogen-fixing rhizobial bacteria. Here we report the genome and cryo-electron microscopy structure of the Sinorhizobium meliloti-infecting T4 superfamily phage ΦM9. This phage and its close relative Rhizobium phage vB_RleM_P10VF define a new group of T4 superfamily phages. These phages are distinctly different from the recently characterized cyanophage-like S. meliloti phages of the ΦM12 group. Structurally, ΦM9 has a T=16 capsid formed from repeating units of an extended gp23-like subunit that assemble through interactions between one subunit and the adjacent E-loop insertion domain. Though genetically very distant from the cyanophages, the ΦM9 capsid closely resembles that of the T4 superfamily cyanophage Syn9. ΦM9 also has the same T=16 capsid architecture as the very distant phage SPO1 and the herpesviruses. Despite their overall lack of similarity at the genomic and structural levels, ΦM9 and S. meliloti phage ΦM12 have a small number of open reading frames in common that appear to encode structural proteins involved in interaction with the host and which may have been acquired by horizontal transfer. These proteins are predicted to encode tail baseplate proteins, tail fibers, tail fiber assembly proteins, and glycanases that cleave host exopolysaccharide. IMPORTANCE Despite recent advances in the phylogenetic and structural characterization of bacteriophages, only a small number of phages of plant-symbiotic nitrogen-fixing soil bacteria have been studied at the molecular level. The effects of phage predation upon beneficial bacteria that promote plant growth remain poorly characterized. First steps in understanding these soil bacterium-phage dynamics are genetic, molecular, and structural characterizations of these groups of phages. The T4 superfamily phages are among the most complex phages; they have large genomes packaged within an icosahedral head and a long

  15. Genome-wide RNA-Seq analysis of breast muscles of two broiler chicken groups differing in shear force.

    PubMed

    Piórkowska, K; Żukowski, K; Nowak, J; Połtowicz, K; Ropka-Molik, K; Gurgul, A

    2016-02-01

    In this study, a whole transcriptome analysis of breast muscles was conducted in broiler chicken groups differing in shear force. Shear force is a determinant of tenderness, which in turn is one of the most important parameters of meat quality in chickens. In our analysis, a total of 11,560 transcripts and 9824 genes per sample were identified. In chickens with more tender meat, up-regulation of 19 genes and down-regulation of 49 genes was observed. The up-regulated gene group included the ASB2 gene, which is probably involved in the meat conversion process, as its product results in the degradation of filamins, proteins which form muscle fibres. In the down-regulated gene group, genes which play a role in lipogenesis (THRSP, PLIN1) and in collagen synthesis (P4HA3, LEPREL4, PCOLCE2, COL16A1, COL20A1, VWA1) were detected. Their presence suggests the involvement of the extracellular matrix in the determination of meat tenderness. Thus, our study identified a pool of genes that may participate in the tenderisation process in broiler chickens. PMID:26592359

  16. The mitochondrial genome of Paraspadella gotoi is highly reduced and reveals that chaetognaths are a sister-group to protostomes

    SciTech Connect

    Helfenbein, Kevin G.; Fourcade, H. Matthew; Vanjani, Rohit G.; Boore, Jeffrey L.

    2004-05-01

    We report the first complete mitochondrial (mt) DNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous non-coding intergenic nucleotides, while the other group is tightly packed, having no non-coding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha within the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins gives a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore.

  17. The mitochondrial genome of Paraspadella gotoi is highly reduced and reveals that chaetognaths are a sister group to protostomes.

    PubMed

    Helfenbein, Kevin G; Fourcade, H Matthew; Vanjani, Rohit G; Boore, Jeffrey L

    2004-07-20

    We report the complete mtDNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous noncoding intergenic nucleotides although the other group is tightly packed, having no noncoding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha within the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins give a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this analysis, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore. PMID:15249679

  18. Complete mitochondrial genome of Blue-crowned Parakeet (Aratinga acuticaudata)--phylogenetic position of the species among parrots group called Conures.

    PubMed

    Urantowka, Adam Dawid; Grabowski, Krzysztof Aleksander; Strzała, Tomasz

    2013-08-01

    Blue-crowned Parakeet (Aratinga acuticaudata) is a South American parrot species with a taxonomic position not confirmed by molecular studies. We sequenced full mitochondrial genome and constructed phylogenetic tree using sequences of mitochondrial ND2 gene from A. acuticaudata and some other representatives of Conures group. Our results confirmed previously described distribution of Aratinga species into three clades, but surprisingly did not classify Blue-crowned Parakeet to any of them. We found that A. acuticaudata shares the closest relationship with Diopsittaca nobilis and forms a separate clade together with Guaruba guarouba and Leptosittaca branickii species with a significant node. Our results confirm lack of monophyly of the genus Aratinga and underline the need of its taxonomic revision. PMID:23351080

  19. Evidence of Bacillus thuringiensis intra-serovar diversity revealed by Bacillus cereus group-specific repetitive extragenic palindromic sequence-based PCR genomic fingerprinting.

    PubMed

    Sauka, Diego H; Basile, Juan I; Benintende, Graciela

    2011-01-01

    Bacillus thuringiensis is classified into serovars on the basis of H-flagellar antigens. Several alternative typing methods have been described. Among them, a B. cereus group-specific repetitive extragenic palindromic (Rep)-PCR fingerprinting technique was shown to be discriminative and able to identify B. thuringiensis serovars. The aim of this study was to investigate the genomic diversity and relationship among B. thuringiensis strains collected from different Argentinean ecosystems. Thirty-seven B. thuringiensis reference strains and 131 Argentinean isolates were analyzed using a B. cereus group-specific Rep-PCR. Fourteen different patterns were identified among the Argentinean isolates. Eight could not be associated to any pattern obtained from a reference strain. The pattern identical to the serovar kurstaki HD-1 strain was the most frequently identified in 68 native isolates. The profiles allowed tracing a single dendrogram with two groups and eight main lineages. Some strains showed distinctive patterns despite belonging to the same serovar. An intraspecific diversity resulted from this analysis that was highlighted by this technique since strains from a given serovar showed distinct profiles. This study may help to establish a system of B. thuringiensis classification with a higher discrimination level than established by the H antigen serotyping. PMID:22286045

  20. Complete Genome Sequence of Pelosinus fermentans JBW45, a Member of a Remarkably Competitive Group of Negativicutes in the Firmicutes Phylum

    PubMed Central

    De León, Kara B.; Utturkar, Sagar M.; Camilleri, Laura B.; Elias, Dwayne A.; Arkin, Adam P.; Fields, Matthew W.; Brown, Steven D.

    2015-01-01

    The genome of Pelosinus fermentans JBW45, isolated from a chromium-contaminated site in Hanford, Washington, USA, has been completed with PacBio sequencing. Nine copies of the rRNA gene operon and multiple transposase genes with identical sequences resulted in breaks in the original draft genome and may suggest genomic instability of JBW45. PMID:26404608

  1. Complete Genome Sequence of Pelosinus fermentans JBW45, a Member of a Remarkably Competitive Group of Negativicutes in the Firmicutes Phylum

    DOE PAGESBeta

    De León, Kara B.; Utturkar, Sagar M.; Camilleri, Laura B.; Elias, Dwayne A.; Arkin, Adam P.; Fields, Matthew W.; Brown, Steven D.; Wall, Judy D.

    2015-01-01

    The genome of Pelosinus fermentans JBW45, isolated from a chromium-contaminated site in Hanford, Washington, USA, has been completed with PacBio sequencing. Finally, nine copies of the rRNA gene operon and multiple transposase genes with identical sequences resulted in breaks in the original draft genome and may suggest genomic instability of JBW45.

  2. Complete Genome Sequence of Pelosinus fermentans JBW45, a Member of a Remarkably Competitive Group of Negativicutes in the Firmicutes Phylum

    SciTech Connect

    De León, Kara B.; Utturkar, Sagar M.; Camilleri, Laura B.; Elias, Dwayne A.; Arkin, Adam P.; Fields, Matthew W.; Brown, Steven D.; Wall, Judy D.

    2015-09-24

    The genome of Pelosinus fermentans JBW45, isolated from a chromium-contaminated site in Hanford, Washington, USA, has been completed with PacBio sequencing. Finally, nine copies of the rRNA gene operon and multiple transposase genes with identical sequences resulted in breaks in the original draft genome and may suggest genomic instability of JBW45.

  3. Genomics:GTL Contractor-Grantee Workshop IV and Metabolic Engineering Working Group Inter-Agency Conference on Metabolic Engineering 2006

    SciTech Connect

    Mansfield, Betty Kay; Martin, Sheryl A

    2006-02-01

    Welcome to the 2006 joint meeting of the fourth Genomics:GTL Contractor-Grantee Workshop and the six Metabolic Engineering Working Group Inter-Agency Conference. The vision and scope of the Genomics:GTL program continue to expand and encompass research and technology issues from diverse scientific disciplines, attracting broad interest and support from researchers at universities, DOE national laboratories, and industry. Metabolic engineering's vision is the targeted and purposeful alteration of metabolic pathways to improve the understanding and use of cellular pathways for chemical transformation, energy transduction, and supramolecular assembly. These two programs have much complementarity in both vision and technological approaches, as reflected in this joint workshop. GLT's challenge to the scientific community remains the further development and use of a broad array of innovative technologies and computational tools to systematically leverage the knowledge and capabilities brought to us by DNA sequencing projects. The goal is to seek a broad and predictive understanding of the functioning and control of complex systems--individual microbes, microbial communities, and plants. GTL's prominent position at the interface of the physical, computational, and biological sciences is both a strength and challenge. Microbes remain GTL's principal biological focus. In the complex 'simplicity' of microbes, they find capabilities needed by DOE and the nation for clean and secure energy, cleanup of environmental contamination, and sequestration of atmospheric carbon dioxide that contributes to global warming. An ongoing challenge for the entire GTL community is to demonstrate that the fundamental science conducted in each of your research projects brings us a step closer to biology-based solutions for these important national energy and environmental needs.

  4. Whole-genomic analysis of 12 porcine group A rotaviruses isolated from symptomatic piglets in Brazil during the years of 2012-2013.

    PubMed

    Silva, Fernanda D F; Espinoza, Luis R L; Tonietti, Paloma O; Barbosa, Bruna R P; Gregori, Fabio

    2015-06-01

    Group A rotaviruses (RVAs) are leading causes of viral diarrhea in children and in the young of many animal species, particularly swine. In the current study, porcine RVAs were found in fecal specimens from symptomatic piglets on 4 farms in Brazil during the years of 2012-2013. Using RT-PCR, Sanger nucleotide sequencing, and phylogenetic analyses, the whole genomes of 12 Brazilian porcine RVA strains were analyzed. Specifically, the full-length open reading frame (ORF) sequences were determined for the NSP2-, NSP3-, and VP6-coding genes, and partial ORF sequences were determined for the VP1-, VP2-, VP3-, VP4-, VP7-, NSP1-, NSP4-, and NSP5/6-coding genes. The results indicate that all 12 strains had an overall porcine-RVA-like backbone with most segments being designated as genotype 1, with the exception of the VP6- and NSP1-coding genes, which were genotypes I5 and A8, respectively. These results add to our growing understanding of porcine RVA genetic diversity and will provide a platform for monitoring the role of animals as genetic reservoirs of emerging human RVAs strains. PMID:25796358

  5. A taxonomic framework for emerging groups of ecologically important marine gammaproteobacteria based on the reconstruction of evolutionary relationships using genome-scale data.

    PubMed

    Spring, Stefan; Scheuner, Carmen; Göker, Markus; Klenk, Hans-Peter

    2015-01-01

    In recent years a large number of isolates were obtained from saline environments that are phylogenetically related to distinct clades of oligotrophic marine gammaproteobacteria, which were originally identified in seawater samples using cultivation independent methods and are characterized by high seasonal abundances in coastal environments. To date a sound taxonomic framework for the classification of these ecologically important isolates and related species in accordance with their evolutionary relationships is missing. In this study we demonstrate that a reliable allocation of members of the oligotrophic marine gammaproteobacteria (OMG) group and related species to higher taxonomic ranks is possible by phylogenetic analyses of whole proteomes but also of the RNA polymerase beta subunit, whereas phylogenetic reconstructions based on 16S rRNA genes alone resulted in unstable tree topologies with only insignificant bootstrap support. The identified clades could be correlated with distinct phenotypic traits illustrating an adaptation to common environmental factors in their evolutionary history. Genome wide gene-content analyses revealed the existence of two distinct ecological guilds within the analyzed lineage of marine gammaproteobacteria which can be distinguished by their trophic strategies. Based on our results a novel order within the class Gammaproteobacteria is proposed, which is designated Cellvibrionales ord. nov. and comprises the five novel families Cellvibrionaceae fam. nov., Halieaceae fam. nov., Microbulbiferaceae fam. nov., Porticoccaceae fam. nov., and Spongiibacteraceae fam. nov. PMID:25914684

  6. A taxonomic framework for emerging groups of ecologically important marine gammaproteobacteria based on the reconstruction of evolutionary relationships using genome-scale data

    PubMed Central

    Spring, Stefan; Scheuner, Carmen; Göker, Markus; Klenk, Hans-Peter

    2015-01-01

    In recent years a large number of isolates were obtained from saline environments that are phylogenetically related to distinct clades of oligotrophic marine gammaproteobacteria, which were originally identified in seawater samples using cultivation independent methods and are characterized by high seasonal abundances in coastal environments. To date a sound taxonomic framework for the classification of these ecologically important isolates and related species in accordance with their evolutionary relationships is missing. In this study we demonstrate that a reliable allocation of members of the oligotrophic marine gammaproteobacteria (OMG) group and related species to higher taxonomic ranks is possible by phylogenetic analyses of whole proteomes but also of the RNA polymerase beta subunit, whereas phylogenetic reconstructions based on 16S rRNA genes alone resulted in unstable tree topologies with only insignificant bootstrap support. The identified clades could be correlated with distinct phenotypic traits illustrating an adaptation to common environmental factors in their evolutionary history. Genome wide gene-content analyses revealed the existence of two distinct ecological guilds within the analyzed lineage of marine gammaproteobacteria which can be distinguished by their trophic strategies. Based on our results a novel order within the class Gammaproteobacteria is proposed, which is designated Cellvibrionales ord. nov. and comprises the five novel families Cellvibrionaceae fam. nov., Halieaceae fam. nov., Microbulbiferaceae fam. nov., Porticoccaceae fam. nov., and Spongiibacteraceae fam. nov. PMID:25914684

  7. Microbial genomic taxonomy.

    PubMed

    Thompson, Cristiane C; Chimetto, Luciane; Edwards, Robert A; Swings, Jean; Stackebrandt, Erko; Thompson, Fabiano L

    2013-01-01

    A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132

  8. Genome-Wide Association Study Identifies That the ABO Blood Group System Influences Interleukin-10 Levels and the Risk of Clinical Events in Patients with Acute Coronary Syndrome

    PubMed Central

    Johansson, Åsa; Alfredsson, Jenny; Eriksson, Niclas; Wallentin, Lars; Siegbahn, Agneta

    2015-01-01

    Introduction Acute coronary syndrome (ACS) is a major cause of mortality worldwide. We have previously shown that increased interleukin-10 (IL-10) levels are associated with poor outcome in ACS patients. Method We performed a genome-wide association study in 2864 ACS patients and 408 healthy controls, to identify genetic variants associated with IL-10 levels. Then haplotype analyses of the identified loci were done and comparisons to levels of IL-10 and other known ACS related biomarkers. Results Genetic variants at the ABO blood group locus associated with IL-10 levels (top SNP: rs676457, P = 4.4 × 10−10) were identified in the ACS patients. Haplotype analysis, using SNPs tagging the four main ABO antigens (A1, A2, B and O), showed that O and A2 homozygous individuals, or O/A2 heterozygotes have much higher levels of IL-10 compared to individuals with other antigen combinations. In the ACS patients, associations between ABO antigens and von Willebrand factor (VWF, P = 9.2 × 10−13), and soluble tissue factor (sTF, P = 8.6 × 10−4) were also found. In the healthy control cohort, the associations with VWF and sTF were similar to those in ACS patients (P = 1.2 × 10−15 and P = 1.0 × 10−5 respectively), but the healthy cohort showed no association with IL-10 levels (P>0.05). In the ACS patients, the O antigen was also associated with an increased risk of cardiovascular death, all causes of death, and recurrent myocardial infarction (odds ratio [OR] = 1.24–1.29, P = 0.029–0.00067). Conclusion Our results suggest that the ABO antigens play important roles, not only for the immunological response in ACS patients, but also for the outcome of the disease. PMID:26600159

  9. Genome sequence of the pink to light reddish-pigmented Rubellimicrobium mesophilum type strain (DSM 19309T), a representative of the Roseobacter group isolated from soil, and emended description of the species

    PubMed Central

    Riedel, Thomas; Spring, Stefan; Fiebig, Anne; Petersen, Jörn; Göker, Markus; Klenk, Hans-Peter

    2014-01-01

    Rubellimicrobium mesophilum Dastager et al. 2008 is a mesophilic and light reddish-pigmented representative of the Roseobacter group within the alphaproteobacterial family Rhodobacteraceae. Representatives of the Roseobacter group play an important role in the marine biogeochemical cycles and were found in a broad variety of marine environments associated with algal blooms, different kinds of sediments, and surfaces of invertebrates and vertebrates. Roseobacters were shown to be widely distributed, especially within the total bacterial community found in coastal waters, as well as in mixed water layers of the open ocean. Here we describe the features of R. mesophilum strain MSL-20T together with its genome sequence and annotation generated from a culture of DSM 19309T. The 4,927,676 bp genome sequence consists of one chromosome and probably one extrachromosomal element. It contains 5,082 protein-coding genes and 56 RNA genes. As previously reported, the G+C content is significantly different from the actual genome sequence-based G+C content and as the type strain tests positively for oxidase, the species description is emended accordingly. The genome was sequenced as part of the activities of the Transregional Collaborative Research Centre 51 (TRR51) funded by the German Research Foundation (DFG). PMID:25197472

  10. Complete Genome Characterization of Recent and Ancient Belgian Pig Group A Rotaviruses and Assessment of Their Evolutionary Relationship with Human Rotaviruses

    PubMed Central

    Heylen, Elisabeth; Zeller, Mark; Roukaerts, Inge D. M.; Desmarets, Lowiese M. B.; Van Ranst, Marc; Nauwynck, Hans J.; Matthijnssens, Jelle

    2014-01-01

    ABSTRACT Group A rotaviruses (RVAs) are an important cause of diarrhea in young pigs and children. An evolutionary relationship has been suggested to exist between pig and human RVAs. This hypothesis was further investigated by phylogenetic analysis of the complete genomes of six recent (G2P[27], G3P[6], G4P[7], G5P[7], G9P[13], and G9P[23]) and one historic (G1P[7]) Belgian pig RVA strains and of all completely characterized pig RVAs from around the globe. In contrast to the large diversity of genotypes found for the outer capsid proteins VP4 and VP7, a relatively conserved genotype constellation (I5-R1-C1-M1-A8-N1-T7-E1-H1) was found for the other 9 genes in most pig RVA strains. VP1, VP2, VP3, NSP2, NSP4, and NSP5 genes of porcine RVAs belonged to genotype 1, which is shared with human Wa-like RVAs. However, for most of these gene segments, pig strains clustered distantly from human Wa-like RVAs, indicating that viruses from both species have entered different evolutionary paths. However, VP1, VP2, and NSP3 genes of some archival human strains were moderately related to pig strains. Phylogenetic analysis of the VP6, NSP1, and NSP3 genes, as well as amino acid analysis of the antigenic regions of VP7, further confirmed this evolutionary segregation. The present results also indicate that the species barrier is less strict for pig P[6] strains but that chances for successful spread of these strains in the human population are hampered by the better adaptation of pig RVAs to pig enterocytes. However, future surveillance of pig and human RVA strains is warranted. IMPORTANCE Rotaviruses are an important cause of diarrhea in many species, including pigs and humans. Our understanding of the evolutionary relationship between rotaviruses from both species is limited by the lack of genomic data on pig strains. In this study, recent and ancient Belgian pig rotavirus isolates were sequenced, and their evolutionary relationship with human Wa-like strains was investigated

  11. UCSC genome browser tutorial.

    PubMed

    Zweig, Ann S; Karolchik, Donna; Kuhn, Robert M; Haussler, David; Kent, W James

    2008-08-01

    The University of California Santa Cruz (UCSC) Genome Bioinformatics website consists of a suite of free, open-source, on-line tools that can be used to browse, analyze, and query genomic data. These tools are available to anyone who has an Internet browser and an interest in genomics. The website provides a quick and easy-to-use visual display of genomic data. It places annotation tracks beneath genome coordinate positions, allowing rapid visual correlation of different types of information. Many of the annotation tracks are submitted by scientists worldwide; the others are computed by the UCSC Genome Bioinformatics group from publicly available sequence data. It also allows users to upload and display their own experimental results or annotation sets by creating a custom track. The suite of tools, downloadable data files, and links to documentation and other information can be found at http://genome.ucsc.edu/. PMID:18514479

  12. Draft Genome Sequence of a Dominant, Multidrug-Resistant Neisseria gonorrhoeae Strain, TCDC-NG08107, from a Sexual Group at High Risk of Acquiring Human Immunodeficiency Virus Infection and Syphilis▿

    PubMed Central

    Chen, Chun-Chen; Hsia, Ko-Chiang; Huang, Chung-Ter; Wong, Wing-Wai; Yen, Muh-Yong; Li, Lan-Hui; Lin, Kun-Yen; Chen, Kuo-Wei; Li, Shu-Ying

    2011-01-01

    Neisseria gonorrhoeae infection is the second major cause of sexually transmitted diseases worldwide. Development of resistance to multiple classes of antimicrobials in N. gonorrhoeae has compromised treatment and disease control. Herein, we report the availability of the draft genome sequence of a multidrug-resistant N. gonorrhoeae isolate, TCDC-NG08107, which spread in groups of men who have sex with men (MSM) in Taiwan. PMID:21257765

  13. Analysis of ATP6 sequence diversity in the Triticum-Aegilops group of species reveals the crucial role of rearrangement in mitochondrial genome evolution

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mutation and chromosomal rearrangements are the two main forces of increasing genetic diversity for natural selection to act upon, and ultimately drive the evolutionary process. Although genome evolution is a function of both forces, simultaneously, the ratio of each can be varied among different ge...

  14. Matrix-Assisted Laser Desorption Ionization–Time of Flight and Comparative Genomic Analysis of M-18 Group A Streptococcus Strains Associated with an Acute Rheumatic Fever Outbreak in Northeast Italy in 2012 and 2013

    PubMed Central

    Scaltriti, Erika; Foschi, Claudio; Baggio, Enrico; Tamburini, Maria Vittoria; Creti, Roberta; Pascucci, Maria Grazia; Fagioni, Marco; Ambretti, Simone; Comandatore, Francesco; Landini, Maria Paola

    2015-01-01

    Acute rheumatic fever (ARF) is a postsuppurative sequela caused by Streptococcus pyogenes infections affecting school-age children. We describe here the occurrence of an ARF outbreak that occurred in Bologna province, northeastern Italy, between November 2012 and May 2013. Molecular analysis revealed that ARF-related group A Streptococcus (GAS) strains belonged to the M-18 serotype, including subtypes emm18.29 and emm18.32. All M-18 GAS strains shared the same antigenic profile, including SpeA, SpeB, SpeC, SpeL, SpeM, and SmeZ. Matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) analysis revealed that M-18 GAS strains grouped separately from other serotypes, suggesting a different S. pyogenes lineage. Single nucleotide polymorphisms and phylogenetic analysis based on whole-genome sequencing showed that emm18.29 and emm18.32 GAS strains clustered in two distinct groups, highlighting genetic variations between these subtypes. Comparative analysis revealed a similar genome architecture between emm18.29 and emm18.32 strains that differed from noninvasive emm18.0 strains. The major sources of differences between M-18 genomes were attributable to the prophage elements. Prophage regions contained several virulence factors that could have contributed to the pathogenic potential of emm18.29 and emm18.32 strains. Notably, phage ΦSPBO.1 carried erythrogenic toxin A gene (speA1) in six ARF-related M-18 GAS strains but not in emm18.0 strains. In addition, a phage-encoded hyaluronidase gene (hylP.2) presented different variants among M-18 GAS strains by showing internal deletions located in the α-helical and TSβH regions. In conclusion, our study yielded insights into the genome structure of M-18 GAS strains responsible for the ARF outbreak in Italy, thus expanding our knowledge of this serotype. PMID:25740772

  15. The African Turquoise Killifish Genome Provides Insights into Evolution and Genetic Architecture of Lifespan.

    PubMed

    Valenzano, Dario Riccardo; Benayoun, Bérénice A; Singh, Param Priya; Zhang, Elisa; Etter, Paul D; Hu, Chi-Kuo; Clément-Ziza, Mathieu; Willemsen, David; Cui, Rongfeng; Harel, Itamar; Machado, Ben E; Yee, Muh-Ching; Sharp, Sabrina C; Bustamante, Carlos D; Beyer, Andreas; Johnson, Eric A; Brunet, Anne

    2015-12-01

    Lifespan is a remarkably diverse trait ranging from a few days to several hundred years in nature, but the mechanisms underlying the evolution of lifespan differences remain elusive. Here we de novo assemble a reference genome for the naturally short-lived African turquoise killifish, providing a unique resource for comparative and experimental genomics. The identification of genes under positive selection in this fish reveals potential candidates to explain its compressed lifespan. Several aging genes are under positive selection in this short-lived fish and long-lived species, raising the intriguing possibility that the same gene could underlie evolution of both compressed and extended lifespans. Comparative genomics and linkage analysis identify candidate genes associated with lifespan differences between various turquoise killifish strains. Remarkably, these genes are clustered on the sex chromosome, suggesting that short lifespan might have co-evolved with sex determination. Our study provides insights into the evolutionary forces that shape lifespan in nature. PMID:26638078

  16. Genome sequence of the pink–pigmented marine bacterium Loktanella hongkongensis type strain (UST950701–009PT), a representative of the Roseobacter group

    SciTech Connect

    Lau, Stanley CK; Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Petersen, Jörn; Ivanova, Natalia N.; Markowitz, Victor; Woyke, Tanja; Göker, Markus; Kyrpides, Nikos C.; Klenk, Hans-Peter; Qian, Pei-Yuan

    2015-08-11

    Loktanella hongkongensis UST950701-009PT is a Gram-negative, non-motile and rod-shaped bacterium isolated from a marine biofilm in the subtropical seawater of Hong Kong. When growing as a monospecies biofilm on polystyrene surfaces, this bacterium is able to induce larval settlement and metamorphosis of a ubiquitous polychaete tubeworm Hydroides elegans. The inductive cues are low-molecular weight compounds bound to the exopolymeric matrix of the bacterial cells. In the present study we describe the features of L. hongkongensis strain DSM 17492T together with its genome sequence and annotation and novel aspects of its phenotype. The 3,198,444 bp long genome sequence encodes 3104 protein-coding genes and 57 RNA genes. Lastly, the two unambiguously identified extrachromosomal replicons contain replication modules of the RepB and the Rhodobacteraceae-specific DnaA-like type, respectively.

  17. Genome sequence of the pink–pigmented marine bacterium Loktanella hongkongensis type strain (UST950701–009PT), a representative of the Roseobacter group

    DOE PAGESBeta

    Lau, Stanley CK; Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Petersen, Jörn; Ivanova, Natalia N.; Markowitz, Victor; Woyke, Tanja; Göker, Markus; et al

    2015-08-11

    Loktanella hongkongensis UST950701-009PT is a Gram-negative, non-motile and rod-shaped bacterium isolated from a marine biofilm in the subtropical seawater of Hong Kong. When growing as a monospecies biofilm on polystyrene surfaces, this bacterium is able to induce larval settlement and metamorphosis of a ubiquitous polychaete tubeworm Hydroides elegans. The inductive cues are low-molecular weight compounds bound to the exopolymeric matrix of the bacterial cells. In the present study we describe the features of L. hongkongensis strain DSM 17492T together with its genome sequence and annotation and novel aspects of its phenotype. The 3,198,444 bp long genome sequence encodes 3104 protein-codingmore » genes and 57 RNA genes. Lastly, the two unambiguously identified extrachromosomal replicons contain replication modules of the RepB and the Rhodobacteraceae-specific DnaA-like type, respectively.« less

  18. Genome sequence of the pink-pigmented marine bacterium Loktanella hongkongensis type strain (UST950701-009P(T)), a representative of the Roseobacter group.

    PubMed

    Lau, Stanley Ck; Riedel, Thomas; Fiebig, Anne; Han, James; Huntemann, Marcel; Petersen, Jörn; Ivanova, Natalia N; Markowitz, Victor; Woyke, Tanja; Göker, Markus; Kyrpides, Nikos C; Klenk, Hans-Peter; Qian, Pei-Yuan

    2015-01-01

    Loktanella hongkongensis UST950701-009P(T) is a Gram-negative, non-motile and rod-shaped bacterium isolated from a marine biofilm in the subtropical seawater of Hong Kong. When growing as a monospecies biofilm on polystyrene surfaces, this bacterium is able to induce larval settlement and metamorphosis of a ubiquitous polychaete tubeworm Hydroides elegans. The inductive cues are low-molecular weight compounds bound to the exopolymeric matrix of the bacterial cells. In the present study we describe the features of L. hongkongensis strain DSM 17492(T) together with its genome sequence and annotation and novel aspects of its phenotype. The 3,198,444 bp long genome sequence encodes 3104 protein-coding genes and 57 RNA genes. The two unambiguously identified extrachromosomal replicons contain replication modules of the RepB and the Rhodobacteraceae-specific DnaA-like type, respectively. PMID:26380639

  19. Genomics of sorghum.

    PubMed

    Paterson, Andrew H

    2008-01-01

    Sorghum (Sorghum bicolor (L.) Moench) is a subject of plant genomics research based on its importance as one of the world's leading cereal crops, a biofuels crop of high and growing importance, a progenitor of one of the world's most noxious weeds, and a botanical model for many tropical grasses with complex genomes. A rich history of genome analysis, culminating in the recent complete sequencing of the genome of a leading inbred, provides a foundation for invigorating progress toward relating sorghum genes to their functions. Further characterization of the genomes other than Saccharinae cereals may shed light on mechanisms, levels, and patterns of evolution of genome size and structure, laying the foundation for further study of sugarcane and other economically important members of the group. PMID:18483564

  20. A Preliminary Genome-Wide Association Study of Acute Mountain Sickness Susceptibility in a Group of Nepalese Pilgrims Ascending to 4380 m.

    PubMed

    MacInnis, Martin J; Widmer, Nadia; Timulsina, Utsav; Subedi, Ankita; Siwakoti, Ashmita; Pandit, Bidur Prasad; Freeman, Michael G; Carter, Eric A; Manokhina, Irina; Thapa, Ghan Bahadur; Koehle, Michael S

    2015-12-01

    There is significant interindividual variation in acute mountain sickness (AMS) susceptibility in humans. To identify genes related to AMS susceptibility, we used a genome-wide association study (GWAS) to simultaneously test associations between genetic variants dispersed throughout the genome and the presence and severity of AMS. DNA samples were collected from subjects who ascended rapidly to Gosainkunda, Nepal (4380 m), as part of the 2005, 2010, and 2012 Janai Purnima festivals. The Lake Louise Score was used to measure AMS severity. The primary analysis was based on 99 male subjects (43 with AMS; 56 without AMS). Genotyping for the GWAS was performed using Infinium Human Core Exome Bead Chips (542,556 single-nucleotide polymorphisms were assayed), and validation genotyping was performed with pyrosequencing in two additional cohorts (n = 101 for each). In total, 270,389 single nucleotide polymorphisms (SNPs) passed quality control, and 4 SNPs (one intronic, three nonsynonymous) in the FAM149A gene were associated with AMS severity after correcting for multiple hypothesis testing (p = 1.8E-7); however, in the validation cohorts, FAM149A was not associated with the presence or severity of AMS. No other genes were associated with AMS susceptibility at the genome-wide level. Due to the large influence of environmental factors (i.e., ascent rate and altitude attained) and the difficulties associated with the AMS phenotype (i.e., low repeatability, nonspecific symptoms, potentially independent ailments), we suggest that future studies addressing the variation in the acute human hypoxia response should focus on objective responses to acute hypoxia instead of AMS. PMID:26600424

  1. Experimental RNomics and genomic comparative analysis reveal a large group of species-specific small non-message RNAs in the silkworm Bombyx mori

    PubMed Central

    Li, Dandan; Wang, Yanhong; Zhang, Kun; Jiao, Zhujin; Zhu, Xiaopeng; Skogerboe, Geir; Guo, Xiangqian; Chinnusamy, Viswanathan; Bi, Lijun; Huang, Yongping; Dong, Shuanglin; Chen, Runsheng; Kan, Yunchao

    2011-01-01

    Accumulating evidences show that small non-protein coding RNAs (ncRNAs) play important roles in development, stress response and other cellular processes. The silkworm is an important model for studies on insect genetics and control of lepidopterous pests. Here, we have performed the first systematic identification and analysis of intermediate size ncRNAs (50–500 nt) in the silkworm. We identified 189 novel ncRNAs, including 141 snoRNAs, six snRNAs, three tRNAs, one SRP and 38 unclassified ncRNAs. Forty ncRNAs showed significantly altered expression during silkworm development or across specific stage transitions. Genomic comparisons revealed that 123 of these ncRNAs are potentially silkworm-specific. Analysis of the genomic organization of the ncRNA loci showed that 32.62% of the novel snoRNA loci are intergenic, and that all the intronic snoRNAs follow the pattern of one-snoRNA-per-intron. Target site analysis predicted a total of 95 2′-O-methylation and pseudouridylation modification sites of rRNAs, snRNAs and tRNAs. Together, these findings provide new clues for future functional study of ncRNA during insect development and evolution. PMID:21227919

  2. Genomic analyses of cherry rusty mottle group and cherry twisted leaf-associated viruses reveal a possible new genus within the family betaflexiviridae.

    PubMed

    Villamor, D E V; Susaimuthu, J; Eastwell, K C

    2015-03-01

    It is demonstrated that closely related viruses within the family Betaflexiviridae are associated with a number of diseases that affect sweet cherry (Prunus avium) and other Prunus spp. Cherry rusty mottle-associated virus (CRMaV) is correlated with the appearance of cherry rusty mottle disease (CRMD), and Cherry twisted leaf-associated virus (CTLaV) is linked to cherry twisted leaf disease (CTLD) and apricot ringpox disease (ARPD). Comprehensive analysis of previously reported full genomic sequences plus those determined in this study representing isolates of CTLaV, CRMaV, Cherry green ring mottle virus, and Cherry necrotic rusty mottle virus revealed segregation of sequences into four clades corresponding to distinct virus species. High-throughput sequencing of RNA from representative source trees for CRMD, CTLD, and ARPD did not reveal additional unique virus sequences that might be associated with these diseases, thereby further substantiating the association of CRMaV and CTLaV with CRMD and CTLD or ARPD, respectively. Based on comparison of the nucleotide and amino acid sequence identity values, phylogenetic relationships with other triple-gene block-coding viruses within the family Betaflexiviridae, genome organization, and natural host range, a new genus (Robigovirus) is suggested. PMID:25496302

  3. The complete nucleotide sequence of the cassava (Manihot esculenta) chloroplast genome and the evolution of atpF in Malpighiales: RNA editing and multiple losses of a group II intron

    PubMed Central

    Wurdack, Kenneth J.; Kanagaraj, Anderson; Lee, Seung-Bum; Saski, Christopher; Jansen, Robert K.

    2008-01-01

    The complete sequence of the chloroplast genome of cassava (Manihot esculenta, Euphorbiaceae) has been determined. The genome is 161,453 bp in length and includes a pair of inverted repeats (IR) of 26,954 bp. The genome includes 128 genes; 96 are single copy and 16 are duplicated in the IR. There are four rRNA genes and 30 distinct tRNAs, seven of which are duplicated in the IR. The infA gene is absent; expansion of IRb has duplicated 62 amino acids at the 3′ end of rps19 and a number of coding regions have large insertions or deletions, including insertions within the 23S rRNA gene. There are 17 intron-containing genes in cassava, 15 of which have a single intron while two (clpP, ycf3) have two introns. The usually conserved atpF group II intron is absent and this is the first report of its loss from land plant chloroplast genomes. The phylogenetic distribution of the atpF intron loss was determined by a PCR survey of 251 taxa representing 34 families of Malpighiales and 16 taxa from closely related rosids. The atpF intron is not only missing in cassava but also from closely related Euphorbiaceae and other Malpighiales, suggesting that there have been at least seven independent losses. In cassava and all other sequenced Malphigiales, atpF gene sequences showed a strong association between C-to-T substitutions at nucleotide position 92 and the loss of the intron, suggesting that recombination between an edited mRNA and the atpF gene may be a possible mechanism for the intron loss. PMID:18214421

  4. Genomics and functional genomics with haloarchaea.

    PubMed

    Soppa, J; Baumann, A; Brenneis, M; Dambeck, M; Hering, O; Lange, C

    2008-09-01

    The first haloarchaeal genome was published in 2000 and today five genome sequences are available. Transcriptome and proteome analyses have been established for two and three haloarchaeal species, respectively, and more than 20 studies using these functional genomic approaches have been published in the last two years. These studies gave global overviews of metabolic regulation (aerobic and anaerobic respiration, phototrophy, carbon source usage), stress response (UV, X-rays, transition metals, osmotic and temperature stress), cell cycle-dependent transcript level regulation, and transcript half-lives. The only translatome analysis available for any prokaryotic species revealed that 10 and 20% of all transcripts are translationally regulated in Haloferax volcanii and Halobacterium salinarum, respectively. Very effective methods for the construction of in frame deletion mutants have been established recently for haloarchaea and are intensively used to unravel the biological roles of genes in this group. Bioinformatic analyses include both cross-genome comparisons as well as integration of genomic data with experimental results. The first systems biology approaches have been performed that used experimental data to construct predictive models of gene expression and metabolism, respectively. In this contribution the current status of genomics, functional genomics, and molecular genetics of haloarchaea is summarized and selected examples are discussed. PMID:18493745

  5. Genome Data Mining and Soil Survey for the Novel Group 5 [NiFe]-Hydrogenase To Explore the Diversity and Ecological Importance of Presumptive High-Affinity H2-Oxidizing Bacteria ▿†

    PubMed Central

    Constant, Philippe; Chowdhury, Soumitra Paul; Hesse, Laura; Pratscher, Jennifer; Conrad, Ralf

    2011-01-01

    Streptomyces soil isolates exhibiting the unique ability to oxidize atmospheric H2 possess genes specifying a putative high-affinity [NiFe]-hydrogenase. This study was undertaken to explore the taxonomic diversity and the ecological importance of this novel functional group. We propose to designate the genes encoding the small and large subunits of the putative high-affinity hydrogenase hhyS and hhyL, respectively. Genome data mining revealed that the hhyL gene is unevenly distributed in the phyla Actinobacteria, Proteobacteria, Chloroflexi, and Acidobacteria. The hhyL gene sequences comprised a phylogenetically distinct group, namely, the group 5 [NiFe]-hydrogenase genes. The presumptive high-affinity H2-oxidizing bacteria constituting group 5 were shown to possess a hydrogenase gene cluster, including the genes encoding auxiliary and structural components of the enzyme and four additional open reading frames (ORFs) of unknown function. A soil survey confirmed that both high-affinity H2 oxidation activity and the hhyL gene are ubiquitous. A quantitative PCR assay revealed that soil contained 106 to 108 hhyL gene copies g (dry weight)−1. Assuming one hhyL gene copy per genome, the abundance of presumptive high-affinity H2-oxidizing bacteria was higher than the maximal population size for which maintenance energy requirements would be fully supplied through the H2 oxidation activity measured in soil. Our data indicate that the abundance of the hhyL gene should not be taken as a reliable proxy for the uptake of atmospheric H2 by soil, because high-affinity H2 oxidation is a facultatively mixotrophic metabolism, and microorganisms harboring a nonfunctional group 5 [NiFe]-hydrogenase may occur. PMID:21742924

  6. Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.

    PubMed

    Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A

    2016-01-01

    One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. PMID:27238013

  7. Direct Measurement of Hg(II) Removal from Organomercurial Lyase (MerB) by Tryptophan Fluorescence: NmerA Domain of Co-evolved γ -Proteobacterial Mercuric Ion Reductase (MerA) Is More Efficient than MerA Catalytic Core or Glutathione†

    PubMed Central

    Hong, Baoyu; Nauss, Rachel; Harwood, Ian; Miller, Susan M.

    2011-01-01

    Aerobic and facultative bacteria and archaea harboring mer loci exhibit resistance to the toxic effects of Hg(II) and organomercurials [RHg(I)]. In broad spectrum resistance, RHg(I) is converted to less toxic Hg(0) in the cytosol by the sequential action of organomercurial lyase (MerB: RHg(I) --> RH + Hg(II)) and mercuric ion reductase (MerA: Hg(II) --> Hg(0)) enzymes, requiring transfer of Hg(II) from MerB to MerA. Although previous studies with γ-proteobacterial versions of MerA and a non-physiological Hg(II)-DTT-MerB complex qualitatively support a pathway for direct transfer between proteins, assessment of the relative efficiencies of Hg(II) transfer to the two different di-cysteine motifs in γ-proteobacterial MerA and to competing cellular thiol is lacking. Here we show the intrinsic tryptophan fluorescence of γ-proteobacterial MerB is sensitive to Hg(II) binding and use this to probe the kinetics of Hg(II) removal from MerB by the N-terminal domain (NmerA) and catalytic core C-terminal cysteine pairs of its co-evolved MerA, and by glutathione (GSH), the major competing cellular thiol in γ-proteobacteria. At physiologically relevant concentrations, reaction with a 10-fold excess NmerA over HgMerB removes ≥ 92% Hg(II), while similar extents of reaction require more than 1000-fold excess of GSH. Kinetically, the apparent second order rate constant for Hg(II) transfer from MerB to NmerA, at 2.3 ± 0.1 × 104 M−1 s−1 is ~ 100-fold greater than that for GSH (1.2 ± 0.2 × 102 M−1 s−1) or the MerA catalytic core (1.2 × 102 M−1 s−1), establishing transfer to the metallochaperone-like NmerA domain as the kinetically favored pathway in this co-evolved system. PMID:20722420

  8. High Mobility Group Protein N5 (HMGN5) and Lamina-associated Polypeptide 2α (LAP2α) Interact and Reciprocally Affect Their Genome-wide Chromatin Organization*

    PubMed Central

    Zhang, Shaofei; Schones, Dustin E.; Malicet, Cedric; Rochman, Mark; Zhou, Ming; Foisner, Roland; Bustin, Michael

    2013-01-01

    The interactions of nuclear lamins with the chromatin fiber play an important role in regulating nuclear architecture and chromatin function; however, the full spectrum of these interactions is not known. We report that the N-terminal domain of the nucleosome-binding protein HMGN5 interacts with the C-terminal domain of the lamin-binding protein LAP2α and that these proteins reciprocally alter their interaction with chromatin. Chromatin immunoprecipitation analysis of cells lacking either HMGN5 or LAP2α reveals that loss of either protein affects the genome-wide distribution of the remaining partner. Our study identifies a new functional link between chromatin-binding and lamin-binding proteins. PMID:23673662

  9. Quantifying dynamical spillover in co-evolving multiplex networks

    PubMed Central

    Vijayaraghavan, Vikram S.; Noël, Pierre-André; Maoz, Zeev; D’Souza, Raissa M.

    2015-01-01

    Multiplex networks (a system of multiple networks that have different types of links but share a common set of nodes) arise naturally in a wide spectrum of fields. Theoretical studies show that in such multiplex networks, correlated edge dynamics between the layers can have a profound effect on dynamical processes. However, how to extract the correlations from real-world systems is an outstanding challenge. Here we introduce the Multiplex Markov chain to quantify correlations in edge dynamics found in longitudinal data of multiplex networks. By comparing the results obtained from the multiplex perspective to a null model which assumes layers in a network are independent, we can identify real correlations as distinct from simultaneous changes that occur due to random chance. We use this approach on two different data sets: the network of trade and alliances between nation states, and the email and co-commit networks between developers of open source software. We establish the existence of “dynamical spillover” showing the correlated formation (or deletion) of edges of different types as the system evolves. The details of the dynamics over time provide insight into potential causal pathways. PMID:26459949

  10. Quantifying dynamical spillover in co-evolving multiplex networks

    NASA Astrophysics Data System (ADS)

    Vijayaraghavan, Vikram S.; Noël, Pierre-André; Maoz, Zeev; D'Souza, Raissa M.

    2015-10-01

    Multiplex networks (a system of multiple networks that have different types of links but share a common set of nodes) arise naturally in a wide spectrum of fields. Theoretical studies show that in such multiplex networks, correlated edge dynamics between the layers can have a profound effect on dynamical processes. However, how to extract the correlations from real-world systems is an outstanding challenge. Here we introduce the Multiplex Markov chain to quantify correlations in edge dynamics found in longitudinal data of multiplex networks. By comparing the results obtained from the multiplex perspective to a null model which assumes layers in a network are independent, we can identify real correlations as distinct from simultaneous changes that occur due to random chance. We use this approach on two different data sets: the network of trade and alliances between nation states, and the email and co-commit networks between developers of open source software. We establish the existence of “dynamical spillover” showing the correlated formation (or deletion) of edges of different types as the system evolves. The details of the dynamics over time provide insight into potential causal pathways.

  11. Genome Size and Species Diversification

    PubMed Central

    2010-01-01

    Theoretically, there are reasons to believe that large genome size should favour speciation. Several major factors contributing to genome size, such as duplications and transposable element activity have been proposed to facilitate the formation of new species. However, it is also possible that small genome size promotes speciation. For example, selection for genome reduction may be resolved in different ways in incipient species, leading to incompatibilities. Mutations and chromosomal rearrangements may also be more stably inherited in smaller genomes. Here I review the following lines of empirical evidence bearing on this question: (i) Correlations between genome size and species richness of taxa are often negative. (ii) Fossil evidence in lungfish shows that the accumulation of DNA in the genomes of this group coincided with a reduction in species diversity. (iii) Estimates of speciation interval in mammals correlate positively with genome size. (iv) Genome reductions are inferred at the base of particular species radiations and genome expansions at the base of others. (v) Insect clades that have been increasing in diversity up to the present have smaller genomes than clades that have remained stable or have decreased in diversity. The general pattern emerging from these observations is that higher diversification rates are generally found in small-genome taxa. Since diversification rates are the net effect of speciation and extinction, large genomes may thus either constrain speciation rate, increase extinction rate, or both. I argue that some of the cited examples are unlikely to be explained by extinction alone. PMID:22140283

  12. Genomic taxonomy of vibrios

    PubMed Central

    Thompson, Cristiane C; Vicente, Ana Carolina P; Souza, Rangel C; Vasconcelos, Ana Tereza R; Vesth, Tammi; Alves, Nelson; Ussery, David W; Iida, Tetsuya; Thompson, Fabiano L

    2009-01-01

    Background Vibrio taxonomy has been based on a polyphasic approach. In this study, we retrieve useful taxonomic information (i.e. data that can be used to distinguish different taxonomic levels, such as species and genera) from 32 genome sequences of different vibrio species. We use a variety of tools to explore the taxonomic relationship between the sequenced genomes, including Multilocus Sequence Analysis (MLSA), supertrees, Average Amino Acid Identity (AAI), genomic signatures, and Genome BLAST atlases. Our aim is to analyse the usefulness of these tools for species identification in vibrios. Results We have generated four new genome sequences of three Vibrio species, i.e., V. alginolyticus 40B, V. harveyi-like 1DA3, and V. mimicus strains VM573 and VM603, and present a broad analyses of these genomes along with other sequenced Vibrio species. The genome atlas and pangenome plots provide a tantalizing image of the genomic differences that occur between closely related sister species, e.g. V. cholerae and V. mimicus. The vibrio pangenome contains around 26504 genes. The V. cholerae core genome and pangenome consist of 1520 and 6923 genes, respectively. Pangenomes might allow different strains of V. cholerae to occupy different niches. MLSA and supertree analyses resulted in a similar phylogenetic picture, with a clear distinction of four groups (Vibrio core group, V. cholerae-V. mimicus, Aliivibrio spp., and Photobacterium spp.). A Vibrio species is defined as a group of strains that share > 95% DNA identity in MLSA and supertree analysis, > 96% AAI, ≤ 10 genome signature dissimilarity, and > 61% proteome identity. Strains of the same species and species of the same genus will form monophyletic groups on the basis of MLSA and supertree. Conclusion The combination of different analytical and bioinformatics tools will enable the most accurate species identification through genomic computational analysis. This endeavour will culminate in the birth of the online

  13. Insights into conifer giga-genomes.

    PubMed

    De La Torre, Amanda R; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K; Jansson, Stefan; Jones, Steven J M; Keeling, Christopher I; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

    2014-12-01

    Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world's forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20-30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325

  14. Molecular cloning, genomic structure, and tissue distribution of EW135, a novel chicken egg white protein with group B scavenger receptor cysteine-rich domains.

    PubMed

    Yoo, Whayoung; Nakamura, Tomohiro; Asanuma, Hideki; Matsushita, Misao

    2013-11-01

    Approximately 80 proteins are reported to be present in chicken egg white. The major function of egg white proteins isolated so far is to defend the egg yolk against infections. We recently isolated a novel protein termed EW135 from chicken egg white. In this paper, we have determined the complete amino acid sequence of EW135 based on cDNA cloning. EW135 consists of 970 amino acids with a putative signal peptide of 17 amino acids. It is composed exclusively of tandem repeats of nine group B scavenger receptor cysteine-rich (SRCR) domains separated by eight seven-amino acid peptides. The features of consensus sequences found in the group B SRCR domain were well conserved in EW135. The EW135 gene consists of putative 11 exons, with each SRCR domain being encoded by a single exon. Reverse transcription PCR showed that EW135 is expressed in only the oviduct among the 11 types of tissues tested. EW135 is a second soluble protein belonging to the group B SRCR domain superfamily identified in chickens. One of the important functions of proteins belonging to the group B SRCR domain superfamily is to recognize pathogens in innate immunity. It is, therefore, conceivable that EW135 could be involved in host defense in egg white. PMID:23913278

  15. Comparison of the prevalence and genomic characteristics of Clostridium difficile isolated from various production groups in a vertically integrated swine operation

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective of this study was to compare the prevalence of Clostridium difficile among different age and production groups of swine in a vertically integrated swine operation in Texas in 2006. Isolation of C. difficile was performed utilizing an enrichment technique and restrictive media. Prelim...

  16. Single-Cell Genome and Group-Specific dsrAB Sequencing Implicate Marine Members of the Class Dehalococcoidia (Phylum Chloroflexi) in Sulfur Cycling

    PubMed Central

    Cooper, Myriel; Schreiber, Lars; Lloyd, Karen G.; Baker, Brett J.; Petersen, Dorthe G.; Jørgensen, Bo Barker; Stepanauskas, Ramunas; Reinhardt, Richard; Schramm, Andreas; Loy, Alexander; Adrian, Lorenz

    2016-01-01

    ABSTRACT The marine subsurface sediment biosphere is widely inhabited by bacteria affiliated with the class Dehalococcoidia (DEH), phylum Chloroflexi, and yet little is known regarding their metabolisms. In this report, genomic content from a single DEH cell (DEH-C11) with a 16S rRNA gene that was affiliated with a diverse cluster of 16S rRNA gene sequences prevalent in marine sediments was obtained from sediments of Aarhus Bay, Denmark. The distinctive gene content of this cell suggests metabolic characteristics that differ from those of known DEH and Chloroflexi. The presence of genes encoding dissimilatory sulfite reductase (Dsr) suggests that DEH could respire oxidized sulfur compounds, although Chloroflexi have never been implicated in this mode of sulfur cycling. Using long-range PCR assays targeting DEH dsr loci, dsrAB genes were amplified and sequenced from various marine sediments. Many of the amplified dsrAB sequences were affiliated with the DEH Dsr clade, which we propose equates to a family-level clade. This provides supporting evidence for the potential for sulfite reduction by diverse DEH species. DEH-C11 also harbored genes encoding reductases for arsenate, dimethyl sulfoxide, and halogenated organics. The reductive dehalogenase homolog (RdhA) forms a monophyletic clade along with RdhA sequences from various DEH-derived contigs retrieved from available metagenomes. Multiple facts indicate that this RdhA may not be a terminal reductase. The presence of other genes indicated that nutrients and energy may be derived from the oxidation of substituted homocyclic and heterocyclic aromatic compounds. Together, these results suggest that marine DEH play a previously unrecognized role in sulfur cycling and reveal the potential for expanded catabolic and respiratory functions among subsurface DEH. PMID:27143384

  17. Genetic, comparative genomic, and expression analyses of the Mc1r locus in the polychromatic Midas cichlid fish (Teleostei, Cichlidae Amphilophus sp.) species group.

    PubMed

    Henning, Frederico; Renz, Adina Josepha; Fukamachi, Shoji; Meyer, Axel

    2010-05-01

    Natural populations of the Midas cichlid species in several different crater lakes in Nicaragua exhibit a conspicuous color polymorphism. Most individuals are dark and the remaining have a gold coloration. The color morphs mate assortatively and sympatric population differentiation has been shown based on neutral molecular data. We investigated the color polymorphism using segregation analysis and a candidate gene approach. The segregation patterns observed in a mapping cross between a gold and a dark individual were consistent with a single dominant gene as a cause of the gold phenotype. This suggests that a simple genetic architecture underlies some of the speciation events in the Midas cichlids. We compared the expression levels of several candidate color genes Mc1r, Ednrb1, Slc45a2, and Tfap1a between the color morphs. Mc1r was found to be up regulated in the gold morph. Given its widespread association in color evolution and role on melanin synthesis, the Mc1r locus was further investigated using sequences derived from a genomic library. Comparative analysis revealed conserved synteny in relation to the majority of teleosts and highlighted several previously unidentified conserved non-coding elements (CNEs) in the upstream and downstream regions in the vicinity of Mc1r. The identification of the CNEs regions allowed the comparison of sequences from gold and dark specimens of natural populations. No polymorphisms were found between in the population sample and Mc1r showed no linkage to the gold phenotype in the mapping cross, demonstrating that it is not causally related to the color polymorphism in the Midas cichlid. PMID:20449580

  18. 77 FR 28888 - National Human Genome Research Institute Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-05-16

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... applications. Place: National Human Genome Research Institute, 3635 Fishers Lane, Suite 4076, ] Rockville,...

  19. 76 FR 28056 - National Human Genome Research Institute; Notice of Closed Meetings

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-05-13

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group, Genome Research Review... Scientific Review, National Human Genome Research Institute, National Institutes of Health, Bethesda,...

  20. 78 FR 64222 - National Human Genome Research Institute; Notice of Closed Meetings

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-10-28

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review..., Ph.D., Scientific Review Officer, Office of Scientific Review, National Human Genome...

  1. 76 FR 58023 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-09-19

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Review, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892,...

  2. Comparative genomics of Brassicaceae crops

    PubMed Central

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-01-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  3. Comparative genomics of Brassicaceae crops.

    PubMed

    Sharma, Ashutosh; Li, Xiaonan; Lim, Yong Pyo

    2014-05-01

    The family Brassicaceae is one of the major groups of the plant kingdom and comprises diverse species of great economic, agronomic and scientific importance, including the model plant Arabidopsis. The sequencing of the Arabidopsis genome has revolutionized our knowledge in the field of plant biology and provides a foundation in genomics and comparative biology. Genomic resources have been utilized in Brassica for diversity analyses, construction of genetic maps and identification of agronomic traits. In Brassicaceae, comparative sequence analysis across the species has been utilized to understand genome structure, evolution and the detection of conserved genomic segments. In this review, we focus on the progress made in genetic resource development, genome sequencing and comparative mapping in Brassica and related species. The utilization of genomic resources and next-generation sequencing approaches in improvement of Brassica crops is also discussed. PMID:24987286

  4. Importance of anchor genomes for any plant genome project

    PubMed Central

    Messing, Joachim; Llaca, Victor

    1998-01-01

    Progress in agricultural and environmental technologies is hampered by a slower rate of gene discovery in plants than animals. The vast pool of genes in plants, however, will be an important resource for insertion of genes, via biotechnological procedures, into an array of plants, generating unique germ plasms not achievable by conventional breeding. It just became clear that genomes of grasses have evolved in a manner analogous to Lego blocks. Large chromosome segments have been reshuffled and stuffer pieces added between genes. Although some genomes have become very large, the genome with the fewest stuffer pieces, the rice genome, is the Rosetta Stone of all the bigger grass genomes. This means that sequencing the rice genome as anchor genome of the grasses will provide instantaneous access to the same genes in the same relative physical position in other grasses (e.g., corn and wheat), without the need to sequence each of these genomes independently. (i) The sequencing of the entire genome of rice as anchor genome for the grasses will accelerate plant gene discovery in many important crops (e.g., corn, wheat, and rice) by several orders of magnitudes and reduce research and development costs for government and industry at a faster pace. (ii) Costs for sequencing entire genomes have come down significantly. Because of its size, rice is only 12% of the human or the corn genome, and technology improvements by the human genome project are completely transferable, translating in another 50% reduction of the costs. (iii) The physical mapping of the rice genome by a group of Japanese researchers provides a jump start for sequencing the genome and forming an international consortium. Otherwise, other countries would do it alone and own proprietary positions. PMID:9482827

  5. Genome walking.

    PubMed

    Shapter, Frances M; Waters, Daniel L E

    2014-01-01

    Genome walking is a method for determining the DNA sequence of unknown genomic regions flanking a region of known DNA sequence. The Genome walking has the potential to capture 6-7 kb of sequence in a single round. Ideal for identifying gene promoter regions where only the coding region. Genome walking also has significant utility for capturing homologous genes in new species when there are areas in the target gene with strong sequence conservation to the characterized species. The increasing use of next-generation sequencing technologies will see the principles of genome walking adapted to in silico methods. However, for smaller projects, PCR-based genome walking will remain an efficient method of characterizing unknown flanking sequence. PMID:24243201

  6. Complete genomic sequence analyses of the first group A giraffe rotavirus reveals close evolutionary relationship with rotaviruses infecting other members of the Artiodactyla.

    PubMed

    O'Shea, Helen; Mulherin, Emily; Matthijnssens, Jelle; McCusker, Matthew P; Collins, P J; Cashman, Olivia; Gunn, Lynda; Beltman, Marijke E; Fanning, Séamus

    2014-05-14

    Group A Rotaviruses (RVA) have been established as significant contributory agents of acute gastroenteritis in young children and many animal species. In 2008, we described the first RVA strain detected in a giraffe calf (RVA/Giraffe-wt/IRL/GirRV/2008/G10P[11]), presenting with acute diarrhoea. Molecular characterisation of the VP7 and VP4 genes revealed the bovine-like genotypes G10 and P[11], respectively. To further investigate the origin of this giraffe RVA strain, the 9 remaining gene segments were sequenced and analysed, revealing the following genotype constellation: G10-P[11]-I2-R2-C2-M2-A3-N2-T6-E2-H3. This genotype constellation is very similar to RVA strains isolated from cattle or other members of the artiodactyls. Phylogenetic analyses confirmed the close relationship between GirRV and RVA strains with a bovine-like genotype constellation detected from several host species, including humans. These results suggest that RVA strain GirRV was the result of an interspecies transmission from a bovine host to the giraffe calf. However, we cannot rule out completely that this bovine-like RVA genotype constellation may be enzootic in giraffes. Future RVA surveillance in giraffes may answer this intriguing question. PMID:24582453

  7. Genomics of Volvocine Algae

    PubMed Central

    Umen, James G.; Olson, Bradley J.S.C.

    2015-01-01

    Volvocine algae are a group of chlorophytes that together comprise a unique model for evolutionary and developmental biology. The species Chlamydomonas reinhardtii and Volvox carteri represent extremes in morphological diversity within the Volvocine clade. Chlamydomonas is unicellular and reflects the ancestral state of the group, while Volvox is multicellular and has evolved numerous innovations including germ-soma differentiation, sexual dimorphism, and complex morphogenetic patterning. The Chlamydomonas genome sequence has shed light on several areas of eukaryotic cell biology, metabolism and evolution, while the Volvox genome sequence has enabled a comparison with Chlamydomonas that reveals some of the underlying changes that enabled its transition to multicellularity, but also underscores the subtlety of this transition. Many of the tools and resources are in place to further develop Volvocine algae as a model for evolutionary genomics. PMID:25883411

  8. Comparative mitochondrial genomics in zygomycetes: bacteria-like RNase P RNAs, mobile elements and a close source of the group I intron invasion in angiosperms

    PubMed Central

    Seif, Elias; Leigh, Jessica; Liu, Yu; Roewer, Ingeborg; Forget, Lise; Lang, B. Franz

    2005-01-01

    To generate data for comparative analyses of zygomycete mitochondrial gene expression, we sequenced mtDNAs of three distantly related zygomycetes, Rhizopus oryzae, Mortierella verticillata and Smittium culisetae. They all contain the standard fungal mitochondrial gene set, plus rnpB, the gene encoding the RNA subunit of the mitochondrial RNase P (mtP-RNA) and rps3, encoding ribosomal protein S3 (the latter lacking in R.oryzae). The mtP-RNAs of R.oryzae and of additional zygomycete relatives have the most eubacteria-like RNA structures among fungi. Precise mapping of the 5′ and 3′ termini of the R.oryzae and M.verticillata mtP-RNAs confirms their expression and processing at the exact sites predicted by secondary structure modeling. The 3′ RNA processing of zygomycete mitochondrial mRNAs, SSU-rRNA and mtP-RNA occurs at the C-rich sequence motifs similar to those identified in fission yeast and basidiomycete mtDNAs. The C-rich motifs are included in the mature transcripts, and are likely generated by exonucleolytic trimming of RNA 3′ termini. Zygomycete mtDNAs feature a variety of insertion elements: (i) mtDNAs of R.oryzae and M.verticillata were subject to invasions by double hairpin elements; (ii) genes of all three species contain numerous mobile group I introns, including one that is closest to an intron that invaded angiosperm mtDNAs; and (iii) at least one additional case of a mobile element, characterized by a homing endonuclease insertion between partially duplicated genes [Paquin,B., Laforest,M.J., Forget,L., Roewer,I., Wang,Z., Longcore,J. and Lang,B.F. (1997) Curr. Genet., 31, 380–395]. The combined mtDNA-encoded proteins contain insufficient phylogenetic signal to demonstrate monophyly of zygomycetes. PMID:15689432

  9. Long-term study of an infection with ranaviruses in a group of edible frogs (Pelophylax kl. esculentus) and partial characterization of two viruses based on four genomic regions.

    PubMed

    Stöhr, Anke C; Hoffmann, Alexandra; Papp, Tibor; Robert, Nadia; Pruvost, Nicolas B M; Reyer, Heinz-Ulrich; Marschang, Rachel E

    2013-08-01

    Several edible frogs (Pelophylax kl. esculentus) collected into a single group from various ponds in Europe died suddenly with reddening of the skin (legs, abdomen) and haemorrhages in the gastrointestinal tract. Ranavirus was detected in some of the dead frogs using PCR, and virus was also isolated in cell culture. Over the following 3 years, another two outbreaks occurred with low to high mortality in between asymptomatic periods. In the first 2 years, the same ranavirus was detected repeatedly, but a new ranavirus was isolated in association with the second mass-mortality event. The two different ranaviruses were characterized based on nucleotide sequences from four genomic regions, namely, major capsid protein, DNA polymerase, ribonucleoside diphosphate reductase alpha and beta subunit genes. The sequences showed slight variations to each other or GenBank entries and both clustered to the Rana esculenta virus (REV-like) clade in the phylogenetic analysis. Furthermore, a quiescent infection was demonstrated in two individuals. By comparing samples taken before and after transport and caging in groups it was possible to identify the pond of origin and a ranavirus was detected for the first time in wild amphibians in Germany. PMID:23535222

  10. Opinion dynamics in a group-based society

    NASA Astrophysics Data System (ADS)

    Gargiulo, F.; Huet, S.

    2010-09-01

    Many models have been proposed to analyze the evolution of opinion structure due to the interaction of individuals in their social environment. Such models analyze the spreading of ideas both in completely interacting backgrounds and on social networks, where each person has a finite set of interlocutors. In this paper we analyze the reciprocal feedback between the opinions of the individuals and the structure of the interpersonal relationships at the level of community structures. For this purpose we define a group-based random network and we study how this structure co-evolves with opinion dynamics processes. We observe that the adaptive network structure affects the opinion dynamics process helping the consensus formation. The results also show interesting behaviors in regards to the size distribution of the groups and their correlation with opinion structure.

  11. Human Genome Annotation

    NASA Astrophysics Data System (ADS)

    Gerstein, Mark

    A central problem for 21st century science is annotating the human genome and making this annotation useful for the interpretation of personal genomes. My talk will focus on annotating the 99% of the genome that does not code for canonical genes, concentrating on intergenic features such as structural variants (SVs), pseudogenes (protein fossils), binding sites, and novel transcribed RNAs (ncRNAs). In particular, I will describe how we identify regulatory sites and variable blocks (SVs) based on processing next-generation sequencing experiments. I will further explain how we cluster together groups of sites to create larger annotations. Next, I will discuss a comprehensive pseudogene identification pipeline, which has enabled us to identify >10K pseudogenes in the genome and analyze their distribution with respect to age, protein family, and chromosomal location. Throughout, I will try to introduce some of the computational algorithms and approaches that are required for genome annotation. Much of this work has been carried out in the framework of the ENCODE, modENCODE, and 1000 genomes projects.

  12. Ebolavirus comparative genomics

    DOE PAGESBeta

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; et al

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of themore » same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.« less

  13. Aquaculture Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genomics chapter covers the basics of genome mapping and sequencing and the current status of several relevant species. The chapter briefly describes the development and use of (cDNA, BAC, etc.) libraries for mapping and obtaining specific sequence information. Other topics include comparative ...

  14. THE PHYLOGENY AND GENOME OF TRICHINELLA SPECIES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In 2004, funding was received by Washington University’s Genome Sequencing Center through NHGRI, to completely sequence several nematode genomes as part of a holistic effort to advance our understanding of the human genome. Trichinella spiralis was among this group because of its strategic ...

  15. Genetics, Genomics and Breeding in Soybean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The soybean (Glycine max) genome sequencing project began as an interagency project with the DOE’s Joint Genome Institute providing the production sequencing throughput with the NSF and USDA funded groups providing genomic resources and soybean expertise to the project (Jackson et al, 2006). The go...

  16. THE PHYLOGENY AND GENOME OF TRICHINELLA SPECIES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In 2004, funding was received by Washington University’s Genome Sequencing Center through NHGRI, to completely sequence several nematode genomes as part of a holistic effort to advance our understanding of the human genome. Trichinella spiralis was among this group because of its strategic ...

  17. Genome-to-Watershed Predictive Understanding of Terrestrial Environments

    NASA Astrophysics Data System (ADS)

    Hubbard, S. S.; Agarwal, D.; Banfield, J. F.; Beller, H. R.; Brodie, E.; Long, P.; Nico, P. S.; Steefel, C. I.; Tokunaga, T. K.; Williams, K. H.

    2014-12-01

    Although terrestrial environments play a critical role in cycling water, greenhouse gasses, and other life-critical elements, the complexity of interactions among component microbes, plants, minerals, migrating fluids and dissolved constituents hinders predictive understanding of system behavior. The 'Sustainable Systems 2.0' project is developing genome-to-watershed scale predictive capabilities to quantify how the microbiome affects biogeochemical watershed functioning, how watershed-scale hydro-biogeochemical processes affect microbial functioning, and how these interactions co-evolve with climate and land-use changes. Development of such predictive capabilities is critical for guiding the optimal management of water resources, contaminant remediation, carbon stabilization, and agricultural sustainability - now and with global change. Initial investigations are focused on floodplains in the Colorado River Basin, and include iterative model development, experiments and observations with an early emphasis on subsurface aspects. Field experiments include local-scale experiments at Rifle CO to quantify spatiotemporal metabolic and geochemical responses to O2and nitrate amendments as well as floodplain-scale monitoring to quantify genomic and biogeochemical response to natural hydrological perturbations. Information obtained from such experiments are represented within GEWaSC, a Genome-Enabled Watershed Simulation Capability, which is being developed to allow mechanistic interrogation of how genomic information stored in a subsurface microbiome affects biogeochemical cycling. This presentation will describe the genome-to-watershed scale approach as well as early highlights associated with the project. Highlights include: first insights into the diversity of the subsurface microbiome and metabolic roles of organisms involved in subsurface nitrogen, sulfur and hydrogen and carbon cycling; the extreme variability of subsurface DOC and hydrological controls on carbon and

  18. A Genome-wide Scan for Selective Sweeps in Racing Horses

    PubMed Central

    Moon, Sunjin; Lee, Jin Woo; Shin, Donghyun; Shin, Kwang-Yun; Kim, Jun; Choi, Ik-Young; Kim, Jaemin; Kim, Heebal

    2015-01-01

    Using next-generation sequencing, we conducted a genome-wide scan of selective sweeps associated with selection toward genetic improvement in Thoroughbreds. We investigated potential phenotypic consequence of putative candidate loci by candidate gene association mapping for the finishing time in 240 Thoroughbred horses. We found a significant association with the trait for Ral GApase alpha 2 (RALGAP2) that regulates a variety of cellular processes of signal trafficking. Neighboring genes around RALGAP2 included insulinoma-associated 1 (INSM1), pallid (PLDN), and Ras and Rab interactor 2 (RIN2) genes have similar roles in signal trafficking, suggesting that a co-evolving gene cluster located on the chromosome 22 is under strong artificial selection in racehorses. PMID:26333666

  19. Antarctic Genomics

    PubMed Central

    Clarke, Andrew; Cockell, Charles S.; Convey, Peter; Detrich III, H. William; Fraser, Keiron P. P.; Johnston, Ian A.; Methe, Barbara A.; Murray, Alison E.; Peck, Lloyd S.; Römisch, Karin; Rogers, Alex D.

    2004-01-01

    With the development of genomic science and its battery of technologies, polar biology stands on the threshold of a revolution, one that will enable the investigation of important questions of unprecedented scope and with extraordinary depth and precision. The exotic organisms of polar ecosystems are ideal candidates for genomic analysis. Through such analyses, it will be possible to learn not only the novel features that enable polar organisms to survive, and indeed thrive, in their extreme environments, but also fundamental biological principles that are common to most, if not all, organisms. This article aims to review recent developments in Antarctic genomics and to demonstrate the global context of such studies. PMID:18629155

  20. Correlation between the genomic o454-nlpD region polymorphisms, virulence gene equipment and phylogenetic group of extraintestinal Escherichia coli (ExPEC) enables pathotyping irrespective of host, disease and source of isolation

    PubMed Central

    2014-01-01

    Background The mutS-rpoS intergenic region in E. coli displays a mosaic structure which revealed pathotype specific patterns. To assess the importance of this region as a surrogate marker for the identification of highly virulent extraintestinal pathogenic E. coli (ExPEC) strains we aimed to: (i) characterize the genetic diversity of the mutS gene and the o454-nlpD genomic region among 510 E. coli strains from animals and humans; (ii) delineate associations between the polymorphism of this region and features such as phylogenetic background of E. coli, pathotype, host species, clinical condition, serogroup and virulence associated genes (VAG)s; and (iii) identify the most important VAGs for classification of the o454-nlpD region. Methods Size variation in the o454-nlpD region was investigated by PCR amplification and sequencing. Phylogenetic relationships were assessed by Ecor- and Multilocus sequence- typing (MLST), and a comparative analysis between mutS gene phylogenetic tree obtained with RAxML and the MLST grouping method was performed. Correlation between o454-nlpD patterns and the features described above were analysed. In addition, the importance of 47 PCR-amplified ExPEC-related VAGs for classification of o454-nlpD patterns was investigated by means of Random Forest algorithm. Results Four main structures (patterns I-IV) of the o454-nlpD region among ExPEC and commensal E. coli strains were identified. Statistical analysis showed a positive and exclusive association between pattern III and the ExPEC strains. A strong association between pattern III and either the Ecor group B2 or the sequence type complexes known to represent the phylogenetic background of highly virulent ExPEC strains (such as STC95, STC73 and STC131) was found as well. RF analyses determined five genes (csgA, malX, chuA, sit, and vat) to be suitable to predict pattern III strains. Conclusion The significant association between pattern III and group B2 strains suggested the o454-nlp

  1. A Genomic Mechanism for Antagonism Between Retinoic Acid and Estrogen Signaling in Breast Cancer

    PubMed Central

    Hua, Sujun; Kittler, Ralf; White, Kevin P.

    2012-01-01

    SUMMARY Retinoic acid (RA) triggers growth-suppressive effects in tumor cells and therefore RA has and its synthetic analogs have great potential as anti-carcinogenic agent. RA effects are mediated by Retinoic Acid Receptors (RARs), which regulate gene expression in an RA-dependent manner. To define the genetic network regulated by RARs in breast cancer, we identified RAR genomic targets using chromatin immunoprecipitation and expression analysis. We found that RAR binding throughout the genome is highly co-incident with estrogen receptor α (ERα) binding, and identified a widespread crosstalk of RA and estrogen signaling to antagonistically regulate breast cancer-associated genes. ERα and RAR binding sites appear to be co-evolved on a large scale throughout the human genome, allowing for competitive binding between these transcription factors via nearby or overlapping cis-regulatory elements. Together these data indicate the existence of a highly coordinated intersection between these two critical nuclear hormone receptor signaling pathways providing a global mechanism for balancing gene expression output via local regulatory interactions dispersed throughout the genome. PMID:19563758

  2. Teaching Residents Genomic Pathology: A Novel Approach for New Technology

    PubMed Central

    Haspel, Richard L.

    2013-01-01

    Genomics-based diagnostics have become part of patient care. As pathologists have the expertise in clinical laboratory testing as well as access to patient samples, all genomic medicine is genomic pathology. This article will review the evidence that there is a critical need for pathology resident training in genomics. Several individual program curricula are described as well as the progress of the Training Residents in Genomic (TRIG) Working Group. This group has made significant advances towards developing, implementing and evaluating a national curriculum in genomics for pathology residents. The novel approach of the TRIG Working Group can be used as a model for training pathology professionals in any new technology. PMID:23399798

  3. Genome databases

    SciTech Connect

    Courteau, J.

    1991-10-11

    Since the Genome Project began several years ago, a plethora of databases have been developed or are in the works. They range from the massive Genome Data Base at Johns Hopkins University, the central repository of all gene mapping information, to small databases focusing on single chromosomes or organisms. Some are publicly available, others are essentially private electronic lab notebooks. Still others limit access to a consortium of researchers working on, say, a single human chromosome. An increasing number incorporate sophisticated search and analytical software, while others operate as little more than data lists. In consultation with numerous experts in the field, a list has been compiled of some key genome-related databases. The list was not limited to map and sequence databases but also included the tools investigators use to interpret and elucidate genetic data, such as protein sequence and protein structure databases. Because a major goal of the Genome Project is to map and sequence the genomes of several experimental animals, including E. coli, yeast, fruit fly, nematode, and mouse, the available databases for those organisms are listed as well. The author also includes several databases that are still under development - including some ambitious efforts that go beyond data compilation to create what are being called electronic research communities, enabling many users, rather than just one or a few curators, to add or edit the data and tag it as raw or confirmed.

  4. Listeria Genomics

    NASA Astrophysics Data System (ADS)

    Cabanes, Didier; Sousa, Sandra; Cossart, Pascale

    The opportunistic intracellular foodborne pathogen Listeria monocytogenes has become a paradigm for the study of host-pathogen interactions and bacterial adaptation to mammalian hosts. Analysis of L. monocytogenes infection has provided considerable insight into how bacteria invade cells, move intracellularly, and disseminate in tissues, as well as tools to address fundamental processes in cell biology. Moreover, the vast amount of knowledge that has been gathered through in-depth comparative genomic analyses and in vivo studies makes L. monocytogenes one of the most well-studied bacterial pathogens. This chapter provides an overview of progress in the exploration of genomic, transcriptomic, and proteomic data in Listeria spp. to understand genome evolution and diversity, as well as physiological aspects of metabolism used by bacteria when growing in diverse environments, in particular in infected hosts.

  5. Genome Informatics

    PubMed Central

    Winslow, Raimond L.; Boguski, Mark S.

    2005-01-01

    This article reviews recent advances in genomics and informatics relevant to cardiovascular research. In particular, we review the status of (1) whole genome sequencing efforts in human, mouse, rat, zebrafish, and dog; (2) the development of data mining and analysis tools; (3) the launching of the National Heart, Lung, and Blood Institute Programs for Genomics Applications and Proteomics Initiative; (4) efforts to characterize the cardiac transcriptome and proteome; and (5) the current status of computational modeling of the cardiac myocyte. In each instance, we provide links to relevant sources of information on the World Wide Web and critical appraisals of the promises and the challenges of an expanding and diverse information landscape. PMID:12750305

  6. Jumbled genomes: missing Apicomplexan synteny.

    PubMed

    DeBarry, Jeremy D; Kissinger, Jessica C

    2011-10-01

    Whole-genome comparisons provide insight into genome evolution by informing on gene repertoires, gene gains/losses, and genome organization. Most of our knowledge about eukaryotic genome evolution is derived from studies of multicellular model organisms. The eukaryotic phylum Apicomplexa contains obligate intracellular protist parasites responsible for a wide range of human and veterinary diseases (e.g., malaria, toxoplasmosis, and theileriosis). We have developed an in silico protein-encoding gene based pipeline to investigate synteny across 12 apicomplexan species from six genera. Genome rearrangement between lineages is extensive. Syntenic regions (conserved gene content and order) are rare between lineages and appear to be totally absent across the phylum, with no group of three genes found on the same chromosome and in the same order within 25 kb up- and downstream of any orthologous genes. Conserved synteny between major lineages is limited to small regions in Plasmodium and Theileria/Babesia species, and within these conserved regions, there are a number of proteins putatively targeted to organelles. The observed overall lack of synteny is surprising considering the divergence times and the apparent absence of transposable elements (TEs) within any of the species examined. TEs are ubiquitous in all other groups of eukaryotes studied to date and have been shown to be involved in genomic rearrangements. It appears that there are different criteria governing genome evolution within the Apicomplexa relative to other well-studied unicellular and multicellular eukaryotes. PMID:21504890

  7. Future Health Applications of Genomics

    PubMed Central

    McBride, Colleen M.; Bowen, Deborah; Brody, Lawrence C.; Condit, Celeste M.; Croyle, Robert T.; Gwinn, Marta; Khoury, Muin J.; Koehly, Laura M.; Korf, Bruce R.; Marteau, Theresa M.; McLeroy, Kenneth; Patrick, Kevin; Valente, Thomas W.

    2014-01-01

    Despite the quickening momentum of genomic discovery, the communication, behavioral, and social sciences research needed for translating this discovery into public health applications has lagged behind. The National Human Genome Research Institute held a 2-day workshop in October 2008 convening an interdisciplinary group of scientists to recommend forward-looking priorities for translational research. This research agenda would be designed to redress the top three risk factors (tobacco use, poor diet, and physical inactivity) that contribute to the four major chronic diseases (heart disease, type 2 diabetes, lung disease, and many cancers) and account for half of all deaths worldwide. Three priority research areas were identified: (1) improving the public’s genetic literacy in order to enhance consumer skills; (2) gauging whether genomic information improves risk communication and adoption of healthier behaviors more than current approaches; and (3) exploring whether genomic discovery in concert with emerging technologies can elucidate new behavioral intervention targets. Important crosscutting themes also were identified, including the need to: (1) anticipate directions of genomic discovery; (2) take an agnostic scientific perspective in framing research questions asking whether genomic discovery adds value to other health promotion efforts; and (3) consider multiple levels of influence and systems that contribute to important public health problems. The priorities and themes offer a framework for a variety of stakeholders, including those who develop priorities for research funding, interdisciplinary teams engaged in genomics research, and policymakers grappling with how to use the products born of genomics research to address public health challenges. PMID:20409503

  8. Ebolavirus comparative genomics

    PubMed Central

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Wassenaar, Trudy M.; Ussery, David W.

    2015-01-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies. This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  9. Ebolavirus comparative genomics.

    PubMed

    Jun, Se-Ran; Leuze, Michael R; Nookaew, Intawat; Uberbacher, Edward C; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S; Pedersen, Thomas D; Wassenaar, Trudy M; Ussery, David W

    2015-09-01

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. To examine the dynamics of this genome, we compare more than 100 currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP) and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. This information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.This manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan (http://energy.gov/downloads/doe-public-access-plan). PMID:26175035

  10. Biobanks for Genomics and Genomics for Biobanks

    PubMed Central

    Ducournau, Pascal; Gourraud, Pierre-Antoine; Pontille, David

    2003-01-01

    Biobanks include biological samples and attached databases. Human biobanks occur in research, technological development and medical activities. Population genomics is highly dependent on the availability of large biobanks. Ethical issues must be considered: protecting the rights of those people whose samples or data are in biobanks (information, autonomy, confidentiality, protection of private life), assuring the non-commercial use of human body elements and the optimal use of samples and data. They balance other issues, such as protecting the rights of researchers and companies, allowing long-term use of biobanks while detailed information on future uses is not available. At the level of populations, the traditional form of informed consent is challenged. Other dimensions relate to the rights of a group as such, in addition to individual rights. Conditions of return of results and/or benefit to a population need to be defined. With ‘large-scale biobanking’ a marked trend in genomics, new societal dimensions appear, regarding communication, debate, regulation, societal control and valorization of such large biobanks. Exploring how genomics can help health sector biobanks to become more rationally constituted and exploited is an interesting perspective. For example, evaluating how genomic approaches can help in optimizing haematopoietic stem cell donor registries using new markers and high-throughput techniques to increase immunogenetic variability in such registries is a challenge currently being addressed. Ethical issues in such contexts are important, as not only individual decisions or projects are concerned, but also national policies in the international arena and organization of democratic debate about science, medicine and society. PMID:18629026

  11. Towards a Library of Standard Operating Procedures (SOPs) for (meta)genomic annotation

    SciTech Connect

    Kyrpides, Nikos; Angiuoli, Samuel V.; Cochrane, Guy; Field, Dawn; Garrity, George; Gussman, Aaron; Kodira, Chinnappa D.; Klimke, William; Kyrpides, Nikos; Madupu, Ramana; Markowitz, Victor; Tatusova, Tatiana; Thomson, Nick; White, Owen

    2008-04-01

    Genome annotations describe the features of genomes and accompany sequences in genome databases. The methodologies used to generate genome annotation are diverse and typically vary amongst groups. Descriptions of the annotation procedure are helpful in interpreting genome annotation data. Standard Operating Procedures (SOPs) for genome annotation describe the processes that generate genome annotations. Some groups are currently documenting procedures but standards are lacking for structure and content of annotation SOPs. In addition, there is no central repository to store and disseminate procedures and protocols for genome annotation. We highlight the importance of SOPs for genome annotation and endorse a central online repository of SOPs.

  12. Whither genomics?

    PubMed Central

    Murray, Andrew W

    2000-01-01

    The flood of data from genome-wide analysis is transforming biology. We need to develop new, interdisciplinary approaches to convert these data into information about the components and structures of individual biological pathways and to use the resulting information to yield knowledge about general principles that explain the functions and evolution of life. PMID:11104516

  13. Genomics of Bacillus Species

    NASA Astrophysics Data System (ADS)

    Økstad, Ole Andreas; Kolstø, Anne-Brit

    Members of the genus Bacillus are rod-shaped spore-forming bacteria belonging to the Firmicutes, the low G+C gram-positive bacteria. The Bacillus genus was first described and classified by Ferdinand Cohn in Cohn (1872), and Bacillus subtilis was defined as the type species (Soule, 1932). Several Bacilli may be linked to opportunistic infections. However, pathogenicity among Bacillus spp. is mainly a feature of bacteria belonging to the Bacillus cereus group, including B. cereus, Bacillus anthracis, and Bacillus thuringiensis. Here we review the genomics of B. cereus group bacteria in relation to their roles as etiological agents of two food poisoning syndromes (emetic and diarrhoeal).

  14. The Nostoc punctiforme Genome

    SciTech Connect

    John C. Meeks

    2001-12-31

    Nostoc punctiforme is a filamentous cyanobacterium with extensive phenotypic characteristics and a relatively large genome, approaching 10 Mb. The phenotypic characteristics include a photoautotrophic, diazotrophic mode of growth, but N. punctiforme is also facultatively heterotrophic; its vegetative cells have multiple development alternatives, including terminal differentiation into nitrogen-fixing heterocysts and transient differentiation into spore-like akinetes or motile filaments called hormogonia; and N. punctiforme has broad symbiotic competence with fungi and terrestrial plants, including bryophytes, gymnosperms and an angiosperm. The shotgun-sequencing phase of the N. punctiforme strain ATCC 29133 genome has been completed by the Joint Genome Institute. Annotation of an 8.9 Mb database yielded 7432 open reading frames, 45% of which encode proteins with known or probable known function and 29% of which are unique to N. punctiforme. Comparative analysis of the sequence indicates a genome that is highly plastic and in a state of flux, with numerous insertion sequences and multilocus repeats, as well as genes encoding transposases and DNA modification enzymes. The sequence also reveals the presence of genes encoding putative proteins that collectively define almost all characteristics of cyanobacteria as a group. N. punctiforme has an extensive potential to sense and respond to environmental signals as reflected by the presence of more than 400 genes encoding sensor protein kinases, response regulators and other transcriptional factors. The signal transduction systems and any of the large number of unique genes may play essential roles in the cell differentiation and symbiotic interaction properties of N. punctiforme.

  15. Genome sequencing of the important oilseed crop Sesamum indicum L.

    PubMed

    Zhang, Haiyang; Miao, Hongmei; Wang, Lei; Qu, Lingbo; Liu, Hongyan; Wang, Qiang; Yue, Meiwang

    2013-01-01

    The Sesame Genome Working Group (SGWG) has been formed to sequence and assemble the sesame (Sesamum indicum L.) genome. The status of this project and our planned analyses are described. PMID:23369264

  16. REEF: searching REgionally Enriched Features in genomes

    PubMed Central

    Coppe, Alessandro; Danieli, Gian Antonio; Bortoluzzi, Stefania

    2006-01-01

    Background In Eukaryotic genomes, different features including genes are not uniformly distributed. The integration of annotation information and genomic position of functional DNA elements in the Eukaryotic genomes opened the way to test novel hypotheses of higher order genome organization and regulation of expression. Results REEF is a new tool, aimed at identifying genomic regions enriched in specific features, such as a class or group of genes homogeneous for expression and/or functional characteristics. The method for the calculation of local feature enrichment uses test statistic based on the Hypergeometric Distribution applied genome-wide by using a sliding window approach and adopting the False Discovery Rate for controlling multiplicity. REEF software, source code and documentation are freely available at . Conclusion REEF can aid to shed light on the role of organization of specific genomic regions in the determination of their functional role. PMID:17042935

  17. Citrus Genomics

    PubMed Central

    Talon, Manuel; Gmitter Jr., Fred G.

    2008-01-01

    Citrus is one of the most widespread fruit crops globally, with great economic and health value. It is among the most difficult plants to improve through traditional breeding approaches. Currently, there is risk of devastation by diseases threatening to limit production and future availability to the human population. As technologies rapidly advance in genomic science, they are quickly adapted to address the biological challenges of the citrus plant system and the world's industries. The historical developments of linkage mapping, markers and breeding, EST projects, physical mapping, an international citrus genome sequencing project, and critical functional analysis are described. Despite the challenges of working with citrus, there has been substantial progress. Citrus researchers engaged in international collaborations provide optimism about future productivity and contributions to the benefit of citrus industries worldwide and to the human population who can rely on future widespread availability of this health-promoting and aesthetically pleasing fruit crop. PMID:18509486

  18. Ancient genomics

    PubMed Central

    Der Sarkissian, Clio; Allentoft, Morten E.; Ávila-Arcos, María C.; Barnett, Ross; Campos, Paula F.; Cappellini, Enrico; Ermini, Luca; Fernández, Ruth; da Fonseca, Rute; Ginolhac, Aurélien; Hansen, Anders J.; Jónsson, Hákon; Korneliussen, Thorfinn; Margaryan, Ashot; Martin, Michael D.; Moreno-Mayar, J. Víctor; Raghavan, Maanasa; Rasmussen, Morten; Velasco, Marcela Sandoval; Schroeder, Hannes; Schubert, Mikkel; Seguin-Orlando, Andaine; Wales, Nathan; Gilbert, M. Thomas P.; Willerslev, Eske; Orlando, Ludovic

    2015-01-01

    The past decade has witnessed a revolution in ancient DNA (aDNA) research. Although the field's focus was previously limited to mitochondrial DNA and a few nuclear markers, whole genome sequences from the deep past can now be retrieved. This breakthrough is tightly connected to the massive sequence throughput of next generation sequencing platforms and the ability to target short and degraded DNA molecules. Many ancient specimens previously unsuitable for DNA analyses because of extensive degradation can now successfully be used as source materials. Additionally, the analytical power obtained by increasing the number of sequence reads to billions effectively means that contamination issues that have haunted aDNA research for decades, particularly in human studies, can now be efficiently and confidently quantified. At present, whole genomes have been sequenced from ancient anatomically modern humans, archaic hominins, ancient pathogens and megafaunal species. Those have revealed important functional and phenotypic information, as well as unexpected adaptation, migration and admixture patterns. As such, the field of aDNA has entered the new era of genomics and has provided valuable information when testing specific hypotheses related to the past. PMID:25487338

  19. Genomic Imprinting

    PubMed Central

    Bajrami, Emirjeta; Spiroski, Mirko

    2016-01-01

    BACKGROUND: Genomic imprinting is the inheritance out of Mendelian borders. Many of inherited diseases and human development violates Mendelian law of inheritance, this way of inheriting is studied by epigenetics. AIM: The aim of this review is to analyze current opinions and options regarding to this way of inheriting. RESULTS: Epigenetics shows that gene expression undergoes changes more complex than modifications in the DNA sequence; it includes the environmental influence on the gametes before conception. Humans inherit two alleles from mother and father, both are functional for the majority of the genes, but sometimes one is turned off or “stamped” and doesn’t show in offspring, that gene is imprinted. Imprinting means that that gene is silenced, and gene from other parent is expressed. The mechanisms for imprinting are still incompletely defined, but they involve epigenetic modifications that are erased and then reset during the creation of eggs and sperm. Genomic imprinting is a process of silencing genes through DNA methylation. The repressed allele is methylated, while the active allele is unmethylated. The most well-known conditions include Prader-Willi syndrome, and Angelman syndrome. Both of these syndromes can be caused by imprinting or other errors involving genes on the long arm of chromosome 15. CONCLUSIONS: Genomic imprinting and other epigenetic mechanisms such as environment is shown that plays role in offspring neurodevelopment and autism spectrum disorder. PMID:27275355

  20. Avian papillomaviruses: the parrot Psittacus erithacus papillomavirus (PePV) genome has a unique organization of the early protein region and is phylogenetically related to the chaffinch papillomavirus

    PubMed Central

    Tachezy, Ruth; Rector, Annabel; Havelkova, Marta; Wollants, Elke; Fiten, Pierre; Opdenakker, Ghislain; Jenson, A Bennett; Sundberg, John P; Van Ranst, Marc

    2002-01-01

    Background An avian papillomavirus genome has been cloned from a cutaneous exophytic papilloma from an African grey parrot (Psittacus erithacus). The nucleotide sequence, genome organization, and phylogenetic position of the Psittacus erithacus papillomavirus (PePV) were determined. This PePV sequence represents the first complete avian papillomavirus genome defined. Results The PePV genome (7304 basepairs) differs from other papillomaviruses, in that it has a unique organization of the early protein region lacking classical E6 and E7 open reading frames. Phylogenetic comparison of the PePV sequence with partial E1 and L1 sequences of the chaffinch (Fringilla coelebs) papillomavirus (FPV) reveals that these two avian papillomaviruses form a monophyletic cluster with a common branch that originates near the unresolved center of the papillomavirus evolutionary tree. Conclusions The PePV genome has a unique layout of the early protein region which represents a novel prototypic genomic organization for avian papillomaviruses. The close relationship between PePV and FPV, and between their Psittaciformes and Passeriformes hosts, supports the hypothesis that papillomaviruses have co-evolved and speciated together with their host species throughout evolution. PMID:12110158

  1. Reannotation of Shewanella oneidensis genome.

    PubMed

    Daraselia, N; Dernovoy, D; Tian, Y; Borodovsky, M; Tatusov, R; Tatusova, T

    2003-01-01

    As more and more complete bacterial genome sequences become available, the genome annotation of previously sequenced genomes may become quickly outdated. This is primarily due to the discovery and functional characterization of new genes. We have reannotated the recently published genome of Shewanella oneidensis with the following results: 51 new genes have been identified, and functional annotation has been added to the 97 genes, including 15 new and 82 existing ones with previously unassigned function. The identification of new genes was achieved by predicting the protein coding regions using the HMM-based program GeneMark.hmm. Subsequent comparison of the predicted gene products to the non-redundant protein database using BLAST and the COG (Clusters of Orthologous Groups) database using COGNITOR provided for the functional annotation. PMID:14506846

  2. Development of genome viewer (Web Omics Viewer) for managing databases of cucumber genome

    NASA Astrophysics Data System (ADS)

    Wojcieszek, M.; RóŻ, P.; Pawełkowicz, M.; Nowak, R.; Przybecki, Z.

    Cucumber is an important plant in horticulture and science world. Sequencing projects of C. sativus genome enable new methodological aproaches in further investigation of this species. Accessibility is crucial to fully exploit obtained information about detail structure of genes, markers and other characteristic features such contigs, scaffolds and chromosomes. Genome viewer is one of tools providing plain and easy way for presenting genome data for users and for databases administration. Gbrowse - the main viewer has several very useful features but lacks in managing simplicity. Our group developed new genome browser Web Omics Viewer (WOV), keeping functionality but improving utilization and accessibility to cucumber genome data.

  3. Patterns of genome size variation in snapping shrimp.

    PubMed

    Jeffery, Nicholas W; Hultgren, Kristin; Chak, Solomon Tin Chi; Gregory, T Ryan; Rubenstein, Dustin R

    2016-06-01

    Although crustaceans vary extensively in genome size, little is known about how genome size may affect the ecology and evolution of species in this diverse group, in part due to the lack of large genome size datasets. Here we investigate interspecific, intraspecific, and intracolony variation in genome size in 39 species of Synalpheus shrimps, representing one of the largest genome size datasets for a single genus within crustaceans. We find that genome size ranges approximately 4-fold across Synalpheus with little phylogenetic signal, and is not related to body size. In a subset of these species, genome size is related to chromosome size, but not to chromosome number, suggesting that despite large genomes, these species are not polyploid. Interestingly, there appears to be 35% intraspecific genome size variation in Synalpheus idios among geographic regions, and up to 30% variation in Synalpheus duffyi genome size within the same colony. PMID:27171678

  4. Update on RefSeq microbial genomes resources

    PubMed Central

    Tatusova, Tatiana; Ciufo, Stacy; Federhen, Scott; Fedorov, Boris; McVeigh, Richard; O'Neill, Kathleen; Tolstoy, Igor; Zaslavsky, Leonid

    2015-01-01

    NCBI RefSeq genome collection http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10 000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30 000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach. PMID:25510495

  5. Insights into Conifer Giga-Genomes1

    PubMed Central

    De La Torre, Amanda R.; Birol, Inanc; Bousquet, Jean; Ingvarsson, Pär K.; Jansson, Stefan; Jones, Steven J.M.; Keeling, Christopher I.; MacKay, John; Nilsson, Ove; Ritland, Kermit; Street, Nathaniel; Yanchuk, Alvin; Zerbe, Philipp; Bohlmann, Jörg

    2014-01-01

    Insights from sequenced genomes of major land plant lineages have advanced research in almost every aspect of plant biology. Until recently, however, assembled genome sequences of gymnosperms have been missing from this picture. Conifers of the pine family (Pinaceae) are a group of gymnosperms that dominate large parts of the world’s forests. Despite their ecological and economic importance, conifers seemed long out of reach for complete genome sequencing, due in part to their enormous genome size (20–30 Gb) and the highly repetitive nature of their genomes. Technological advances in genome sequencing and assembly enabled the recent publication of three conifer genomes: white spruce (Picea glauca), Norway spruce (Picea abies), and loblolly pine (Pinus taeda). These genome sequences revealed distinctive features compared with other plant genomes and may represent a window into the past of seed plant genomes. This Update highlights recent advances, remaining challenges, and opportunities in light of the publication of the first conifer and gymnosperm genomes. PMID:25349325

  6. Genome size: a novel genomic signature in support of Afrotheria.

    PubMed

    Redi, Carlo Alberto; Garagna, Silvia; Zuccotti, Maurizio; Capanna, Ernesto

    2007-04-01

    Molecular phylogenetic analyses suggest an emerging phylogeny for the extant Placentalia (eutherian) that radically departs from morphologically based constructions of the past. Placental mammals are partitioned into four supraordinal clades: Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. Afrotheria form an endemic African clade that includes elephant shrews, golden moles, tenrecs, aardvarks, hyraxes, elephants, dugongs, and manatees. Datamining databases of genome size (GS) shows that till today just one afrotherian GS has been evaluated, that of the aardvark Orycteropus afer. We show that the GSs of six selected representatives across the Afrotheria supraordinal group are among the highest for the extant Placentalia, providing a novel genomic signature of this enigmatic group. The mean GS value of Afrotheria, 5.3 +/- 0.7 pg, is the highest reported for the extant Placentalia. This should assist in planning new genome sequencing initiatives. PMID:17479346

  7. Phylogenomic grouping of Listeria monocytogenes.

    PubMed

    Doijad, Swapnil; Weigel, Markus; Barbuddhe, Sukhadeo; Blom, Jochen; Goesmann, Alexander; Hain, Torsten; Chakraborty, Trinad

    2015-09-01

    The precise delineation of lineages and clonal groups are a prerequisite to examine within-species genetic variations, particularly with respect to pathogenic potential. A whole-genome-based approach was used to subtype and subgroup isolates of Listeria monocytogenes. Core-genome typing was performed, employing 3 different approaches: total core genes (CG), high-scoring segment pairs (HSPs), and average nucleotide identity (ANI). Examination of 113 L. monocytogenes genomes available in-house and in public domains revealed 33 phylogenomic groups (PGs). Each PG could be differentiated into a number of genomic types (GTs), depending on the approach used: HSPs (n = 57 GTs), CG (n = 71 GTs), and ANI (n = 83 GTs). Demarcation of the PGs was concordant with the 4 known lineages and led to the identification of sublineages in the lineage groups I, II, and III. In addition, PG assignments had discriminatory power similar to multi-virulence-locus sequence typing types and clonal complexes of multilocus sequence typing. Clustering of genomically highly similar isolates from different countries, sources, and isolation dates using whole-genome-based PG suggested that dispersion of phylogenomic clones of L. monocytogenes preceded their subsequent evolution. Classification according to PG may act as a guideline for future epidemiological studies. PMID:26245135

  8. Genomics - the new rock and roll?

    PubMed

    Dunham, I

    2000-10-01

    The end of the beginning of the Human Genome Project was announced on 26 June when the working draft or first assembly was announced. Here, Ian Dunham who led the group at the Sanger Centre that produced the first complete sequence of a human chromosome reflects on how it felt to be with the genome project from the beginning. PMID:11050332

  9. Exploring genomes for glycosyltransferases.

    PubMed

    Hansen, Sara Fasmer; Bettler, Emmanuel; Rinnan, Asmund; Engelsen, Søren B; Breton, Christelle

    2010-10-01

    Glycosyltransferases are one of the largest and most diverse enzyme groups in Nature. They catalyse the synthesis of glycosidic linkages by the transfer of a sugar residue from a donor to an acceptor substrate. These enzymes have been classified into families on the basis of amino acid sequence similarity that are kept updated in the Carbohydrate Active enZyme database (CAZy, ). The repertoire of glycosyltransferases in genomes is believed to determine the diversity of cellular glycan structures, and current estimates suggest that for most genomes about 1% of the coding regions are glycosyltransferases. However, plants tend to have far more glycosyltransferase genes than any other organism sequenced to date, and this can be explained by the highly complex polysaccharide network that form the cell wall and also by the numerous glycosylated secondary metabolites. In recent years, various bioinformatics strategies have been used to search bacterial and plant genomes for new glycosyltransferase genes. These are based on the use of remote homology detection methods that act at the 1D, 2D, and 3D level. The combined use of methods such as profile Hidden Markov Model (HMM) and fold recognition appears to be appropriate for this class of enzyme. Chemometric tools are also particularly well suited for obtaining an overview of multivariate data and revealing hidden latent information when dealing with large and highly complex datasets. PMID:20556308

  10. Group Counseling

    ERIC Educational Resources Information Center

    Mahler, Clarence A.

    1971-01-01

    This article reviews the major concerns of group counseling and differentiates among group guidance, group counseling, and group therapy. It also evaluates the research status of group counseling and presents implications for the future of this approach. Comment by Carl E. Thoresen follows. (Author)

  11. Genome Sequence of Spizellomyces punctatus

    PubMed Central

    Russ, Carsten; Lang, B. Franz; Chen, Zehua; Gujja, Sharvari; Shea, Terrance; Zeng, Qiandong; Young, Sarah; Nusbaum, Chad

    2016-01-01

    Spizellomyces punctatus is a basally branching chytrid fungus that is found in the Chytridiomycota phylum. Spizellomyces species are common in soil and of importance in terrestrial ecosystems. Here, we report the genome sequence of S. punctatus, which will facilitate the study of this group of early diverging fungi. PMID:27540072

  12. Genomes on ice.

    PubMed

    Parkhill, Julian

    2016-03-01

    This month's Genome Watch discusses the analysis of a Helicobacter pylori genome from the preserved Copper-Age mummy known as the Iceman and how ancient genomes shed light on the history of bacterial pathogens. PMID:26853114

  13. Whole Genome Sequencing

    MedlinePlus

    ... you want to learn. Search form Search Whole Genome Sequencing You are here Home Testing & Services Testing ... the full story, click here . What is whole genome sequencing? Whole genome sequencing is the mapping out ...

  14. Comparative genomic analyses in Asparagus.

    PubMed

    Kuhl, Joseph C; Havey, Michael J; Martin, William J; Cheung, Foo; Yuan, Qiaoping; Landherr, Lena; Hu, Yi; Leebens-Mack, James; Town, Christopher D; Sink, Kenneth C

    2005-12-01

    Garden asparagus (Asparagus officinalis L.) belongs to the monocot family Asparagaceae in the order Asparagales. Onion (Allium cepa L.) and Asparagus officinalis are 2 of the most economically important plants of the core Asparagales, a well supported monophyletic group within the Asparagales. Coding regions in onion have lower GC contents than the grasses. We compared the GC content of 3374 unique expressed sequence tags (ESTs) from A. officinalis with Lycoris longituba and onion (both members of the core Asparagales), Acorus americanus (sister to all other monocots), the grasses, and Arabidopsis. Although ESTs in A. officinalis and Acorus had a higher average GC content than Arabidopsis, Lycoris, and onion, all were clearly lower than the grasses. The Asparagaceae have the smallest nuclear genomes among all plants in the core Asparagales, which typically have huge genomes. Within the Asparagaceae, European Asparagus species have approximately twice the nuclear DNA of that of southern African Asparagus species. We cloned and sequenced 20 genomic amplicons from European A. officinalis and the southern African species Asparagus plumosus and observed no clear evidence for a recent genome doubling in A. officinalis relative to A. plumosus. These results indicate that members of the genus Asparagus with smaller genomes may be useful genomic models for plants in the core Asparagales. PMID:16391674

  15. Genome sequence of Coxiella burnetii strain Namibia

    PubMed Central

    2014-01-01

    We present the whole genome sequence and annotation of the Coxiella burnetii strain Namibia. This strain was isolated from an aborting goat in 1991 in Windhoek, Namibia. The plasmid type QpRS was confirmed in our work. Further genomic typing placed the strain into a unique genomic group. The genome sequence is 2,101,438 bp long and contains 1,979 protein-coding and 51 RNA genes, including one rRNA operon. To overcome the poor yield from cell culture systems, an additional DNA enrichment with whole genome amplification (WGA) methods was applied. We describe a bioinformatics pipeline for improved genome assembly including several filters with a special focus on WGA characteristics. PMID:25593636

  16. Complete Genome Sequence and Comparative Genomics of a Novel Myxobacterium Myxococcus hansupus

    PubMed Central

    Sharma, Gaurav; Narwani, Tarun; Subramanian, Srikrishna

    2016-01-01

    Myxobacteria, a group of Gram-negative aerobes, belong to the class δ-proteobacteria and order Myxococcales. Unlike anaerobic δ-proteobacteria, they exhibit several unusual physiogenomic properties like gliding motility, desiccation-resistant myxospores and large genomes with high coding density. Here we report a 9.5 Mbp complete genome of Myxococcus hansupus that encodes 7,753 proteins. Phylogenomic and genome-genome distance based analysis suggest that Myxococcus hansupus is a novel member of the genus Myxococcus. Comparative genome analysis with other members of the genus Myxococcus was performed to explore their genome diversity. The variation in number of unique proteins observed across different species is suggestive of diversity at the genus level while the overrepresentation of several Pfam families indicates the extent and mode of genome expansion as compared to non-Myxococcales δ-proteobacteria. PMID:26900859

  17. Reconciling ecological and genomic divergence among lineages of listeria under an ‘‘extended mosaic genome concept’’

    Technology Transfer Automated Retrieval System (TEKTRAN)

    There is growing evidence for a discontinuity between genomic and ecological divergence in several groups of bacteria. Such evidence is difficult to reconcile with the traditional “species genome concept”; i.e., the concept that genomes of ecologically divergent lineages maintain a cohesive gene po...

  18. Ensembl Genomes 2016: more genomes, more complexity.

    PubMed

    Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  19. Ensembl genomes 2016: more genomes, more complexity

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent...

  20. Ensembl Genomes 2016: more genomes, more complexity

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J.; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K.; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D.; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello–Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin L.; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

    2016-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces. PMID:26578574

  1. Comparative Genomic and Phylogenomic Analyses Reveal a Conserved Core Genome Shared by Estuarine and Oceanic Cyanopodoviruses

    PubMed Central

    Huang, Sijun; Zhang, Si; Jiao, Nianzhi; Chen, Feng

    2015-01-01

    Podoviruses are among the major viral groups that infect marine picocyanobacteria Prochlorococcus and Synechococcus. Here, we reported the genome sequences of five Synechococcus podoviruses isolated from the estuarine environment, and performed comparative genomic and phylogenomic analyses based on a total of 20 cyanopodovirus genomes. The genomes of all the known marine cyanopodoviruses are highly syntenic. A pan-genome of 349 clustered orthologous groups was determined, among which 15 were core genes. These core genes make up nearly half of each genome in length, reflecting the high level of genome conservation among this cyanophage type. The whole genome phylogenies based on concatenated core genes and gene content were highly consistent and confirmed the separation of two discrete marine cyanopodovirus clusters MPP-A and MPP-B. The genomes within cluster MPP-B grouped into subclusters mainly corresponding to Prochlorococcus or Synechococcus host types. Auxiliary metabolic genes tend to occur in a specific phylogenetic group of these cyanopodoviruses. All the MPP-B phages analyzed here encode the photosynthesis gene psbA, which are absent in all the MPP-A genomes thus far. Interestingly, all the MPP-B and two MPP-A Synechococcus podoviruses encode the thymidylate synthase gene thyX, while at the same genome locus all the MPP-B Prochlorococcus podoviruses encode the transaldolase gene talC. Both genes are hypothesized to have the potential to facilitate the biosynthesis of deoxynucleotide for phage replication. Inheritance of specific functional genes could be important to the evolution and ecological fitness of certain cyanophage genotypes. Our analyses demonstrate that cyanopodoviruses of estuarine and oceanic origins share a conserved core genome and suggest that accessory genes may be related to environmental adaptation. PMID:26569403

  2. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes

    PubMed Central

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M.; Murphy, Robert W.; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-01-01

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies. PMID:25733869

  3. Whole-genome sequence of the Tibetan frog Nanorana parkeri and the comparative evolution of tetrapod genomes.

    PubMed

    Sun, Yan-Bo; Xiong, Zi-Jun; Xiang, Xue-Yan; Liu, Shi-Ping; Zhou, Wei-Wei; Tu, Xiao-Long; Zhong, Li; Wang, Lu; Wu, Dong-Dong; Zhang, Bao-Lin; Zhu, Chun-Ling; Yang, Min-Min; Chen, Hong-Man; Li, Fang; Zhou, Long; Feng, Shao-Hong; Huang, Chao; Zhang, Guo-Jie; Irwin, David; Hillis, David M; Murphy, Robert W; Yang, Huan-Ming; Che, Jing; Wang, Jun; Zhang, Ya-Ping

    2015-03-17

    The development of efficient sequencing techniques has resulted in large numbers of genomes being available for evolutionary studies. However, only one genome is available for all amphibians, that of Xenopus tropicalis, which is distantly related from the majority of frogs. More than 96% of frogs belong to the Neobatrachia, and no genome exists for this group. This dearth of amphibian genomes greatly restricts genomic studies of amphibians and, more generally, our understanding of tetrapod genome evolution. To fill this gap, we provide the de novo genome of a Tibetan Plateau frog, Nanorana parkeri, and compare it to that of X. tropicalis and other vertebrates. This genome encodes more than 20,000 protein-coding genes, a number similar to that of Xenopus. Although the genome size of Nanorana is considerably larger than that of Xenopus (2.3 vs. 1.5 Gb), most of the difference is due to the respective number of transposable elements in the two genomes. The two frogs exhibit considerable conserved whole-genome synteny despite having diverged approximately 266 Ma, indicating a slow rate of DNA structural evolution in anurans. Multigenome synteny blocks further show that amphibians have fewer interchromosomal rearrangements than mammals but have a comparable rate of intrachromosomal rearrangements. Our analysis also identifies 11 Mb of anuran-specific highly conserved elements that will be useful for comparative genomic analyses of frogs. The Nanorana genome offers an improved understanding of evolution of tetrapod genomes and also provides a genomic reference for other evolutionary studies. PMID:25733869

  4. Funding Opportunity: Genomic Data Centers

    Cancer.gov

    Funding Opportunity CCG, Funding Opportunity Center for Cancer Genomics, CCG, Center for Cancer Genomics, CCG RFA, Center for cancer genomics rfa, genomic data analysis network, genomic data analysis network centers,

  5. Enabling functional genomics with genome engineering

    PubMed Central

    Hilton, Isaac B.; Gersbach, Charles A.

    2015-01-01

    Advances in genome engineering technologies have made the precise control over genome sequence and regulation possible across a variety of disciplines. These tools can expand our understanding of fundamental biological processes and create new opportunities for therapeutic designs. The rapid evolution of these methods has also catalyzed a new era of genomics that includes multiple approaches to functionally characterize and manipulate the regulation of genomic information. Here, we review the recent advances of the most widely adopted genome engineering platforms and their application to functional genomics. This includes engineered zinc finger proteins, TALEs/TALENs, and the CRISPR/Cas9 system as nucleases for genome editing, transcription factors for epigenome editing, and other emerging applications. We also present current and potential future applications of these tools, as well as their current limitations and areas for future advances. PMID:26430154

  6. Group X

    SciTech Connect

    Fields, Susannah

    2007-08-16

    This project is currently under contract for research through the Department of Homeland Security until 2011. The group I was responsible for studying has to remain confidential so as not to affect the current project. All dates, reference links and authors, and other distinguishing characteristics of the original group have been removed from this report. All references to the name of this group or the individual splinter groups has been changed to 'Group X'. I have been collecting texts from a variety of sources intended for the use of recruiting and radicalizing members for Group X splinter groups for the purpose of researching the motivation and intent of leaders of those groups and their influence over the likelihood of group radicalization. This work included visiting many Group X websites to find information on splinter group leaders and finding their statements to new and old members. This proved difficult because the splinter groups of Group X are united in beliefs, but differ in public opinion. They are eager to tear each other down, prove their superiority, and yet remain anonymous. After a few weeks of intense searching, a list of eight recruiting texts and eight radicalizing texts from a variety of Group X leaders were compiled.

  7. Exploring Other Genomes: Bacteria.

    ERIC Educational Resources Information Center

    Flannery, Maura C.

    2001-01-01

    Points out the importance of genomes other than the human genome project and provides information on the identified bacterial genomes Pseudomonas aeuroginosa, Leprosy, Cholera, Meningitis, Tuberculosis, Bubonic Plague, and plant pathogens. Considers the computer's use in genome studies. (Contains 14 references.) (YDS)

  8. Navigating yeast genome maintenance with functional genomics.

    PubMed

    Measday, Vivien; Stirling, Peter C

    2016-03-01

    Maintenance of genome integrity is a fundamental requirement of all organisms. To address this, organisms have evolved extremely faithful modes of replication, DNA repair and chromosome segregation to combat the deleterious effects of an unstable genome. Nonetheless, a small amount of genome instability is the driver of evolutionary change and adaptation, and thus a low level of instability is permitted in populations. While defects in genome maintenance almost invariably reduce fitness in the short term, they can create an environment where beneficial mutations are more likely to occur. The importance of this fact is clearest in the development of human cancer, where genome instability is a well-established enabling characteristic of carcinogenesis. This raises the crucial question: what are the cellular pathways that promote genome maintenance and what are their mechanisms? Work in model organisms, in particular the yeast Saccharomyces cerevisiae, has provided the global foundations of genome maintenance mechanisms in eukaryotes. The development of pioneering genomic tools inS. cerevisiae, such as the systematic creation of mutants in all nonessential and essential genes, has enabled whole-genome approaches to identifying genes with roles in genome maintenance. Here, we review the extensive whole-genome approaches taken in yeast, with an emphasis on functional genomic screens, to understand the genetic basis of genome instability, highlighting a range of genetic and cytological screening modalities. By revealing the biological pathways and processes regulating genome integrity, these analyses contribute to the systems-level map of the yeast cell and inform studies of human disease, especially cancer. PMID:26323482

  9. Group Flow and Group Genius

    ERIC Educational Resources Information Center

    Sawyer, Keith

    2015-01-01

    Keith Sawyer views the spontaneous collaboration of group creativity and improvisation actions as "group flow," which organizations can use to function at optimum levels. Sawyer establishes ideal conditions for group flow: group goals, close listening, complete concentration, being in control, blending egos, equal participation, knowing…

  10. Genome size and longevity in fish.

    PubMed

    Griffith, O L; Moodie, G E E; Civetta, A

    2003-03-01

    The wide variety of genome sizes (measured as C-value) observed across taxa is not related to organismal complexity or number of coding genes. Partial answers to this C-value enigma have been found by establishing associations between C-value and particular phenotypic characteristics. One such controversial association has been recently suggested between genome size and longevity in birds. In order to determine whether genome size is a general predictor of longevity, we have extended the analysis to the Actinoptergyian fish, a widely divergent group in terms of both longevity and genome size. We collected data on genome size, longevity and body mass for species covering fourteen orders of bony fish. Analysis of covariance using order as a cofactor shows a significant effect of genome size on longevity (corrected for body mass), with lifespan increasing as a function of genome size. Analysis of phylogenetically independent contrasts for orders with a large number of species with a well resolved phylogenetic relationship (Acipenseriformes, Cypriniformes, and Salmoniformes) found the same trend of longer lifespan with increases in genome size but the relationship was not significant. Our results consistently show an increase in lifespan for fish with larger genomes. PMID:12581799

  11. Whole genome analysis of a Vietnamese trio.

    PubMed

    Hai, Dang Thanh; Thanh, Nguyen Dai; Trang, Pham Thi Minh; Quang, Le Si; Hang, Phan Thi Thu; Cuong, Dang Cao; Phuc, Hoang Kim; Duc, Nguyen Huu; Dong, Do Duc; Minh, Bui Quang; Son, Pham Bao; Vinh, Le Sy

    2015-03-01

    We here present the first whole genome analysis of an anonymous Kinh Vietnamese (KHV) trio whose genomes were deeply sequenced to 30-fold average coverage. The resulting short reads covered 99.91 percent of the human reference genome (GRCh37d5). We identified 4,719,412 SNPs and 827,385 short indels that satisfied the Mendelian inheritance law. Among them, 109,914 (2.3 percent) SNPs and 59,119 (7.1 percent) short indels were novel. We also detected 30,171 structural variants of which 27,604 (91.5 percent) were large indels. There were 6,681 large indels in the range 0.1-100 kbp occurring in the child genome that were also confirmed in either the father or mother genome. We compared these large indels against the DGV database and found that 1,499 (22.44 percent) were KHV specific. De novo assembly of high-quality unmapped reads yielded 789 contigs with the length greater than or equal to 300 bp. There were 235 contigs from the child genome of which 199 (84.7 percent) were significantly matched with at least one contig from the father or mother genome. Blasting these 199 contigs against other alternative human genomes revealed 4 novel contigs. The novel variants identified from our study demonstrated the necessity of conducting more genome-wide studies not only for Kinh but also for other ethnic groups in Vietnam. PMID:25740146

  12. Genome size evolution in pufferfish: an insight from BAC clone-based Diodon holocanthus genome sequencing

    PubMed Central

    2010-01-01

    Background Variations in genome size within and between species have been observed since the 1950 s in diverse taxonomic groups. Serving as model organisms, smooth pufferfish possess the smallest vertebrate genomes. Interestingly, spiny pufferfish from its sister family have genome twice as large as smooth pufferfish. Therefore, comparative genomic analysis between smooth pufferfish and spiny pufferfish is useful for our understanding of genome size evolution in pufferfish. Results Ten BAC clones of a spiny pufferfish Diodon holocanthus were randomly selected and shotgun sequenced. In total, 776 kb of non-redundant sequences without gap representing 0.1% of the D. holocanthus genome were identified, and 77 distinct genes were predicted. In the sequenced D. holocanthus genome, 364 kb is homologous with 265 kb of the Takifugu rubripes genome, and 223 kb is homologous with 148 kb of the Tetraodon nigroviridis genome. The repetitive DNA accounts for 8% of the sequenced D. holocanthus genome, which is higher than that in the T. rubripes genome (6.89%) and that in the Te. nigroviridis genome (4.66%). In the repetitive DNA, 76% is retroelements which account for 6% of the sequenced D. holocanthus genome and belong to known families of transposable elements. More than half of retroelements were distributed within genes. In the non-homologous regions, repeat element proportion in D. holocanthus genome increased to 10.6% compared with T. rubripes and increased to 9.19% compared with Te. nigroviridis. A comparison of 10 well-defined orthologous genes showed that the average intron size (566 bp) in D. holocanthus genome is significantly longer than that in the smooth pufferfish genome (435 bp). Conclusion Compared with the smooth pufferfish, D. holocanthus has a low gene density and repeat elements rich genome. Genome size variation between D. holocanthus and the smooth pufferfish exhibits as length variation between homologous region and different accumulation of non

  13. Isopermutation group

    SciTech Connect

    Muktibodh, A. S.

    2015-03-10

    The concept of ‘Isotopy’ as formulated by Ruggero Maria Santilli [1, 2, 3] plays a vital role in the development of Iso mathematics. Santilli defined iso-fields of characteristic zero. In this paper we extend this definition to define Iso-Galois fields [4] which are essentially of non-zero characteristic. Isotopically isomorphic realizations of a group define isopermutation group which gives a clear cut distinction between automorphic groups and isotopic groups.

  14. 78 FR 24223 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-04-24

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... applications. Place: National Human Genome Research Institute, 3rd floor Conf. Room 3146, 5635 Fishers...

  15. 76 FR 3643 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-01-20

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: January...

  16. 75 FR 26762 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-05-12

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS) Dated: May 3,...

  17. 75 FR 2148 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-01-14

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group, Genome Research Review... Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS)...

  18. 75 FR 52537 - National Human Genome Research Institute; Notice of Closed Meeting

    Federal Register 2010, 2011, 2012, 2013, 2014

    2010-08-26

    ... HUMAN SERVICES National Institutes of Health National Human Genome Research Institute; Notice of Closed... of Committee: National Human Genome Research Institute Initial Review Group; Genome Research Review... Domestic Assistance Program Nos. 93.172, Human Genome Research, National Institutes of Health, HHS)...

  19. Separation Group.

    ERIC Educational Resources Information Center

    Addington, Jean

    1992-01-01

    Describes eight-week short-term group designed to help separated or divorced men and women move through related adjustment phase in focused group setting. Discusses constructs that form the foundations of this short-term psychoeducational and support group and presents brief overview of psychological difficulties that occur as result of marital…

  20. Genome Maps, a new generation genome browser.

    PubMed

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-07-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  1. Genome Maps, a new generation genome browser

    PubMed Central

    Medina, Ignacio; Salavert, Francisco; Sanchez, Rubén; de Maria, Alejandro; Alonso, Roberto; Escobar, Pablo; Bleda, Marta; Dopazo, Joaquín

    2013-01-01

    Genome browsers have gained importance as more genomes and related genomic information become available. However, the increase of information brought about by new generation sequencing technologies is, at the same time, causing a subtle but continuous decrease in the efficiency of conventional genome browsers. Here, we present Genome Maps, a genome browser that implements an innovative model of data transfer and management. The program uses highly efficient technologies from the new HTML5 standard, such as scalable vector graphics, that optimize workloads at both server and client sides and ensure future scalability. Thus, data management and representation are entirely carried out by the browser, without the need of any Java Applet, Flash or other plug-in technology installation. Relevant biological data on genes, transcripts, exons, regulatory features, single-nucleotide polymorphisms, karyotype and so forth, are imported from web services and are available as tracks. In addition, several DAS servers are already included in Genome Maps. As a novelty, this web-based genome browser allows the local upload of huge genomic data files (e.g. VCF or BAM) that can be dynamically visualized in real time at the client side, thus facilitating the management of medical data affected by privacy restrictions. Finally, Genome Maps can easily be integrated in any web application by including only a few lines of code. Genome Maps is an open source collaborative initiative available in the GitHub repository (https://github.com/compbio-bigdata-viz/genome-maps). Genome Maps is available at: http://www.genomemaps.org. PMID:23748955

  2. Galaxy groups

    SciTech Connect

    Brent Tully, R.

    2015-02-01

    Galaxy groups can be characterized by the radius of decoupling from cosmic expansion, the radius of the caustic of second turnaround, and the velocity dispersion of galaxies within this latter radius. These parameters can be a challenge to measure, especially for small groups with few members. In this study, results are gathered pertaining to particularly well-studied groups over four decades in group mass. Scaling relations anticipated from theory are demonstrated and coefficients of the relationships are specified. There is an update of the relationship between light and mass for groups, confirming that groups with mass of a few times 10{sup 12}M{sub ⊙} are the most lit up while groups with more and less mass are darker. It is demonstrated that there is an interesting one-to-one correlation between the number of dwarf satellites in a group and the group mass. There is the suggestion that small variations in the slope of the luminosity function in groups are caused by the degree of depletion of intermediate luminosity systems rather than variations in the number per unit mass of dwarfs. Finally, returning to the characteristic radii of groups, the ratio of first to second turnaround depends on the dark matter and dark energy content of the universe and a crude estimate can be made from the current observations of Ω{sub matter}∼0.15 in a flat topology, with a 68% probability of being less than 0.44.

  3. Galaxy Groups

    NASA Astrophysics Data System (ADS)

    Tully, R. Brent

    2015-02-01

    Galaxy groups can be characterized by the radius of decoupling from cosmic expansion, the radius of the caustic of second turnaround, and the velocity dispersion of galaxies within this latter radius. These parameters can be a challenge to measure, especially for small groups with few members. In this study, results are gathered pertaining to particularly well-studied groups over four decades in group mass. Scaling relations anticipated from theory are demonstrated and coefficients of the relationships are specified. There is an update of the relationship between light and mass for groups, confirming that groups with mass of a few times {{10}12}{{M}⊙ } are the most lit up while groups with more and less mass are darker. It is demonstrated that there is an interesting one-to-one correlation between the number of dwarf satellites in a group and the group mass. There is the suggestion that small variations in the slope of the luminosity function in groups are caused by the degree of depletion of intermediate luminosity systems rather than variations in the number per unit mass of dwarfs. Finally, returning to the characteristic radii of groups, the ratio of first to second turnaround depends on the dark matter and dark energy content of the universe and a crude estimate can be made from the current observations of {{Ω}matter}˜ 0.15 in a flat topology, with a 68% probability of being less than 0.44.

  4. Analysis of the biologic differences between porcine circovirus type 2 Group-1 and Group-2 viruses

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Phylogenetically, porcine circovirus type 2 (PCV2) isolates have been divided into two subgroups: Group-1 and Group-2. Although the genomic nucleotide sequences of Group-1 and Group-2 viruses differ by less than 5 percent, distinct amino acid residue motifs have been noted in the coding sequences o...

  5. Genomic definition of species. Revision 2

    SciTech Connect

    Crkvenjakov, R.; Drmanac, R.

    1993-03-01

    A genome is the sum total of the DNA sequences in the cells of an individual organism. The common usage that species possess genomes comes naturally to biochemists, who have shown that all protein and nucleic acid molecules are at the same time species- and individual-specific, with minor individual variations being superimposed on a consensus sequence that is constant for a species. By extension, this property is attributed to the common features of DNA in the chromosomes of members of a given species and is called species genome. Our proposal for the definition of a biological species is as follows: A species comprises a group of actual and potential biological organisms built according to a unique genome program that is recorded, and at least in part expressed, in the structures of their genomic nucleic acid molecule(s), having intragroup sequence differences which can be fully interconverted in the process of organismal reproduction.

  6. Nuclear DNA content in Sinningia (Gesneriaceae); intraspecific genome size variation and genome characterization in S. speciosa.

    PubMed

    Zaitlin, David; Pierce, Andrew J

    2010-12-01

    The Gesneriaceae (Lamiales) is a family of flowering plants comprising >3000 species of mainly tropical origin, the most familiar of which is the cultivated African violet (Saintpaulia spp.). Species of Gesneriaceae are poorly represented in the lists of taxa sampled for genome size estimation; measurements are available for three species of Ramonda and one each of Haberlea, Saintpaulia, and Streptocarpus, all species of Old World origin. We report here nuclear genome size estimates for 10 species of Sinningia, a neotropical genus largely restricted to Brazil. Flow cytometry of leaf cell nuclei showed that holoploid genome size in Sinningia is very small (approximately two times the size of the Arabidopsis genome), and is small compared to the other six species of Gesneriaceae with genome size estimates. We also documented intraspecific genome size variation of 21%-26% within a group of wild Sinningia speciosa (Lodd.) Hiern collections. In addition, we analyzed 1210 genome survey sequences from S. speciosa to characterize basic features of the nuclear genome such as guanine-cytosine content, types of repetitive elements, numbers of protein-coding sequences, and sequences unique to S. speciosa. We included several other angiosperm species as genome size standards, one of which was the snapdragon (Antirrhinum majus L.; Veronicaceae, Lamiales). Multiple measurements on three accessions indicated that the genome size of A. majus is ~633 × 10⁶ base pairs, which is approximately 40% of the previously published estimate. PMID:21164539

  7. Genomic Encyclopedia of Fungi

    SciTech Connect

    Grigoriev, Igor

    2012-08-10

    Genomes of fungi relevant to energy and environment are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 150 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such parts suggested by comparative genomics and functional analysis in these areas are presented here.

  8. JGI Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor V.

    2011-03-14

    Genomes of energy and environment fungi are in focus of the Fungal Genomic Program at the US Department of Energy Joint Genome Institute (JGI). Its key project, the Genomics Encyclopedia of Fungi, targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts), and explores fungal diversity by means of genome sequencing and analysis. Over 50 fungal genomes have been sequenced by JGI to date and released through MycoCosm (www.jgi.doe.gov/fungi), a fungal web-portal, which integrates sequence and functional data with genome analysis tools for user community. Sequence analysis supported by functional genomics leads to developing parts list for complex systems ranging from ecosystems of biofuel crops to biorefineries. Recent examples of such 'parts' suggested by comparative genomics and functional analysis in these areas are presented here

  9. Genomics and Health Impact Update

    MedlinePlus

    ... Genomics in Practice Newborn Screening Pharmacogenomics Reproductive Health Tools and Databases About the Genomics & Health Impact Update The Office of Public Health Genomics provides updated and credible ...

  10. Phylogenomics and the Dynamic Genome Evolution of the Genus Streptococcus

    PubMed Central

    Richards, Vincent P.; Palmer, Sara R.; Pavinski Bitar, Paulina D.; Qin, Xiang; Weinstock, George M.; Highlander, Sarah K.; Town, Christopher D.; Burne, Robert A.; Stanhope, Michael J.

    2014-01-01

    The genus Streptococcus comprises important pathogens that have a severe impact on human health and are responsible for substantial economic losses to agriculture. Here, we utilize 46 Streptococcus genome sequences (44 species), including eight species sequenced here, to provide the first genomic level insight into the evolutionary history and genetic basis underlying the functional diversity of all major groups of this genus. Gene gain/loss analysis revealed a dynamic pattern of genome evolution characterized by an initial period of gene gain followed by a period of loss, as the major groups within the genus diversified. This was followed by a period of genome expansion associated with the origins of the present extant species. The pattern is concordant with an emerging view that genomes evolve through a dynamic process of expansion and streamlining. A large proportion of the pan-genome has experienced lateral gene transfer (LGT) with causative factors, such as relatedness and shared environment, operating over different evolutionary scales. Multiple gene ontology terms were significantly enriched for each group, and mapping terms onto the phylogeny showed that those corresponding to genes born on branches leading to the major groups represented approximately one-fifth of those enriched. Furthermore, despite the extensive LGT, several biochemical characteristics have been retained since group formation, suggesting genomic cohesiveness through time, and that these characteristics may be fundamental to each group. For example, proteolysis: mitis group; urea metabolism: salivarius group; carbohydrate metabolism: pyogenic group; and transcription regulation: bovis group. PMID:24625962

  11. AGAPE (Automated Genome Analysis PipelinE) for Pan-Genome Analysis of Saccharomyces cerevisiae

    PubMed Central

    Song, Giltae; Dickins, Benjamin J. A.; Demeter, Janos; Engel, Stacia; Dunn, Barbara; Cherry, J. Michael

    2015-01-01

    The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community. PMID:25781462

  12. Development and characterization of genomics resources for leafy spurge: A model perennial weed for functional genomics studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High throughput genomics approaches to study weed biology have so far been limited to a small number of research groups within the weed science community. In most cases, these groups have relied on heterologous approaches, since resources needed for functional genomics studies within desired species...

  13. Genome size diversity in orchids: consequences and evolution

    PubMed Central

    Leitch, I. J.; Kahandawala, I.; Suda, J.; Hanson, L.; Ingrouille, M. J.; Chase, M. W.; Fay, M. F.

    2009-01-01

    Background The amount of DNA comprising the genome of an organism (its genome size) varies a remarkable 40 000-fold across eukaryotes, yet most groups are characterized by much narrower ranges (e.g. 14-fold in gymnosperms, 3- to 4-fold in mammals). Angiosperms stand out as one of the most variable groups with genome sizes varying nearly 2000-fold. Nevertheless within angiosperms the majority of families are characterized by genomes which are small and vary little. Species with large genomes are mostly restricted to a few monocots families including Orchidaceae. Scope A survey of the literature revealed that genome size data for Orchidaceae are comparatively rare representing just 327 species. Nevertheless they reveal that Orchidaceae are currently the most variable angiosperm family with genome sizes ranging 168-fold (1C = 0·33–55·4 pg). Analysing the data provided insights into the distribution, evolution and possible consequences to the plant of this genome size diversity. Conclusions Superimposing the data onto the increasingly robust phylogenetic tree of Orchidaceae revealed how different subfamilies were characterized by distinct genome size profiles. Epidendroideae possessed the greatest range of genome sizes, although the majority of species had small genomes. In contrast, the largest genomes were found in subfamilies Cypripedioideae and Vanilloideae. Genome size evolution within this subfamily was analysed as this is the only one with reasonable representation of data. This approach highlighted striking differences in genome size and karyotype evolution between the closely related Cypripedium, Paphiopedilum and Phragmipedium. As to the consequences of genome size diversity, various studies revealed that this has both practical (e.g. application of genetic fingerprinting techniques) and biological consequences (e.g. affecting where and when an orchid may grow) and emphasizes the importance of obtaining further genome size data given the considerable

  14. Integrating sequence, evolution and functional genomics in regulatory genomics

    PubMed Central

    Vingron, Martin; Brazma, Alvis; Coulson, Richard; van Helden, Jacques; Manke, Thomas; Palin, Kimmo; Sand, Olivier; Ukkonen, Esko

    2009-01-01

    With genome analysis expanding from the study of genes to the study of gene regulation, 'regulatory genomics' utilizes sequence information, evolution and functional genomics measurements to unravel how regulatory information is encoded in the genome. PMID:19226437

  15. MaizeGDB's New Genome Browser Project

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MaizeGDB (http://www.maizegdb.org) is the community database for maize genetics and genomics. Based upon the 2006 MaizeGDB Working Group Report (available at http://www.maizegdb.org/working_group.php) and the Allerton Report (http://www.maizegdb.org/AllertonReport.doc), it has become evident that th...

  16. Complete Genome Sequencing of Trivittatus virus

    PubMed Central

    Groseth, Allison; Vine, Veronica; Weisend, Carla; Ebihara, Hideki

    2015-01-01

    Trivittatus virus (family Bunyaviridae, genus Orthobunyavirus) represents an important genetic intermediate between the California encephalitis group, and Bwamba/Pongola and Nyando groups. Here, we report the first complete genome sequence of the prototype (Eklund) strain, isolated in 1948, which interestingly shows only few differences compared to partial sequences of modern strains. PMID:26212363

  17. Pathway and network analysis of cancer genomes.

    PubMed

    2015-07-01

    Genomic information on tumors from 50 cancer types cataloged by the International Cancer Genome Consortium (ICGC) shows that only a few well-studied driver genes are frequently mutated, in contrast to many infrequently mutated genes that may also contribute to tumor biology. Hence there has been large interest in developing pathway and network analysis methods that group genes and illuminate the processes involved. We provide an overview of these analysis techniques and show where they guide mechanistic and translational investigations. PMID:26125594

  18. Pathway and Network Analysis of Cancer Genomes

    PubMed Central

    Haider, Syed; Wu, Guanming; Shibata, Tatsuhiro; Vazquez, Miguel; Mustonen, Ville; Gonzalez-Perez, Abel; Pearson, John; Sander, Chris; Raphael, Benjamin J.; Marks, Debora S.; Ouellette, B.F. Francis; Valencia, Alfonso; Bader, Gary D.; Boutros, Paul C.; Stuart, Joshua M.; Linding, Rune; Lopez-Bigas, Nuria; Stein, Lincoln D.

    2016-01-01

    Genomic information on tumors from 50 cancer types catalogued by The International Cancer Genome Consortium (ICGC) shows that only few well-studied driver genes are frequently mutated, in contrast to many infrequently mutated genes that may also contribute to tumor biology. Hence there has been large interest in developing pathway and network analysis methods that group genes and illuminate the processes involved. We provide an overview of these analysis techniques and show where they guide mechanistic and translational investigations. PMID:26125594

  19. Genomic Data Commons | Office of Cancer Genomics

    Cancer.gov

    The NCI’s Center for Cancer Genomics launches the Genomic Data Commons (GDC), a unified data sharing platform for the cancer research community. The mission of the GDC is to enable data sharing across the entire cancer research community, to ultimately support precision medicine in oncology.

  20. Harvesting rice's dispensable genome.

    PubMed

    Wing, Rod A

    2015-01-01

    A rapid and cost-effective approach has been developed to harvest and map the dispensable genome, that is, population-level natural sequence variation within a species that is not present in static genome assemblies. PMID:26429765

  1. Libraries for genomic SELEX.

    PubMed Central

    Singer, B S; Shtatland, T; Brown, D; Gold, L

    1997-01-01

    An increasing number of proteins are being identified that regulate gene expression by binding specific nucleic acidsin vivo. A method termed genomic SELEX facilitates the rapid identification of networks of protein-nucleic acid interactions by identifying within the genomic sequences of an organism the highest affinity sites for any protein of the organism. As with its progenitor, SELEX of random-sequence nucleic acids, genomic SELEX involves iterative binding, partitioning, and amplification of nucleic acids. The two methods differ in that the variable region of the nucleic acid library for genomic SELEX is derived from the genome of an organism. We have used a quick and simple method to construct Escherichia coli, Saccharomyces cerevisiae, and human genomic DNA PCR libraries that can be transcribed with T7 RNA polymerase. We present evidence that the libraries contain overlapping inserts starting at most of the positions within the genome, making these libraries suitable for genomic SELEX. PMID:9016629

  2. Genomic Data Commons launches

    Cancer.gov

    The Genomic Data Commons (GDC), a unified data system that promotes sharing of genomic and clinical data between researchers, launched today with a visit from Vice President Joe Biden to the operations center at the University of Chicago.

  3. GENOMICS AND ENVIRONMENTAL RESEARCH

    EPA Science Inventory

    The impact of recently developed and emerging genomics technologies on environmental sciences has significant implications for human and ecological risk assessment issues. The linkage of data generated from genomics, transcriptomics, proteomics, metabalomics, and ecology can be ...

  4. Observing copepods through a genomic lens

    PubMed Central

    2011-01-01

    Background Copepods outnumber every other multicellular animal group. They are critical components of the world's freshwater and marine ecosystems, sensitive indicators of local and global climate change, key ecosystem service providers, parasites and predators of economically important aquatic animals and potential vectors of waterborne disease. Copepods sustain the world fisheries that nourish and support human populations. Although genomic tools have transformed many areas of biological and biomedical research, their power to elucidate aspects of the biology, behavior and ecology of copepods has only recently begun to be exploited. Discussion The extraordinary biological and ecological diversity of the subclass Copepoda provides both unique advantages for addressing key problems in aquatic systems and formidable challenges for developing a focused genomics strategy. This article provides an overview of genomic studies of copepods and discusses strategies for using genomics tools to address key questions at levels extending from individuals to ecosystems. Genomics can, for instance, help to decipher patterns of genome evolution such as those that occur during transitions from free living to symbiotic and parasitic lifestyles and can assist in the identification of genetic mechanisms and accompanying physiological changes associated with adaptation to new or physiologically challenging environments. The adaptive significance of the diversity in genome size and unique mechanisms of genome reorganization during development could similarly be explored. Genome-wide and EST studies of parasitic copepods of salmon and large EST studies of selected free-living copepods have demonstrated the potential utility of modern genomics approaches for the study of copepods and have generated resources such as EST libraries, shotgun genome sequences, BAC libraries, genome maps and inbred lines that will be invaluable in assisting further efforts to provide genomics tools for

  5. Exploiting the genome

    SciTech Connect

    Block, S.; Cornwall, J.; Dyson, F.; Koonin, S.; Lewis, N.; Schwitters, R.

    1998-09-11

    In 1997, JASON conducted a DOE-sponsored study of the human genome project with special emphasis on the areas of technology, quality assurance and quality control, and informatics. The present study has two aims: first, to update the 1997 Report in light of recent developments in genome sequencing technology, and second, to consider possible roles for the DOE in the ''post-genomic" era, following acquisition of the complete human genome sequence.

  6. Genomic and transcriptomic alterations following hybridisation and genome doubling in trigenomic allohexaploid Brassica carinata × Brassica rapa.

    PubMed

    Xu, Y; Zhao, Q; Mei, S; Wang, J

    2012-09-01

    Allopolyploidisation is a prominent evolutionary force that involves two major events: interspecific hybridisation and genome doubling. Both events have important functional consequences in shaping the genomic architecture of the neo-allopolyploids. The respective effects of hybridisation and genome doubling upon genomic and transcriptomic changes in Brassica allopolyploids are unresolved. In this study, amplified fragment length polymorphism (AFLP), methylation-sensitive amplification polymorphism (MSAP) and cDNA-AFLP approaches were used to track genetic, epigenetic and transcriptional changes in both allohexaploid Brassica (ArArBcBcCcCc genome) and triploid hybrids (ArBcCc genome). Results from these groups were compared with each other and also to their parents Brassica carinata (BBCC genome) and Brassica rapa (AA genome). Rapid and dramatic genetic, DNA methylation and gene expression changes were detected in the triploid hybrids. During the shift from triploidy to allohexaploidy, some of the hybridisation-induced alterations underwent reversion. Additionally, novel genetic, epigenetic and transcriptional alterations were also detected. The proportions of A-genome-specific DNA methylation and gene expression alterations were significantly greater than those of BC-genome-specific alterations in the triploid hybrids. However, the two parental genomes were equally affected during the ploidy shift. Hemi-CCG methylation changes induced by hybridisation were recovered after genome doubling. Full-CG methylation changes were a more general process initiated in the hybrid and continued after genome doubling. These results indicate that genome doubling could ameliorate genomic and transcriptomic alterations induced by hybridisation and instigate additional alterations in trigenomic Brassica allohexaploids. Moreover, genome doubling also modified hybridisation-induced progenitor genome-biased alterations and epigenetic alteration characteristics. PMID:22309095

  7. COMPARATIVE GENOMICS IN LEGUMES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The legume plant family will soon include three sequenced genomes. The majority of the gene-containing portions of the model legumes Medicago truncatula and Lotus japonicus have been sequenced in clone-by-clone projects, and the sequencing of the soybean genome is underway in a whole-genome shotgun ...

  8. Whole Genome Selection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Whole genome selection (WGS) is an approach to using DNA markers that are distributed throughout the entire genome. Genes affecting most economically-important traits are distributed throughout the genome and there are relatively few that have large effects with many more genes with progressively sm...

  9. Group Grammar

    ERIC Educational Resources Information Center

    Adams, Karen

    2015-01-01

    In this article Karen Adams demonstrates how to incorporate group grammar techniques into a classroom activity. In the activity, students practice using the target grammar to do something they naturally enjoy: learning about each other.

  10. Sequencing three crocodilian genomes to illuminate the evolution of archosaurs and amniotes

    PubMed Central

    2012-01-01

    The International Crocodilian Genomes Working Group (ICGWG) will sequence and assemble the American alligator (Alligator mississippiensis), saltwater crocodile (Crocodylus porosus) and Indian gharial (Gavialis gangeticus) genomes. The status of these projects and our planned analyses are described. PMID:22293439

  11. Population genomics of the symbiotic plasmids of sympatric nitrogen-fixing Rhizobium species associated with Phaseolus vulgaris.

    PubMed

    Pérez Carrascal, Olga M; VanInsberghe, David; Juárez, Soledad; Polz, Martin F; Vinuesa, Pablo; González, Víctor

    2016-09-01

    Cultivated common beans are the primary protein source for millions of people around the world who subsist on low-input agriculture, enabled by the symbiotic N2 -fixation these legumes perform in association with rhizobia. Within a single agricultural plot, multiple Rhizobium species can nodulate bean roots, but it is unclear how genetically isolated these species remain in sympatry. To better understand this issue, we sequenced and compared the genomes of 33 strains isolated from the rhizosphere and root nodules of a particular bean variety grown in the same agricultural plot. We found that the Rhizobium species we observed coexist with low genetic recombination across their core genomes. Accessory plasmids thought to be necessary for the saprophytic lifestyle in soil show similar levels of genetic isolation, but with higher rates of recombination than the chromosomes. However, the symbiotic plasmids are extremely similar, with high rates of recombination and do not appear to have co-evolved with the chromosome or accessory plasmids. Therefore, while Rhizobium species are genetically isolated units within the microbial community, a common symbiotic plasmid allows all Rhizobium species to engage in symbiosis with the same host in a single agricultural plot. PMID:27312778

  12. Mitochondrial respiration and genomic analysis provide insight into the influence of the symbiotic bacterium on host trypanosomatid oxygen consumption.

    PubMed

    Azevedo-Martins, A C; Machado, A C L; Klein, C C; Ciapina, L; Gonzaga, L; Vasconcelos, A T R; Sagot, M F; DE Souza, W; Einicker-Lamas, M; Galina, A; Motta, M C M

    2015-02-01

    Certain trypanosomatids co-evolve with an endosymbiotic bacterium in a mutualistic relationship that is characterized by intense metabolic exchanges. Symbionts were able to respire for up to 4 h after isolation from Angomonas deanei. FCCP (carbonyl cyanide-4-(trifluoromethoxy)phenylhydrazone) similarly increased respiration in wild-type and aposymbiotic protozoa, though a higher maximal O2 consumption capacity was observed in the symbiont-containing cells. Rotenone, a complex I inhibitor, did not affect A. deanei respiration, whereas TTFA (thenoyltrifluoroacetone), a complex II activity inhibitor, completely blocked respiration in both strains. Antimycin A and cyanide, inhibitors of complexes III and IV, respectively, abolished O2 consumption, but the aposymbiotic protozoa were more sensitive to both compounds. Oligomycin did not affect cell respiration, whereas carboxyatractyloside (CAT), an inhibitor of the ADP-ATP translocator, slightly reduced O2 consumption. In the A. deanei genome, sequences encoding most proteins of the respiratory chain are present. The symbiont genome lost part of the electron transport system (ETS), but complex I, a cytochrome d oxidase, and FoF1-ATP synthase remain. In conclusion, this work suggests that the symbiont influences the mitochondrial respiration of the host protozoan. PMID:25160925

  13. Tetranucleotide usage highlights genomic heterogeneity among mycobacteriophages

    PubMed Central

    Siranosian, Benjamin; Perera, Sudheesha; Williams, Edward; Ye, Chen; de Graffenried, Christopher; Shank, Peter

    2015-01-01

    Background The genomic sequences of mycobacteriophages, phages infecting mycobacterial hosts, are diverse and mosaic. Mycobacteriophages often share little nucleotide similarity, but most of them have been grouped into lettered clusters and further into subclusters. Traditionally, mycobacteriophage genomes are analyzed based on sequence alignment or knowledge of gene content. However, these approaches are computationally expensive and can be ineffective for significantly diverged sequences. As an alternative to alignment-based genome analysis, we evaluated tetranucleotide usage in mycobacteriophage genomes. These methods make it easier to characterize features of the mycobacteriophage population at many scales. Description We computed tetranucleotide usage deviation (TUD), the ratio of observed counts of 4-mers in a genome to the expected count under a null model. TUD values are comparable between members of a phage subcluster and distinct between subclusters. With few exceptions, neighbor joining phylogenetic trees and hierarchical clustering dendrograms constructed using TUD values place phages in a monophyletic clade with members of the same subcluster. Regions in a genome with exceptional TUD values can point to interesting features of genomic architecture. Finally, we found that subcluster B3 mycobacteriophages contain significantly overrepresented 4-mers and 6-mers that are atypical of phage genomes. Conclusions Statistics based on tetranucleotide usage support established clustering of mycobacteriophages and can uncover interesting relationships within and between sequenced phage genomes. These methods are efficient to compute and do not require sequence alignment or knowledge of gene content. The code to download mycobacteriophage genome sequences and reproduce our analysis is freely available at https://github.com/bsiranosian/tango_final. PMID:27134721

  14. Chromium and Genomic Stability

    PubMed Central

    Wise, Sandra S.; Wise, John Pierce

    2014-01-01

    Many metals serve as micronutrients which protect against genomic instability. Chromium is most abundant in its trivalent and hexavalent forms. Trivalent chromium has historically been considered an essential element, though recent data indicate that while it can have pharmacological effects and value, it is not essential. There are no data indicating that trivalent chromium promotes genomic stability and, instead may promote genomic instability. Hexavalent chromium is widely accepted as highly toxic and carcinogenic with no nutritional value. Recent data indicate that it causes genomic instability and also has no role in promoting genomic stability. PMID:22192535

  15. The Genomic Medicine Game.

    PubMed

    Tran, Elvis; de Andrés-Galiana, Enrique J; Benitez, Sonia; Martin-Sanchez, Fernando; Lopez-Campos, Guillermo H

    2016-01-01

    With advancements in genomics technology, health care has been improving and new paradigms of medicine such as genomic medicine have evolved. The education of clinicians, researchers and students to face the challenges posed by these new approaches, however, has been often lagging behind. From this the Genomic Medicine Game, an educational tool, was created for the purpose of conceptualizing the key components of Genomic Medicine. A number of phenotype-genotype associations were found through a literature review, which was used to be a base for the concepts the Genomic Medicine Game would focus on. Built in Java, the game was successfully tested with promising results. PMID:27577486

  16. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes.

    PubMed

    Ribeiro, Teresa; Barrela, Ricardo M; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A P

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  17. Advancing Eucalyptus Genomics: Cytogenomics Reveals Conservation of Eucalyptus Genomes

    PubMed Central

    Ribeiro, Teresa; Barrela, Ricardo M.; Bergès, Hélène; Marques, Cristina; Loureiro, João; Morais-Cecílio, Leonor; Paiva, Jorge A. P.

    2016-01-01

    The genus Eucalyptus encloses several species with high ecological and economic value, being the subgenus Symphyomyrtus one of the most important. Species such as E. grandis and E. globulus are well characterized at the molecular level but knowledge regarding genome and chromosome organization is very scarce. Here we characterized and compared the karyotypes of three economically important species, E. grandis, E. globulus, and E. calmadulensis, and three with ecological relevance, E. pulverulenta, E. cornuta, and E. occidentalis, through an integrative approach including genome size estimation, fluorochrome banding, rDNA FISH, and BAC landing comprising genes involved in lignin biosynthesis. All karyotypes show a high degree of conservation with pericentromeric 35S and 5S rDNA loci in the first and third pairs, respectively. GC-rich heterochromatin was restricted to the 35S rDNA locus while the AT-rich heterochromatin pattern was species-specific. The slight differences in karyotype formulas and distribution of AT-rich heterochromatin, along with genome sizes estimations, support the idea of Eucalyptus genome evolution by local expansions of heterochromatin clusters. The unusual co-localization of both rDNA with AT-rich heterochromatin was attributed mainly to the presence of silent transposable elements in those loci. The cinnamoyl CoA reductase gene (CCR1) previously assessed to linkage group 10 (LG10) was clearly localized distally at the long arm of chromosome 9 establishing an unexpected correlation between the cytogenetic chromosome 9 and the LG10. Our work is novel and contributes to the understanding of Eucalyptus genome organization which is essential to develop successful advanced breeding strategies for this genus. PMID:27148332

  18. Group dynamics.

    PubMed

    Scandiffio, A L

    1990-12-01

    Group dynamics play a significant role within any organization, culture, or unit. The important thing to remember with any of these structures is that they are made up of people--people with different ideas, motivations, background, and sometimes different agendas. Most groups, formal or informal, look for a leader in an effort to maintain cohesiveness of the unit. At times, that cultural bond must be developed; once developed, it must be nurtured. There are also times that one of the group no longer finds the culture comfortable and begins to act out behaviorally. It is these times that become trying for the leader as she or he attempts to remain objective when that which was once in the building phase of group cohesiveness starts to fall apart. At all times, the manager must continue to view the employee creating the disturbance as an integral part of the group. It is at this time that it is beneficial to perceive the employee exhibiting problem behaviors as a special employee, as one who needs the benefit of your experience and skills, as one who is still part of the group. It is also during this time that the manager should focus upon her or his own views in the area of power, communication, and the corporate culture of the unit that one has established before attempting to understand another's point of view. Once we understand our own motivation and accept ourselves, it is then that we may move on to offer assistance to another. Once we understand our insecurities recognizing staff dysfunction as a symptom of system dysfunction will not be so threatening to the concept of the manager that we perceive ourselves to be. It takes a secure person to admit that she or he favors staff before deciding to do something to change things. The important thing to know is that it can be done. The favored staff can find a new way of relating to others, the special employee can find new modes of behavior (and even find self-esteem in the process), the group can find new ways

  19. The Bluejay genome browser.

    PubMed

    Soh, Jung; Gordon, Paul M K; Sensen, Christoph W

    2012-03-01

    The Bluejay genome browser is a stand-alone visualization tool for the multi-scale viewing of annotated genomes and other genomic elements. Bluejay allows users to customize display features to suit their needs, and produces publication-quality graphics. Bluejay provides a multitude of ways to interrelate biological data at the genome scale. Users can load gene expression data into a genome display for expression visualization in context. Multiple genomes can be compared concurrently, including time series expression data, based on Gene Ontology labels. External, context-sensitive biological Web Services are linked to the displayed genomic elements ad hoc for in-depth genomic data analysis and interpretation. Users can mark multiple points of interest in a genome by creating waypoints, and exploit them for easy navigation of single or multiple genomes. Using this comprehensive visual environment, users can study a gene not just in relation to its genome, but also its transcriptome and evolutionary origins. Written in Java, Bluejay is platform-independent and is freely available from http://bluejay.ucalgary.ca. PMID:22389011

  20. Bacterial Genome Instability

    PubMed Central

    Darmon, Elise

    2014-01-01

    SUMMARY Bacterial genomes are remarkably stable from one generation to the next but are plastic on an evolutionary time scale, substantially shaped by horizontal gene transfer, genome rearrangement, and the activities of mobile DNA elements. This implies the existence of a delicate balance between the maintenance of genome stability and the tolerance of genome instability. In this review, we describe the specialized genetic elements and the endogenous processes that contribute to genome instability. We then discuss the consequences of genome instability at the physiological level, where cells have harnessed instability to mediate phase and antigenic variation, and at the evolutionary level, where horizontal gene transfer has played an important role. Indeed, this ability to share DNA sequences has played a major part in the evolution of life on Earth. The evolutionary plasticity of bacterial genomes, coupled with the vast numbers of bacteria on the planet, substantially limits our ability to control disease. PMID:24600039

  1. Variations in genome mass.

    PubMed

    Wachtel, S S; Tiersch, T R

    1993-02-01

    1. Genome size varies considerably among vertebrates, ranging from less than 1 pg to more than 200 pg; the amount of DNA differing among individuals in a population can equal the amount in the entire structural gene complement. 2. Recent technological advances permit evaluation of genome size variation at several levels including sub-chromosomal, chromosomal and cellular. 3. Genome size variation may also be viewed from taxonomic levels, and across evolutionary time frames. 4. As sources of genome size variation are identified and studied, the conundrum of the C-value paradox (lack of correlations among genome size, genomic complexity and phylogenetic status of organisms) may prove to be more apparent than real. 5. For example, the limited and relatively constant genome size of avians may be related to the physiological constraints of flight. PMID:8462275

  2. The genomes of root-knot nematodes.

    PubMed

    Bird, David McK; Williamson, Valerie M; Abad, Pierre; McCarter, James; Danchin, Etienne G J; Castagnone-Sereno, Philippe; Opperman, Charles H

    2009-01-01

    Plant-parasitic nematodes are the most destructive group of plant pathogens worldwide and are extremely challenging to control. The recent completion of two root-knot nematode genomes opens the way for a comparative genomics approach to elucidate the success of these parasites. Sequencing revealed that Meloidogyne hapla, a diploid that reproduces by facultative, meiotic parthenogenesis, encodes approximately 14,200 genes in a compact, 54 Mpb genome. Indeed, this is the smallest metazoan genome completed to date. By contrast, the 86 Mbp Meloidogyne incognita genome encodes approximately 19,200 genes. This species reproduces by obligate mitotic parthenogenesis and exhibits a complex pattern of aneuploidy. The genome includes triplicated regions and contains allelic pairs with exceptionally high degrees of sequence divergence, presumably reflecting adaptations to the strictly asexual reproductive mode. Both root-knot nematode genomes have compacted gene families compared with the free-living nematode Caenorhabditis elegans, and both encode large suites of enzymes that uniquely target the host plant. Acquisition of these genes, apparently via horizontal gene transfer, and their subsequent expansion and diversification point to the evolutionary history of these parasites. It also suggests new routes to their control. PMID:19400640

  3. A genomic perspective on hybridization and speciation

    PubMed Central

    Payseur, Bret A.; Rieseberg, Loren H.

    2016-01-01

    Hybridization among diverging lineages is common in nature. Genomic data provide a special opportunity to characterize the history of hybridization and the genetic basis of speciation. We review existing methods and empirical studies to identify recent advances in the genomics of hybridization, as well as issues that need to be addressed. Notable progress has been made in the development of methods for detecting hybridization and inferring individual ancestries. However, few approaches reconstruct the magnitude and timing of gene flow, estimate the fitness of hybrids or incorporate knowledge of recombination rate. Empirical studies indicate that the genomic consequences of hybridization are complex, including a highly heterogeneous landscape of differentiation. Inferred characteristics of hybridization differ substantially among species groups. Loci showing unusual patterns – which may contribute to reproductive barriers – are usually scattered throughout the genome, with potential enrichment in sex chromosomes and regions of reduced recombination. We caution against the growing trend of interpreting genomic variation in summary statistics across genomes as evidence of differential gene flow. We argue that converting genomic patterns into useful inferences about hybridization will ultimately require models and methods that directly incorporate key ingredients of speciation, including the dynamic nature of gene flow, selection acting in hybrid populations and recombination rate variation. PMID:26836441

  4. A Distance Measure for Genome Phylogenetic Analysis

    NASA Astrophysics Data System (ADS)

    Cao, Minh Duc; Allison, Lloyd; Dix, Trevor

    Phylogenetic analyses of species based on single genes or parts of the genomes are often inconsistent because of factors such as variable rates of evolution and horizontal gene transfer. The availability of more and more sequenced genomes allows phylogeny construction from complete genomes that is less sensitive to such inconsistency. For such long sequences, construction methods like maximum parsimony and maximum likelihood are often not possible due to their intensive computational requirement. Another class of tree construction methods, namely distance-based methods, require a measure of distances between any two genomes. Some measures such as evolutionary edit distance of gene order and gene content are computational expensive or do not perform well when the gene content of the organisms are similar. This study presents an information theoretic measure of genetic distances between genomes based on the biological compression algorithm expert model. We demonstrate that our distance measure can be applied to reconstruct the consensus phylogenetic tree of a number of Plasmodium parasites from their genomes, the statistical bias of which would mislead conventional analysis methods. Our approach is also used to successfully construct a plausible evolutionary tree for the γ-Proteobacteria group whose genomes are known to contain many horizontally transferred genes.

  5. Group Learning.

    ERIC Educational Resources Information Center

    Black, Susan

    1992-01-01

    Research suggests that cooperative learning works best when students are first taught group-processing skills, such as leadership, decision making, communication, trust building, and conflict management. Inadequate teacher training and boring assignments can torpedo cooperative learning efforts. Administrators should reassure teachers with…

  6. Molecular Pathology: Prognostic and Diagnostic Genomic Markers for Myeloid Neoplasms.

    PubMed

    Kuo, Frank C

    2016-09-01

    Application of next-generation sequencing (NGS) on myeloid neoplasms has expanded our knowledge of genomic alterations in this group of diseases. Genomic alterations in myeloid neoplasms are complex, heterogeneous, and not specific to a disease entity. NGS-based panel testing of myeloid neoplasms can complement existing diagnostic modalities and is gaining acceptance in the clinics and diagnostic laboratories. Prospective, randomized trials to evaluate the prognostic significance of genomic markers in myeloid neoplasms are under way in academic medical centers. PMID:27523973

  7. Characterization of fHbp, nhba (gna2132), nadA, porA, Sequence Type (ST), and Genomic Presence of IS1301 in Group B Meningococcal ST269 Clonal Complex Isolates from England and Wales▿

    PubMed Central

    Lucidarme, Jay; Comanducci, Maurizio; Findlow, Jamie; Gray, Stephen J.; Kaczmarski, Edward B.; Guiver, Malcolm; Kugelberg, Elisabeth; Vallely, Pamela J.; Oster, Philipp; Pizza, Mariagrazia; Bambini, Stefania; Muzzi, Alessandro; Tang, Christoph M.; Borrow, Ray

    2009-01-01

    Highly effective glycoconjugate vaccines exist against four of the five major pathogenic groups of meningococci: A, C, W-135, and Y. An equivalent vaccine against group B meningococci (menB) has remained elusive due to the poorly immunogenic capsular polysaccharide. A promising alternative, the investigational recombinant menB (rMenB)- outer membrane vesicle (OMV) vaccine, contains fHBP, NHBA (previously GNA2132), NadA, and outer membrane vesicles (OMVs) from the New Zealand MeNZB vaccine. MenB currently accounts for 90% of meningococcal disease in England and Wales, where the multilocus sequence type (ST) 269 (ST269) clonal complex (cc269) has recently expanded to account for a third of menB cases. To assess the potential cc269 coverage of the rMenB-OMV vaccine, English and Welsh cc269 isolates from the past decade were genetically characterized with respect to fHBP, NHBA, and NadA. All of the isolates harbored fHbp and nhba alleles, while 98% of the cc269 isolates were devoid of nadA. Subvariant profiling of fHbp, nhba, and porA against STs revealed the presence of two broadly distinct and well-defined clusters of isolates, centered around ST269 and ST275, respectively. An additional molecular marker, insertion sequence IS1301, was found to be present in 100% and <2% of isolates of the respective clusters. On the basis of the genetic data, the potential rMenB-OMV coverage of cc269 in England and Wales is high (up to 100%) within both clusters. Expression studies and serum bactericidal antibody assays will serve to enhance predictions of coverage and will augment ongoing studies regarding the significance of IS1301 within the ST269 cluster. PMID:19759227

  8. Joint assembly and genetic mapping of the Atlantic horseshoe crab genome reveals ancient whole genome duplication

    PubMed Central

    2014-01-01

    Background Horseshoe crabs are marine arthropods with a fossil record extending back approximately 450 million years. They exhibit remarkable morphological stability over their long evolutionary history, retaining a number of ancestral arthropod traits, and are often cited as examples of “living fossils.” As arthropods, they belong to the Ecdysozoa, an ancient super-phylum whose sequenced genomes (including insects and nematodes) have thus far shown more divergence from the ancestral pattern of eumetazoan genome organization than cnidarians, deuterostomes and lophotrochozoans. However, much of ecdysozoan diversity remains unrepresented in comparative genomic analyses. Results Here we apply a new strategy of combined de novo assembly and genetic mapping to examine the chromosome-scale genome organization of the Atlantic horseshoe crab, Limulus polyphemus. We constructed a genetic linkage map of this 2.7 Gbp genome by sequencing the nuclear DNA of 34 wild-collected, full-sibling embryos and their parents at a mean redundancy of 1.1x per sample. The map includes 84,307 sequence markers grouped into 1,876 distinct genetic intervals and 5,775 candidate conserved protein coding genes. Conclusions Comparison with other metazoan genomes shows that the L. polyphemus genome preserves ancestral bilaterian linkage groups, and that a common ancestor of modern horseshoe crabs underwent one or more ancient whole genome duplications 300 million years ago, followed by extensive chromosome fusion. These results provide a counter-example to the often noted correlation between whole genome duplication and evolutionary radiations. The new, low-cost genetic mapping method for obtaining a chromosome-scale view of non-model organism genomes that we demonstrate here does not require laboratory culture, and is potentially applicable to a broad range of other species. PMID:24987520

  9. Genomic Evolution of 11 Type Strains within Family Planctomycetaceae

    PubMed Central

    Zhou, Yizhuang; Yang, Linfeng; Liu, Tianxiang; Yang, Jinlong; Chen, Yanling; Su, Longxiang; Xu, Jin; Chen, Jing; Liu, Feng; Chen, Jiapeng; Dai, Wenkui; Ni, Peixiang; Fang, Chengxiang; Yang, Ruifu

    2014-01-01

    The species in family Planctomycetaceae are ideal groups for investigating the origin of eukaryotes. Their cells are divided by a lipidic intracytoplasmic membrane and they share a number of eukaryote-like molecular characteristics. However, their genomic structures, potential abilities, and evolutionary status are still unknown. In this study, we searched for common protein families and a core genome/pan genome based on 11 sequenced species in family Planctomycetaceae. Then, we constructed phylogenetic tree based on their 832 common protein families. We also annotated the 11 genomes using the Clusters of Orthologous Groups database. Moreover, we predicted and reconstructed their core/pan metabolic pathways using the KEGG (Kyoto Encyclopedia of Genes and Genomes) orthology system. Subsequently, we identified genomic islands (GIs) and structural variations (SVs) among the five complete genomes and we specifically investigated the integration of two Planctomycetaceae plasmids in all 11 genomes. The results indicate that Planctomycetaceae species share diverse genomic variations and unique genomic characteristics, as well as have huge potential for human applications. PMID:24489782

  10. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex

    PubMed Central

    Garrido-Sanz, Daniel; Meier-Kolthoff, Jan P.; Göker, Markus; Martín, Marta; Rivilla, Rafael; Redondo-Nieto, Miguel

    2016-01-01

    The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR) with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH) identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI) approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as PGPR. PMID:26915094

  11. Genomic and Genetic Diversity within the Pseudomonas fluorescens Complex.

    PubMed

    Garrido-Sanz, Daniel; Meier-Kolthoff, Jan P; Göker, Markus; Martín, Marta; Rivilla, Rafael; Redondo-Nieto, Miguel

    2016-01-01

    The Pseudomonas fluorescens complex includes Pseudomonas strains that have been taxonomically assigned to more than fifty different species, many of which have been described as plant growth-promoting rhizobacteria (PGPR) with potential applications in biocontrol and biofertilization. So far the phylogeny of this complex has been analyzed according to phenotypic traits, 16S rDNA, MLSA and inferred by whole-genome analysis. However, since most of the type strains have not been fully sequenced and new species are frequently described, correlation between taxonomy and phylogenomic analysis is missing. In recent years, the genomes of a large number of strains have been sequenced, showing important genomic heterogeneity and providing information suitable for genomic studies that are important to understand the genomic and genetic diversity shown by strains of this complex. Based on MLSA and several whole-genome sequence-based analyses of 93 sequenced strains, we have divided the P. fluorescens complex into eight phylogenomic groups that agree with previous works based on type strains. Digital DDH (dDDH) identified 69 species and 75 subspecies within the 93 genomes. The eight groups corresponded to clustering with a threshold of 31.8% dDDH, in full agreement with our MLSA. The Average Nucleotide Identity (ANI) approach showed inconsistencies regarding the assignment to species and to the eight groups. The small core genome of 1,334 CDSs and the large pan-genome of 30,848 CDSs, show the large diversity and genetic heterogeneity of the P. fluorescens complex. However, a low number of strains were enough to explain most of the CDSs diversity at core and strain-specific genomic fractions. Finally, the identification and analysis of group-specific genome and the screening for distinctive characters revealed a phylogenomic distribution of traits among the groups that provided insights into biocontrol and bioremediation applications as well as their role as PGPR. PMID:26915094

  12. Multiple genome alignment for identifying the core structure among moderately related microbial genomes

    PubMed Central

    Uchiyama, Ikuo

    2008-01-01

    Background Identifying the set of intrinsically conserved genes, or the genomic core, among related genomes is crucial for understanding prokaryotic genomes where horizontal gene transfers are common. Although core genome identification appears to be obvious among very closely related genomes, it becomes more difficult when more distantly related genomes are compared. Here, we consider the core structure as a set of sufficiently long segments in which gene orders are conserved so that they are likely to have been inherited mainly through vertical transfer, and developed a method for identifying the core structure by finding the order of pre-identified orthologous groups (OGs) that maximally retains the conserved gene orders. Results The method was applied to genome comparisons of two well-characterized families, Bacillaceae and Enterobacteriaceae, and identified their core structures comprising 1438 and 2125 OGs, respectively. The core sets contained most of the essential genes and their related genes, which were primarily included in the intersection of the two core sets comprising around 700 OGs. The definition of the genomic core based on gene order conservation was demonstrated to be more robust than the simpler approach based only on gene conservation. We also investigated the core structures in terms of G+C content homogeneity and phylogenetic congruence, and found that the core genes primarily exhibited the expected characteristic, i.e., being indigenous and sharing the same history, more than the non-core genes. Conclusion The results demonstrate that our strategy of genome alignment based on gene order conservation can provide an effective approach to identify the genomic core among moderately related microbial genomes. PMID:18976470

  13. Use of Whole Genome Sequence Data To Infer Baculovirus Phylogeny

    PubMed Central

    Herniou, Elisabeth A.; Luque, Teresa; Chen, Xinwen; Vlak, Just M.; Winstanley, Doreen; Cory, Jennifer S.; O'Reilly, David R.

    2001-01-01

    Several phylogenetic methods based on whole genome sequence data were evaluated using data from nine complete baculovirus genomes. The utility of three independent character sets was assessed. The first data set comprised the sequences of the 63 genes common to these viruses. The second set of characters was based on gene order, and phylogenies were inferred using both breakpoint distance analysis and a novel method developed here, termed neighbor pair analysis. The third set recorded gene content by scoring gene presence or absence in each genome. All three data sets yielded phylogenies supporting the separation of the Nucleopolyhedrovirus (NPV) and Granulovirus (GV) genera, the division of the NPVs into groups I and II, and species relationships within group I NPVs. Generation of phylogenies based on the combined sequences of all 63 shared genes proved to be the most effective approach to resolving the relationships among the group II NPVs and the GVs. The history of gene acquisitions and losses that have accompanied baculovirus diversification was visualized by mapping the gene content data onto the phylogenetic tree. This analysis highlighted the fluid nature of baculovirus genomes, with evidence of frequent genome rearrangements and multiple gene content changes during their evolution. Of more than 416 genes identified in the genomes analyzed, only 63 are present in all nine genomes, and 200 genes are found only in a single genome. Despite this fluidity, the whole genome-based methods we describe are sufficiently powerful to recover the underlying phylogeny of the viruses. PMID:11483757

  14. Underrepresented groups

    NASA Technical Reports Server (NTRS)

    Peters, David A.

    1990-01-01

    The problem with the shortage of under represented groups in science and engineering is absolutely crucial, especially considering that U.S. will experience a shortage of 560,000 science and engineering personnel by the year 2010. Most studies by the National Science Foundation also concluded that projected shortages cannot be alleviated without significant increases in the involvement of Blacks, Hispanics, Native Americans, handicapped persons, and women.

  15. Cantor Groups

    ERIC Educational Resources Information Center

    Mathes, Ben; Dow, Chris; Livshits, Leo

    2011-01-01

    The Cantor subset of the unit interval [0, 1) is "large" in cardinality and also "large" algebraically, that is, the smallest subgroup of [0, 1) generated by the Cantor set (using addition mod 1 as the group operation) is the whole of [0, 1). In this paper, we show how to construct Cantor-like sets which are "large" in cardinality but "small"…

  16. Figure 2 from Integrative Genomics Viewer: Visualizing Big Data | Office of Cancer Genomics

    Cancer.gov

    Grouping and sorting genomic data in IGV. The IGV user interface displaying 202 glioblastoma samples from TCGA. Samples are grouped by tumor subtype (second annotation column) and data type (first annotation column) and sorted by copy number of the EGFR locus (middle column). Adapted from Figure 1; Robinson et al. 2011

  17. A Comparison of 14 Erythrobacter Genomes Provides Insights into the Genomic Divergence and Scattered Distribution of Phototrophs

    PubMed Central

    Zheng, Qiang; Lin, Wenxin; Liu, Yanting; Chen, Chang; Jiao, Nianzhi

    2016-01-01

    Aerobic anoxygenic phototrophic bacteria (AAPB) are bacteriochlorophyll a (Bchl a)-containing microbial functional population. Erythrobacter is the first genus that was identified to contain AAPB species. Here, we compared 14 Erythrobacter genomes: seven phototrophic strains and seven non- phototrophic strains. Interestingly, AAPB strains are scattered in this genus based on their phylogenetic relationships. All 14 strains could be clustered into three groups based on phylo-genomic analysis, average genomic nucleotide identity and the phylogeny of signature genes (16S rRNA and virB4 genes). The AAPB strains were distributed in three groups, and gain and loss of phototrophic genes co-occurred in the evolutionary history of the genus Erythrobacter. The organization and structure of photosynthesis gene clusters (PGCs) in seven AAPB genomes displayed high synteny of major regions except for few insertions. The 14 Erythrobacter genomes had a large range of genome sizes, from 2.72 to 3.60 M, and the sizes of the core and pan- genomes were 1231 and 8170 orthologous clusters, respectively. Integrative and conjugative elements (ICEs) were frequently identified in genomes we studied, which might play significant roles in shaping or contributing to the pan-genome of Erythrobacter. Our findings suggest the ongoing evolutionary divergence of Erythrobacter genomes and the scattered distribution characteristic of PGC. PMID:27446024

  18. A Comparison of 14 Erythrobacter Genomes Provides Insights into the Genomic Divergence and Scattered Distribution of Phototrophs.

    PubMed

    Zheng, Qiang; Lin, Wenxin; Liu, Yanting; Chen, Chang; Jiao, Nianzhi

    2016-01-01

    Aerobic anoxygenic phototrophic bacteria (AAPB) are bacteriochlorophyll a (Bchl a)-containing microbial functional population. Erythrobacter is the first genus that was identified to contain AAPB species. Here, we compared 14 Erythrobacter genomes: seven phototrophic strains and seven non- phototrophic strains. Interestingly, AAPB strains are scattered in this genus based on their phylogenetic relationships. All 14 strains could be clustered into three groups based on phylo-genomic analysis, average genomic nucleotide identity and the phylogeny of signature genes (16S rRNA and virB4 genes). The AAPB strains were distributed in three groups, and gain and loss of phototrophic genes co-occurred in the evolutionary history of the genus Erythrobacter. The organization and structure of photosynthesis gene clusters (PGCs) in seven AAPB genomes displayed high synteny of major regions except for few insertions. The 14 Erythrobacter genomes had a large range of genome sizes, from 2.72 to 3.60 M, and the sizes of the core and pan- genomes were 1231 and 8170 orthologous clusters, respectively. Integrative and conjugative elements (ICEs) were frequently identified in genomes we studied, which might play significant roles in shaping or contributing to the pan-genome of Erythrobacter. Our findings suggest the ongoing evolutionary divergence of Erythrobacter genomes and the scattered distribution characteristic of PGC. PMID:27446024

  19. Quality scores for 32,000 genomes

    PubMed Central

    2014-01-01

    Background More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). We have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Results Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. Conclusions The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here. PMID:25780509

  20. Quality scores for 32,000 genomes

    SciTech Connect

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; Kora, Guruprasad H.; Hauser, Loren J.; Lukjancenko, Oksana; Ussery, David W.

    2014-12-08

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes had quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.

  1. Quality scores for 32,000 genomes

    DOE PAGESBeta

    Land, Miriam L.; Hyatt, Doug; Jun, Se-Ran; Kora, Guruprasad H.; Hauser, Loren J.; Lukjancenko, Oksana; Ussery, David W.

    2014-12-08

    More than 80% of the microbial genomes in GenBank are of ‘draft’ quality (12,553 draft vs. 2,679 finished, as of October, 2013). In this study, we have examined all the microbial DNA sequences available for complete, draft, and Sequence Read Archive genomes in GenBank as well as three other major public databases, and assigned quality scores for more than 30,000 prokaryotic genome sequences. Scores were assigned using four categories: the completeness of the assembly, the presence of full-length rRNA genes, tRNA composition and the presence of a set of 102 conserved genes in prokaryotes. Most (~88%) of the genomes hadmore » quality scores of 0.8 or better and can be safely used for standard comparative genomics analysis. We compared genomes across factors that may influence the score. We found that although sequencing depth coverage of over 100x did not ensure a better score, sequencing read length was a better indicator of sequencing quality. With few exceptions, most of the 30,000 genomes have nearly all the 102 essential genes. The score can be used to set thresholds for screening data when analyzing “all published genomes” and reference data is either not available or not applicable. The scores highlighted organisms for which commonly used tools do not perform well. This information can be used to improve tools and to serve a broad group of users as more diverse organisms are sequenced. Finally and unexpectedly, the comparison of predicted tRNAs across 15,000 high quality genomes showed that anticodons beginning with an ‘A’ (codons ending with a ‘U’) are almost non-existent, with the exception of one arginine codon (CGU); this has been noted previously in the literature for a few genomes, but not with the depth found here.« less

  2. The tiniest tiny genomes.

    PubMed

    Moran, Nancy A; Bennett, Gordon M

    2014-01-01

    Starting in 2006, surprisingly tiny genomes have been discovered from numerous bacterial symbionts of insect hosts. Despite their size, each retains some genes that enable provisioning of limiting nutrients or other capabilities required by hosts. Genome sequence analyses show that genome reduction is an ongoing process, resulting in a continuum of sizes, with the smallest genome currently known at 112 kilobases. Genome reduction is typical in host-restricted symbionts and pathogens, but the tiniest genomes are restricted to symbionts required by hosts and restricted to specialized host cells, resulting from long coevolution with hosts. Genes are lost in all functional categories, but core genes for central informational processes, including genes encoding ribosomal proteins, are mostly retained, whereas genes underlying production of cell envelope components are especially depleted. Thus, these entities retain cell-like properties but are heavily dependent on coadaptation of hosts, which continuously evolve to support the symbionts upon which they depend. PMID:24995872

  3. Genomic landscape of pancreatic neuroendocrine tumors

    PubMed Central

    Gebauer, Niklas; Schmidt-Werthern, Christian; Bernard, Veronica; Feller, Alfred C; Keck, Tobias; Begum, Nehara; Rades, Dirk; Lehnert, Hendrik; Brabant, Georg; Thorns, Christoph

    2014-01-01

    AIM: To investigate the prognostic role of genomic stability and copy number alterations (CNAs) pancreatic neuroendocrine tumors (PanNETs). METHODS: A high-resolution array-based comparative genomic hybridization approach was utilized in order to investigate and quantify chromosomal aberrations in a panel of 37 primary PanNET and 11 metastatic samples. DNA samples were extracted from formalin-fixed and paraffin-embedded tumor specimen. Genomic findings were correlated with histopathological and immunohistochemical data. Moreover, the dataset was subjected to employing an unsupervised hierarchical clustering analysis approach utilizing Euclidean distance and average linkage and associations between genomically defined tumor groups and recurrent CNAs or clinicopathological features of the study group were assessed. RESULTS: Numerous chromosomal aberrations were recurrently detected in both, primary tumor samples and metastases. Copy number gains were most frequently observed at 06p22.2-p22.1 (27.1%), 17p13.1 (20.8%), 07p21.3-p21.2 (18.8%), 09q34.11 (18.8%). Genomic losses were significantly less frequent and the only recurrent aberration affected 08q24.3 (6.3%). Moreover, we detected a high degree of genomic heterogeneity between primary tumors and metastatic lesions. Unsupervised hierarchical clustering of loci affected by CNAs in more than 3 primary tumor samples revealed two genetically distinct tumor groups as well as two chromosomal clusters of genomic imbalances indicating a small subset of tumors with common molecular features (13.5%). Aberrations affecting 6p22.2-22.1, 8q24.3, 9q34.11 and 17p13.1 (P = 0.011; 0.003; 0.003; 0.001), were significantly associated with a poorer survival prognosis. CONCLUSION: This study suggests that several frequent CNAs in numerous candidate regions are involved in the pathogenesis and metastatic progression of PanNET. PMID:25516664

  4. Querying genomic databases

    SciTech Connect

    Baehr, A.; Hagstrom, R.; Joerg, D.; Overbeek, R.

    1991-09-01

    A natural-language interface has been developed that retrieves genomic information by using a simple subset of English. The interface spares the biologist from the task of learning database-specific query languages and computer programming. Currently, the interface deals with the E. coli genome. It can, however, be readily extended and shows promise as a means of easy access to other sequenced genomic databases as well.

  5. Genome Aliquoting Revisited

    NASA Astrophysics Data System (ADS)

    Warren, Robert; Sankoff, David

    We prove that the genome aliquoting problem, the problem of finding a recent polyploid ancestor of a genome, with breakpoint distance can be solved in polynomial time. We propose an aliquoting algorithm that is a 2-approximation for the genome aliquoting problem with double cut and join distance, improving upon the previous best solution to this problem, Feijão and Meidanis' 4-approximation algorithm.

  6. Physician Assistant Genomic Competencies.

    PubMed

    Goldgar, Constance; Michaud, Ed; Park, Nguyen; Jenkins, Jean

    2016-09-01

    Genomic discoveries are increasingly being applied to the clinical care of patients. All physician assistants (PAs) need to acquire competency in genomics to provide the best possible care for patients within the scope of their practice. In this article, we present an updated version of PA genomic competencies and learning outcomes in a framework that is consistent with the current medical education guidelines and the collaborative nature of PAs in interprofessional health care teams. PMID:27490287

  7. Comparative and Functional Genomics in Identifying Aflatoxin Biosynthetic Genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Identification of genes involved in aflatoxin biosynthesis through Aspergillus flavus genomics has been actively pursued. A. flavus Expressed Sequence Tags (EST’s) and whole genome sequencing have been completed. Groups of genes that are potentially involved in aflatoxin production have been profi...

  8. The International Cotton Genome Initiative: Opportunities and Challenges

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The International Cotton Genome Initiative (ICGI) was conceived at the dawn of this century by a group of cotton scientists and industry representatives who envisioned a need of expanded genomic resources and researches for sustainable cotton genetic improvement through enhanced collaboration and co...

  9. Linking environmental risk assessment and communication: An experiment in co-evolving scientific and social knowledge

    USGS Publications Warehouse

    Graffy, E.A.; Booth, N.L.

    2008-01-01

    Dissemination of information to decision-makers and enhanced methods of public participation are often put forward as antidotes to a perceived disconnect between risk assessment and risk communication in the public domain. However, mechanisms that support both the provision of routine, timely and relevant technical knowledge to the public and meaningful opportunities for public participation in the evaluation and management of risk are few. We argue for the need to re-conceptualise the institutional context in which risk research and communication occur as one in which scientific knowledge and public understanding are co-evolutionary instead of independent or sequential. Here, we report on an experiment to promote coevolution of environmental risk assessment and risk communication through the instrumental use of a web-based platform that dynamically links expert and public discourses through common information sources, linked scenario evaluations, and opportunities for iterative dialogue. On the basis of technical feasibility, research value and public communication capacity, we conclude that there is potential for further refinement of the methodologies presented here. Copyright ?? 2008 Inderscience Enterprises Ltd.

  10. Beyond Dyadic Interdependence: Actor-Oriented Models for Co-Evolving Social Networks and Individual Behaviors

    ERIC Educational Resources Information Center

    Burk, William J.; Steglich, Christian E. G.; Snijders, Tom A. B.

    2007-01-01

    Actor-oriented models are described as a longitudinal strategy for examining the co-evolution of social networks and individual behaviors. We argue that these models provide advantages over conventional approaches due to their ability to account for inherent dependencies between individuals embedded in a social network (i.e., reciprocity,…

  11. Filarial and Wolbachia genomics.

    PubMed

    Scott, A L; Ghedin, E; Nutman, T B; McReynolds, L A; Poole, C B; Slatko, B E; Foster, J M

    2012-01-01

    Filarial nematode parasites, the causative agents for a spectrum of acute and chronic diseases including lymphatic filariasis and river blindness, threaten the well-being and livelihood of hundreds of millions of people in the developing regions of the world. The 2007 publication on a draft assembly of the 95-Mb genome of the human filarial parasite Brugia malayi- representing the first helminth parasite genome to be sequenced - has been followed in rapid succession by projects that have resulted in the genome sequencing of six additional filarial species, seven nonfilarial nematode parasites of animals and nearly 30 plant parasitic and free-living species. Parallel to the genomic sequencing, transcriptomic and proteomic projects have facilitated genome annotation, expanded our understanding of stage-associated gene expression and provided a first look at the role of epigenetic regulation of filarial genomes through microRNAs. The expansion in filarial genomics will also provide a significant enrichment in our knowledge of the diversity and variability in the genomes of the endosymbiotic bacterium Wolbachia leading to a better understanding of the genetic principles that govern filarial-Wolbachia mutualism. The goal here is to provide an overview of the trends and advances in filarial and Wolbachia genomics. PMID:22098559

  12. Fungal Genomics Program

    SciTech Connect

    Grigoriev, Igor

    2012-03-12

    The JGI Fungal Genomics Program aims to scale up sequencing and analysis of fungal genomes to explore the diversity of fungi important for energy and the environment, and to promote functional studies on a system level. Combining new sequencing technologies and comparative genomics tools, JGI is now leading the world in fungal genome sequencing and analysis. Over 120 sequenced fungal genomes with analytical tools are available via MycoCosm (www.jgi.doe.gov/fungi), a web-portal for fungal biologists. Our model of interacting with user communities, unique among other sequencing centers, helps organize these communities, improves genome annotation and analysis work, and facilitates new larger-scale genomic projects. This resulted in 20 high-profile papers published in 2011 alone and contributing to the Genomics Encyclopedia of Fungi, which targets fungi related to plant health (symbionts, pathogens, and biocontrol agents) and biorefinery processes (cellulose degradation, sugar fermentation, industrial hosts). Our next grand challenges include larger scale exploration of fungal diversity (1000 fungal genomes), developing molecular tools for DOE-relevant model organisms, and analysis of complex systems and metagenomes.

  13. Genomics of Clostridium tetani.

    PubMed

    Brüggemann, Holger; Brzuszkiewicz, Elzbieta; Chapeton-Montes, Diana; Plourde, Lucile; Speck, Denis; Popoff, Michel R

    2015-05-01

    Genomic information about Clostridium tetani, the causative agent of the tetanus disease, is scarce. The genome of strain E88, a strain used in vaccine production, was sequenced about 10 years ago. One additional genome (strain 12124569) has recently been released. Here we report three new genomes of C. tetani and describe major differences among all five C. tetani genomes. They all harbor tetanus-toxin-encoding plasmids that contain highly conserved genes for TeNT (tetanus toxin), TetR (transcriptional regulator of TeNT) and ColT (collagenase), but substantially differ in other plasmid regions. The chromosomes share a large core genome that contains about 85% of all genes of a given chromosome. The non-core chromosome comprises mainly prophage-like genomic regions and genes encoding environmental interaction and defense functions (e.g. surface proteins, restriction-modification systems, toxin-antitoxin systems, CRISPR/Cas systems) and other fitness functions (e.g. transport systems, metabolic activities). This new genome information will help to assess the level of genome plasticity of the species C. tetani and provide the basis for detailed comparative studies. PMID:25638019

  14. Between two fern genomes.

    PubMed

    Sessa, Emily B; Banks, Jo Ann; Barker, Michael S; Der, Joshua P; Duffy, Aaron M; Graham, Sean W; Hasebe, Mitsuyasu; Langdale, Jane; Li, Fay-Wei; Marchant, D Blaine; Pryer, Kathleen M; Rothfels, Carl J; Roux, Stanley J; Salmi, Mari L; Sigel, Erin M; Soltis, Douglas E; Soltis, Pamela S; Stevenson, Dennis W; Wolf, Paul G

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  15. [Landscape and ecological genomics].

    PubMed

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment. PMID:25508669

  16. [Landscape and ecological genomics].

    PubMed

    Tetushkin, E Ia

    2013-10-01

    Landscape genomics is the modern version of landscape genetics, a discipline that arose approximately 10 years ago as a combination of population genetics, landscape ecology, and spatial statistics. It studies the effects of environmental variables on gene flow and other microevolutionary processes that determine genetic connectivity and variations in populations. In contrast to population genetics, it operates at the level of individual specimens rather than at the level of population samples. Another important difference between landscape genetics and genomics and population genetics is that, in the former, the analysis of gene flow and local adaptations takes quantitative account of landforms and features of the matrix, i.e., hostile spaces that separate species habitats. Landscape genomics is a part of population ecogenomics, which, along with community genomics, is a major part of ecological genomics. One of the principal purposes of landscape genomics is the identification and differentiation of various genome-wide and locus-specific effects. The approaches and computation tools developed for combined analysis of genomic and landscape variables make it possible to detect adaptation-related genome fragments, which facilitates the planning of conservation efforts and the prediction of species' fate in response to expected changes in the environment. PMID:25474890

  17. Between Two Fern Genomes

    PubMed Central

    2014-01-01

    Ferns are the only major lineage of vascular plants not represented by a sequenced nuclear genome. This lack of genome sequence information significantly impedes our ability to understand and reconstruct genome evolution not only in ferns, but across all land plants. Azolla and Ceratopteris are ideal and complementary candidates to be the first ferns to have their nuclear genomes sequenced. They differ dramatically in genome size, life history, and habit, and thus represent the immense diversity of extant ferns. Together, this pair of genomes will facilitate myriad large-scale comparative analyses across ferns and all land plants. Here we review the unique biological characteristics of ferns and describe a number of outstanding questions in plant biology that will benefit from the addition of ferns to the set of taxa with sequenced nuclear genomes. We explain why the fern clade is pivotal for understanding genome evolution across land plants, and we provide a rationale for how knowledge of fern genomes will enable progress in research beyond the ferns themselves. PMID:25324969

  18. Timing and Scope of Genomic Expansion within Annelida: Evidence from Homeoboxes in the Genome of the Earthworm Eisenia fetida.

    PubMed

    Zwarycz, Allison S; Nossa, Carlos W; Putnam, Nicholas H; Ryan, Joseph F

    2016-01-01

    Annelida represents a large and morphologically diverse group of bilaterian organisms. The recently published polychaete and leech genome sequences revealed an equally dynamic range of diversity at the genomic level. The availability of more annelid genomes will allow for the identification of evolutionary genomic events that helped shape the annelid lineage and better understand the diversity within the group. We sequenced and assembled the genome of the common earthworm, Eisenia fetida. As a first pass at understanding the diversity within the group, we classified 363 earthworm homeoboxes and compared them with those of the leech Helobdella robusta and the polychaete Capitella teleta. We inferred many gene expansions occurring in the lineage connecting the most recent common ancestor (MRCA) of Capitella and Eisenia to the Eisenia/Helobdella MRCA. Likewise, the lineage leading from the Eisenia/Helobdella MRCA to the leech H. robusta has experienced substantial gains and losses. However, the lineage leading from Eisenia/Helobdella MRCA to E. fetida is characterized by extraordinary levels of homeobox gain. The evolutionary dynamics observed in the homeoboxes of these lineages are very likely to be generalizable to all genes. These genome expansions and losses have likely contributed to the remarkable biology exhibited in this group. These results provide a new perspective from which to understand the diversity within these lineages, show the utility of sub-draft genome assemblies for understanding genomic evolution, and provide a critical resource from which the biology of these animals can be studied. PMID:26659921

  19. Timing and Scope of Genomic Expansion within Annelida: Evidence from Homeoboxes in the Genome of the Earthworm Eisenia fetida

    PubMed Central

    Zwarycz, Allison S.; Nossa, Carlos W.; Putnam, Nicholas H.; Ryan, Joseph F.

    2016-01-01

    Annelida represents a large and morphologically diverse group of bilaterian organisms. The recently published polychaete and leech genome sequences revealed an equally dynamic range of diversity at the genomic level. The availability of more annelid genomes will allow for the identification of evolutionary genomic events that helped shape the annelid lineage and better understand the diversity within the group. We sequenced and assembled the genome of the common earthworm, Eisenia fetida. As a first pass at understanding the diversity within the group, we classified 363 earthworm homeoboxes and compared them with those of the leech Helobdella robusta and the polychaete Capitella teleta. We inferred many gene expansions occurring in the lineage connecting the most recent common ancestor (MRCA) of Capitella and Eisenia to the Eisenia/Helobdella MRCA. Likewise, the lineage leading from the Eisenia/Helobdella MRCA to the leech H. robusta has experienced substantial gains and losses. However, the lineage leading from Eisenia/Helobdella MRCA to E. fetida is characterized by extraordinary levels of homeobox gain. The evolutionary dynamics observed in the homeoboxes of these lineages are very likely to be generalizable to all genes. These genome expansions and losses have likely contributed to the remarkable biology exhibited in this group. These results provide a new perspective from which to understand the diversity within these lineages, show the utility of sub-draft genome assemblies for understanding genomic evolution, and provide a critical resource from which the biology of these animals can be studied. PMID:26659921

  20. A Plant-Associated Microbe Genome Initiative

    SciTech Connect

    Jan E. Leach; Scott Gold; Sue Tolin; Kellye Eversole

    2003-03-06

    effort for genome analysis of plant-associated microbes. The goals are to (i) obtain genome sequence information for several representative groups of microbes; (ii) identify and determine function for the genes/proteins and other genomic elements involved in plant-microbe interactions; (iii) develop and implement standardized bioinformatic tools and a database system that is applicable across all microbes; and (iv) educate and train scientists with skills and knowledge of biological and computational sciences who will apply the information to the protection of our food sources and environment.

  1. A plant-associated microbe genome initiative.

    PubMed

    Leach, Jan E; Gold, Scott; Tolin, Sue; Eversole, Kellye

    2003-05-01

    international public effort for genome analysis of plant-associated microbes. The goals are to (i) obtain genome sequence information for several representative groups of microbes; (ii) identify and determine function for the genes/proteins and other genomic elements involved in plant-microbe interactions; (iii) develop and implement standardized bioinformatic tools and a database system that is applicable across all microbes; and (iv) educate and train scientists with skills and knowledge of biological and computational sciences who will apply the information to the protection of our food sources and environment. PMID:18942973

  2. Cardiovascular group

    NASA Technical Reports Server (NTRS)

    Blomqvist, Gunnar

    1989-01-01

    As a starting point, the group defined a primary goal of maintaining in flight a level of systemic oxygen transport capacity comparable to each individual's preflight upright baseline. The goal of maintaining capacity at preflight levels would seem to be a reasonable objective for several different reasons, including the maintenance of good health in general and the preservation of sufficient cardiovascular reserve capacity to meet operational demands. It is also important not to introduce confounding variables in whatever other physiological studies are being performed. A change in the level of fitness is likely to be a significant confounding variable in the study of many organ systems. The principal component of the in-flight cardiovascular exercise program should be large-muscle activity such as treadmill exercise. It is desirable that at least one session per week be monitored to assure maintenance of proper functional levels and to provide guidance for any adjustments of the exercise prescription. Appropriate measurements include evaluation of the heart-rate/workload or the heart-rate/oxygen-uptake relationship. Respiratory gas analysis is helpful by providing better opportunities to document relative workload levels from analysis of the interrelationships among VO2, VCO2, and ventilation. The committee felt that there is no clear evidence that any particular in-flight exercise regimen is protective against orthostatic hypotension during the early readaptation phase. Some group members suggested that maintenance of the lower body muscle mass and muscle tone may be helpful. There is also evidence that late in-flight interventions to reexpand blood volume to preflight levels are helpful in preventing or minimizing postflight orthostatic hypotension.

  3. EGN: a wizard for construction of gene and genome similarity networks

    PubMed Central

    2013-01-01

    Background Increasingly, similarity networks are being used for evolutionary analyses of molecular datasets. These networks are very useful, in particular for the analysis of gene sharing, lateral gene transfer and for the detection of distant homologs. Currently, such analyses require some computer programming skills due to the limited availability of user-friendly freely distributed software. Consequently, although appealing, the construction and analyses of these networks remain less familiar to biologists than do phylogenetic approaches. Results In order to ease the use of similarity networks in the community of evolutionary biologists, we introduce a software program, EGN, that runs under Linux or MacOSX. EGN automates the reconstruction of gene and genome networks from nucleic and proteic sequences. EGN also implements statistics describing genetic diversity in these samples, for various user-defined thresholds of similarities. In the interest of studying the complexity of evolutionary processes affecting microbial evolution, we applied EGN to a dataset of 571,044 proteic sequences from the three domains of life and from mobile elements. We observed that, in Borrelia, plasmids play a different role than in most other eubacteria. Rather than being genetic couriers involved in lateral gene transfer, Borrelia’s plasmids and their genes act as private genetic goods, that contribute to the creation of genetic diversity within their parasitic hosts. Conclusion EGN can be used for constructing, analyzing, and mining molecular datasets in evolutionary studies. The program can help increase our knowledge of the processes through which genes from distinct sources and/or from multiple genomes co-evolve in lineages of cellular organisms. PMID:23841456

  4. Genome comparison of Pseudomonas aeruginosa large phages.

    PubMed

    Hertveldt, Kirsten; Lavigne, Rob; Pleteneva, Elena; Sernova, Natalia; Kurochkina, Lidia; Korchevskii, Roman; Robben, Johan; Mesyanzhinov, Vadim; Krylov, Victor N; Volckaert, Guido

    2005-12-01

    Pseudomonas aeruginosa phage EL is a dsDNA phage related to the giant phiKZ-like Myoviridae. The EL genome sequence comprises 211,215 bp and has 201 predicted open reading frames (ORFs). The EL genome does not share DNA sequence homology with other viruses and micro-organisms sequenced to date. However, one-third of the predicted EL gene products (gps) shares similarity (Blast alignments of 17-55% amino acid identity) with phiKZ proteins. Comparative EL and phiKZ genomics reveals that these giant phages are an example of substantially diverged genetic mosaics. Based on the position of similar EL and phiKZ predicted gene products, five genome regions can be delineated in EL, four of which are relatively conserved between EL and phiKZ. Region IV, a 17.7 kb genome region with 28 predicted ORFs, is unique to EL. Fourteen EL ORFs have been assigned a putative function based on protein similarity. Assigned proteins are involved in DNA replication and nucleotide metabolism (NAD+-dependent DNA ligase, ribonuclease HI, helicase, thymidylate kinase), host lysis and particle structure. EL-gp146 is the first chaperonin GroEL sequence identified in a viral genome. Besides a putative transposase, EL harbours predicted mobile endonucleases related to H-N-H and LAGLIDADG homing endonucleases associated with group I intron and intein intervening sequences. PMID:16256135

  5. Endogenous Viral Elements in Animal Genomes

    PubMed Central

    Katzourakis, Aris; Gifford, Robert J.

    2010-01-01

    Integration into the nuclear genome of germ line cells can lead to vertical inheritance of retroviral genes as host alleles. For other viruses, germ line integration has only rarely been documented. Nonetheless, we identified endogenous viral elements (EVEs) derived from ten non-retroviral families by systematic in silico screening of animal genomes, including the first endogenous representatives of double-stranded RNA, reverse-transcribing DNA, and segmented RNA viruses, and the first endogenous DNA viruses in mammalian genomes. Phylogenetic and genomic analysis of EVEs across multiple host species revealed novel information about the origin and evolution of diverse virus groups. Furthermore, several of the elements identified here encode intact open reading frames or are expressed as mRNA. For one element in the primate lineage, we provide statistically robust evidence for exaptation. Our findings establish that genetic material derived from all known viral genome types and replication strategies can enter the animal germ line, greatly broadening the scope of paleovirological studies and indicating a more significant evolutionary role for gene flow from virus to animal genomes than has previously been recognized. PMID:21124940

  6. Genomic contributions in livestock gene introgression programmes

    PubMed Central

    Wall, Eileen; Visscher, Peter M; Hospital, Frédéric; Woolliams, John A

    2005-01-01

    The composition of the genome after introgression of a marker gene from a donor to a recipient breed was studied using analytical and simulation methods. Theoretical predictions of proportional genomic contributions, including donor linkage drag, from ancestors used at each generation of crossing after an introgression programme agreed closely with simulated results. The obligate drag, the donor genome surrounding the target locus that cannot be removed by subsequent selection, was also studied. It was shown that the number of backcross generations and the length of the chromosome affected proportional genomic contributions to the carrier chromosomes. Population structure had no significant effect on ancestral contributions and linkage drag but it did have an effect on the obligate drag whereby larger offspring groups resulted in smaller obligate drag. The implications for an introgression programme of the number of backcross generations, the population structure and the carrier chromosome length are discussed. The equations derived describing contributions to the genome from individuals from a given generation provide a framework to predict the genomic composition of a population after the introgression of a favourable donor allele. These ancestral contributions can be assigned a value and therefore allow the prediction of genetic lag. PMID:15823237

  7. Ebolavirus comparative genomics

    SciTech Connect

    Jun, Se-Ran; Leuze, Michael R.; Nookaew, Intawat; Uberbacher, Edward C.; Land, Miriam; Zhang, Qian; Wanchai, Visanu; Chai, Juanjuan; Nielsen, Morten; Trolle, Thomas; Lund, Ole; Buzard, Gregory S.; Pedersen, Thomas D.; Ussery, David W.

    2015-07-14

    The 2014 Ebola outbreak in West Africa is the largest documented for this virus. We examine the dynamics of this genome, comparing more than one hundred currently available ebolavirus genomes to each other and to other viral genomes. Based on oligomer frequency analysis, the family Filoviridae forms a distinct group from all other sequenced viral genomes. All filovirus genomes sequenced to date encode proteins with similar functions and gene order, although there is considerable divergence in sequences between the three genera Ebolavirus, Cuevavirus, and Marburgvirus within the family Filoviridae. Whereas all ebolavirus genomes are quite similar (multiple sequences of the same strain are often identical), variation is most common in the intergenic regions and within specific areas of the genes encoding the glycoprotein (GP), nucleoprotein (NP), and polymerase (L). We predict regions that could contain epitope-binding sites, which might be good vaccine targets. In conclusion, this information, combined with glycosylation sites and experimentally determined epitopes, can identify the most promising regions for the development of therapeutic strategies.

  8. Life History Evolution and Genome Size in Subtribe Oncidiinae (Orchidaceae)

    PubMed Central

    CHASE, MARK W.; HANSON, LYNDA; ALBERT, VICTOR A.; WHITTEN, W. MARK; WILLIAMS, NORRIS H.

    2005-01-01

    • Background and Aims Within Oncidiinae, there are several groups of species that are effectively annuals, and we wished to see if these species had smaller genome sizes than average for the subtribe. • Methods Fifty-four genome size estimates (50 of which are new) for species in subtribe Oncidiinae (Orchidaceae) were examined for the first time in a phylogenetic context to evaluate hypotheses concerning genome sizes and life history traits. • Results and Conclusions Within the limits of still relatively sparse sampling, the species that are effectively annuals do appear to have smaller genome sizes than average. However, the genome sizes of their immediate sister group are also small, indicating that changes in genome size preceded the change in life history traits. Genome sizes and chromosome numbers also do not correlate; some slowly growing species have lower chromosome numbers but large genomes and vice versa. Based on a survey of the literature on orchids, it is also clear that epiphytic species have smaller genome sizes than do terrestrial species, which could be an effect of different water relations or the fact that most terrestrial orchids are geophytic or have distinct growth and dormancy phases. PMID:15596466

  9. Genomics of Disease

    Technology Transfer Automated Retrieval System (TEKTRAN)

    This edited book represents the 23rd symposium in the Stadler Genetics Symposia series, and the general theme of this conference was "The Genomics of Disease." The 24 national and international speakers were invited to discuss their world-class research into the advances that genomics has made on c...

  10. Genomics for Weed Science

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Numerous genomic-based studies have provided insight to the physiological and evolutionary processes involved in developmental and environmental processes of model plants such as arabidopsis and rice. However, far fewer efforts have been attempted to use genomic resources to study physiological and ...

  11. Unlocking the bovine genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The draft genome sequence of cattle (Bos taurus) has now been analyzed by the Bovine Genome Sequencing and Analysis Consortium and the Bovine HapMap Consortium, which together represent an extensive collaboration involving more than 300 scientists from 25 different countries. ...

  12. Genetics and Genomics

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Good progress is being made on genetics and genomics of sugar beet, however it is in process and the tools are now being generated and some results are being analyzed. The GABI BeetSeq project released a first draft of the sugar beet genome of KWS2320, a dihaploid (see http://bvseq.molgen.mpg.de/Gen...

  13. Development of Genomic GMACE

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The use of genomics to enhance national genetic evaluation systems of dairy cattle is quickly becoming standard practice. The current MACE procedure used by Interbull may not accommodate these new “genomically-enhanced” national evaluations. An important assumption in MACE may no longer be valid in ...

  14. GENOME OF HORSEPOX VIRUS

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Here we present the genomic sequence of horsepox virus (HSPV) isolate MNR-76, an orthopoxvirus (OPV) isolated in 1976 from diseased Mongolian horses. The 212 kbp genome contained 7.5 kbp inverted terminal repeats (ITR) and lacked extensive terminal tandem repetition. HSPV contained 236 ORFs with sim...

  15. Sequencing of Porcine Enterovirus Groups II and III Reveals Unique Features of Both Virus Groups

    PubMed Central

    Krumbholz, Andi; Dauber, Malte; Henke, Andreas; Birch-Hirschfeld, Eckhard; Knowles, Nick J.; Stelzner, Axel; Zell, Roland

    2002-01-01

    The molecular classification of the porcine enterovirus (PEV) groups II and III was investigated. The sequence of the almost complete PEV-8 (group II) genome reveals that this virus has unique L and 2A gene regions. A reclassification of this group into a new picornavirus genus is suggested. PEV group III viruses are typical enteroviruses. They differ from other enteroviruses by a prolonged stem-loop D of the 5′-cloverleaf structure. PMID:11992011

  16. Genomic insights into the emerging human pathogen Mycobacterium massiliense.

    PubMed

    Tettelin, Hervé; Sampaio, Elizabeth P; Daugherty, Sean C; Hine, Erin; Riley, David R; Sadzewicz, Lisa; Sengamalay, Naomi; Shefchek, Kent; Su, Qi; Tallon, Luke J; Conville, Patricia; Olivier, Kenneth N; Holland, Steven M; Fraser, Claire M; Zelazny, Adrian M

    2012-10-01

    Mycobacterium massiliense (Mycobacterium abscessus group) is an emerging pathogen causing pulmonary disease and skin and soft tissue infections. We report the genome sequence of the type strain CCUG 48898. PMID:22965080

  17. Genome Sequence of the Immunomodulatory Strain Bifidobacterium bifidum LMG 13195

    PubMed Central

    Gueimonde, Miguel; Ventura, Marco; Margolles, Abelardo

    2012-01-01

    In this work, we report the genome sequences of Bifidobacterium bifidum strain LMG13195. Results from our research group show that this strain is able to interact with human immune cells, generating functional regulatory T cells. PMID:23209243

  18. INTEGRATING GENOMICS AND PHYLOGENETICS IN UNDERSTANDING THE HISTORY OF TRICHINELLA SPECIES

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In 2004, funding was received by Washington University’s Genome Sequencing Center through NHGRI, to completely sequence several nematode genomes as part of a holistic effort to advance our understanding of the human genome and evolution within the Metazoa. Trichinella spiralis was among this group o...

  19. BeetleBase in 2010: Revisions to Provide Comprehensive Genomic Information for Tribolium castaneum

    Technology Transfer Automated Retrieval System (TEKTRAN)

    BeetleBase (http://www.beetlebase.org) has been updated to provide more comprehensive genomic information for the red flour beetle Tribolium castaneum. The database contains genomic sequence scaffolds mapped to 10 linkage groups (genome assembly release Tcas_3.0), genetic linkage maps, the official ...

  20. The amphioxus genome and the evolution of the chordate karyotype

    SciTech Connect

    Putnam, Nicholas H.; Butts, Thomas; Ferrier, David E.K.; Furlong, Rebecca F.; Hellsten, Uffe; Kawashima, Takeshi; Robinson-Rechavi, Marc; Shoguchi, Eiichi; Terry, Astrid; Yu, Jr-Kai; Benito-Gutierrez, Elia; Dubchak, Inna; Garcia-Fernandez, Jordi; Gibson-Brown, Jeremy J.; Grigoriev, Igor V.; Horton, Amy C.; de Jong, Pieter J.; Jurka, Jerzy; Kapitonov, Vladimir; Kohara, Yuji; Kuroki, Yoko; Lindquist, Erika; Lucas, Susan; Osoegawa, Kazutoyo; Pennacchio, Len A.; Salamov, Asaf A.; Satou, Yutaka; Sauka-Spengler, Tatjana; Schmutz [, Jeremy; Shin-I, Tadasu; Toyoda, Atsushi; Bronner-Fraser, Marianne; Fujiyama, Asao; Holland, Linda Z.; Holland, Peter W. H.; Satoh, Nori; Rokhsar, Daniel S.

    2008-04-01

    Lancelets ('amphioxus') are the modern survivors of an ancient chordate lineage with a fossil record dating back to the Cambrian. We describe the structure and gene content of the highly polymorphic {approx}520 million base pair genome of the Florida lancelet Branchiostoma floridae, and analyze it in the context of chordate evolution. Whole genome comparisons illuminate the murky relationships among the three chordate groups (tunicates, lancelets, and vertebrates), and allow reconstruction of not only the gene complement of the last common chordate ancestor, but also a partial reconstruction of its genomic organization, as well as a description of two genome-wide duplications and subsequent reorganizations in the vertebrate lineage. These genome-scale events shaped the vertebrate genome and provided additional genetic variation for exploitation during vertebrate evolution.

  1. Genomic Instability and Cancer

    PubMed Central

    Yao, Yixin; Dai, Wei

    2014-01-01

    Genomic instability is a characteristic of most cancer cells. It is an increased tendency of genome alteration during cell division. Cancer frequently results from damage to multiple genes controlling cell division and tumor suppressors. It is known that genomic integrity is closely monitored by several surveillance mechanisms, DNA damage checkpoint, DNA repair machinery and mitotic checkpoint. A defect in the regulation of any of these mechanisms often results in genomic instability, which predisposes the cell to malignant transformation. Posttranslational modifications of the histone tails are closely associated with regulation of the cell cycle as well as chromatin structure. Nevertheless, DNA methylation status is also related to genomic integrity. We attempt to summarize recent developments in this field and discuss the debate of driving force of tumor initiation and progression. PMID:25541596

  2. Microbial Genomes Multiply

    NASA Technical Reports Server (NTRS)

    Doolittle, Russell F.

    2002-01-01

    The publication of the first complete sequence of a bacterial genome in 1995 was a signal event, underscored by the fact that the article has been cited more than 2,100 times during the intervening seven years. It was a marvelous technical achievement, made possible by automatic DNA-sequencing machines. The feat is the more impressive in that complete genome sequencing has now been adopted in many different laboratories around the world. Four years ago in these columns I examined the situation after a dozen microbial genomes had been completed. Now, with upwards of 60 microbial genome sequences determined and twice that many in progress, it seems reasonable to assess just what is being learned. Are new concepts emerging about how cells work? Have there been practical benefits in the fields of medicine and agriculture? Is it feasible to determine the genomic sequence of every bacterial species on Earth? The answers to these questions maybe Yes, Perhaps, and No, respectively.

  3. Comparative genomic analysis of hyperthermophilic archaeal fuselloviridae viruses

    SciTech Connect

    B. Wiedenheft; K. Stedman; F. Roberto; D. Willits; A. K. Gleske; L. Zoeller; J. Snyder; T. Douglas; M. Young

    2004-02-01

    The complete genome sequences of two Sulfolobus spindle-shaped viruses (SSVs) from acidic hot springs in Kamchatka (Russia) and Yellowstone National Park (United States) have been determined. These nonlytic temperate viruses were isolated from hyperthermophilic Sulfolobus hosts, and both viruses share the spindleshaped morphology characteristic of the Fuselloviridae family. These two genomes, in combination with the previously determined SSV1 genome from Japan and the SSV2 genome from Iceland, have allowed us to carry out a phylogenetic comparison of these geographically distributed hyperthermal viruses. Each virus contains a circular double-stranded DNA genome of _15 kbp with approximately 34 open reading frames (ORFs). These Fusellovirus ORFs show little or no similarity to genes in the public databases. In contrast, 18 ORFs are common to all four isolates and may represent the minimal gene set defining this viral group. In general, ORFs on one half of the genome are colinear and highly conserved, while ORFs on the other half are not. One shared ORF among all four genomes is an integrase of the tyrosine recombinase family. All four viral genomes integrate into their host tRNA genes. The specific tRNA gene used for integration varies, and one genome integrates into multiple loci. Several unique ORFs are found in the genome of each isolate.

  4. Barcode Server: A Visualization-Based Genome Analysis System

    PubMed Central

    Mao, Fenglou; Olman, Victor; Wang, Yan; Xu, Ying

    2013-01-01

    We have previously developed a computational method for representing a genome as a barcode image, which makes various genomic features visually apparent. We have demonstrated that this visual capability has made some challenging genome analysis problems relatively easy to solve. We have applied this capability to a number of challenging problems, including (a) identification of horizontally transferred genes, (b) identification of genomic islands with special properties and (c) binning of metagenomic sequences, and achieved highly encouraging results. These application results inspired us to develop this barcode-based genome analysis server for public service, which supports the following capabilities: (a) calculation of the k-mer based barcode image for a provided DNA sequence; (b) detection of sequence fragments in a given genome with distinct barcodes from those of the majority of the genome, (c) clustering of provided DNA sequences into groups having similar barcodes; and (d) homology-based search using Blast against a genome database for any selected genomic regions deemed to have interesting barcodes. The barcode server provides a job management capability, allowing processing of a large number of analysis jobs for barcode-based comparative genome analyses. The barcode server is accessible at http://csbl1.bmb.uga.edu/Barcode. PMID:23457606

  5. Barcode server: a visualization-based genome analysis system.

    PubMed

    Mao, Fenglou; Olman, Victor; Wang, Yan; Xu, Ying

    2013-01-01

    We have previously developed a computational method for representing a genome as a barcode image, which makes various genomic features visually apparent. We have demonstrated that this visual capability has made some challenging genome analysis problems relatively easy to solve. We have applied this capability to a number of challenging problems, including (a) identification of horizontally transferred genes, (b) identification of genomic islands with special properties and (c) binning of metagenomic sequences, and achieved highly encouraging results. These application results inspired us to develop this barcode-based genome analysis server for public service, which supports the following capabilities: (a) calculation of the k-mer based barcode image for a provided DNA sequence; (b) detection of sequence fragments in a given genome with distinct barcodes from those of the majority of the genome, (c) clustering of provided DNA sequences into groups having similar barcodes; and (d) homology-based search using Blast against a genome database for any selected genomic regions deemed to have interesting barcodes. The barcode server provides a job management capability, allowing processing of a large number of analysis jobs for barcode-based comparative genome analyses. The barcode server is accessible at http://csbl1.bmb.uga.edu/Barcode. PMID:23457606

  6. Genomics education for medical professionals - the current UK landscape.

    PubMed

    Slade, Ingrid; Subramanian, Deepak N; Burton, Hilary

    2016-08-01

    Genomics education in the UK is at an early stage of development, and its pace of evolution has lagged behind that of the genomics research upon which it is based. As a result, knowledge of genomics and its applications remains limited among non-specialist clinicians. In this review article, we describe the complex landscape for genomics education within the UK, and highlight the large number and variety of organisations that can influence, direct and provide genomics training to medical professionals. Postgraduate genomics education is being shaped by the work of the Health Education England (HEE) Genomics Education Programme, working in conjunction with the Joint Committee on Genomics in Medicine. The success of their work will be greatly enhanced by the full cooperation and engagement of the many groups, societies and organisations involved with medical education and training (such as the royal colleges). Without this cooperation, there is a risk of poor coordination and unnecessary duplication of work. Leadership from an organisation such as the HEE Genomics Education Programme will have a key role in guiding the formulation and delivery of genomics education policy by various stakeholders among the different disciplines in medicine. PMID:27481379

  7. Comparative genomic analysis of hyperthermophilic archaeal Fuselloviridae viruses.

    PubMed

    Wiedenheft, Blake; Stedman, Kenneth; Roberto, Francisco; Willits, Deborah; Gleske, Anne-Kathrin; Zoeller, Luisa; Snyder, Jamie; Douglas, Trevor; Young, Mark

    2004-02-01

    The complete genome sequences of two Sulfolobus spindle-shaped viruses (SSVs) from acidic hot springs in Kamchatka (Russia) and Yellowstone National Park (United States) have been determined. These nonlytic temperate viruses were isolated from hyperthermophilic Sulfolobus hosts, and both viruses share the spindle-shaped morphology characteristic of the Fuselloviridae family. These two genomes, in combination with the previously determined SSV1 genome from Japan and the SSV2 genome from Iceland, have allowed us to carry out a phylogenetic comparison of these geographically distributed hyperthermal viruses. Each virus contains a circular double-stranded DNA genome of approximately 15 kbp with approximately 34 open reading frames (ORFs). These Fusellovirus ORFs show little or no similarity to genes in the public databases. In contrast, 18 ORFs are common to all four isolates and may represent the minimal gene set defining this viral group. In general, ORFs on one half of the genome are colinear and highly conserved, while ORFs on the other half are not. One shared ORF among all four genomes is an integrase of the tyrosine recombinase family. All four viral genomes integrate into their host tRNA genes. The specific tRNA gene used for integration varies, and one genome integrates into multiple loci. Several unique ORFs are found in the genome of each isolate. PMID:14747560

  8. Hawaiian Drosophila genomes: size variation and evolutionary expansions.

    PubMed

    Craddock, Elysse M; Gall, Joseph G; Jonas, Mark

    2016-02-01

    This paper reports genome sizes of one Hawaiian Scaptomyza and 16 endemic Hawaiian Drosophila species that include five members of the antopocerus species group, one member of the modified mouthpart group, and ten members of the picture wing clade. Genome size expansions have occurred independently multiple times among Hawaiian Drosophila lineages, and have resulted in an over 2.3-fold range of genome sizes among species, with the largest observed in Drosophila cyrtoloma (1C = 0.41 pg). We find evidence that these repeated genome size expansions were likely driven by the addition of significant amounts of heterochromatin and satellite DNA. For example, our data reveal that the addition of seven heterochromatic chromosome arms to the ancestral haploid karyotype, and a remarkable proportion of ~70 % satellite DNA, account for the greatly expanded size of the D. cyrtoloma genome. Moreover, the genomes of 13/17 Hawaiian picture wing species are composed of substantial proportions (22-70 %) of detectable satellites (all but one of which are AT-rich). Our results suggest that in this tightly knit group of recently evolved species, genomes have expanded, in large part, via evolutionary amplifications of satellite DNA sequences in centric and pericentric domains (especially of the X and dot chromosomes), which have resulted in longer acrocentric chromosomes or metacentrics with an added heterochromatic chromosome arm. We discuss possible evolutionary mechanisms that may have shaped these patterns, including rapid fixation of novel expanded genomes during founder-effect speciation. PMID:26790663

  9. Supergenes: The Genomic Architecture of a Bird with Four Sexes.

    PubMed

    Campagna, Leonardo

    2016-02-01

    Supergenes are clusters of physically linked, co-evolving genes that often control complex traits. A new study clarifies the origin and possible fate of a fascinating supergene that determines the coloration and mating behavior of a widespread North American bird. PMID:26859263

  10. Phytozome Comparative Plant Genomics Portal

    SciTech Connect

    Goodstein, David; Batra, Sajeev; Carlson, Joseph; Hayes, Richard; Phillips, Jeremy; Shu, Shengqiang; Schmutz, Jeremy; Rokhsar, Daniel

    2014-09-09

    The Dept. of Energy Joint Genome Institute is a genomics user facility supporting DOE mission science in the areas of Bioenergy, Carbon Cycling, and Biogeochemistry. The Plant Program at the JGI applies genomic, analytical, computational and informatics platforms and methods to: 1. Understand and accelerate the improvement (domestication) of bioenergy crops 2. Characterize and moderate plant response to climate change 3. Use comparative genomics to identify constrained elements and infer gene function 4. Build high quality genomic resource platforms of JGI Plant Flagship genomes for functional and experimental work 5. Expand functional genomic resources for Plant Flagship genomes

  11. Group evaporation

    NASA Technical Reports Server (NTRS)

    Shen, Hayley H.

    1991-01-01

    Liquid fuel combustion process is greatly affected by the rate of droplet evaporation. The heat and mass exchanges between gas and liquid couple the dynamics of both phases in all aspects: mass, momentum, and energy. Correct prediction of the evaporation rate is therefore a key issue in engineering design of liquid combustion devices. Current analytical tools for characterizing the behavior of these devices are based on results from a single isolated droplet. Numerous experimental studies have challenged the applicability of these results in a dense spray. To account for the droplets' interaction in a dense spray, a number of theories have been developed in the past decade. Herein, two tasks are examined. One was to study how to implement the existing theoretical results, and the other was to explore the possibility of experimental verifications. The current theoretical results of group evaporation are given for a monodispersed cluster subject to adiabatic conditions. The time evolution of the fluid mechanic and thermodynamic behavior in this cluster is derived. The results given are not in the form of a subscale model for CFD codes.

  12. The Genome Portal of the Department of Energy Joint Genome Institute

    SciTech Connect

    Nordberg, Henrik; Cantor, Michael; Dushekyo, Serge; Hua, Susan; Poliakov, Alexander; Smirnova, Tatyana; Dubchak, Inna

    2014-03-14

    The JGI Genome Portal (http://genome.jgi.doe.gov) provides unified access to all JGI genomic databases and analytical tools. A user can search, download and explore multiple data sets available for all DOE JGI sequencing projects including their status, assemblies and annotations of sequenced genomes. Genome Portal in the past 2 years was significantly updated, with a specific emphasis on efficient handling of the rapidly growing amount of diverse genomic data accumulated in JGI. A critical aspect of handling big data in genomics is the development of visualization and analysis tools that allow scientists to derive meaning from what are otherwise terrabases of inert sequence. An interactive visualization tool developed in the group allows us to explore contigs resulting from a single metagenome assembly. Implemented with modern web technologies that take advantage of the power of the computer's graphical processing unit (gpu), the tool allows the user to easily navigate over a 100,000 data points in multiple dimensions, among many biologically meaningful parameters of a dataset such as relative abundance, contig length, and G+C content.

  13. MorusDB: a resource for mulberry genomics and genome biology

    PubMed Central

    Li, Tian; Qi, Xiwu; Zeng, Qiwei; Xiang, Zhonghuai; He, Ningjia

    2014-01-01

    Mulberry is an important cultivated plant that has received the attention of biologists interested in sericulture and plant–insect interaction. Morus notabilis, a wild mulberry species with a minimal chromosome number is an ideal material for whole-genome sequencing and assembly. The genome and transcriptome of M. notabilis were sequenced and analyzed. In this article, a web-based and open-access database, the Morus Genome Database (MorusDB), was developed to enable easy-to-access and data mining. The MorusDB provides an integrated data source and an easy accession of mulberry large-scale genomic sequencing and assembly, predicted genes and functional annotations, expressed sequence tags (ESTs), transposable elements (TEs), Gene Ontology (GO) terms, horizontal gene transfers between mulberry and silkworm and ortholog and paralog groups. Transcriptome sequencing data for M. notabilis root, leaf, bark, winter bud and male flower can also be searched and downloaded. Furthermore, MorusDB provides an analytical workbench with some built-in tools and pipelines, such as BLAST, Search GO, Mulberry GO and Mulberry GBrowse, to facilitate genomic studies and comparative genomics. The MorusDB provides important genomic resources for scientists working with mulberry and other Moraceae species, which include many important fruit crops. Designed as a basic platform and accompanied by the SilkDB, MorusDB strives to be a comprehensive platform for the silkworm–mulberry interaction studies. Database URL: http://morus.swu.edu.cn/morusdb. PMID:24923822

  14. Comparative Genome Mapping of Sorghum and Maize

    PubMed Central

    Whitkus, R.; Doebley, J.; Lee, M.

    1992-01-01

    Linkage relationships were determined among 85 maize low copy number nuclear DNA probes and seven isozyme loci in an F(2) population derived from a cross of Sorghum bicolor ssp. bicolor X S. bicolor ssp. arundinaceum. Thirteen linkage groups were defined, three more than the 10 chromosomes of sorghum. Use of maize DNA probes to produce the sorghum linkage map allowed us to make several inferences concerning processes involved in the evolutionary divergence of the maize and sorghum genomes. The results show that many linkage groups are conserved between these two genomes and that the amount of recombination in these conserved linkage groups is roughly equivalent in maize and sorghum. Estimates of the proportions of duplicated loci suggest that a larger proportion of the loci are duplicated in the maize genome than in the sorghum genome. This result concurs with a prior estimate that the nuclear DNA content of maize is three to four times greater than that of sorghum. The pattern of conserved linkages between maize and sorghum is such that most sorghum linkage groups are composed of loci that map to two maize chromosomes. This pattern is consistent with the hypothesized ancient polyploid origin of maize and sorghum. There are nine cases in which locus order within shared linkage groups is inverted in sorghum relative to maize. These may have arisen from either inversions or intrachromosomal translocations. We found no evidence for large interchromosomal translocations. Overall, the data suggest that the primary processes involved in divergence of the maize and sorghum genomes were duplications (either by polyploidy or segmental duplication) and inversions or intrachromosomal translocations. PMID:1360933

  15. Reassessment of the Genome Size in Elaeis guineensis and Elaeis oleifera, and Its Interspecific Hybrid

    PubMed Central

    Camillo, Julceia; Leão, André P; Alves, Alexandre A; Formighieri, Eduardo F; Azevedo, Ana LS; Nunes, Juliana D; de Capdeville, Guy; de A Mattos, Jean K; Souza, Manoel T

    2014-01-01

    Aiming at generating a comprehensive genomic database on Elaeis spp., our group is leading several R&D initiatives with Elaeis guineensis (African oil palm) and Elaeis oleifera (American oil palm), including the whole-genome sequencing of the last. Genome size estimates currently available for this genus are controversial, as they indicate that American oil palm genome is about half the size of the African oil palm genome and that the genome of the interspecific hybrid is bigger than both the parental species genomes. We estimated the genome size of three E. guineensis genotypes, five E. oleifera genotypes, and two interspecific hybrids genotypes. On average, the genome size of E. guineensis is 4.32 ± 0.173 pg, while that of E. oleifera is 4.43 ± 0.018 pg. This indicates that both genomes are similar in size, even though E. oleifera is in fact bigger. As expected, the hybrid genome size is around the average of the two genomes, 4.40 ± 0.016 pg. Additionally, we demonstrate that both species present around 38% of GC content. As our results contradict the currently available data on Elaeis spp. genome sizes, we propose that the actual genome size of the Elaeis species is around 4 pg and that American oil palm possesses a larger genome than African oil palm. PMID:26203259

  16. Reassessment of the Genome Size in Elaeis guineensis and Elaeis oleifera, and Its Interspecific Hybrid.

    PubMed

    Camillo, Julceia; Leão, André P; Alves, Alexandre A; Formighieri, Eduardo F; Azevedo, Ana Ls; Nunes, Juliana D; de Capdeville, Guy; de A Mattos, Jean K; Souza, Manoel T

    2014-01-01

    Aiming at generating a comprehensive genomic database on Elaeis spp., our group is leading several R&D initiatives with Elaeis guineensis (African oil palm) and Elaeis oleifera (American oil palm), including the whole-genome sequencing of the last. Genome size estimates currently available for this genus are controversial, as they indicate that American oil palm genome is about half the size of the African oil palm genome and that the genome of the interspecific hybrid is bigger than both the parental species genomes. We estimated the genome size of three E. guineensis genotypes, five E. oleifera genotypes, and two interspecific hybrids genotypes. On average, the genome size of E. guineensis is 4.32 ± 0.173 pg, while that of E. oleifera is 4.43 ± 0.018 pg. This indicates that both genomes are similar in size, even though E. oleifera is in fact bigger. As expected, the hybrid genome size is around the average of the two genomes, 4.40 ± 0.016 pg. Additionally, we demonstrate that both species present around 38% of GC content. As our results contradict the currently available data on Elaeis spp. genome sizes, we propose that the actual genome size of the Elaeis species is around 4 pg and that American oil palm possesses a larger genome than African oil palm. PMID:26203259

  17. Genome size evolution: sizing mammalian genomes.

    PubMed

    Redi, C A; Capanna, E

    2012-01-01

    The study of genome size (GS) and its variation is so fascinating to the scientific community because it constitutes the link between the present-day analytical and molecular studies of the genome and the old trunk of the holistic and synthetic view of the genome. The GS of several taxa vary over a broad range and do not correlate with the complexity of the organisms (the C-value paradox). However, the biology of transposable elements has let us reach a satisfactory view of the molecular mechanisms that give rise to GS variation and novelties, providing a less perplexing view of the significance of the GS (C-enigma). The knowledge of the composition and structure of a genome is a pre-requisite for trying to understand the evolution of the main genome signature: its size. The radiation of mammals provides an approximately 180-million-year test case for theories of how GS evolves. It has been found from data-mining GS databases that GS is a useful cyto-taxonomical instrument at the level of orders/superorders, providing genomic signatures characterizing Monotremata, Marsupialia, Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires. A hypothetical ancestral mammalian-like GS of 2.9-3.7 pg has been suggested. This value appears compatible with the average values calculated for the high systematic levels of the extant Monotremata (∼2.97 pg) and Marsupialia (∼4.07 pg), suggesting invasion of mobile DNA elements concurrently with the separation of the older clades of Afrotheria (∼5.5 pg) and Xenarthra (∼4.5 pg) with larger GS, leaving the Euarchontoglires (∼3.4 pg) and Laurasiatheria (∼2.8 pg) genomes with fewer transposable elements. However, the paucity of GS data (546 mammalian species sized from 5,488 living species) for species, genera, and families calls for caution. Considering that mammalian species may be vanished even before they are known, GS data are sorely needed to phenotype the effects brought about by their variation and to validate any

  18. Evolution of genome architecture.

    PubMed

    Koonin, Eugene V

    2009-02-01

    Charles Darwin believed that all traits of organisms have been honed to near perfection by natural selection. The empirical basis underlying Darwin's conclusions consisted of numerous observations made by him and other naturalists on the exquisite adaptations of animals and plants to their natural habitats and on the impressive results of artificial selection. Darwin fully appreciated the importance of heredity but was unaware of the nature and, in fact, the very existence of genomes. A century and a half after the publication of the "Origin", we have the opportunity to draw conclusions from the comparisons of hundreds of genome sequences from all walks of life. These comparisons suggest that the dominant mode of genome evolution is quite different from that of the phenotypic evolution. The genomes of vertebrates, those purported paragons of biological perfection, turned out to be veritable junkyards of selfish genetic elements where only a small fraction of the genetic material is dedicated to encoding biologically relevant information. In sharp contrast, genomes of microbes and viruses are incomparably more compact, with most of the genetic material assigned to distinct biological functions. However, even in these genomes, the specific genome organization (gene order) is poorly conserved. The results of comparative genomics lead to the conclusion that the genome architecture is not a straightforward result of continuous adaptation but rather is determined by the balance between the selection pressure, that is itself dependent on the effective population size and mutation rate, the level of recombination, and the activity of selfish elements. Although genes and, in many cases, multigene regions of genomes possess elaborate architectures that ensure regulation of expression, these arrangements are evolutionarily volatile and typically change substantially even on short evolutionary scales when gene sequences diverge minimally. Thus, the observed genome

  19. Detecting uber-operons in prokaryotic genomes

    PubMed Central

    Che, Dongsheng; Li, Guojun; Mao, Fenglou; Wu, Hongwei; Xu, Ying

    2006-01-01

    We present a study on computational identification of uber-operons in a prokaryotic genome, each of which represents a group of operons that are evolutionarily or functionally associated through operons in other (reference) genomes. Uber-operons represent a rich set of footprints of operon evolution, whose full utilization could lead to new and more powerful tools for elucidation of biological pathways and networks than what operons have provided, and a better understanding of prokaryotic genome structures and evolution. Our prediction algorithm predicts uber-operons through identifying groups of functionally or transcriptionally related operons, whose gene sets are conserved across the target and multiple reference genomes. Using this algorithm, we have predicted uber-operons for each of a group of 91 genomes, using the other 90 genomes as references. In particular, we predicted 158 uber-operons in Escherichia coli K12 covering 1830 genes, and found that many of the uber-operons correspond to parts of known regulons or biological pathways or are involved in highly related biological processes based on their Gene Ontology (GO) assignments. For some of the predicted uber-operons that are not parts of known regulons or pathways, our analyses indicate that their genes are highly likely to work together in the same biological processes, suggesting the possibility of new regulons and pathways. We believe that our uber-operon prediction provides a highly useful capability and a rich information source for elucidation of complex biological processes, such as pathways in microbes. All the prediction results are available at our Uber-Operon Database: , the first of its kind. PMID:16682449

  20. A Distinct Group II Alphabaculovirus Isolated from a Peridroma Species.

    PubMed

    Rohrmann, George F; Erlandson, Martin A; Theilmann, David A

    2015-01-01

    The genome sequence of an alphabaculovirus isolated from a Peridroma species indicated that it is a novel member of a group II lineage most closely related to alphabaculoviruses from Spodoptera exigua and Agrotis segetum. It contains a genome of 151,110 nucleotides (nt), with a G+C content of 53.3%. PMID:25838477

  1. Lactobacillus paracasei comparative genomics: towards species pan-genome definition and exploitation of diversity.

    PubMed

    Smokvina, Tamara; Wels, Michiel; Polka, Justyna; Chervaux, Christian; Brisse, Sylvain; Boekhorst, Jos; van Hylckama Vlieg, Johan E T; Siezen, Roland J

    2013-01-01

    Lactobacillus paracasei is a member of the normal human and animal gut microbiota and is used extensively in the food industry in starter cultures for dairy products or as probiotics. With the development of low-cost, high-throughput sequencing techniques it has become feasible to sequence many different strains of one species and to determine its "pan-genome". We have sequenced the genomes of 34 different L. paracasei strains, and performed a comparative genomics analysis. We analysed genome synteny and content, focussing on the pan-genome, core genome and variable genome. Each genome was shown to contain around 2800-3100 protein-coding genes, and comparative analysis identified over 4200 ortholog groups that comprise the pan-genome of this species, of which about 1800 ortholog groups make up the conserved core. Several factors previously associated with host-microbe interactions such as pili, cell-envelope proteinase, hydrolases p40 and p75 or the capacity to produce short branched-chain fatty acids (bkd operon) are part of the L. paracasei core genome present in all analysed strains. The variome consists mainly of hypothetical proteins, phages, plasmids, transposon/conjugative elements, and known functions such as sugar metabolism, cell-surface proteins, transporters, CRISPR-associated proteins, and EPS biosynthesis proteins. An enormous variety and variability of sugar utilization gene cassettes were identified, with each strain harbouring between 25-53 cassettes, reflecting the high adaptability of L. paracasei to different niches. A phylogenomic tree was constructed based on total genome contents, and together with an analysis of horizontal gene transfer events we conclude that evolution of these L. paracasei strains is complex and not always related to niche adaptation. The results of this genome content comparison was used, together with high-throughput growth experiments on various carbohydrates, to perform gene-trait matching analysis, in order to link

  2. Evolution of Genome Size and Complexity in the Rhabdoviridae

    PubMed Central

    Walker, Peter J.; Firth, Cadhla; Widen, Steven G.; Blasdell, Kim R.; Guzman, Hilda; Wood, Thomas G.; Paradkar, Prasad N.; Holmes, Edward C.; Tesh, Robert B.; Vasilakis, Nikos

    2015-01-01

    RNA viruses exhibit substantial structural, ecological and genomic diversity. However, genome size in RNA viruses is likely limited by a high mutation rate, resulting in the evolution of various mechanisms to increase complexity while minimising genome expansion. Here we conduct a large-scale analysis of the genome sequences of 99 animal rhabdoviruses, including 45 genomes which we determined de novo, to identify patterns of genome expansion and the evolution of genome complexity. All but seven of the rhabdoviruses clustered into 17 well-supported monophyletic groups, of which eight corresponded to established genera, seven were assigned as new genera, and two were taxonomically ambiguous. We show that the acquisition and loss of new genes appears to have been a central theme of rhabdovirus evolution, and has been associated with the appearance of alternative, overlapping and consecutive ORFs within the major structural protein genes, and the insertion and loss of additional ORFs in each gene junction in a clade-specific manner. Changes in the lengths of gene junctions accounted for as much as 48.5% of the variation in genome size from the smallest to the largest genome, and the frequency with which new ORFs were observed increased in the 3’ to 5’ direction along the genome. We also identify several new families of accessory genes encoded in these regions, and show that non-canonical expression strategies involving TURBS-like termination-reinitiation, ribosomal frame-shifts and leaky ribosomal scanning appear to be common. We conclude that rhabdoviruses have an unusual capacity for genomic plasticity that may be linked to their discontinuous transcription strategy from the negative-sense single-stranded RNA genome, and propose a model that accounts for the regular occurrence of genome expansion and contraction throughout the evolution of the Rhabdoviridae. PMID:25679389

  3. The Banana Genome Hub

    PubMed Central

    Droc, Gaëtan; Larivière, Delphine; Guignon, Valentin; Yahiaoui, Nabila; This, Dominique; Garsmeur, Olivier; Dereeper, Alexis; Hamelin, Chantal; Argout, Xavier; Dufayard, Jean-François; Lengelle, Juliette; Baurens, Franc-Christophe; Cenci, Alberto; Pitollat, Bertrand; D’Hont, Angélique; Ruiz, Manuel; Rouard, Mathieu; Bocs, Stéphanie

    2013-01-01

    Banana is one of the world’s favorite fruits and one of the most important crops for developing countries. The banana reference genome sequence (Musa acuminata) was recently released. Given the taxonomic position of Musa, the completed genomic sequence has particular comparative value to provide fresh insights about the evolution of the monocotyledons. The study of the banana genome has been enhanced by a number of tools and resources that allows harnessing its sequence. First, we set up essential tools such as a Community Annotation System, phylogenomics resources and metabolic pathways. Then, to support post-genomic efforts, we improved banana existing systems (e.g. web front end, query builder), we integrated available Musa data into generic systems (e.g. markers and genetic maps, synteny blocks), we have made interoperable with the banana hub, other existing systems containing Musa data (e.g. transcriptomics, rice reference genome, workflow manager) and finally, we generated new results from sequence analyses (e.g. SNP and polymorphism analysis). Several uses cases illustrate how the Banana Genome Hub can be used to study gene families. Overall, with this collaborative effort, we discuss the importance of the interoperability toward data integration between existing information systems. Database URL: http://banana-genome.cirad.fr/ PMID:23707967

  4. Genomic Insights into Bifidobacteria

    PubMed Central

    Lee, Ju-Hoon; O'Sullivan, Daniel J.

    2010-01-01

    Summary: Since the discovery in 1899 of bifidobacteria as numerically dominant microbes in the feces of breast-fed infants, there have been numerous studies addressing their role in modulating gut microflora as well as their other potential health benefits. Because of this, they are frequently incorporated into foods as probiotic cultures. An understanding of their full interactions with intestinal microbes and the host is needed to scientifically validate any health benefits they may afford. Recently, the genome sequences of nine strains representing four species of Bifidobacterium became available. A comparative genome analysis of these genomes reveals a likely efficient capacity to adapt to their habitats, with B. longum subsp. infantis exhibiting more genomic potential to utilize human milk oligosaccharides, consistent with its habitat in the infant gut. Conversely, B. longum subsp. longum exhibits a higher genomic potential for utilization of plant-derived complex carbohydrates and polyols, consistent with its habitat in an adult gut. An intriguing observation is the loss of much of this genome potential when strains are adapted to pure culture environments, as highlighted by the genomes of B. animalis subsp. lactis strains, which exhibit the least potential for a gut habitat and are believed to have evolved from the B. animalis species during adaptation to dairy fermentation environments. PMID:20805404

  5. Genomic signatures of evolutionary transitions from solitary to group living

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Eusociality has evolved rarely, but repeatedly, in vertebrates and invertebrates, and resulted inconvergent morphological, physiological, and behavioural innovations. It is unknown whether similar evolutionary processes are responsible for the repeated origins and further elaborations of eusociality...

  6. Ensembl comparative genomics resources

    PubMed Central

    Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J.; Searle, Stephen M. J.; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847

  7. Genome instability and aging.

    PubMed

    Vijg, Jan; Suh, Yousin

    2013-01-01

    Genome instability has long been implicated as the main causal factor in aging. Somatic cells are continuously exposed to various sources of DNA damage, from reactive oxygen species to UV radiation to environmental mutagens. To cope with the tens of thousands of chemical lesions introduced into the genome of a typical cell each day, a complex network of genome maintenance systems acts to remove damage and restore the correct base pair sequence. Occasionally, however, repair is erroneous, and such errors, as well as the occasional failure to correctly replicate the genome during cell division, are the basis for mutations and epimutations. There is now ample evidence that mutations accumulate in various organs and tissues of higher animals, including humans, mice, and flies. What is not known, however, is whether the frequency of these random changes is sufficient to cause the phenotypic effects generally associated with aging. The exception is cancer, an age-related disease caused by the accumulation of mutations and epimutations. Here, we first review current concepts regarding the relationship between DNA damage, repair, and mutation, as well as the data regarding genome alterations as a function of age. We then describe a model for how randomly induced DNA sequence and epigenomic variants in the somatic genomes of animals can result in functional decline and disease in old age. Finally, we discuss the genetics of genome instability in relation to longevity to address the importance of alterations in the somatic genome as a causal factor in aging and to underscore the opportunities provided by genetic approaches to develop interventions that attenuate genome instability, reduce disease risk, and increase life span. PMID:23398157

  8. Ensembl comparative genomics resources.

    PubMed

    Herrero, Javier; Muffato, Matthieu; Beal, Kathryn; Fitzgerald, Stephen; Gordon, Leo; Pignatelli, Miguel; Vilella, Albert J; Searle, Stephen M J; Amode, Ridwan; Brent, Simon; Spooner, William; Kulesha, Eugene; Yates, Andrew; Flicek, Paul

    2016-01-01

    Evolution provides the unifying framework with which to understand biology. The coherent investigation of genic and genomic data often requires comparative genomics analyses based on whole-genome alignments, sets of homologous genes and other relevant datasets in order to evaluate and answer evolutionary-related questions. However, the complexity and computational requirements of producing such data are substantial: this has led to only a small number of reference resources that are used for most comparative analyses. The Ensembl comparative genomics resources are one such reference set that facilitates comprehensive and reproducible analysis of chordate genome data. Ensembl computes pairwise and multiple whole-genome alignments from which large-scale synteny, per-base conservation scores and constrained elements are obtained. Gene alignments are used to define Ensembl Protein Families, GeneTrees and homologies for both protein-coding and non-coding RNA genes. These resources are updated frequently and have a consistent informatics infrastructure and data presentation across all supported species. Specialized web-based visualizations are also available including synteny displays, collapsible gene tree plots, a gene family locator and different alignment views. The Ensembl comparative genomics infrastructure is extensively reused for the analysis of non-vertebrate species by other projects including Ensembl Genomes and Gramene and much of the information here is relevant to these projects. The consistency of the annotation across species and the focus on vertebrates makes Ensembl an ideal system to perform and support vertebrate comparative genomic analyses. We use robust software and pipelines to produce reference comparative data and make it freely available. Database URL: http://www.ensembl.org. PMID:26896847

  9. Center for Cancer Genomics | Office of Cancer Genomics

    Cancer.gov

    The Center for Cancer Genomics (CCG) was established to unify the National Cancer Institute's activities in cancer genomics, with the goal of advancing genomics research and translating findings into the clinic to improve the precise diagnosis and treatment of cancers. In addition to promoting genomic sequencing approach

  10. Human Genome Project

    SciTech Connect

    Block, S.; Cornwall, J.; Dally, W.; Dyson, F.; Fortson, N.; Joyce, G.; Kimble, H. J.; Lewis, N.; Max, C.; Prince, T.; Schwitters, R.; Weinberger, P.; Woodin, W. H.

    1998-01-04

    The study reviews Department of Energy supported aspects of the United States Human Genome Project, the joint National Institutes of Health/Department of Energy program to characterize all human genetic material, to discover the set of human genes, and to render them accessible for further biological study. The study concentrates on issues of technology, quality assurance/control, and informatics relevant to current effort on the genome project and needs beyond it. Recommendations are presented on areas of the genome program that are of particular interest to and supported by the Department of Energy.

  11. Human Genome Program

    SciTech Connect

    Not Available

    1993-01-01

    The DOE Human Genome program has grown tremendously, as shown by the marked increase in the number of genome-funded projects since the last workshop held in 1991. The abstracts in this book describe the genome research of DOE-funded grantees and contractors and invited guests, and all projects are represented at the workshop by posters. The 3-day meeting includes plenary sessions on ethical, legal, and social issues pertaining to the availability of genetic data; sequencing techniques, informatics support; and chromosome and cDNA mapping and sequencing.

  12. What Is a Genome?

    PubMed Central

    Goldman, Aaron David; Landweber, Laura F.

    2016-01-01

    The genome is often described as the information repository of an organism. Whether millions or billions of letters of DNA, its transmission across generations confers the principal medium for inheritance of organismal traits. Several emerging areas of research demonstrate that this definition is an oversimplification. Here, we explore ways in which a deeper understanding of genomic diversity and cell physiology is challenging the concepts of physical permanence attached to the genome as well as its role as the sole information source for an organism. PMID:27442251

  13. Insights from twenty years of bacterial genome sequencing

    SciTech Connect

    Land, Miriam L; Hauser, Loren John; Jun, Se Ran; Nookaew, Intawat; Leuze, Michael Rex; Ahn, Tae-Hyuk; Karpinets, Tatiana V; Lund, Ole; Kora, Guruprasad H; Wassenaar, Trudy; Poudel, Suresh; Ussery, David W

    2015-01-01

    Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome

  14. Insights from 20 years of bacterial genome sequencing.

    PubMed

    Land, Miriam; Hauser, Loren; Jun, Se-Ran; Nookaew, Intawat; Leuze, Michael R; Ahn, Tae-Hyuk; Karpinets, Tatiana; Lund, Ole; Kora, Guruprased; Wassenaar, Trudy; Poudel, Suresh; Ussery, David W

    2015-03-01

    Since the first two complete bacterial genome sequences were published in 1995, the science of bacteria has dramatically changed. Using third-generation DNA sequencing, it is possible to completely sequence a bacterial genome in a few hours and identify some types of methylation sites along the genome as well. Sequencing of bacterial genome sequences is now a standard procedure, and the information from tens of thousands of bacterial genomes has had a major impact on our views of the bacterial world. In this review, we explore a series of questions to highlight some insights that comparative genomics has produced. To date, there are genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. However, the distribution is quite skewed towards a few phyla that contain model organisms. But the breadth is continuing to improve, with projects dedicated to filling in less characterized taxonomic groups. The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system provides bacteria with immunity against viruses, which outnumber bacteria by tenfold. How fast can we go? Second-generation sequencing has produced a large number of draft genomes (close to 90 % of bacterial genomes in GenBank are currently not complete); third-generation sequencing can potentially produce a finished genome in a few hours, and at the same time provide methlylation sites along the entire chromosome. The diversity of bacterial communities is extensive as is evident from the genome sequences available from 50 different bacterial phyla and 11 different archaeal phyla. Genome sequencing can help in classifying an organism, and in the case where multiple genomes of the same species are available, it is possible to calculate the pan- and core genomes; comparison of more than 2000 Escherichia coli genomes finds an E. coli core genome of about 3100 gene families and a total of about 89,000 different gene families. Why do we care about bacterial genome

  15. Comparative primate genomics: emerging patterns of genome content and dynamics

    PubMed Central

    Rogers, Jeffrey; Gibbs, Richard A.

    2014-01-01

    Preface Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for several primates, with analyses of several others underway. Whole genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other nonhuman primates provide valuable insight into genetic similarities and differences among species used as models for disease-related research. This review summarizes current knowledge regarding primate genome content and dynamics and offers a series of goals for the near future. PMID:24709753

  16. Comparative primate genomics: emerging patterns of genome content and dynamics.

    PubMed

    Rogers, Jeffrey; Gibbs, Richard A

    2014-05-01

    Advances in genome sequencing technologies have created new opportunities for comparative primate genomics. Genome assemblies have been published for various primate species, and analyses of several others are underway. Whole-genome assemblies for the great apes provide remarkable new information about the evolutionary origins of the human genome and the processes involved. Genomic data for macaques and other non-human primates offer valuable insights into genetic similarities and differences among species that are used as models for disease-related research. This Review summarizes current knowledge regarding primate genome content and dynamics, and proposes a series of goals for the near future. PMID:24709753

  17. GenomeView: a next-generation genome browser

    PubMed Central

    Abeel, Thomas; Van Parys, Thomas; Saeys, Yvan; Galagan, James; Van de Peer, Yves

    2012-01-01

    Due to ongoing advances in sequencing technologies, billions of nucleotide sequences are now produced on a daily basis. A major challenge is to visualize these data for further downstream analysis. To this end, we present GenomeView, a stand-alone genome browser specifically designed to visualize and manipulate a multitude of genomics data. GenomeView enables users to dynamically browse high volumes of aligned short-read data, with dynamic navigation and semantic zooming, from the whole genome level to the single nucleotide. At the same time, the tool enables visualization of whole genome alignments of dozens of genomes relative to a reference sequence. GenomeView is unique in its capability to interactively handle huge data sets consisting of tens of aligned genomes, thousands of annotation features and millions of mapped short reads both as viewer and editor. GenomeView is freely available as an open source software package. PMID:22102585

  18. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine

    PubMed Central

    Elsik, Christine G.; Tayal, Aditi; Diesh, Colin M.; Unni, Deepak R.; Emery, Marianne L.; Nguyen, Hung N.; Hagen, Darren E.

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  19. Hymenoptera Genome Database: integrating genome annotations in HymenopteraMine.

    PubMed

    Elsik, Christine G; Tayal, Aditi; Diesh, Colin M; Unni, Deepak R; Emery, Marianne L; Nguyen, Hung N; Hagen, Darren E

    2016-01-01

    We report an update of the Hymenoptera Genome Database (HGD) (http://HymenopteraGenome.org), a model organism database for insect species of the order Hymenoptera (ants, bees and wasps). HGD maintains genomic data for 9 bee species, 10 ant species and 1 wasp, including the versions of genome and annotation data sets published by the genome sequencing consortiums and those provided by NCBI. A new data-mining warehouse, HymenopteraMine, based on the InterMine data warehousing system, integrates the genome data with data from external sources and facilitates cross-species analyses based on orthology. New genome browsers and annotation tools based on JBrowse/WebApollo provide easy genome navigation, and viewing of high throughput sequence data sets and can be used for collaborative genome annotation. All of the genomes and annotation data sets are combined into a single BLAST server that allows users to select and combine sequence data sets to search. PMID:26578564

  20. Genomes and geography: genomic insights into the evolution and phylogeography of the genus Schistosoma

    PubMed Central

    2011-01-01

    Blood flukes within the genus Schistosoma still remain a major cause of disease in the tropics and subtropics and the study of their evolution has been an area of major debate and research. With the advent of modern molecular and genomic approaches deeper insights have been attained not only into the divergence and speciation of these worms, but also into the historic movement of these parasites from Asia into Africa, via migration and dispersal of definitive and snail intermediate hosts. This movement was subsequently followed by a radiation of Schistosoma species giving rise to the S. mansoni and S. haematobium groups, as well as the S. indicum group that reinvaded Asia. Each of these major evolutionary events has been marked by distinct changes in genomic structure evident in differences in mitochondrial gene order and nuclear chromosomal architecture between the species associated with Asia and Africa. Data from DNA sequencing, comparative molecular genomics and karyotyping are indicative of major constitutional genomic events which would have become fixed in the ancestral populations of these worms. Here we examine how modern genomic techniques may give a more in depth understanding of the evolution of schistosomes and highlight the complexity of speciation and divergence in this group. PMID:21736723